Nonparametric distributed learning under general designs

Liu, Meimei; Shang, Zuofeng; Cheng, Guang

Nonparametric distributed learning under general designs

Files

euclid.ejs.1597975224.pdf (471.81 KB)

Downloads: 154

Date

2020-08-21

Authors

Liu, Meimei

Shang, Zuofeng

Cheng, Guang

Abstract

This paper focuses on the distributed learning in nonparametric regression framework. With sufficient computational resources, the efficiency of distributed algorithms improves as the number of machines increases. We aim to analyze how the number of machines affects statistical optimality. We establish an upper bound for the number of machines to achieve statistical minimax in two settings: nonparametric estimation and hypothesis testing. Our framework is general compared with existing work. We build a unified frame in distributed inference for various regression problems, including thin-plate splines and additive regression under random design: univariate, multivariate, and diverging-dimensional designs. The main tool to achieve this goal is a tight bound of an empirical process by introducing the Green function for equivalent kernels. Thorough numerical studies back theoretical findings.

Keywords

Computational limit, divide and conquer, kernel ridge regression, minimax optimality, nonparametric testing

Persistent link

http://hdl.handle.net/10919/101855

Collections

Scholarly Works, Statistics

Full item page

Nonparametric distributed learning under general designs

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections