Open AccessPosted Content
Is your function low-dimensional?
TL;DR: It is shown that the class of linear $k$-juntas is not testable, but adding a surface area constraint makes it testable: a $\mathsf{poly}(k \cdot s/\epsilon)$-query non-adaptive tester for linear £k-junta with surface area at most $s$.
read more
Abstract: We study the problem of testing if a function depends on a small number of linear directions of its input data. We call a function $f$ a linear $k$-junta if it is completely determined by some $k$-dimensional subspace of the input space. In this paper, we study the problem of testing whether a given $n$ variable function $f : \mathbb{R}^n \to \{0,1\}$, is a linear $k$-junta or $\epsilon$-far from all linear $k$-juntas, where the closeness is measured with respect to the Gaussian measure on $\mathbb{R}^n$. Linear $k$-juntas are a common generalization of two fundamental classes from Boolean function analysis (both of which have been studied in property testing) $\textbf{1.}$ $k$- juntas which are functions on the Boolean cube which depend on at most k of the variables and $\textbf{2.}$ intersection of $k$ halfspaces, a fundamental geometric concept class.
We show that the class of linear $k$-juntas is not testable, but adding a surface area constraint makes it testable: we give a $\mathsf{poly}(k \cdot s/\epsilon)$-query non-adaptive tester for linear $k$-juntas with surface area at most $s$. We show that the polynomial dependence on $s$ is necessary. Moreover, we show that if the function is a linear $k$-junta with surface area at most $s$, we give a $(s \cdot k)^{O(k)}$-query non-adaptive algorithm to learn the function up to a rotation of the basis. In particular, this implies that we can test the class of intersections of $k$ halfspaces in $\mathbb{R}^n$ with query complexity independent of $n$.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Journal Article
An Invariance Principle for Polytopes.
TL;DR: In this paper, it was shown that for any polytope P formed by the intersection of k halfspaces, it is possible to learn a pseudorandom generator in O(log n poly(log k, 1/δ) time.
Junta Correlation is Testable
Anindya De,Elchanan Mossel,Joe Neeman +2 more
- 01 Nov 2019
TL;DR: An algorithm which given distance parameters c, d, and oracle access to a Boolean function f on the hypercube, has query complexity exp(k).poly(1/(c-d)) and distinguishes between the following cases: 1. The distance of f from any k-junta is at least c; 2. There is a k-Junta g which has distance at most d from f.
23
•Posted Content
Learning Polynomials of Few Relevant Dimensions
Sitan Chen,Raghu Meka +1 more
TL;DR: A new filtered PCA approach is introduced to get a warm start for the true subspace and geodesic SGD to boost to arbitrary accuracy; the techniques may be of independent interest, especially for problems dealing with subspace recovery or analyzing SGD on manifolds.
20
•Posted Content
Distribution-Free Testing of Linear Functions on R^n.
Noah Fleming,Yuichi Yoshida +1 more
TL;DR: It is shown that, given query access to f, sampling access to the unknown distribution as well as the standard Gaussian, and eps>0, it is possible to distinguish additive functions from functions that are eps-far from additive functions with O((1/eps)log( 1/eps)) queries, independent of n.
9
•Journal Article
Learning functions of halfspaces using prefix covers
TL;DR: This work identifies a new structural property of Boolean functions that yields learnability with queries: that of having a small prefix cover, which gives the first poly(n, 1/e) algorithm for learning even the intersection of 2 halfspaces under the uniform distribution on {0, 1}; previously known algorithms had an exponential dependence either on the accuracy parameter e or the dimension n.
7
References
Model compression
Cristian Buciluǎ,Rich Caruana,Alexandru Niculescu-Mizil +2 more
- 20 Aug 2006
TL;DR: This work presents a method for "compressing" large, complex ensembles into smaller, faster models, usually without significant loss in performance.
3K
•Book
Introduction to matrix computations
G. W. Stewart
- 11 Jun 1973
TL;DR: Rounding-Error Analysis of Solution of Triangular Systems and of Gaussian Elimination.
2.5K
•Posted Content
Do Deep Nets Really Need to be Deep
Lei Jimmy Ba,Rich Caruana +1 more
TL;DR: This paper showed that shallow feed-forward networks can learn the complex functions previously learned by deep networks and achieve accuracies previously only achievable with deep models, and in some cases the shallow neural nets can learn these deep functions using a total number of parameters similar to the original deep model.
1.1K
Robust Characterizations of Polynomials withApplications to Program Testing
Ronitt Rubinfeld,Madhu Sudan +1 more
TL;DR: The characterizations provide results in the area of coding theory by giving extremely fast and efficient error-detecting schemes for some well-known codes and play a crucial role in subsequent results on the hardness of approximating some NP-optimization problems.
1.1K