1 of 1

Error Estimation for Random Fourier Features

1

Scientific Achievement

Our work develops a bootstrap approach to estimate the errors of Random Fourier Features (RFF) approximations. Advantages of our approach are: (1) error estimates are specific to the problem at hand; and (2) the approach enables adaptive computation, so that the user can quickly inspect the error of a rough kernel approximation and then predict how much extra work is needed.

Significance and Impact

Developing and improving methods for error estimation and uncertainty quantification for machine learning (ML) models is highly relevant to the DOE’s mission, and for society. Our proposed error estimation method will be applicable to a broad range of problems which involve kernel methods.

This figure illustrates a hypothetical scenario where it is possible to track the random variable that represents the RFF approximation error of as the number of features is increased over a grid ranging from 1 to 1600. The result is displayed with the red curve. Similarly, by repeating this experiment many times, a large collection of such random curves can be generated, and these are displayed in blue. In addition, the 90% quantile of the curves at each value of s is plotted in black. Hence, if the user had access to the black curve it would be possible to know if a given number of features is adequate. Despite the fact that none of the curves in Figure 1 are available to the user in practice, our work will show that, that we can compute a close approximation for the black curve.

Technical Approach

  • Conceptually, the proposed bootstrap method for error estimation is based on generating a collection of ``pseudo error variables'' that behave approximately like i.i.d. samples of the true unknown error variable.

  • Once the pseudo error variables have been generated, their empirical (1-α)-quantile can then be used to define the error estimate.

Yao, Junwen, N. Benjamin Erichson, and Miles E. Lopes. "Error Estimation for Random Fourier Features." In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics.

approximation error

number of features, s