A Hilbert space embedding of a distribution—in short, a kernel mean embedding—has recently emerged as a powerful tool for machine learning and statistical inference. The basic idea behind this framework is to map distributions into a reproducing kernel Hilbert space (RKHS) in which the whole arsenal of kernel methods can be extended to probability measures. It can be viewed as a generalization of the original “feature map” common to support vector machines (SVMs) and other kernel methods. In addition to the classical applications of kernel methods, the kernel mean embedding has found novel applications in fields ranging from probabilistic modeling to statistical inference, causal discovery, and deep learning. This survey aims to give a comprehensive review of existing work and recent advances in this research area, and to discuss challenging issues and open problems that could potentially lead to new research directions. The survey begins with a brief introduction to the RKHS and positive definite kernels which forms the backbone of this survey, followed by a thorough discussion of the Hilbert space embedding of marginal distributions, theoretical guarantees, and a review of its applications. The embedding of distributions enables us to apply RKHS methods to probability measures which prompts a wide range of applications such as kernel two-sample testing, independent testing, and learning on distributional data. Next, we discuss the Hilbert space embedding for conditional distributions, give theoretical insights, and review some applications. The conditional mean embedding enables us to perform sum, product, and Bayes' rules—which are ubiquitous in graphical model, probabilistic inference, and reinforcement learning— in a non-parametric way using this new representation of distributions. We then discuss relationships between this framework and other related areas. Lastly, we give some suggestions on future research directions.
Article navigation
28 June 2017
Research Article|
June 28 2017
Kernel Mean Embedding of Distributions: A Review and Beyond
Krikamol Muandet;
Krikamol Muandet
Department of Mathematics, Faculty of Science, 272 Rama VI Road, Ratchathewi, Bangkok 10400,
Thailand
Empirical Inference Department,
Max Planck Institute for Intelligent Systems
Spemannstraße 38, Tübingen 72076, Germany
Search for other works by this author on:
Kenji Fukumizu;
Kenji Fukumizu
Institute of Statistical Mathematics
, 10-3 Midoricho, Tachikawa, Tokyo 190-8562, Japan
Search for other works by this author on:
Bharath Sriperumbudur;
Bharath Sriperumbudur
Department of Statistics, Pennsylvania State University, University Park
, PA 16802, USA
Search for other works by this author on:
Bernhard Schölkopf
Bernhard Schölkopf
Max Planck Institute for Intelligent Systems
, Spemannstraße 38, Tübingen 72076, Germany
Search for other works by this author on:
Online ISSN: 1935-8245
Print ISSN: 1935-8237
© 2017 K. Muandet, K. Fukumizu, B. Sriperumbudur and B. Schölkopf
2017
K. Muandet, K. Fukumizu, B. Sriperumbudur and B. Schölkopf
Licensed re-use rights only
Foundations and Trends in Machine Learning (2017) 10 (1-2): 1–141.
Citation
Muandet K, Fukumizu K, Sriperumbudur B, Schölkopf B (2017), "Kernel Mean Embedding of Distributions: A Review and Beyond". Foundations and Trends in Machine Learning, Vol. 10 No. 1-2 pp. 1–141, doi: https://doi.org/10.1561/2200000060
Download citation file:
Suggested Reading
Instrumental identification of interventional distribution via moment recovery in a parametric outcome model
Asian Journal of Economics and Banking (November,2025)
Performance and accuracy analysis of semantic kernel functions
Program (February,2016)
Related Chapters
Smoothness: Bias and Efficiency of Nonparametric Kernel Estimators
Essays in Honor of Aman Ullah
35. Fuel kernel materials for the thorium cycle HTGR
Nuclear Fuel Performance
The Pricing Kernel Puzzle: Reconciling Index Option Data and Economic Theory
Derivative Securities Pricing and Modelling
Recommended for you
These recommendations are informed by your reading behaviors and indicated interests.
