9 Gaussian mixture models
To be added.
Bertsimas, D., A. King, and A. Mazumder. 2016. “Best Subset Selection via a Modern Optimization Lens.” The Annals of Statistics 44 (2): 813–52.
Bertsimas, D., and B. Van Parys. 2020. “Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions.” Annals of Statistics 48 (1): 300–323.
Bühlmann, P., and S. van de Geer. 2011. Statistics for High-Dimensional Data. New York: Springer.
Calon, A., E. Espinet, S. Palomo-Ponce, D. V. F. Tauriello, M. Iglesias, M. V. Céspedes, M. Sevillano, et al. 2012. “Dependency of Colorectal Cancer on a TGF-Beta-Driven Programme in Stromal Cells for Metastasis Initiation.” Cancer Cell 22 (5): 571–84.
Carbonetto, Peter, and Matthew Stephens. 2012. “Scalable Variational Inference for Bayesian Variable Selection in Regression, and Its Accuracy in Genetic Association Studies.” Bayesian Analysis 7 (1): 73–108.
Castillo, I., J. Schmidt-Hieber, and A. W. van der Vaart. 2015. “Bayesian Linear Regression with Sparse Priors.” The Annals of Statistics 43 (5): 1986–2018.
Chang, Hyunwoong, and Quan Zhou. 2025. “Dimension-Free Relaxation Times of Informed MCMC Samplers on Discrete Spaces.” Bernoulli (forthcoming): 1–28.
Chen, J., and Z. Chen. 2008. “Extended Bayesian Information Criteria for Model Selection with Large Model Spaces.” Biometrika 95 (3): 759–71.
Cover, Thomas M, and Jan M Van Campenhout. 2007. “On the Possible Orderings in the Measurement Selection Problem.” IEEE Transactions on Systems, Man, and Cybernetics 7 (9): 657–61.
Fan, J., and R. Li. 2001. “Variable Selection via Nonconcave Penalized Likelihood and Its Oracle Properties.” Journal of the American Statistical Association 96: 1348–60.
Foster, D., H. Karloff, and J. Thaler. 2015. “Variable Selection Is Hard.” In Conference on Learning Theory, 696–709.
Fúquene, J., M. F. J. Steel, and D. Rossell. 2019. “On Choosing Mixture Components via Non-Local Priors.” Journal of the Royal Statistical Society B 81 (5): 809–37.
Giannone, Domenico, Michele Lenza, and Giorgio E Primiceri. 2021. “Economic Predictions with Big Data: The Illusion of Sparsity.” Econometrica 89 (5): 2409–37.
Hazimeh, Hussein, Rahul Mazumder, and Tim Nonet. 2023. “L0learn: A Scalable Package for Sparse Learning Using L0 Regularization.” Journal of Machine Learning Research 24 (205): 1–8.
Hoeting, Jennifer A., David Madigan, Adrian E. Raftery, and Chris T. Volinsky. 1999. “Bayesian Model Averaging: A Tutorial.” Statistical Science 14: 382–401.
Johnson, V. E., and D. Rossell. 2010. “On the Use of Non-Local Prior Densities for Default Bayesian Hypothesis Tests.” Journal of the Royal Statistical Society B 72: 143–70.
———. 2012. “Bayesian Model Selection in High-Dimensional Settings.” Journal of the American Statistical Association 24 (498): 649–60.
Kass, R. E., L. Tierney, and J. B. Kadane. 1990. “The Validity of Posterior Expansions Based on Laplace’s Method.” Bayesian and Likelihood Methods in Statistics and Econometrics 7: 473–88.
Kiefer, Jack, and Jacob Wolfowitz. 1952. “Stochastic Estimation of the Maximum of a Regression Function.” The Annals of Mathematical Statistics 23 (3): 462–66.
Lindley, D. V. 1957. “A Statistical Paradox.” Biometrika 44: 187–92.
Linnainmaa, Seppo. 1970. “The Representation of the Cumulative Rounding Error of an Algorithm as a Taylor Expansion of the Local Rounding Errors.” PhD thesis, Master’s Thesis (in Finnish), University of Helsinki.
Madigan, D., and A. E. Raftery. 1994. “Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam’s Window.” Journal of the American Statistical Association 89 (428): 1535–46.
Natarajan, B. K. 1995. “Sparse Approximate Solutions to Linear Systems.” SIAM Journal on Computing 24 (2): 227–34.
Raskutti, G., M. Wainwright, and B. Yu. 2011. “Minimax Rates of Estimation for High-Dimensional Linear Regression over Balls.” Information Theory, IEEE Transactions on 57 (10): 6976–94.
Robbins, Herbert, and Sutton Monro. 1951. “A Stochastic Approximation Method.” The Annals of Mathematical Statistics 22 (9): 400–407.
Rognon-Vael, Paul, and David Rossell. 2025. “Empirical Bayes for Data Integration.” arXiv 2508.08336: 1–51.
Rognon-Vael, Paul, David Rossell, and Piotr Zwiernik. 2025. “Improving Variable Selection Properties by Leveraging External Data.” arXiv 2502.15584: 1–75.
Rosenblatt, Frank. 1958. “The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain.” Psychological Review 65 (6): 386–408.
Rossell, D., O. Abril, and A. Bhattacharya. 2021. “Approximate Laplace Approximations for Scalable Model Selection.” Journal of the Royal Statistical Society B 83 (4): 853–79.
Rossell, David. 2022. “Concentration of Posterior Model Probabilities and Normalized L0 Criteria.” Bayesian Analysis 17 (2): 565–91.
Rossell, D., and F. J. Rubio. 2018. “Tractable Bayesian Variable Selection: Beyond Normality.” Journal of the American Statistical Association 113 (524): 1742–58.
———. 2021. “Additive Bayesian Variable Selection Under Censoring and Misspecification.” Statistical Science 38 (1): 13–29.
Rossell, D., and D. Telesca. 2017. “Non-Local Priors for High-Dimensional Estimation.” Journal of the American Statistical Association 112: 254–65.
Rumelhart, David E, Geoffrey E Hinton, and Ronald J Williams. 1986. “Learning Representations by Back-Propagating Errors.” Nature 323 (6088): 533–36.
Schwarz, G. 1978. “Estimating the Dimension of a Model.” The Annals of Statistics 6: 461–64.
Scott, J. G., and J. O Berger. 2010. “Bayes and Empirical Bayes Multiplicity Adjustment in the Variable Selection Problem.” The Annals of Statistics 38 (5): 2587–2619.
Stone, M. 1977. “An Asymptotic Equivalence of Choice of Model by Cross-Validation and Akaike’s Criterion.” Journal of the Royal Statistical Society B 39: 44–47.
Tibshirani, R. 1996. “Regression Shrinkage and Selection via the Lasso.” Journal of the Royal Statistical Society, B 58: 267–88.
Wainwright, Martin J. 2009. “Information-Theoretic Limits on Sparsity Recovery in the High-Dimensional and Noisy Setting.” IEEE Transactions on Information Theory 55 (12): 5728–41.
Yang, Y., M. J. Wainwright, and M. I. Jordan. 2016. “On the Computational Complexity of High-Dimensional Bayesian Variable Selection.” The Annals of Statistics 44 (6): 2497–2532.
Zhang, Y., M. J. Wainwright, and M. I. Jordan. 2014. “Lower Bounds on the Performance of Polynomial-Time Algorithms for Sparse Linear Regression.” JMLR: Workshop and Conference Proceedings 35: 1–28.
Zhou, Quan, Jun Yang, Dootika Vats, Gareth O Roberts, and Jeffrey S Rosenthal. 2022. “Dimension-Free Mixing for High-Dimensional Bayesian Variable Selection.” Journal of the Royal Statistical Society B 84 (5): 1751–84.
Zou, H. 2006. “The Adaptive LASSO and Its Oracle Properties.” Journal of the American Statistical Association 101 (476): 1418–29.