Nov 15, 2018

The International Prize in Statistics is awarded to Bradley Efron, Professor of Statistics and Biomedical Data Science at Stanford University, in recognition of the “bootstrap,” a method he developed in 1977 for assessing the uncertainty of scientific results that has had extraordinary and enormous impact across many scientific fields.

With the bootstrap, scientists were able to learn from limited data in a simple way that enabled them to assess the uncertainty of their findings. In essence, it became possible to simulate a potentially infinite number of datasets from an original dataset, and in looking at the differences, measure the uncertainty of the result from the original data analysis.

Made possible by computing, the bootstrap powered a revolution that placed statistics at the center of scientific progress. It helped to propel statistics beyond techniques that relied on complex mathematical calculations or unreliable approximations, and hence it enabled scientists to assess the uncertainty of their results in more realistic and feasible ways.

“Because the bootstrap is easy for a computer to calculate and is applicable in an exceptionally wide range of situations, the method has found use in many fields of science, technology, medicine, and public affairs,” says Sir David Cox, inaugural winner of the International Prize in Statistics. Indeed, Cornell University and EPAM Systems Inc. examined research databases worldwide and found that since 1980, the bootstrap (and multiple variations on the term, such as bootstrapping) has been cited in over 200,000 documents in more than 200 journals worldwide. Citations are found in fields like agricultural research, biochemistry, computer science, engineering, immunology, mathematics, medicine, physics and astronomy, and the social sciences.

“While statistics offers no magic pill for quantitative scientific investigations, the bootstrap is the best statistical pain reliever ever produced,” says Xiao-Li Meng, Whipple V. N. Jones Professor of Statistics at Harvard University [and IMS President]. “It has saved countless scientists and researchers the headache of finding a way to assess uncertainty in complex problems by providing a simple and practical way to do so in many seemingly hopeless situations.”

“The bootstrap was a quantum leap in statistical methodology that has enabled researchers to improve the lives of people everywhere,” says Sally Morton, Dean of the College of Science and Professor of Statistics at Virginia Tech. “Efron is a statistical poet of enormous beauty, applicability and impact.”

Brad Efron will accept the prize next summer at the 2019 World Statistics Congress in Kuala Lumpur.

Efron is Max H. Stein Professor of Humanities and Sciences, Professor of Statistics, and Professor of Biostatistics with the Department of Biomedical Data Science in the School of Medicine; he serves as Co-director of the Mathematical and Computational Sciences Program. He has held visiting faculty appointments at Harvard, UC Berkeley, and Imperial College, London. A recipient of a 2005 National Medal of Science for his contributions to theoretical and applied statistics, especially the bootstrap sampling technique, in 2014 he was awarded the Guy Medal in Gold by the Royal Statistical Society.  He served in 1988–89 as President of IMS, and in 2004 as President of the American Statistical Association.

### Background

Bradley Efron was born in May 1938 to Russian immigrants, and grew up in St. Paul, Minnesota. He credits his salesman father, Miles, for cultivating a love of math and science, in part through baseball and bowling scoring. “He kept track of these things,” says Efron, “so I grew up with a lot of numbers around me and that was very helpful—I was training to be a statistician without realizing it.” He won a national merit scholarship the year it was first introduced and went to Caltech. It was more than an intellectually eye-opening experience. “I’d never seen a mountain or an ocean,” says Efron.[1]

Initially, he thought he was going to become a mathematician, but he realized that abstract mathematics was not where his interests or talents lay. Enrolling in a PhD program at Stanford, he switched to statistics. “I remember when going into statistics that first year I thought ‘this will be pretty easy, I’ve dealt with math and that’s supposed to be hard.’ But statistics was much harder for me at the beginning than any other field. It took years before I felt really comfortable.”[2]

Statistics, by this point (the early 1950s), had been deeply immersed in decision theory—and the formal mathematical principles that explained inference. It was abstract, highly mathematical—and not especially concerned with applied problems. This was about to change under the synergistic influence of data analysis and computing. Statistics suddenly became deeply relevant to scientific research because it could answer questions previously unanswerable.

Statisticians grappled with the problems of outliers in datasets, limited data, and multiple unknowns, developing techniques through computers that were able to overcome the calculative limits of human thought. It was in this milieu that Efron was inspired by and built on the work of John Tukey, David Cox, and Rupert Miller to create the bootstrap.

The name was inspired by the 18th century fictional character Baron Munchausen, and a variation of a story where he pulls himself out of a swamp by his own bootstraps. In a similar vein, the statistician or scientist could now use their own data to assess the uncertainty in their own data.[3] Initially, his paper on the bootstrap was rejected for publication because it didn’t have enough theorems—so he added some at the end.[4]

“The truth is I didn’t think it was anything wonderful when I did it,” says Efron. “But it was one of those lucky ideas that is better than it seems at first view.”[5] It was a tool that many scientists could use, and use easily, especially as personal computing provided the power to do the number crunching. And it worked.

**

Footnotes

[1] ASA interview with Efron, October 2018

[2] Bradley Efron: A Conversation with Good Friends, Susan Holmes, Carl Morris, Rob Tibshirani and Bradley Efron, Statistical Science Vol. 18, No. 2, Silver Anniversary of the Bootstrap (May, 2003), pp. 268-281

[3] A Life in Statistics: Bradley Efron, Julian Champkin, Significance, 18 November 2010

[4] Bradley Efron: A Conversation with Good Friends, Susan P. Holmes et al, Statistical Science, May 2003

[5] ASA interview with Efron, October 2018

### The Bootstrap, simplified

Suppose you want to know the average household income in your city. You can’t afford a complete census so you randomly sample 100 households, record the 100 incomes, and take their average, say $29,308. That sounds very precise, but you would like some estimate of how accurate it really is. A straightforward, but impractical, approach would be to take several more random samples of 100 households, compute the average each time, and see how much the averages differed from each other. The bootstrap lets you approximate this impractical approach using only the original sample’s data. A bootstrap data set is a random sample of size 100 drawn from the original 100 incomes. You can imagine writing each of the original incomes on a slip of paper, putting the slips in a hat, and randomly drawing a slip out. Record the number, put the slip back into the hat, and repeat this process 99 more times. The result would be a bootstrap data set, and we can make as many bootstrap data sets as we wish, each time taking their average. Let’s say we do 250 of them, giving 250 bootstrap averages. The variability of the 250 averages is the bootstrap estimate of accuracy for the original estimate$29,308.

The same idea can be applied to find the accuracy of any statistic, say the median income instead of the average or, perhaps, something much more complicated, which makes the bootstrap ideal for the often elaborate statistical methods of modern scientific practice.

### About the International Prize in Statistics

The International Prize in Statistics recognizes a major achievement of an individual or team in the field of statistics and promotes understanding of the growing importance and diverse ways statistics, data analysis, probability and the understanding of uncertainty advance society, science, technology and human welfare. With a monetary award of \$80,000 USD, it is given every other year by the International Prize in Statistics Foundation, which is comprised of representatives from the American Statistical Association, International Biometric Society, Institute of Mathematical Statistics, International Statistical Institute and Royal Statistical Society. Recipients are chosen from a selection committee comprised of world-renowned academicians and researchers and officially presented with the award at the World Statistics Congress.

## Welcome!

Welcome to the IMS Bulletin website! We are developing the way we communicate news and information more effectively with members. The print Bulletin is still with us (free with IMS membership), and still available as a PDF to download, but in addition, we are placing some of the news, columns and articles on this blog site, which will allow you the opportunity to interact more. We are always keen to hear from IMS members, and encourage you to write articles and reports that other IMS members would find interesting. Contact the IMS Bulletin at bulletin@imstat.org

## What is “Open Forum”?

In the Open Forum, any IMS member can propose a topic for discussion. Email your subject and an opening paragraph (to bulletin@imstat.org) and we'll post it to start off the discussion. Other readers can join in the debate by commenting on the post. Search other Open Forum posts by using the Open Forum category link below. Start a discussion today!