John Tukey
John Tukey | |
---|---|
Foreign Member of the Royal Society (1991) | |
Scientific career | |
Fields | Topology |
Institutions | |
Thesis | On Denumerability in Topology[1] |
Doctoral advisor | Solomon Lefschetz[1] |
Doctoral students |
John Wilder Tukey (/ˈtuːki/; June 16, 1915 – July 26, 2000) was an American mathematician and statistician, best known for the development of the fast Fourier Transform (FFT) algorithm and box plot.[2] The Tukey range test, the Tukey lambda distribution, the Tukey test of additivity, and the Teichmüller–Tukey lemma all bear his name. He is also credited with coining the term bit and the first published use of the word software.
Biography
Tukey was born in
During World War II, Tukey worked at the Fire Control Research Office and collaborated with Samuel Wilks and William Cochran. He is claimed to have helped design the U-2 spy plane. After the war, he returned to Princeton, dividing his time between the university and AT&T Bell Laboratories. In 1962, Tukey was elected to the American Philosophical Society.[7] He became a full professor at 35 and founding chairman of the Princeton statistics department in 1965.[3]
Among many contributions to civil society, Tukey served on a committee of the American Statistical Association that produced a report critiquing the statistical methodology of the Kinsey Report, Statistical Problems of the Kinsey Report on Sexual Behavior in the Human Male, which summarized "A random selection of three people would have been better than a group of 300 chosen by Mr. Kinsey".
From 1960 to 1980, Tukey helped design the NBC television network polls used to predict and analyze elections. He was also a consultant to the Educational Testing Service, the Xerox Corporation, and Merck & Company.
During the 1970s and early 1980s, Tukey played a key role in the design and conduct of the National Assessment of Educational Progress.
He was awarded the National Medal of Science by President Nixon in 1973.[3] He was awarded the IEEE Medal of Honor in 1982 "For his contributions to the spectral analysis of random processes and the fast Fourier transform (FFT) algorithm".
Tukey retired in 1985. He died in New Brunswick, New Jersey, on July 26, 2000.
Scientific contributions
Early in his career Tukey worked on developing
His statistical interests were many and varied. He is particularly remembered for his development with James Cooley of the Cooley–Tukey FFT algorithm. In 1970, he contributed significantly to what is today known as the jackknife—also termed Quenouille–Tukey jackknife. He introduced the box plot in his 1977 book, "Exploratory Data Analysis".
In 1974, he developed, with Jerome H. Friedman, the concept of the projection pursuit.[11]
Data analysis and foundations of data science
John Tukey contributed greatly to statistical practice and data analysis in general. In fact, some regard John Tukey as the father of data science. At the very least, he pioneered many of the key foundations of what came later to be known as data science.[12]
Making sense of data has a long history and has been addressed by statisticians, mathematicians, scientists, and others for many many years. During the 1960s, Tukey challenged the dominance at the time of what he called "confirmatory data analysis", statistical analyses driven by rigid mathematical configurations.[13] Tukey emphasized the importance of having a more flexible attitude towards data analysis and of exploring data carefully to see what structures and information might be contained therein. He called this "exploratory data analysis" (EDA). In many ways, EDA was a precursor to data science.
Tukey also realized the importance of computer science to EDA. Graphics are an integral part of EDA methodology and, while much of Tukey's work focused on static displays (such as box plots) that could be drawn by hand, he realized that computer graphics would be much more effective for studying multivariate data. PRIM-9, the first program for viewing multivariate data, was conceived by him during the early 1970s.[14]
This coupling of data analysis and computer science is what is now called data science.
Tukey articulated the important distinction between
A. D. Gordon offered the following summary of Tukey's principles for statistical practice:[15]
... the usefulness and limitation of mathematical statistics; the importance of having methods of statistical analysis that are robust to violations of the assumptions underlying their use; the need to amass experience of the behaviour of specific methods of analysis in order to provide guidance on their use; the importance of allowing the possibility of data's influencing the choice of method by which they are analysed; the need for statisticians to reject the role of "guardian of proven truth", and to resist attempts to provide once-for-all solutions and tidy over-unifications of the subject; the iterative nature of data analysis; implications of the increasing power, availability, and cheapness of computing facilities; the training of statisticians.
Tukey's lectures were described to be unusual. McCullagh described his lecture given in London in 1977:[15][16]
Tukey ambled to the podium, a great bear of a man dressed in baggy pants and a black knitted shirt. These might once have been a matching pair but the vintage was such that it was hard to tell. ... Carefully and deliberately a list of headings was chalked on the blackboard. The words came too, not many, like overweight parcels, delivered at a slow unfaltering pace. ... When it was complete, Tukey turned to face the audience and the podium ... "Comments, queries, suggestions?" he asked the audience ... As he waited for a response, he clambered onto the podium and manoeuvred until he was sitting cross-legged facing the audience. ... We in the audience sat like spectators at the zoo waiting for the great bear to move or say something. But the great bear appeared to be doing the same thing, and the feeling was not comfortable.
Coining the term bit
While working with John von Neumann on early computer designs, Tukey introduced the word bit as a portmanteau of binary digit.[17] The term bit was first used in an article by Claude Shannon in 1948.
See also
Publications
- Andrews, David F.; Bickel, Peter J.; Hampel, Frank R.; Huber, Peter J.; Rogers, W. H.; Tukey, John Wilder (1972). Robust estimates of location: survey and advances. OCLC 369963.
- ISBN 978-0-486-60507-4.
- .
- Cooley, James W.; Tukey, John W. (1965). "An algorithm for the machine calculation of complex Fourier series". OCLC 8495063.
- Hoaglin, David C.; OCLC 11550398.
- Hoaglin, David C.; OCLC 23180322.
- Morgenthaler, Stephan; Tukey, John Wilder, eds. (1991). Configural polysampling: a route to practical robustness. OCLC 22381036.
- OCLC 3235470.
- Tukey, John Wilder (1940). Convergence and Uniformity in Topology. OCLC 227948615.
- Tukey, John Wilder (1977). Exploratory Data Analysis. OCLC 3058187.
- Tukey, John Wilder; Ross, Ian C.; Bertrand, Verna (1973). Index to statistics and probability. R & D Press. OCLC 745715.
- The collected works of John W Tukey, edited by William S. Cleveland
- Brillinger, David R., ed. (1984). Volume I: Time series, 1949–1964. OCLC 10998116.
- Brillinger, David R., ed. (1985). Volume II: Time series, 1965–1984. OCLC 159731367.
- Jones, Lyle V., ed. (1985). Volume III: Philosophy and principles of data analysis, 1949–1964. OCLC 159731367.
- Jones, Lyle V., ed. (1986). Volume IV: Philosophy and principles of data analysis, 1965–1986. OCLC 165832503.
- Cleveland, William S., ed. (1988). Volume V: Graphics, 1965–1985. OCLC 230023465.
- Mallows, Colin L., ed. (1990). Volume VI: More mathematical, 1938–1984. OCLC 232966724.
- Cox, David R., ed. (1992). Volume VII: Factorial and ANOVA, 1949–1962. OCLC 165366083.
- Braun, Henry I., ed. (1994). Volume VIII: Multiple comparisons, 1949–1983. OCLC 165099761.
- About John Tukey
- O'Connor, John J.; Robertson, Edmund F., "John Tukey", MacTutor History of Mathematics Archive, University of St Andrews
- Interview of John Tukey about his experience at Princeton
References
- ^ a b John Tukey at the Mathematics Genealogy Project
- .
- ^ New York Times. Retrieved 2012-09-24.
- ^ "John Tukey". Mathematics Genealogy Project. Retrieved 2022-07-02.
- ^ Tukey, John W. (1939). On denumerability in topology.
- ^ "John Tukey". IEEE Global History Network. IEEE. Retrieved 2011-07-18.
- ^ "APS Member History". search.amphilsoc.org. Retrieved 2021-01-28.
- hdl:11858/00-001M-0000-002C-4314-2. Archived from the original (PDF) on 1998-07-15., a word suggested by J. W. Tukey.
The choice of a logarithmic base corresponds to the choice of a unit for measuring information. If the base 2 is used the resulting units may be called binary digits, or more briefly bits
- .
- ISBN 0-252-72548-4. Archived from the original(PDF) on 1998-07-15.
- S2CID 7997450.
- ^ David Donoho (2017), 50 Years of Data Science, Journal of Computational and Graphical Statistics, 2017, https://doi.org/10.1080/10618600.2017.1384734
- ^ John W. Tukey (1962) The Future of Data Analysis. Ann. Math. Statist. 33(1): 1-67. DOI: 10.1214/aoms/1177704711.
- ^ Friedman, J. H., & Stuetzle, W. (2002). John W. Tukey’s Work on Interactive Graphics. The Annals of Statistics, 30(6), 1629-1639. http://www.jstor.org/stable/1558733
- ^ a b "John Tukey - Biography". Maths History. Retrieved 2022-02-18.
- ^ P McCullagh, John Wilder Tukey, Biographical Memoirs of Fellows of the Royal Society 49 (2003), 538-559.
- ^ "Bit definition by The Linux Information Project (LINFO)". www.linfo.org.
- JSTOR 2677019.)
{{cite journal}}
: CS1 maint: untitled periodical (link - doi:10.2135/cropsci2000.0015br.)
{{cite journal}}
: CS1 maint: untitled periodical (link - S2CID 26430218.)
{{cite journal}}
: CS1 maint: untitled periodical (link - S2CID 86230606.)
{{cite journal}}
: CS1 maint: untitled periodical (link
External links
- Royal Society obit. by Peter McCullagh
- John W. Tukey: His Life and Professional Contributions published in The Annals of Statistics
- John Wilder Tukey (1915–2000) in Notices of the American Mathematical Society
- Memories of John Tukey
- Short biography by Mary Bittrich
- "John Tukey, 85, Statistician; Coined the Word 'Software'", The New York Times, 2000-07-28
- "Remembering John W. Tukey", special issue of Statistical Science
- John Wilder Tukey at the Mathematics Genealogy Project