Jeffrey T. Leek

American biostatistician From Wikipedia, the free encyclopedia

Jeffrey Tullis Leek is an American biostatistician and data scientist working as a Vice President, Chief Data Officer, and Professor at Fred Hutchinson Cancer Research Center.[1][2] He is an author of the Simply Statistics blog, and runs several online courses through Coursera, as part of their Data Science Specialization.[3][4][5] His most popular course is The Data Scientist's Toolbox,[6] which he instructed along with Roger Peng and Brian Caffo. Leek is best known for his contributions to genomic data analysis and critical view of research and the accuracy of popular statistical methods.

EducationUtah State University (B.S.)
University of Washington (Ph.D., M.S.)
KnownforBiostatistics and Data Science
Quick facts Education, Known for ...
Jeffrey T. Leek
Jeffrey Leek in 2017
EducationUtah State University (B.S.)
University of Washington (Ph.D., M.S.)
Known forBiostatistics and Data Science
AwardsCOPSS Presidents' Award
Scientific career
FieldsBiostatistics
InstitutionsFred Hutchinson Cancer Research Center
Doctoral advisorJohn D. Storey
Doctoral studentsHilary S. Parker
Close

Education

Leek graduated from Utah State University in 2003 with his Bachelors of Science. Then went on to study at the University of Washington achieving a Master's degree in 2005 and completed a PhD in Biostatistics in 2007 with John Storey as his doctoral advisor.[2][7]

Research and career

Leek joined Johns Hopkins University as an assistant professor in Biostatistics in 2009, working at the Bloomberg School of Public Health. In 2014 he became an associate professor in Biostatistics and Oncology.[8]

Leek works in The Center for Computational Biology[9] at Johns Hopkins University creating statistical packages[10][11] for analysis of genomes.

He also co-edits a blog, Simply Statistics[12] with Roger Peng and Rafa Irizarry, which contains a mix of articles on statistics and meta-research.

Leek has conducted several talks at prestigious universities and locations such as a colloquium series at Harvard[13] and a lecture at the New York Genome Center titled “Building a Comprehensive Resource for the Study of Human Gene Expression with Machine Learning and Data Science”[14] as a part of their lecture series.

He is an expert in reproducibility, and his work and opinions have been published in notable scientific and medical journals such as Nature[15][16] and the Proceedings of the National Academy of Sciences. Leek wrote a self-published book, The Elements of Data Analytic Style and is considered an expert on replication.[17][18]

He is currently Vice President and Chief Data Officer at Fred Hutchinson Cancer Center in Seattle, WA.[2]

Recognition

Leek was elected as a Fellow of the American Statistical Association in 2020.[19] In 2021, Leek won the COPSS Presidents' Award.[20] He was named one of the one hundred most influential people in AI in 2025 by Time Magazine.[21]

Selected publications

Leek's highly cited works include

  • "Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis"[22]
  • "Tackling the Widespread and Critical Impact of Batch Effects in High-Throughput Data"[23]

References

Related Articles

Wikiwand AI