Mark Daniel Ward

The Data Mine
Purdue University
1201 West State Street
West Lafayette, IN 47907-2057
[email protected]
[email protected]
phone: (765)496-9563
VRS: (765)248-6858

Current Position

I am a Professor of Statistics and (by courtesy) of Agricultural & Biological Engineering, Computer Science, Mathematics, and Public Health at Purdue University. My research is in probabilistic, combinatorial, and analytic techniques for the analysis of algorithms and data structures. I am also interested in data science, science of information, game theory, and large-scale computation. I currently serve as Director of The Data Mine and Interim Director of the Integrative Data Science Initiative. I am also Associate Director of the Actuarial Science Program.

Visiting Faculty Positions

École Polytechnique, Palaiseau, France
Invited Professor in the Algorithms and Models for Integrative Biology team, November-December 2015

The George Washington University, Washington, DC
(sabbatical) Adjunct Professor in the Department of Statistics, September 2013-May 2014

University of Maryland, College Park, MD
(sabbatical) Visiting Professor in the Department of Mathematics, July 2013-June 2014

Université de Paris 13, Villetaneuse, France
Invited Professor at the Laboratoire Informatique de Paris Nord (LIPN), February-March 2012

University of Pennsylvania
(postdoc) Lecturer in Mathematics, 2005-2007

Education

Purdue University
Ph.D., Mathematics with Specialization in Computational Science, May 2005
Dissertation: Analysis of the Multiplicity Matching Parameter in Suffix Trees
Advisor: Wojciech Szpankowski

University of Wisconsin-Madison
M.S., Applied Mathematical Sciences, May 2003
Thesis: Analysis of a Randomized Selection Algorithm

Denison University
B.S., Mathematics and Computer Science, summa cum laude, May 1999
Senior Honors Project: Mathematical Foundations for Performance Analysis

Contracts and Corporate Partnerships

The Data Mine has 50 Corporate Partners, offering 70 projects to students during the 2022-23 academic year: American First Finance, Bayer Crop Science, Beck’s Hybrids, Caterpillar, CDC Protect, Central Insurance, CliftonLarsenAllen, Co-Alliance, Corteva Agriscience, Cummins, Delta Faucet, DORIS, Elanco, Farfetch, Ford, Gro Master, Halderman, Helmer Scientific, Indiana Family and Social Services Administration, Ingenii, Intelinair, IU Health, Jobvite, John Deere, Lawrence Livermore National Laboratories (Accelerating Therapeutics for Opportunities in Medicine consortium), Lockheed Martin, MākuSafe, Merck, Microsoft Minecraft, MISO, Mursix, Nuvve, Pebblst, PlaneEnglish, Purdue Athletics, Purdue Center for Regional Development, Purdue Co-Rec and Student Life, Purdue Military Family Research Institute, Raytheon, REACH, Renzoe Box, Republic Airways, Sandia National Laboratories, Telemetry Sports, Tesla, TMap / MakeMyMove, UPS, USAA, USDA Forest Service, Viasat, Wabash, and Webee, and data sharing agreements with OneAmerica Financial Partners and the Wabash Heartland Innovation Network.

Cummins: Indiana Digital Crossroads - Optimized Prognostics, Diagnostics, and Repair (2022)

Cummins: Indiana Digital Crossroads 2020 Pilot (2020-2021)

Gro Master: Ultra Sort: Hardware, Software, Data Analysis, and Cloud Computing (2020-2021)

Sandia National Laboratories: Analytic Tools for Experimental Wargames (2020-2023)

Grants

National Science Foundation OAC-#2118329: HDR Institute: Geospatial Understanding through an Integrative Discovery Environment (2021-2026)

National Institute of Food and Agriculture: Data Science and Analytics for Precision Livestock Systems (2021-2026)

National Science Foundation DMS-#2123321: HDR DSC: National Data Mine Network (2021-2024)

Lilly Endowment: Charting the Future for Indiana’s Colleges and Universities: The Indiana Data Mine (2021-2025)

National Institute of Food and Agriculture: Computational Skills Development for Next Generation Agriscience Professionals for Sustaining Data Driven Agriculture (2020-2023)

National Science Foundation OAC-#2005632: Category I: Anvil---A National Composable Advanced Computational Resource for the Future of Science and Engineering (2020-2022)

Inspiring Actuarial Education through Learning Communities and Research Experiences; from the Society of Actuaries (2019-2021)

National Institute of Food and Agriculture: Experiential Learning with Data Tools for Digital Agriscience and FACT (2019-2023)

Foundation for Food and Agriculture Research: An Open Source Framework and Community for Sharing Data and Algorithms (2018-2022)

National Science Foundation DMS-#1600382: Lake Michigan Workshop on Combinatorics and Graph Theory (2016)

National Science Foundation DMS-#1560332: REU Site: Diverse Undergraduate Research Experiences in Statistics (2016-2018)

National Science Foundation DMS-#1246818: MCTP: Sophomore Transitions: Bridges into a Statistics Major and Big Data Research Experiences via Learning Communities (2013-2018)

National Science Foundation CCF-#0939370: Science and Technology Centers: Integrative Partnerships Program: Emerging Frontiers of Science of Information (2010-2023)

National Science Foundation DUE-#1140489/#1140519: Collaborative Research: Science of Information: Bringing Many Disciplines Together (2012-2014)

Army Research Office: Stochastic Control of Multi-scale Networks (2008-2014)

National Science Foundation DMS-#0603821: Asymptotic Enumeration, Reinforcement, and Effective Limit Theory (2006-2009)

Honors

Focus Award, 2019
Mu Sigma Rho William D. Warde Statistics Education Award, 2016
Statistics Advising Award, 2015-2016
Voted as the Most Outstanding Faculty in the Favorite Faculty program, 2015-2016
Fellow of the Purdue University Teaching Academy, 2015-present
College of Science Undergraduate Advising Award, 2015-2016
College of Science Team Award, 2014-2015
Excellence in Research Award (for $1 million or larger external grants) 2011, 2012, 2013, 2015
College of Science Graduate Student Mentoring Award, 2012-2013
College of Science Team Award, 2011-2012
Junior Oberwolfach Fellow at the Mathematisches Forschungsinstitut Oberwolfach (MFO) in Germany, April 2011
College of Science Interdisciplinary Award, 2010-2011
Purdue University Teaching for Tomorrow Award, 2010-2011
Purdue University’s Mortar Board Chapter Citation Award, 2009-2010
College of Science Faculty Award for Outstanding Contributions to Undergraduate Teaching by an Assistant Professor, 2009-2010
College of Science Undergraduate Advising Award, 2009-2010
Department of Statistics Outstanding Assistant Professor Teaching Award, 2008-2009
Top Ten Outstanding Teacher in College of Science, 2007-2008
Good Teaching Award (Penn) in Math 104, Spring 2007
Good Teaching Award (Penn) in Math 104 and Math 580, Fall 2006
Good Teaching Award (Penn) in Math 104, Spring 2006
Good Teaching Award (Penn) in Math 104 and Math 432, Fall 2005
Actuarial Science Program Scholarship (Purdue), Fall 2004
Excellence in Teaching Award (Purdue), Spring 2004
GAANN Fellowship in Computational Science and Engineering (Purdue), 2002-2005
Frederick N. Andrews Fellowship in Mathematics (Purdue), 2001-2002
GAANN Fellowship in Mathematics and Computation in Engineering (Wisconsin), 1999-2001
Phi Beta Kappa, elected in 1999
Sigma Xi
Faculty Scholarship for Achievement (full tuition at Denison), 1995-1999
Anderson Science Scholarship (full tuition at Denison, 1 of 2 selected), 1995-1999

Publications

  1. "Mine" the Gap: Connecting Curriculum, Courses, and Community, by J. W. Manz, M. D. Ward, and E. Gundlach. In J. E. Eidum and L. Lomicka, editors, Faculty Factor: Developing Faculty Engagement with Living Learning Communities, chapter 8. Center for Engaged Learning at Elon University, 2022. Also contains vignette "The Impact of Experiential Learning" by Tim Knight.

  2. Student Experiences within a Data Science Learning Community: A Communities of Practice Perspective, by Aparajita Jaiswal, Alejandra Magana, Joseph A. Lyon, Ellen Gundlach, and Mark D. Ward, Learning Communities Research and Practice, Volume 9, Issue 1 (2021).

  3. Work-in-Progress: Evaluating Student Experiences in a Residential Learning Community: A Situated Learning Perspective, by Aparajita Jaiswal, Joseph A. Lyon, Viranga Perera, Alejandra J. Magana, Ellen Gundlach, Mark D. Ward, accepted for publication in the American Society for Engineering Education (ASEE) Annual Conference (2021).

  4. Characterizing the psychosocial effects of participating on a year-long residential research-oriented learning community, by Alejandra J. Magana, Aparajita Jaiswal, Aasakiran Madamanchi, Loran C. Parker, Ellen Gundlach, Mark D. Ward, accepted for publication in Current Psychology (2021).

  5. The number of distinct adjacent pairs ingeometrically distributed words, by Margaret Archibald, Aubrey Blecher, Charlotte Brennan, Arnold Knopfmacher, Stephan Wagner, Mark Daniel Ward, 18 pages, accepted for publication in Discrete Mathematics and Theoretical Computer Science (2021).

  6. Research Experiences in the Statistics Living Learning Community, by Maggie Betz, Peter Boyd, Emily Damone, Christina DeSantiago, Kent Gauen, Katie Lothrop, Mikaela Meyer, Kristen Mori, Ashley Peterson, Mark Daniel Ward, 12 pages, provisionally accepted pending minor revisions, for forthcoming book on the Future of Undergraduate Research in Math, to be published by the MAA in 2021.

  7. The Data Mine: Enabling Data Science Across the Curriculum, by E. Gundlach and M. D. Ward, Journal of Statistics and Data Science Education, Volume 29 (2021), supplement, S74-S82.

  8. The Periodicity of Nim-Sequences in Two-Element Subtraction Games, by B. Benesh, J. Carter, D. Crabill, D. Coleman, J. Good, M. Smith, J. Travis, and M. D. Ward, INTEGERS: Electronic Journal of Combinatorial Number Theory, Volume 20 (2020), 6 pages (pdf).

  9. The Next Wave: We Will All Be Data Scientists, by M. Betz, E. Gundlach, E. Hillery, J. Rickus, and M. D. Ward, Statistical Analysis and Data Mining, volume 13 (2020), 544-547 (pdf).

  10. Asymptotic Analysis of the kth Subword Complexity, by L. Ahmadi and M. D. Ward, Entropy, Volume 22, Issue 2 (2020), 34 pages (pdf).

  11. Fostering Undergraduate Data Science, by F. Gokalp Yavuz and M. D. Ward, The American Statistician, volume 74 (2020), 8-16 (pdf).

  12. Purdue University: Statistics Living Learning Community, by L. C. Parker and M. D. Ward, Aligning Institutional Support for Student Success: Case Studies of Sophomore-Year Initiatives, National Resource Center for The First-Year Experience & Students in Transition, University of South Carolina, edited by Tracy Skipper, September 2019 (pdf).

  13. Undergraduate Data Science and Diversity at Purdue University, by E. Hillery, M. D. Ward, J. Rickus, A. Younts, P. Smith, and E. Adams, PEARC '19: Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines, July 2019, Article No. 88 (pdf).

  14. The Characterization of Tenable Pólya Urns, by A. Davidson and M. D. Ward, Statistics and Probability Letters, volume 135 (2018), 38-43 (pdf).

  15. Asymptotic Analysis of Sums of Powers of Multinomial Coefficients: A Saddle Point Approach, by G. Louchard and M. D. Ward, INTEGERS: Electronic Journal of Combinatorial Number Theory, volume 17 (2017), paper A47 (27 pages) (pdf).

  16. Building Bridges: The Role of an Undergraduate Mentor, by M. D. Ward, invited submission for The American Statistician, volume 71 (2017), 30-33 (pdf).

  17. On the Variety of Shapes in Digital Trees, by J. Gaither, H. Mahmoud, and M. D. Ward, Journal of Theoretical Probability, volume 30 (2017), 1225-1254 (pdf).

  18. Variance of the Internal Profile in Suffix Trees, by J. Gaither and M. D. Ward, Proceedings of the 27th International Conference on Probabilistic, Combinatorial and Asymptotic Methods for the Analysis of Algorithms, 12 pages (2016) (pdf).

  19. On the Asymptotic Probability of Forbidden Motifs on the Fringe of Recursive Trees, by M. Gopaladesikan, S. Wagner, and M. D. Ward, Experimental Mathematics, volume 25 (2016), 237-245 (pdf).

  20. Data Science in the Statistics Curricula: Preparing Students to "Think with Data," by J. Hardin, R. Hoerl, N. J. Horton, D. Nolan, B. Baumer, O. Hall-Holt, P. Murrell, R. Peng, P. Roback, D. Temple Lang, and M. D. Ward, The American Statistician, volume 69 (2015), 343-353 (pdf).

  21. Learning Communities and the Undergraduate Statistics Curriculum (A Response to "Mere Renovation Is Too Little Too Late" by George Cobb), by M. D. Ward, The American Statistician, volume 69 (2015), online supplement (pdf).

  22. The Truncated Geometric Election Algorithm : Duration of the Election, by G. Louchard and M. D. Ward, Statistics and Probability Letters, volume 101 (2015), 40-48 (pdf).

  23. Asymptotic Properties of Protected Nodes in Random Recursive Trees, by H. Mahmoud and M. D. Ward, Journal of Applied Probability, volume 52 (2015), 290-297 (pdf).

  24. Resolution of T. Ward’s Question and the Israel-Finch Conjecture. Precise Analysis of an Integer Sequence Arising in Dynamics, by J. Gaither, G. Louchard, S. Wagner, and M. D. Ward, Combinatorics, Probability, & Computing, volume 24 (2015), 195-215 (pdf).

  25. On Kotzig’s Nim, by X. L. Tan and M. D. Ward, INTEGERS: The Electronic Journal of Combinatorial Number Theory, volume 14 (2014), paper G6 (27 pages) (pdf).

  26. On a Leader Election Algorithm: Truncated Geometric Case Study, by R. Kalpathy and M. D. Ward, Statistics and Probability Letters, volume 87 (2014), 40-47 (pdf).

  27. Asymptotic Joint Normality of Counts of Uncorrelated Motifs in Recursive Trees by M. Gopaladesikan, H. M. Mahmoud, and M. D. Ward, Methodology and Computing in Applied Probability, volume 16 (2014), 863-884 (pdf).

  28. Building Random Trees from Blocks, by M. Gopaladesikan, H. M. Mahmoud, and M. D. Ward, Probability in the Engineering and Informational Sciences, volume 28 (2014), 67-81 (pdf).

  29. The Variance of the Number of 2-Protected Nodes in a Trie, by J. Gaither and M. D. Ward, The Tenth Workshop on Analytic Algorithmics and Combinatorics (2013), 43-51 (pdf).

  30. Analytic Methods for Select Sets, by J. Gaither and M. D. Ward, Probability in the Engineering and Informational Sciences, volume 26 (2012), 561-568 (pdf).

  31. Asymptotic Distribution of Two-Protected Nodes in Random Binary Search Trees, by H. M. Mahmoud and M. D. Ward, Applied Mathematics Letters, volume 25 (2012), 2218-2222 (pdf).

  32. Partitions with Distinct Multiplicities of Parts: On An "Unsolved Problem" Posed By Herbert Wilf, by J. A. Fill, S. Janson, and M. D. Ward, Electronic Journal of Combinatorics, volume 19(2), article P18, 2012 (pdf).

  33. On the Number of 2-Protected Nodes in Tries and Suffix Trees, by J. Gaither, Y. Homma, M. Sellke, and M. D. Ward, Discrete Mathematics and Theoretical Computer Science, volume AQ (2012), 381-398 (pdf).

  34. Asymptotic Analysis of the Nörlund and Stirling Polynomials, by M. D. Ward, Applicable Analysis and Discrete Mathematics, volume 6 (2012), 95-105 (pdf).

  35. Number of survivors in the presence of a demon, by G. Louchard, H. Prodinger, and M. D. Ward, Periodica Mathematica Hungarica, volume 64 (2012), 101-117 (pdf).

  36. Towards the variance of the profile of suffix trees, by P. Nicodeme and M. D. Ward, Report of the Mini-Workshop on Random Trees, Information and Algorithms, from Mathematisches Forschungsinstitut Oberwolfach, Report 23/2011, pages 1269-1272 (pdf).

  37. Asymptotic properties of a leader election algorithm, by R. Kalpathy, H. M. Mahmoud, and M. D. Ward, Journal of Applied Probability, volume 48 (2011), 569-575 (pdf).

  38. Asymptotic rational approximation to Pi: Solution of an "Unsolved Problem" posed by Herbert Wilf, by M. D. Ward, Discrete Mathematics and Theoretical Computer Science, volume AM (2010), 591-602 (pdf).

  39. Inverse auctions: Injecting unique minima into random sets, by F. T. Bruss, G. Louchard, and M. D. Ward, ACM Transactions on Algorithms, volume 6, Article 21, December 2009, 19 pages (pdf). (See the previous version for full details before we did significant editing/trimming for publication.)

  40. On the shape of the fringe of various types of random trees, by M. Drmota, B. Gittenberger, A. Panholzer, H. Prodinger, and M. D. Ward, Mathematical Methods in the Applied Sciences, volume 32 (2009), 1207-1245 (pdf).

  41. Exploring data compression via binary trees, by M. D. Ward, Resources for Teaching Discrete Mathematics, MAA Notes volume 74 (Mathematical Association of America, 2009), 143-150 (pdf).

  42. Average-case analysis of cousins in m-ary tries, by H. M. Mahmoud and M. D. Ward, Journal of Applied Probability, volume 45 (2008), 888-900 (pdf).

  43. On correlation polynomials and subword complexity, by I. Gheorghiciuc and M. D. Ward, Discrete Mathematics and Theoretical Computer Science, volume AH (2007), 1-18 (pdf).

  44. Error resilient LZ'77 data compression: algorithms, analysis, and experiments, by S. Lonardi, W. Szpankowski, and M. D. Ward, IEEE Transactions on Information Theory, volume 53, May 2007, 1799-1813 (pdf).

  45. The average profile of suffix trees, by M. D. Ward, The Fourth Workshop on Analytic Algorithmics and Combinatorics (2007), 183-193 (pdf).

  46. Exploring the average values of Boolean functions via asymptotics and experimentation, by R. Pemantle and M. D. Ward, The Third Workshop on Analytic Algorithmics and Combinatorics (2006), 253-262 (pdf).

  47. Analysis of the multiplicity matching parameter in suffix trees, by M. D. Ward and W. Szpankowski, Discrete Mathematics and Theoretical Computer Science, volume AD (2005), 307-322 (pdf).

  48. Analysis of the average depth in a suffix tree under a Markov model, by J. Fayolle and M. D. Ward, Discrete Mathematics and Theoretical Computer Science, volume AD (2005), 95-104 (pdf).

  49. The number of distinct values of some multiplicity in sequences of geometrically distributed random variables, by G. Louchard, H. Prodinger, and M. D. Ward, Discrete Mathematics and Theoretical Computer Science, volume AD (2005), 231-256 (pdf).

  50. Error resilient LZ'77 scheme and its analysis, by S. Lonardi, W. Szpankowski, and M. D. Ward, Proceedings of the 2004 IEEE International Symposium on Information Theory (2004), 56 (pdf).

  51. Analysis of a randomized selection algorithm motivated by the LZ'77 scheme, by M. D. Ward and W. Szpankowski, The First Workshop on Analytic Algorithmics and Combinatorics (2004), 153-160 (pdf).

Professional Membership

American Mathematical Society (AMS)

American Statistical Association (ASA), ASA Fellow (also member of the Central Indiana Chapter)

Bernoulli Society (Lifetime Membership)

Institute of Mathematical Statistics (IMS) (Lifetime Membership)

International Statistical Institute (ISI) (Elected Member; Lifetime Membership)

Mathematical Association of America (MAA) (Lifetime Membership)

National Association of Mathematicians (NAM) (Lifetime Membership)

National Association of the Deaf (NAD)

National Black Deaf Advocates (also member of the Indiana Chapter)

Society for Advancement of Chicanos and Native Americans in Science (SACNAS) (Lifetime Membership)