About the MCAP Summer Institute (MCAP-SI)

Our week-long Summer Institute on Longitudinal Data Analysis is designed to meet the needs of 50 participants each year, welcoming individuals from all career stages and backgrounds (ie. graduate students, post-docs, faculty, and industry researchers) — all of whom are eager to enhance their knowledge in longitudinal data analysis. Our Summer Institute will be held at Purdue University in West Lafayette, IN and we will provide meals and lodging in Purdue dorms for those selected to receive travel funding. The Summer Institute is ideal for individuals with a foundational understanding of statistics who seek to learn and apply longitudinal methods in their work. We specifically encourage applicants who are not already experts in longitudinal data analysis but who see the potential for these skills to enhance their research or professional contributions. 

Contemporary large-scale NIH initiatives have led to the emergence of many high-quality publicly available longitudinal datasets that that include complex data of various types, sources, and domains (e.g., biological, social, individual, family, neighborhood, etc.). However, use of these datasets without training can lead to scientific setbacks, including work that is imperfect, misleading, or even incorrect. There is an urgent need for educational programming to train researchers both within and outside of academic careers on the innovative and responsible use of publicly available, large, and complex longitudinal datasets. This R25 grant develops and offers an “Interdisciplinary Summer Institute on the Analysis of Complex, Large-Scale Longitudinal Data”, refining it each year based on evaluation data (aim 1). We will also leverage this program to train graduate students to teach advanced longitudinal methods to participants from multiple disciplines (aim 2). Thus, we will serve two groups: program participants (aim 1), and Purdue graduate student teaching assistants (TAs, aim 2). During an immersive week-long summer institute each year, we will train 50 interdisciplinary participants including students, postdocs and faculty across academic institutions (Y1-Y3), expanding to also include professionals in non-profits, governmental agencies, and industries (Y2, Y3). The course is organized in 10 topics: publicly available longitudinal data sources, introduction to longitudinal data analytic methods, data visualization, missing data, longitudinal categorical data analysis, sampling weights and clustering/ stratification, time varying and time-invariant covariate inclusion, combining multiple data sources, embedded family-based designs, and an intro to sociogenomics—emphasizing cross-cutting themes of data management, visualization and communication, causal inference, measurement and modeling decisions, meaningful effect sizes, and representativeness. Lecture examples and assignments will focus on substance use and associated factors and will use the Adolescent Brain and Cognitive Development study data, although participants will be encouraged to use whatever dataset is most relevant to their own research interests. The summer institute will also feature TAs and additional faculty instructors circulating the room in each session to support students in need of extra assistance in real-time, as well as review and office hour sessions, experience in interdisciplinary environments, networking, and joint practice opportunities to help establish collaborations. We will also train 6 graduate student TAs each year, who will gain supervised experience in content development, instruction (via review sessions), consulting, course evaluation, and leadership within interdisciplinary environments. We have carefully designed recruitment strategies to train a diverse (e.g., under-represented groups, discipline, and career stage and path) workforce, and a multi-pronged evaluation plan. Our program faculty includes 8 faculty experts in longitudinal data analysis and instruction, representing different fields, genders, and career stages.

For Participants

  • CITI Training (Human Research Protection Program)
    • If you have not taken it before or yours is expired: complete the Biomedical Research for Investigators or Social Behavioral Research group, and then the Human Subjects Research – Initial (Basic) course. If you completed training within the past 4 years, you may take a refresher. If your certificate is current, no action is required.
  • Office Hours: Each day from 4:00–5:00pm, we will hold in-person office hours with two dedicated tables:
    • questions on assignments / course content
    • consulting on individual projects
  • These sessions will be staffed by our faculty instructors and teaching assistants. Feel free to drop by whichever table aligns with your needs!
  • Slack:
    • We will be actively monitoring Slack during the course and in the evenings to answer any questions. We encourage participants to post R-related and assignment questions on Slack so as not to interrupt class sessions. You can join our Slack workspace using the link provided in your registration email.
  • GENERAL EVENT INFORMATION
    Campus Parking | Purdue Parking Map
    Northwestern Parking Garage
    This garage has limited Parkmobile spots or attendees can purchase their own A-Permit during their stay.
    Grant Street Garage
    This garage is ticketed parking – attendees pay for parking when exiting the garage. *NOTE – For those staying in the Residence Hall: Parking is located on the top floor of the parking garage in First Street Towers. These spots are free of charge.
    Resident Hall Dining | Earhart Dining Court
    1275 1st Street, West Lafayette, IN 47906
    *NOTE – If you are not staying at the dorms during your visit, you may eat at the dining hall. You will pay at the door for your meal every time you go to Earhart Dining Court.
    Dining Hours
    Breakfast: 7:00 am – 8:30 am
    Lunch: 11:00 am – 1:30 pm
    Dinner: 5:00 pm – 7:00 pm
    Find Information Regarding Dietary Restrictions HERE.
    Classroom Space | Wilmeth Active Learning Center (WALC), Room 1132
    340 Centennial Mall Dr., West Lafayette, IN 47907
    Reception Location | Purdue Memorial Union, West Faculty Lounge
    201 Grant St., West Lafayette, IN 47906
    WIFI   |  AT&T WIFI & Eduroam
    AT&T WIFI – This is a free connection that does not require any credentials to sign into and can be used by anyone on campus
    Eduroam – A secure, world-wide roaming access service developed for education and research communities. The credentials to sign into Eduroam are generally your home Universities account.
    Registration   | Locations & Times
    Sunday, July 13 (11:30-12:30PM) | WALC 1132
    Sunday, July 13 (4:30-5:30PM) | Purdue Memorial Union, West Faculty Lounge
    Monday, Jul 14 (7:45-10AM) | WALC 1132
Explore the Greater Lafayette Area!
Events Calendar – Purdue University Events Calendar
Local Eateries and Activities – Home of Purdue Purdue Memorial Union Dining | there are several options for dining in the PMU. See the campus options HERE.

Featured Faculty ANd Teaching Assistants

  • Dr. Kristine Marceau (MCAP Co-Director): Dr. Marceau is an Associate Professor of Human Development and Family Science who specializes in longitudinal methods emphasizing both developmental change and variability across multiple time-scales using and integrating SEM and multilevel modeling techniques. She frequently uses family-based designs and large datasets to explore developmental and behavioral trajectories. Dr. Marceau regularly teaches multilevel modeling and inferential statistics, and trains students in longitudinal data analysis. 
  • Dr. Trenton D. Mize (MCAP Co-Director): Dr. Mize is the Dean’s Associate Professor of Sociology and Statistics (by courtesy) and a quantitative methodologist with expertise in categorical data analysis, latent variable modeling, and data visualization. His research develops and applies innovative methods for analyzing complex social data, and he regularly teaches graduate courses on categorical data and experimental design.  
  • Dr. James A. McCann (MCAP Co-Director): Dr. McCann is a Professor of Political Science with expertise in longitudinal survey analysis and latent variable modeling. He has led multiple large-N longitudinal studies on political behavior and representation and regularly applies advanced econometric and multilevel techniques in his research. Dr. McCann teaches graduate seminars on research design and quantitative analysis, focusing on panel data and survey methodologies. 
  • Dr. Sharon Christ (MCAP Co-Director): Dr. Christ is an Associate Professor of Human Development and Family Science specializing in emergent statistical models, particularly structural equation modeling (SEM) and complex sample designs. Her expertise in multilevel modeling, SEM, and growth models has been applied across numerous large-scale cohort studies. She has taught graduate-level courses on sample design, inferential statistics, and SEM. 
  • Dr. Robert Duncan: Dr. Duncan is an Associate Professor of Human Development and Family Science at Colorado State University with expertise in advanced longitudinal data analysis, including multilevel modeling, structural equation modeling (SEM), and growth curve modeling. His work focuses on children’s development within multilevel contexts like classrooms.  
  • Dr. Dongjuan Xu: Dr. Xu, an Associate Professor in the School of Nursing, specializes in longitudinal cohort studies that evaluate the quality of care and outcomes for older adults. Her expertise spans applied biostatistics, epidemiological methods, and outcome evaluation. She regularly teaches graduate courses in these areas, incorporating advanced quantitative techniques into her instruction, such as weighting methods and sampling designs. 
  • Dr. Shawn Bauldry: Dr. Bauldry, a Professor of Sociology at Purdue, specializes in quantitative methods and statistics, primarily focusing on the development of structural equation models, a broad class of statistical models with wide applicability in the social sciences.
  • Dr. Robbee Wedow: Dr. Wedow is an Assistant Professor of Sociology and Data Science at Purdue University, with expertise in statistical genetics and sociogenomics. His research applies advanced statistical methods, including gene-environment interaction models, to large-scale genetic datasets to investigate social and health outcomes.  
  • Dr. Katie Thompson: Dr. Thompson is a postdoctoral researcher in the Department of Sociology. Her work intersects psychiatry, genomics, and sociology, using innovative statistical approaches to integrate large-scale longitudinal data to better understand mental health. She has specialized in complex longitudinal designs using structural equation models, multilevel and matrix-based mixed models, and genetic and family data. Dr Thompson has taught on MSc statistics courses and led multiple R intensive workshops focused on family data at King’s College London. She has led on projects using multiple longitudinal cohort studies across the USA and UK and has focused on creating open and reproducible code and analytical pipelines.

  • Mallory Bell (Sociology): Mallory Bell is a dual-title PhD candidate in Sociology and Gerontology at Purdue University. Her research uses longitudinal data analysis to examine how social determinants of health help shape trajectories of well-being in later life.
  • Bing Han (Sociology): Bing Han is a dual-title Ph.D. candidate in Sociology and Gerontology at Purdue University, where she has also earned graduate certificates in Applied Statistics and Advanced Methodology. Her research focuses on health behaviors and lifestyles, stigma, and aging, employing a wide range of methodological approaches, including machine learning, categorical data analysis, longitudinal modeling, latent variable analysis, and experimental design.
  • Susmita Ghosh (Public Health): Susmita Ghosh is a PhD candidate in the Department of Nutrition Science at Purdue University, specializing in nutritional epidemiology with a focus on maternal and infant nutrition and food environments. Her research integrates advanced statistical techniques—including multilevel modeling, longitudinal analysis, and causal inference methods—to evaluate randomized controlled trials and social and behavior change interventions aimed at improving health and nutritional outcomes in low-resource settings.
  • Yi Zhu (Education): Yi Zhu is a fifth-year PhD candidate in Mathematics Education at Purdue University. Her research focuses on early mathematics learning, spatial reasoning, and game-based learning environments, employing both quantitative and qualitative (mixed-methods) approaches to understand how children develop mathematical thinking.
  • Amy Loviska (HDFS): Amy Loviska is a PhD candidate in the Human Development and Family Science at Purdue University. Their research program applies advanced quantitative longitudinal methods alongside community-engaged qualitative work to understand effects of individual biology (i.e., hormones, genetics), proximal environments (i.e., prenatal, parents, peers), and sociocultural macroenvironments on adolescent substance use progression for diverse gender and race-ethnic background youth.
  • Catalina Vega Mendez (Political Science): Catalina Vega Méndez is a Ph.D. Candidate in the Department of Political Science at Purdue University. Her research focuses on comparative political behavior and migration policy, with a regional emphasis on Latin America. She studies public attitudes and policy responses to international migration using a range of methodological tools with expertise in difference-in-differences designs, as well as the analysis of international longitudinal survey and panel data.