The following summaries provide a brief overview of the Regenstrief Center for Healthcare Engineering’s past and current datasets.  

For more information on these or if you are unable to view a section, contact Paul Griffin at

MIMIC Critical Care Database 

MIMIC is an openly available dataset developed by the MIT Lab for Computational Physiology, comprising deidentified health data associated with >40,000 critical care patients. It includes demographics, vital signs, laboratory tests, medications, etc..


Kaggle contains a wide variety of datasets including some healthcare related datasets.

Zika Virus Epidemic Dataset

Kaggle assists in analyzing the ongoing spread of this infectious disease, posted by CDC.

Health Insurance Marketplace Dataset

Kaggle assists in exploring health and dental plans data in the US Health Insurance Marketplace, posted by US Department of Health and Human Services.

Truven Health Analytics

Fully deidentified dataset of medical claim, prescription and eligibility information for Purdue University, available upon IRB approval.

Data Dictionary (requires PUCA credentials to view)

Purdue University, 610 Purdue Mall, West Lafayette, IN 47907, (765) 494-4600

© 2018 Purdue University | An equal access/equal opportunity university | Copyright Complaints | Maintained by Discovery Park

Trouble with this page? Disability-related accessibility issue? Please contact Discovery Park at