Data

The following summaries provide a brief overview of the Regenstrief Center for Healthcare Engineering’s past and current datasets.  

For more information on data resources contact Paul Griffin at paulgriffin@purdue.edu

Purdue Claims Data

Fully deidentified dataset of medical claims, prescription claims, eligibility, biometrics, and Johns Hopkins risk assessment information for Purdue University, starting in 2014. This data set is available to researchers pending IRB approval. See HIPAA for more details on RCHE's data access policy and procedures.

Data Dictionary (requires PUCA credentials to view)

MIMIC Critical Care Database 

MIMIC is an openly available dataset developed by the MIT Lab for Computational Physiology, comprising deidentified health data associated with >40,000 critical care patients. It includes demographics, vital signs, laboratory tests, medications, etc..

Kaggle

Kaggle contains a wide variety of datasets including some healthcare related datasets.

Zika Virus Epidemic Dataset

Kaggle assists in analyzing the ongoing spread of this infectious disease, posted by CDC.

Health Insurance Marketplace Dataset

Kaggle assists in exploring health and dental plans data in the US Health Insurance Marketplace, posted by US Department of Health and Human Services.

VPS Logo

Virtual Pediatric Systems (VPS) Pediatric ICU Data

VPS holds the largest research database in pediatric critical care in the world and encourages research utilizing the data. RCHE has contacts within the association and can help get researchers started on a research request.

Further reading on this data set as well as information on other pediatric critical care research resources can be found here.