Data Sources

Data Sources

The All of Us Research Program aims to engage a community of one million or more volunteers who reflect the diversity of the United States. This includes people and communities who haven’t taken part in or have been left out of health research in the past. We recruit participants from across the country, including many communities that have been historically underrepresented in biomedical research.

Participants contribute data from multiple sources, including surveys and electronic health records. Some participants also contribute biosamples, which we use to generate genomic data, and data from mobile health devices. As the program grows, additional data sources will become available.

The All of Us Research Program’s Data and Research Center curates and validates these data as part of the data collection process. Researchers can explore the participant data, with all direct identifiers removed, through our Data Snapshots, Data Browser, or through the Researcher Workbench.

Find out more about our Study & Data Methodology including our use of the OMOP Common Data Model.

Surveys

All participants are invited to complete surveys on the secure All of Us Participant Portal (learn more about how participants join). Initial surveys include questions on health and lifestyle. Additional surveys on more specific subjects are added regularly. You can view survey questions and learn more about where the questions come from with our Survey Explorer.

Survey responses are curated to protect personally identifiable information (PII). See the All of Us Registered Tier Data Dictionary for more information.

Electronic Health Records

Many participants agree to share their EHRs with the program. All of Us removes and/or generalizes personal identifiers from participants’ EHR data before adding this information to the Research Hub. See the All of Us Registered Tier Data Dictionary for more information.

We use the OMOP CDM to standardize all EHR data. Learn more about OMOP on our Methods page.

Physical Measurements

Some participants will provide physical measurements at one of our All of Us Research Program partner locations. These measurements collected include: height, weight, BMI, waist circumference, hip circumference, blood pressure, and heart rate. Program staff also note participants’ pregnancy status and wheelchair use (if applicable) at the visit.

Explore aggregate data on our participants’ physical measurements in the Data Browser.

Genomics

Some participants contribute blood, urine, and/or saliva samples for analyses, including whole genome sequencing and genotyping. These samples are stored in our biobank located at the Mayo Clinic and then sent to our genome centers for genome sequencing and processing. Aggregate-level genomic data is available to the public in the Data Browser. Whole genome sequencing and array data from genotyping are only available in the Researcher Workbench through the Controlled Tier.

Mobile Health Data (wearable devices)

All of Us participants can contribute information from wearable health tracking devices. Wearables track biometric data like heart rate and blood pressure.