Overview
The Information Commons (IC) provides self-service access to de-identified research data assets, including broad and deep clinical data, linked with other data modalities at the patient and clinical event levels.
IC Data Modalities and Included Health Systems
Information Commons self-serve data assets provide access to broad and deep, fully linkable, multi-modal, de-identified data for more than 8 million patients of UCSF Health, SFDPH, and Fresno CHS, without the need for IRB approval or an intermediary to extract the data. The structured clinical data are available in 2 formats:
-
Format close to Epic Caboodle data warehouse (consistent with the source system our clinical data originates in).
-
Standardized, harmonized format based on the OMOP Common Data Model (which supports external collaborations for larger-scale research).
IC Research Data Assets
The UCSF Information Commons combines multiple research data assets, including data from UCSF Health, the San Francisco Department of Public Health, as well as ZSFG Hospital and Fresno Community Health System.
- UCSF DeID CDW - UCSF Health De-Identified Clinical Data Warehouse
The most comprehensive set of UCSF Health de-identified clinical data. Includes structured electronic health records data from UCSF Epic Caboodle (e.g., patient demographics, diagnoses, encounters, procedures, medications, labs), as well as de-identified UCSF clinical notes, clinical concepts extracted from notes, and UCSF cancer genomic testing data. - SFDPH DeID CDW - SFDPH De-Identified Clinical Data Warehouse
De-identified SFDPH EHR (Epic Caboodle) covering SF Health Network & ZSFG. - UCSF DeID OMOP - UCSF Health De-Identified OMOP Database
OMOP-standardized data (demographics, diagnoses, encounters, procedures, medications, labs) derived from the UCSF DeID CDW. Analyses are more reproducible and portable across OMOP sites. - UCSF–SFDPH DeID OMOP
Combined, de-identified OMOP dataset across UCSF Health and SFDPH, merged at the patient level. - UCSF DeID Notes & Extracts
198M+ de-identified clinical notes plus structured concept extracts (available via UCSF DeID CDW and EMERSE). - UCSF Cancer Genetic Testing Dataset
De-identified UCSF500 and Foundation Medicine results (via UCSF DeID CDW). - Imaging Commons
De-identified radiology images with linked EHR metadata; DICOM headers/pixels viewable (access differs from other assets). - PatientExploreR Database (PEDB)
Structured clinical data backing the PatientExploreR application, featuring UCSF Health and Fresno CHS structured clinical data.
Detailed availability and tool compatibility (ATLAS/RAE/Wynton/AWS/Parquet, etc.) are documented in the Wiki (Accessible only from VPN or the UCSF WPA network).
Request Access and Learn More
Learn how these data offerings fit into the broader IC ecosystem on our overview page.
Detailed Documentation – Accessible only from VPN or the UCSF WPA network