Overview
Access UCSF’s integrated ecosystem of research data, tools, and secure environments to explore ideas, build analyses, and accelerate discovery—now enhanced by AI-powered workflows.
Information Commons
UCSF’s ecosystem for data-driven and AI-enabled research
The Information Commons is a comprehensive ecosystem of data, tools, secure computational and analysis environments, and community & support services that enables the full cycle of data-driven research. It welcomes self-service use of the data and resources while providing support with a talented team of informatics and data science experts and a growing research data science community.
Information Commons aims to minimize the time researchers and data scientists need to spend on preparation for projects and get to answering the research question as soon as possible. Its vision is to enable scientific discovery and its translation to clinical care and precision medicine by lowering barriers to access and linking together data, computational capabilities, and shared knowledge.
From Research Question to Insight—Faster
Many researchers experience the same challenge:
They have an important question. They know the data exists. But getting from that question to a working analysis—finding the right data, setting up a compute environment, and building analysis workflows — can take months.
The Information Commons is designed to change that.
By connecting data, tools, environments, and community into a unified ecosystem, IC enables researchers to move more efficiently from idea to data to analysis toward discovery.
Today, this transformation is being further accelerated by artificial intelligence. Increasingly, AI is becoming the interface between researchers and the IC—enabling natural-language interaction with data and tools, and dramatically reducing the distance between a research question and the analysis needed to answer it.
A Connected Research Ecosystem
Data | Tools | Environments | Community
The Information Commons connects:
- Large-scale multimodal clinical data
- Secure, scalable compute environments
- Specialized research tools and applications
- A collaborative support community
Together, these components allow researchers to move seamlessly from data exploration to analysis to modeling to deployment, supporting everything from observational studies to advanced AI development.
The Four Pillars of the Information Commons
Data
The IC provides access to linked, de-identified large-scale multi-modal clinical and biomedical data, including:
- Structured EHR data (diagnoses, labs, medications, encounters, demographics, etc.)
- Clinical notes and text-derived concepts
- Imaging and radiology data
- Genomic testing datasets
These datasets span several health systems (UCSF Health, SF DPH, Fresno CHS) and decades of clinical activity, enabling both population-scale research and AI model development.
👉 Explore: Information Commons Data
Tools
The IC includes a growing ecosystem of tools and applications that support:
- Cohort discovery and data exploration
- Clinical text search and abstraction
- Knowledge graph and multi-omics exploration
- Code-based analytics (Python, R, Jupyter)
- Reusable analytics patterns and workflows
These tools support both no-code exploration and advanced computational research, meeting researchers where they are.
👉 Explore: Information Commons Tools
Environments
The IC is powered by a set of purpose-built computing environments that support the full research lifecycle:
- Exploration: Secure desktop environments for data access and prototyping
- Development: Dedicated compute for building and testing workflows
- Execution at scale: High-performance GPU clusters for large analyses and AI training
- Cloud expansion: Elastic infrastructure for distributed and high-throughput workloads
Integrated environments such as IC FAC and IC AWS combine data, compute, and tools into a seamless research experience.
Community
The IC is not just infrastructure—it is a community of practice that helps researchers succeed.
Support includes:
- Training programs and learning resources
- Office hours and expert consultations
- Slack forums and peer collaboration
- Shared workflows and best practices
This community helps researchers navigate complexity, accelerate onboarding, and build reproducible, collaborative research.
AI as the Interface Layer
A key evolution of the Information Commons is the emergence of AI as a unifying interface across the ecosystem.
Through copilots, agents, and natural-language tools, researchers can:
- Translate research questions into data queries
- Navigate complex datasets and documentation
- Extract and structure information from clinical notes
- Build and execute analysis workflows
- Automate parts of the research pipeline
This “AI interface layer” is transforming how researchers interact with data—collapsing the distance between question and analysis.
From Idea to Impact
By integrating data, tools, environments, and community—and layering AI across them—the Information Commons enables researchers to:
- Accelerate time from idea to analysis
- Work across large, complex datasets with confidence
- Build scalable and reproducible workflows
- Collaborate across teams and institutions
- Translate research into real-world impact
The result is a platform that supports the full spectrum of research outcomes:
Discoveries → Models → Publications → Clinical Insights
Get Started
Whether you are exploring data for the first time or building advanced AI workflows, the Information Commons provides the foundation to support your work.
-
Join our forums on UCTech Slack workspace: #ucsf-deid-cdw-omop on UCTech.slack.com
-
Email for support and any questions: [email protected]
-
Request access to Self-Serve Research Data Assets
-
Book time with informatics and computational environment experts on our team