The UCSF-SFDPH DeID OMOP database integrates EHR data from UCSF Health and SFDPH. UCSF Health primarily serves insured patients, while SFDPH provides accessible healthcare regardless of insurance or immigration status. This integration enhances research in population health and health equity by offering a unified patient database with broad demographic and socioeconomic representation. Our team developed detailed methodologies and transparent data mapping processes, including a global patient identifier and extension tables for health system data differentiation. Documentation includes a data dictionary detailing mapping logic, analysis of patient sociodemographics across systems and subpopulations, and guidance for common research workflows. This project showcases how leveraging diverse expertise and fostering partnerships can drive innovation in data integration and documentation, supporting impactful and efficient research using self-service de-identified data assets.
Speaker/Host
Anna Rubinsky is Lead Clinical Informatics Specialist for UCSF’s Academic Research Services’ Research Data Assets User Support team. Trained as a health services researcher, she specializes in using large-scale EHR data for research, with a focus on population health and health equity. Her background developing robust analytic datasets helps her guide researchers on how to use and interpret complex clinical data for research. In her current role, Dr. Rubinsky contributes to the advancement of UCSF’s de-identified self-services research data assets, including integration of EHR data from San Francisco Department of Public Health and UCSF Fresno. She played a key role in ensuring these data align with research workflows, from optimizing data quality to creating detailed documentation. Beyond her data efforts, Dr. Rubinsky organizes biweekly UCSF GenAI office hours, providing a forum for the UCSF community to share experiences incorporating generative artificial intelligence solutions into their work.