OVERVIEW
Our research focuses on the development and application of advanced data integration methods to enhance the diagnosis, treatment, and prevention of complex diseases. A key aspect of our work involves integrating multi-omics data with biological knowledge to translate genomic and phenotypic data from electronic health records (EHR) into actionable clinical products. Our projects span both theoretical and applied research, including:
​
-
Developing novel data integration methods that combine multi-omics using machine learning and deep learning approaches
-
Predicting clinical outcomes by analyzing interactions between multi-omic features
-
Integrating multi-modal imaging with multi-omics data
-
Identifying gene-by-environment (GxE) interactions across various phenotypes and diseases
-
Constructing disease-disease networks using EHR-linked biobank data
-
Developing Immune health scores and foundation models for Immune Health
​
Our ongoing work continues to focus on providing clinical tools that leverage the interactions within and between different layers of genomic data. Our long-term research goal is to develop and evaluate sophisticated data integration methods that simultaneously account for individual variations in genomic (multi-omics) data, imaging data, phenotype data from EHRs, and environmental/lifelog data, all aimed at advancing precision medicine. Additionally, our research leverages artificial intelligence (AI)-driven approaches to further enhance healthcare outcomes, paving the way for more personalized and effective treatments.
Lecture - Translational research, via data integration
Why Data Integration?
A central problem in translational research is that it’s difficult to understand the genetic architecture of a complex disease. Many genome-wide association studies (GWAS) exist—but most of the genetic loci that have been identified have negligible effects: Heritability, in many cases, isn’t found. That’s why we need alternative strategies to find underlying disease causes, said Assistant Professor of Informatics Dokyoon Kim, PhD—and a systems genomics approach, using data integration, can get the job done.
One Math Model to Explain Disease Phenotype
With the traditional approach, you can analyze a beautiful multi-omics data set, Dr. Kim continued—but you must handle the associations one pair at a time. So he asked himself: Can I integrate everything into one mathematical model to predict outcome or explain disease phenotype?