We develop new statistical methods and computational tools for genomics and transcriptomics, focusing on single-cell data analysis. Our research aims to address the challenges of high-dimensional data, sparsity, and noise in single-cell datasets. We leverage statistical modeling, machine learning, and computational techniques to improve the accuracy and interpretability of single-cell analyses. We are developing new methods for analysing developmental single-cell RNA-seq data, which can help in understanding the dynamics of cell differentiation and lineage specification.
A major goal of modern biology is to decipher the mechanistic principles underlying the phenotypic variation between species. The seminal works of Britten & Davidson and King & Wilson, more than 40 years ago, proposed that the phenotypic differences observed in species with high genetic similarity could be due to differences in the way genes are regulated. Deciphering how changes in the DNA sequence or the underlying epigenomic state have shaped the complex phenotype landscape across species remains a major challenge. We are current using cross-species single-cell data to improve our understanding of human diseases and their underlying biological processes. Our current focus is on developing methods for cross-species integrative analysis of single-cell ATAC-seq and single-cell RNA-seq data ('multiome') We aim to deepen our understanding of the non-coding genome by leveraging information across species thereby ultimately enabling us to accurately predict the functional impact of a novel non-coding mutation anywhere in the genome.
We apply statistical modeling and machine learning techniques to address public health challenges, particularly in the context of both communicable and non-communicable diseases. Our current research focuses on developing predictive models for risk analysis of breast cancer, and developing machine learning models for predicting obstructive sleep apnea. We are also interested in understanding the impact of environmental factors on health outcomes, particularly the impact of climate change on disease, birth and mortality patterns in India. These are relatively new areas for our lab, and we are excited to explore the potential of data-driven approaches to improve public health outcomes.
Duration | Project Title | Funding Organization |
---|---|---|
2024–2026 | Development of a Breast Cancer Risk Prediction Tool for the Indian Population | KCDH IITB - GKII JHU Breakthrough Research Grants Program |
2024–2027 | Characterizing non-invasive biomarkers of fatty liver disease using comparative single-cell multiomics | Anusandhan National Research Foundation (ANRF/ECRG/2024/006839/LS) |
2025–2028 | Development of machine learning models for predicting obstructive sleep apnea | TiH (Department of Science and Technology), IIT Bombay |