Research

Gene Regulatory Networks

Integration of high-dimensional genomic and epigenomic datasets, such as copy number aberration, DNA methylation, gene, and micro RNA expression datasets to compute gene regulatory interactions in cancer.

Description:

Patients with cancer frequently experience copy number aberrations, such as chromosomal amplifications and deletions. Regulators like microRNAs (miRNAs), which control downstream target genes involved in the crucial biological processes of carcinogenesis and proliferation, are frequently found in aberrated copy number areas. In this work, Dr. Bose investigated the networks of miRNA-gene interactions based on copy number and developed a computational pipeline, miRDriver based on the idea that cancer patient copy number data can be used to identify cancer driver miRNAs. miRDriver was applied in 18 different cancer types from The Cancer Genome Atlas (TCGA) database and discovered various miRNA-gene interactions that are common and particular to cancer, as well as interactions that are enriched in experimentally verified miRNA-target interactions. The work highlighted a number of oncogenic and tumor suppressor miRNAs that were both frequent in several cancer types and cancer-specific. In choosing significantly known miRNA-gene interactions across a range of cancer types, this strategy significantly outperformed the state-of-the-art strategies. It was discovered that a number of miRNAs and genes were linked to tumor survival and development. Selected target genes were discovered to have a considerable enrichment in Gene Ontology (GO) terms, cancer-related pathways, and cancer hallmark pathways. Additionally, in numerous cancer types, subtype-specific potential gene signatures have been found.

Precision Medicine

Designing the computational methods which integrate multi-omics datasets of patient tumor samples with drugs’ chemical properties and side effects to predict optimal drug response in cancer patients, thus connecting the patients with the most effective and less toxic therapies.

Description:

The functional link between genes and drug response as well as drug response prediction based on genomic and chemical properties have both been investigated using laboratory-cultured cell lines. However, determining how a medicine will affect actual patients is a more crucial and difficult issue. Bose's research combines information from primary tumors and cancer cell lines to uncover connections between cell lines and tumors in order to address this problem. To fill in these gaps, she designed a computational pipeline CTDPathSim, which uses a pathway activity-based method to compare the genetic, genomic, and epigenetic properties of primary tumor samples and cell lines while integrating multi-omics datasets in breast and ovarian cancers. According to her findings, highly comparable sample-cell line pairs respond to pharmaceuticals similarly to lowly similar pairs when using a variety of FDA-approved cancer medications, including Paclitaxel, Vinorelbine, and Mitomycin-c.

Flowchart of the CTDpathSim pipeline

Drug response concordance with sample-cell line similarity scores. A) Conditional density plot of matching (~ grey) and mismatch (~ black) of Vinorelbine’s response between BRCA samples and CCLE cell lines with computed similarity scores by five different methods: A1) CTDPathSim, A2) TSI method, A3) TC analysis, A4) SRCCM, A5) Celligner. The matching percentage of response of drug vinorelbine increases as the similarity score increases in A1, whereas in A2-A5, this drug does not show concordance between response and similarity scores. B) Boxplots of matching and mismatch response percentage with common drugs between TCGA BRCA cohort and CCLE cell lines in a low similarity threshold (≤ -1.5) for all the methods except TSI threshold (≤ -1) (with a Wilcoxson p-value in each box in red: B1) CTDPathSim shows lower median value of matching drug response in low similarity threshold with significant p-value (<0.05) , whereas all other four methods in B2-B5, there was no difference between medians of matching and mismatch responses ( p-values > 0.05). C) Boxplots of matching and mismatch response percentage with common drugs between TCGA BRCA cohort and CCLE cell lines in a high similarity threshold (≥ 1.5 )for the five methods with a Wilcoxson p-value in each box in red: C1) CTDPathSim shows the higher median value of matching drug response in high similarity threshold with significant p-value (<0.05) , whereas for TSI method and TC analysis in C2 and C3, there was no difference between medians of matching and mismatch responses ( p-values > 0.05); C4)SRCCM and C5)Celligner also show significant higher matching response in high similarity threshold.


Elucidating sexual dimorphism in cancer

Exploring the molecular basis of sexual dimorphism within and across cancers at the molecular systems genomics level in tumors and discovering the contribution of germline genetic variation to cancer risk utilizing statistical genetics and machine learning approaches.

Overview:

As part of this project, Dr. Bose is working to identify a group of sex-biased genes that can reveal the disparity between male and female patients in a variety of tumor types using machine learning models. She is also developing a computational tool that applies various models of sex-dependent genetic effects using statistical genetics and genome-wide analyses of complex traits and disease conditions.

Publication(s):

  • Manuscripts are in progress.