Description
/ABSTRACT DNA sequencing holds significant promise for precisely profiling individual tumors, which in turn can guide personalized cancer therapies and improve long-term patient outcomes. A key prognostic factor across tissue types is intra-tumor heterogeneity (ITH), resulting from somatic single-nucleotide variants and copy-number aberrations, that is highly predictive of patient survival and risks of metastasis and relapse. However, accurately estimating ITH and tumor growth from DNA data remains challenging, thereby impeding its broader adoption for clinical decision making. This proposed project will focus on developing DECODE (Deciphering Cancer Origin from DNA Evolution), a novel approach for reconstructing tumor evolution that can predict patient outcomes and facilitate risk stratification and treatment selection in clinical practice (Aim 1). DECODE will integrate the neutral tail in the site frequency spectrum (SFS), comprised of recent mutations that carry signals indicative of a tumor’s expansion mode. DECODE’s enhanced accuracy will be based on mathematical corrections for sample-specific coverage and effects of mutation filtering. This will reduce the risk of overfitting, enhance the detection of overlapping clusters, and render DECODE ideally suited for data generated by current DNA-seq technologies. In Aim 2, DECODE will be used to analyze the tumor heterogeneity and infer tumor evolution history in primary cancers from the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For each sample, the SFS neutral tail will be characterized and the number of subclones and subclonal frequencies will be estimated. The tumor evolution timelines from these samples will then be inferred, using a novel mathematical framework to estimate the age of each tumor’s most recent common ancestor, subclonal arrival times and growth rates from DECODE’s parameters. Finally, the ITH estimates and tumor and subclonal growth rates will be examined for predictive power of patient outcome in specific cancer types, toward finding potential biomarkers. Relapsed and metastatic tumors from the Personalized OncoGenomics (POG) cohort will also be studied (Aim 3). For these samples, a multi-variate DECODE algorithm will be developed to consider mutations with distinct multiplicities occurring on different copy number backgrounds. The multi-variate DECODE will be able to analyze advanced cancers, which often exhibit variable copy numbers due to chromosomal instability. The comparison of ITH estimates and clonal evolutions inferred from DECODE between samples in POG and ICGC/TCGA will enable a deeper understanding into differences in selection forces driving relapses and metastases compared to de novo cancers. In summary, DECODE is a novel approach toward reliable tumor evolution reconstruction, enabled by advances in mathematical modeling and parameter inference. Its accuracy, notably for samples sequenced with currently available technologies, will enable deeper understanding of tumor adaptation, clinical patient stratification and outcome prediction, toward the goal of personalized therapy treatment. Project Number: 1R01CA310281-01 | Fiscal Year: 2026 | NIH Institute/Center: National Cancer Institute (NCI) | Principal Investigator: Khanh Dinh (+1 co-PI) | Institution: COLUMBIA UNIV NEW YORK MORNINGSIDE, NEW YORK, NY | Award Amount: $428,975 | Activity Code: R01 | Study Section: Special Emphasis Panel[ZRG1 BBBT-M (84)] View on NIH RePORTER: https://reporter.nih.gov/project-details/11320338
Interested in this grant?
Start a free 7-day trial to get match scores, save grants, and build your application with AI.
Grant Details
$428,975 - $428,975
March 31, 2031
NEW YORK, NY
View the application link
Start a free 7-day trial to open the original listing and funder website, save this grant, and track its deadline. Cancel anytime.
Start free trialWant to see how well this grant matches your organization?
Get Your Match Score