openHOUSTON, TX

A novel graph approach to resolve challenging and medically relevant regions in the human genome

National Human Genome Research Institute

Description

Despite improved human genome references, like the T2T and Human Pangenome References, limitations of short-read sequencing and inadequate tools preclude routine characterization of medically relevant complex regions like LPA, HLA, and GBA. This includes ‘graph genome’ methods, which have not yet been effectively applied to the characterization of clinically important, complex genes. We will develop a novel user-friendly and intuitive local graph approach that will be applied to resolve these critical genomic regions and yet accessible to non-specialists who wish to study complex regions of the human genome. A graph genome will be built from ~15,000 long-read genome assemblies from existing data from NIH Programs (All of Us, HPRC) and 50 locally generated long-read assemblies from a Cardiovascular Risk study (HeartCare) and then benchmarked and validated against existing references and methods. This graph will allow us to utilize existing short-read data to deepen our insights into the variation in LPA and its complex hypervariable Kringle IV type 2 (KIV-2) region, where additional variants potentially impact cardiovascular disease (CVD) risk. Newly assessed LPA variants will be analyzed for association with deep phenotypic measurements across 30,000 WGS and positive findings will greatly impact the application of LPA testing outside European populations. We will also apply our graph genome methods across larger data sets from TOPMed and All of US to assess around 1000 challenging but medically significant genes, including the American College of Medical Genetics Secondary Findings Gene List (73 genes), 132 HLA genes, and a further 395 clinically relevant genes, providing the most comprehensive annotated variant catalog of its kind. We will work with individual investigators (ARIC, SOL, All of US, GREGOR, TOPMed) to further validate the pathogenicity of newly identified variants, across multiple genetic diseases. We will also work with the TOPMed IRC to include these variants in imputation servers (BRAVO). Overall, this proposal will enable the use of graph genomes at scale and demonstrate their utility, impacting both the assessment of CVD risk across different populations and providing new information for multiple other diseases. Project Number: 1R01HG014006-01A1 | Fiscal Year: 2026 | NIH Institute/Center: National Human Genome Research Institute (NHGRI) | Principal Investigator: Fritz Sedlazeck | Institution: BAYLOR COLLEGE OF MEDICINE, HOUSTON, TX | Award Amount: $719,909 | Activity Code: R01 | Study Section: Genetic Variation and Evolution Study Section[GVE] View on NIH RePORTER: https://reporter.nih.gov/project-details/11223820

Interested in this grant?

Sign up to get match scores, save grants, and start your application with AI-powered tools.

Start Free Trial

Grant Details

Funding Range

$719,909 - $719,909

Deadline

February 28, 2030

Geographic Scope

HOUSTON, TX

Status
open

External Links

View Original Listing

Want to see how well this grant matches your organization?

Get Your Match Score

Get personalized grant matches

Start your free trial to save opportunities, get AI-powered match scores, and manage your applications in one place.

Start Free Trial