CONNECT: Cohort-Oriented Network kNowledgebase for Enhanced CollaboraTion
National Institute of Environmental Health SciencesDescription
/ABSTRACT Cohort studies play a fundamental role in contemporary environmental health sciences, advancing our understanding of how chemical exposures, lifestyle factors, and genetics contribute to human health. The National Institutes of Health (NIH) has always recognized the importance of cohort studies, especially in enhancing collaboration by pulling cohort resources to include participants from various populations and backgrounds. Summarizing and managing the findings and knowledge from cohort studies are indispensable components of cohort development, yet these tasks have often been overlooked. It will inevitably result in a significant loss if the knowledge and findings from cohort studies are not easily available or accessible to the entire environmental health community and public. Therefore, in this project, we propose Cohort-Oriented Network kNowledgebase for Enhanced CollaboraTion (CONNECT), a directed knowledge graph summarizing the knowledge for children’s environmental health, to meet the emerging needs of contemporary environmental health science community. The CONNECT knowledgebase will harmonize existing cohort study knowledge in children’s health and align with future environmental health development directions for managing multi-cohort findings. Indeed, the proposed CONNECT knowledgebase is a unique resource that is distinguished from other existing knowledgebases in the environmental health community. Our aims are: Aim1. Develop an AI-assisted knowledge extraction pipeline with prompt engineering for harmonizing environmental health knowledge from published cohort studies; Aim 2: Develop the CONNECT knowledgebase for children’s environmental health; Aim3: Develop CONNECT knowledgebase webtool and showcase the utility of CONNECT knowledgebase for users. Specifically, we will first develop an AI-assisted pipeline for knowledge extraction, validation, and harmonization from publications. We will leverage ChatGPT to assist in extracting findings/associations from 17 NIEHS funded children’s cohort publications. We will harmonize our environmental health language based on the existing HHEAR Ontology. Secondly, we will further construct the CONNECT knowledgebase based on the extracted knowledge using the proposed pipeline. The CONNECT knowledgebase will consist of knowledge from each cohort, resulting in an overall knowledge graph with harmonized environmental health language. Moreover, we will develop a webtool to facilitate knowledge dissemination to the environmental health community. We also demonstrate the various utilities of the CONNECT knowledgebase, including exposure-outcome evidence mapping, mechanistic insights and hypothesis generation, statistical querying for meta-analysis, among many other applications. Finally, our proposed CONNECT knowledgebase will provide a unique opportunity to train undergraduate students to become the next generation biomedical scientists in both data science and environmental health. Project Number: 1R15ES038712-01 | Fiscal Year: 2026 | NIH Institute/Center: National Institute of Environmental Health Sciences (NIEHS) | Principal Investigator: Yike Shen | Institution: UNIVERSITY OF TEXAS ARLINGTON, ARLINGTON, TX | Award Amount: $605,091 | Activity Code: R15 | Study Section: Analytics and Statistics for Population Research Panel B Study Section[ASPB] View on NIH RePORTER: https://reporter.nih.gov/project-details/11362236
Interested in this grant?
Start a free 7-day trial to get match scores, save grants, and build your application with AI.
Grant Details
$605,091 - $605,091
Not specified
ARLINGTON, TX
View the application link
Start a free 7-day trial to open the original listing and funder website, save this grant, and track its deadline. Cancel anytime.
Start free trialWant to see how well this grant matches your organization?
Get Your Match Score