Random Matrix Theory and Manifold Learning for High-Dimensional Data Integration
National Science FoundationDescription
This project develops new mathematical and computational tools for integrating high-dimensional datasets with partially shared structures, a challenge that arises across various fields, including molecular biology, precision medicine, business analytics, and economics. When data are collected from multiple sources—such as different individuals, experimental conditions, or technologies—joint analysis can reveal complex patterns that would be missed if each dataset were analyzed in isolation. However, existing methods often struggle to distinguish meaningful signals from noise, particularly when the data are high-dimensional and heterogeneous. This project addresses these limitations by creating a principled framework to uncover and align shared low-dimensional structures across datasets, ultimately enabling more accurate, interpretable, and biologically relevant insights. The project will also contribute to the broader community by developing open-source software tools and offering interdisciplinary training opportunities for students at various levels. This project will build new theoretical foundations and methods at the intersection of random matrix theory, manifold learning, and high-dimensional statistics, and it is closely related to artificial intelligence. Key contributions include new results in random matrix theory for composite and kernel matrices formed from multiple datasets, a Procrustes-based framework for aligning low-dimensional structures in high-dimensional noise, and a kernel-spectral approach for joint nonlinear embedding. These tools will enable more accurate and robust data integration, particularly in single-cell biology, where understanding conserved cellular patterns across different conditions or species is a central challenge. Broader applications include the analysis of electronic health records and other large-scale biomedical or economic datasets. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria. NSF Award ID: 2515684 | Program: 01002627DB NSF RESEARCH & RELATED ACTIVIT | Principal Investigator: Rong Ma | Institution: Harvard University, CAMBRIDGE, MA | Award Amount: $175,000 View on NSF Award Search: https://www.nsf.gov/awardsearch/show-award/?AWD_ID=2515684 View on Research.gov: https://www.research.gov/awardapi-service/v1/awards/2515684.html
Interested in this grant?
Sign up to get match scores, save grants, and start your application with AI-powered tools.
Grant Details
$175,000 - $175,000
June 30, 2029
CAMBRIDGE, MA
External Links
View Original ListingWant to see how well this grant matches your organization?
Get Your Match Score