openLOS ANGELES, CA

RTG: Statistics and Data Theory - Engaging the Future of Data Science

National Science Foundation

Description

The growing reliance on data-driven technologies across industry, science, and society demands a workforce with both strong theoretical foundations and the ability to apply rigorous statistical and mathematical reasoning to real-world problems. This Research Training Group, based in UCLA’s Department of Statistics & Data Science, will train the next generation of researchers in the mathematical and statistical foundations of data science, providing them with the tools to tackle emerging challenges in AI, biotechnology, and applied statistics. The Research Training Group will create an integrated ecosystem of research and training, engaging students at all academic levels, including high school, community college, undergraduate, graduate, and postdoctoral levels. Undergraduate students will participate in course-based research experiences and immersive summer research programs, working alongside graduate students and faculty. Graduate students and postdocs will receive comprehensive mentoring through specialized topic courses, working groups, seminars, workshops, and summer schools. A new seminar series will highlight connections between statistics, data theory, and applications across disciplines. These efforts will help build a mathematically trained workforce ready to engage with the demands of modern data science and its applications. The research project focuses on advancing the mathematical and statistical foundations of modern data science, with an emphasis on theory, methodology, and applications. In deep learning, the project will develop theory for large-scale non-convex optimization, algorithmic regularization, generalization in neural networks, and emerging phenomena such as feature learning. Foundational work on generative AI models will address scaling laws, prompt tuning, and statistical principles for in-context learning and diffusion models. The project also develops rigorous statistical methods to support trustworthy and reliable AI, as well as tools for enhancing the accuracy and interpretability of genomics and single-cell technologies, data integration, and biological network inference. Additional efforts focus on statistical approaches for practical causal inference in observational studies, methodologies for studying hidden populations, epidemic modeling from limited data, and advanced change point detection. Across these areas, the project combines theoretical development with use-inspired applications in AI, biology, medicine, and other sciences. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria. NSF Award ID: 2446222 | Program: 01002627DB NSF RESEARCH & RELATED ACTIVIT,01003031DB NSF RESEARCH & RELATED ACTIVIT,01002930DB NSF RESEARCH & RELATED ACTIVIT | Principal Investigator: Guido Montufar Cuartas | Institution: University of California-Los Angeles, LOS ANGELES, CA | Award Amount: $1,263,600 View on NSF Award Search: https://www.nsf.gov/awardsearch/show-award/?AWD_ID=2446222 View on Research.gov: https://www.research.gov/awardapi-service/v1/awards/2446222.html

Interested in this grant?

Sign up to get match scores, save grants, and start your application with AI-powered tools.

Start Free Trial

Grant Details

Funding Range

$1,263,600 - $1,263,600

Deadline

August 31, 2031

Geographic Scope

LOS ANGELES, CA

Status
open

External Links

View Original Listing

Want to see how well this grant matches your organization?

Get Your Match Score

Get personalized grant matches

Start your free trial to save opportunities, get AI-powered match scores, and manage your applications in one place.

Start Free Trial