Dr. Maria-Florina Balcan

ACM Fellows (2023)
ACM Grace Murray Hopper Award (2019)
ACM Names 68 Fellows for Contributions to Computing That Underpin Our Daily Lives
2019 ACM Grace Murray Hopper Award

ACM Fellows

USA - 2023

citation

For contributions to the foundations of machine learning and its applications to algorithmic economics and algorithm design

Press Release

ACM Grace Murray Hopper Award

USA - 2019

citation

For foundational and breakthrough contributions to minimally-supervised learning.

For many real-world learning problems, unlabelled data is plentiful and easily harvested (images, video, clicks) but data with relevant labels for the task at hand (e.g., personalized learning, drug design) is much more expensive. Maria's foundational work developed the first theoretical framework that could capture the intuition behind the different types of learning methods designed to leverage both types of data and provided formal analysis in both qualitative and quantitative terms. At the same time, Maria also developed breakthrough results for active learning, where the learning algorithm intelligently chooses which data points to send for labelling. Most notably, she showed how active learning can always asymptotically improve over passive learning and gave the first general active learning algorithm that could tolerate realistic noise and model imperfections.

Press Release

ACM Names 68 Fellows for Contributions to Computing That Underpin Our Daily Lives

ACM, the Association for Computing Machinery, has named 68 Fellows for transformative contributions to computing science and technology. All the 2023 inductees are longstanding ACM Members who were selected by their peers for groundbreaking innovations that have improved how we live, work, and play.

“The announcement each year that a new class of ACM Fellows has been selected is met with great excitement,” said ACM President Yannis Ioannidis. “ACM is proud to include nearly 110,000 computing professionals in our ranks and ACM Fellows represent just 1% of our entire global membership. This year’s inductees include the inventor of the World Wide Web,the “godfathers of AI, and other colleagues whose contributions have all been important building blocks in forming the digital society that shapes our modern world.

In keeping with ACM’s global reach, the 2023 Fellows represent universities, corporations, and research centers in Canada, China, Germany, India, Israel, Norway, Singapore, the United Kingdom, and the United States. The contributions of the 2023 Fellows run the gamut of the computing field―including algorithm design, computer graphics, cybersecurity, energy-efficient computing, mobile computing, software analytics, and web search, to name a few.

Additional information about the 2023 ACM Fellows, as well as previously named ACM Fellows, is available through the ACM Fellows website.

Read the news release.

2019 ACM Grace Murray Hopper Award

ACM named Maria Florina “Nina” Balcan of Carnegie Mellon University the recipient of the 2019 ACM Grace Murray Hopper Award for foundational and breakthrough contributions to minimally-supervised learning. Balcan’s influential and pioneering work in machine learning has solved longstanding open problems, enabled entire lines of research crucial for modern AI systems, and has set the agenda for the field for years to come.

The ACM Grace Murray Hopper Award is given to the outstanding young computer professional of the year, selected on the basis of a single recent major technical or service contribution. This award is accompanied by a prize of $35,000. The candidate must have been 35 years of age or less at the time the qualifying contribution was made. Financial support for this award is provided by Microsoft.

“Nina Balcan wonderfully meets the criteria for the ACM Grace Murray Hopper Award, as many of her groundbreaking contributions occurred long before she turned 35,” said ACM President Cherri M. Pancake. “Although she is still in the early stages of her career, she has already established herself as the world leader in the theory of how AI systems can learn with limited supervision. More broadly, her work has realigned the foundations of machine learning, and consequently ushered in many new applications that have brought about leapfrog advances in this exciting area of artificial intelligence.”

Select Technical Contributions

Semi-supervised Learning
Semi-supervised learning is an approach to machine learning in which algorithms use large amounts of easily available unlabeled data to augment small amounts of labeled data to improve predictive accuracy. When semi-supervised learning was first explored, early research suggested some promising results. However, prior to Balcan’s work, there were no general principles for designing and providing formal guarantees for algorithms that leverage both labeled and unlabeled data. By introducing the first general theoretical framework, Balcan showed how to achieve provable guarantees on the performance of such techniques with concrete implications for many different types of semi-supervised learning methods. Her foundational principles for learning from limited supervision were instrumental in advancing this important tool in machine learning and supporting the subsequent work of many other researchers in this area.

Active Learning/Noise Tolerant Learning
Balcan also made significant contributions in the related area of active learning. In active learning, the algorithm processes large volumes of data and intelligently chooses the datapoints to be labeled. Balcan established performance guarantees for active learning that hold even in challenging cases when “noise” is present in the data. These guarantees hold under arbitrary forms of noise, that is, anything that distorts or corrupts the data. This can include anything from a blurry photo, a unit of data that is improperly labeled, meaningless information, or data that the algorithm cannot interpret. Building on this work, Balcan and her collaborators also developed algorithms that can learn more efficiently under more specialized forms of “label noise.” Examples of label noise might include a researcher not being given all of the health symptoms when annotating data to make predictions about a disease, or the data being encoded incorrectly. Her work in active learning in the presence of noise was regarded as a breakthrough in the field.

Clustering
Clustering is an unsupervised learning technique in which an algorithm groups datapoints with similar properties. One goal of clustering is to find meaningful structure in data. An early challenge in the field, however, was to establish a theoretical foundation for what constituted a “meaningful structure” in a dataset. In her early work, Balcan proposed a theoretical foundation for understanding the general kinds of structures that can be detected by clustering, as well as characterizing the functionality of specific clustering algorithms. As she developed her theoretical framework further, she also devised novel clustering algorithms that were derived from these theoretical foundations, and showed applications of these algorithms to computational biology and web search.

Background

Maria Florina Balcan is an Associate Professor of Computer Science at Carnegie Mellon University. Her research interests include learning theory, machine learning, theory of computing, artificial intelligence, algorithmic economics and algorithmic game theory, and optimization. Balcan received Bachelor’s and Master’s degrees from the University of Bucharest (Romania) in 2000 and 2002, respectively. In 2008, she earned a PhD in Computer Science from Carnegie Mellon University.

Balcan’s honors include a National Science Foundation Career Award in 2009, a Microsoft Faculty Fellowship in 2011, and a Sloan Research Fellowship in 2014, as well as numerous conference paper awards. Balcan has served as the Program Committee Co-chair for all three of the major machine learning conferences: Conference on Neural Information Processing Systems (NeurIPS), International Conference on Machine Learning (ICML), and Conference on Learning Theory (COLT). Balcan’s publications are among the most cited in the machine learning theory field, and she continues to be a prolific author. Her most recent publications include chapters on “Data-Driven Algorithm Design” and “Noise in Classification,” for the book Beyond the Worst-Case Analysis of Algorithms, which will be published later this year.