Pieter Abbeel

ACM Prize in Computing (2021)
2021 ACM Prize in Computing

ACM Prize in Computing

USA - 2021

citation

For contributions to robot learning, including learning from demonstrations and deep reinforcement learning for robotic control

Pieter Abbeel has pioneered learning in robotics, an area of research and engineering that barely existed when he started his PhD, now almost 20 years ago, but since has become one of the most prevalent research paradigms in robotics, in large part thanks to his foundational contributions.

Abbeel has pioneered how to make robots learn, both through learning from human demonstrations ("apprenticeship learning") and through their own trial and error ("reinforcement learning"), which has formed the foundation for the next generation of robotics.

Apprenticeship Learning: Abbeel's doctoral dissertation showcased that a clever combination of learning and optimal control can enable helicopter control at the level of the best human pilots. Upon his arrival as a faculty member at UC Berkeley, he showed how to generalize these underlying ideas to robotic laundry folding and surgical suturing. More recently, he has pioneered few-shot imitation learning, where a robot is able to learn to perform a task from just one demonstration after having been pre-trained on a large set of demonstrations on related tasks.

Deep Reinforcement Learning: The harnessing of deep neural models in reinforcement learning (RL) has been a hot topic for several years, highlighted by Google DeepMind's Go player beating the human world champion. RL allows artificial intelligence systems to learn from "trial and error" feedback, alleviating the need for demonstrations. Abbeel is considered the leader of his generation in RL, especially as it pertains to robotics. His work on trust-region policy optimization provided the first reliable RL procedure for continuous control, as showcased on simulated robotic environments. Further from there, Abbeel has made several other pioneering contributions to Deep RL for robotics. These contributions include generalized advantage estimation, which enabled the first 3D robot locomotion learning; soft-actor critic, which is one the most popular Deep RL algorithms to-date; domain randomization, which showcases how learning across appropriately randomized simulators can generalize surprisingly well to the real world; and hindsight experience replay, which has been instrumental for Deep RL in sparse-reward/goal-oriented environments.

Abbeel's work has been widely recognized by his peers. His doctoral dissertation received the 2008 Dick Volz Best U.S. Ph.D. Thesis in Robotics & Automation Award. He received a Sloan Foundation Fellowship in 2011, an NSF Early Career Development Program Award (NSF-CAREER) in 2014, and a Presidential Early Career Award for Scientists and Engineers in 2016. He was elected an IEEE Fellow in 2018. His work is widely cited, and he has received numerous best-paper awards at top robotics and machine-learning conferences.

Abbeel's groundbreaking research has helped shape contemporary robotics and continues to drive the future of the field.

Press Release

2021 ACM Prize in Computing

ACM named Pieter Abbeel the recipient of the 2021 ACM Prize in Computing for contributions to robot learning, including learning from demonstrations and deep reinforcement learning for robotic control. Abbeel pioneered teaching robots to learn from human demonstrations (“apprenticeship learning”) and through their own trial and error (“reinforcement learning”), which have formed the foundation for the next generation of robotics. Abbeel is a Professor at the University of California, Berkeley and the Co-Founder, President and Chief Scientist at Covariant, an AI robotics company.

Early in his career, Abbeel developed new apprenticeship learning techniques to significantly improve robotic manipulation. As the field matured, researchers were able to program robots to perceive and manipulate rigid objects such as wooden blocks or spoons. However, programming robots to manipulate deformable objects, such as cloth, proved difficult because the way soft materials move when touched is unpredictable. Abbeel introduced new methods to enhance robot visual perception, physics-based tracking, control, and learning from demonstration. By combining these new methods, Abbeel developed a robot that was able to fold clothes such as towels and shirts ─ an improvement over existing technology that was considered an important milestone at the time.

Abbeel’s contributions also include developing robots that can perform surgical suturing, detect objects, and plan their trajectories in uncertain situations. More recently, he has pioneered “few-shot imitation learning,” where a robot is able to learn to perform a task from just one demonstration after having been pre-trained with a large set of demonstrations on related tasks.

Another especially promising area where Abbeel has made important contributions is in deep reinforcement learning for robotics. Reinforcement learning is an area of machine learning where an agent (e.g., a computer program) seeks to progress towards a reward (e.g., winning a game). While early reinforcement learning programs were effective, they could only perform simple tasks. The innovation of combining reinforcement learning with deep neural networks ushered in the new field of deep reinforcement learning, which can solve far more complex problems than computer programs developed with reinforcement learning alone.

Abbeel’s key breakthrough contribution in this area was developing a deep reinforcement learning method called Trust Region Policy Optimization. This method stabilizes the reinforcement learning process, enabling robots to learn a range of simulated control skills. By sharing his results, posting video tutorials, and releasing open-source code from his lab, Abbeel helped build a community of researchers that has since pushed deep learning for robotics even further ─ with robots performing ever more complicated tasks.

Abbeel has also made several other pioneering contributions including: generalized advantage estimation, which enabled the first 3D robot locomotion learning; soft-actor critic, which is one of the most popular deep reinforcement learning algorithms to-date; domain randomization, which showcases how learning across appropriately randomized simulators can generalize surprisingly well to the real world; and hindsight experience replay, which has been instrumental for deep reinforcement learning in sparse-reward/goal-oriented environments.

“Teaching robots to learn could spur major advances across many industries ─ from surgery and manufacturing to shipping and automated driving,” said ACM President Gabriele Kotsis. “Pieter Abbeel is a recognized leader among a new generation of researchers who are harnessing the latest machine learning techniques to revolutionize this field. Abbeel has made leapfrog research contributions, while also generously sharing his knowledge to build a community of colleagues working to take robots to an exciting new level of ability. His work exemplifies the intent of the ACM Prize in Computing to recognize outstanding work with ‘depth, impact, and broad implications.’”

“Infosys is proud of our longstanding collaboration with ACM, and we are honored to recognize Pieter Abbeel for the 2021 ACM Prize in Computing,” said Salil Parekh, Chief Executive Officer, Infosys. “The robotics field is poised for even greater advances, as innovative new ways are emerging to combine robotics with AI, and we believe researchers like Abbeel will be instrumental in creating the next great advances in this field.”

Abbeel will be formally presented with the ACM Prize in Computing at the annual ACM Awards Banquet, which will be held this year on Saturday, June 11 at the Palace Hotel in San Francisco.

Background

Pieter Abbeel is a Professor of Computer Science and Electrical Engineering at the University of California, Berkeley and the Co-Founder, President and Chief Scientist at Covariant, an AI robotics company. Abbeel earned a B.S. in Electrical Engineering from Katholieke Universiteit Leuven, as well as M.S. and Ph.D. degrees in Computer Science from Stanford University.

Abbeel’s honors include a Presidential Early Career Award for Scientists and Engineers, a National Science Foundation Early Career Development Program Award, and a Diane McEntyre Award for Excellence in Teaching. Additionally, Abbeel was named a Top Young Innovator Under 35 by the MIT Technology Review and received the Dick Volz Best U.S. Ph.D. Thesis in Robotics and Automation Award. He is a Fellow of IEEE.