Research Blog: February 2015

From Pixels to Actions: Human-level control through Deep Reinforcement Learning

Wednesday, February 25, 2015

Posted by Dharshan Kumaran and Demis Hassabis, Google DeepMind, LondonBreakoutHuman-level control through deep reinforcement learningNatureRiver RaidBoxingEnduroVideo courtesy of Atari Inc. and Mnih et al. “Human-level control through deep reinforcement learning"Deep Neural NetworksReinforcement Learningmachine learning

Comparison of the DQN agent with the best reinforcement learning methods in the literature. The performance of DQN is normalized with respect to a professional human games tester (100% level) and random play (0% level). Note that the normalized performance of DQN, expressed as a percentage, is calculated as: 100 X (DQN score - random play score)/(human score - random play score). Error bars indicate s.d. across the 30 evaluation episodes, starting with different initial conditions. Figure courtesy of Mnih et al. “Human-level control through deep reinforcement learning”, Nature 26 Feb. 2015.

Richard Feynman

Google Faculty Research Awards: Winter 2015

Thursday, February 19, 2015

Posted by Maggie Johnson, Director of Education and University RelationsGoogle Faculty Research Awardslast roundResearch organizationrecipients of this round’s awardsour website

Google Science Fair 2015: what will you try?

Wednesday, February 18, 2015

Posted by Miriam Schneider, Google for Education team(Cross-posted from the Google for Education Blog)Google Science Fairsubmit projects onlinePrizesAnn MakosinskiKenneth ShinozukaHarine RavichandranSomething you love, you’re good at, and want to try

Announcing the 2015 North American Google PhD Fellows

Wednesday, February 18, 2015

Posted by Michael Rennaker, Google University RelationsPhD Fellowship programbuilding new intelligence modelschanging the way in which we interact with computersadvancing into faculty positions

Justin Meza, Google US/Canada Fellowship in Systems Reliability (Carnegie Mellon University)

Waleed Ammar, Google US/Canada Fellowship in Natural Language Processing (Carnegie Mellon University)

Aaron Parks, Google US/Canada Fellowship in Mobile Networking (University of Washington)

Kyle Rector, Google US/Canada Fellowship in Human Computer Interaction (University of Washington)

Nick Arnosti, Google US/Canada Fellowship in Market Algorithms (Stanford University)

Osbert Bastani, Google US/Canada Fellowship in Programming Languages (Stanford University)

Carl Vondrick, Google US/Canada Fellowship in Machine Perception, (Massachusetts Institute of Technology)

Wojciech Zaremba, Google US/Canada Fellowship in Machine Learning (New York University)

Xiaolan Wang, Google US/Canada Fellowship in Structured Data (University of Massachusetts Amherst)

Muhammad Naveed, Google US/Canada Fellowship in Security (University of Illinois at Urbana-Champaign)

Masoud Moshref Javadi, Google US/Canada Fellowship in Computer Networking (University of Southern California)

Riley Spahn, Google US/CanadaFellowship in Privacy (Columbia University)

Saurabh Gupta, Google US/Canada Fellowship in Computer Vision (University of California, Berkeley)

Yun Teng, Google US/Canada Fellowship in Computer Graphics (University of California, Santa Barbara)

Tan Zhang, Google US/Canada Fellowship in Mobile Systems (University of Wisconsin-Madison)

Google Research Blog

From Pixels to Actions: Human-level control through Deep Reinforcement Learning

Google Faculty Research Awards: Winter 2015

Google Science Fair 2015: what will you try?

Announcing the 2015 North American Google PhD Fellows

Labels

Archive

Feed

Company-wide

Products

Developers