Categories
  Encyclosphere.org ENCYCLOREADER
  supported by EncyclosphereKSF

David Silver (programmer)

From HandWiki - Reading time: 3 min

Professor David Silver (dob c.1976) leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo.

He graduated from Cambridge University in 1997 with the Addison-Wesley award, and befriended Demis Hassabis whilst there[1]. Subsequently, David co-founded the video games company Elixir Studios, where he was CTO and lead programmer, receiving several awards for technology and innovation.[2][3]

David returned to academia in 2004 at the University of Alberta to study for a PhD on reinforcement learning, where he co-introduced the algorithms used in the first master-level 9x9 Go programs.[4][5] His version of program MoGo (co-authored with Sylvain Gelly) was one of the strongest Go programs as of 2009.[6]

David was awarded a Royal Society University Research Fellowship in 2011, and subsequently became a lecturer at University College London, where he is now a professor.[7] His lectures on Reinforcement Learning are available on YouTube[8]. David consulted for DeepMind from its inception, joining full-time in 2013.

His recent work has focused on combining reinforcement learning with deep learning, including a program that learns to play Atari games directly from pixels.[9] David led the AlphaGo project, culminating in the first program to defeat a top professional player in the full-size game of Go.[10] AlphaGo subsequently received an honorary 9 Dan Professional Certification; and won the Cannes Lion award for innovation.[11]. He then led development of AlphaZero, which used the same AI to learn to play Go from scratch (learning only by playing itself and not from human games) before learning to play chess and shogi in the same way, to higher levels than any other computer program[12].

David is among the most published members of staff at DeepMind and has an h-index of 40.[13]

References

  1. http://uk.businessinsider.com/david-silver-the-unsung-hero-at-google-deepmind-2016-3
  2. "David Silver: The unsung hero and intellectual powerhouse at Google DeepMind" (in en). Business Insider. http://uk.businessinsider.com/david-silver-the-unsung-hero-at-google-deepmind-2016-3. 
  3. "What the AI Behind AlphaGo Can Teach Us About Being Human". https://www.wired.com/2016/05/google-alpha-go-ai/. Retrieved 17 May 2016. 
  4. David, Silver,. "Reinforcement Learning and Simulation-Based Search in Computer Go" (in en). ERA. doi:10.7939/R39D8T. https://era.library.ualberta.ca/files/cf95jb59d#.WSkP7BPyvBI. 
  5. Sylvain Gelly, David Silver (2008). "Achieving Master Level Play in 9 × 9 Computer Go". Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence. 
  6. Stuart J. Russell, Peter Norvig (2009). A Modern Approach (3rd ed.). Prentice Hall. 
  7. "CSML | David Silver" (in en-US). http://www.csml.ucl.ac.uk/people/silver/. 
  8. https://www.youtube.com/watch?v=2pWv7GOvuf0
  9. Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Rusu, Andrei A.; Veness, Joel; Bellemare, Marc G.; Graves, Alex; Riedmiller, Martin et al. (2015-02-26). "Human-level control through deep reinforcement learning" (in en). Nature 518 (7540): 529–533. doi:10.1038/nature14236. ISSN 0028-0836. https://www.nature.com/nature/journal/v518/n7540/full/nature14236.html. 
  10. Silver, David; Huang, Aja; Maddison, Chris J.; Guez, Arthur; Sifre, Laurent; Driessche, George van den; Schrittwieser, Julian; Antonoglou, Ioannis et al. (28 January 2016). "Mastering the game of Go with deep neural networks and tree search". Nature 529 (7587): 484–489. doi:10.1038/nature16961. ISSN 0028-0836. PMID 26819042. Bibcode2016Natur.529..484S. https://www.nature.com/nature/journal/v529/n7587/full/nature16961.html. Retrieved 10 December 2017. closed access
  11. "Google DeepMind AlphaGo in U.K. Wins Innovation Grand Prix" (in en). http://adage.com/article/special-report-cannes-lions/google-deepmind-alphago-wins-cannes-innovation-grand-prix/304644/. 
  12. https://www.theguardian.com/technology/2017/dec/07/alphazero-google-deepmind-ai-beats-champion-program-teaching-itself-to-play-four-hours
  13. "David Silver – Google Scholar Citations". https://scholar.google.com/citations?user=-8DNE4UAAAAJ&hl=en. Retrieved 30 August 2018. 





Licensed under CC BY-SA 3.0 | Source: https://handwiki.org/wiki/Biography:David_Silver_(programmer)
5 views | Status: cached on August 01 2024 01:04:18
↧ Download this article as ZWI file
Encyclosphere.org EncycloReader is supported by the EncyclosphereKSF