514:
400:
183:
1796:
1776:
1870:
1670:
367:
305:
games directly from pixels. Silver led the AlphaGo project, culminating in the first program to defeat a top professional player in the full-size game of Go.
803:
918:
1840:
415:; Igor Babuschkin; Wojciech M Czarnecki; et al. (30 October 2019). "Grandmaster level in StarCraft II using multi-agent reinforcement learning".
310:
1512:
1860:
317:, which used the same AI to learn to play Go from scratch (learning only by playing itself and not from human games) before learning to play
275:
83:
593:
1028:
826:"ACM Prize in Computing Awarded to AlphaGo Developer: David Silver Recognized for Breakthrough Advances in Computer Game-Playing"
911:
1875:
1855:
1701:
750:; Chris J. Maddison; et al. (27 January 2016). "Mastering the game of Go with deep neural networks and tree search".
668:
1802:
1353:
1090:
875:
829:
481:
638:
1241:
1048:
904:
118:
1569:
1850:
1756:
1696:
1294:
328:
Silver is among the most published members of staff at Google DeepMind, with over 200,000 citations and has an
232:
1845:
1289:
978:
527:
509:
564:
1731:
1128:
1085:
1038:
1033:
1782:
1078:
1004:
355:
195:
29:
1406:
1341:
942:
850:
1807:
1665:
1304:
1135:
958:
283:
204:
136:
1865:
1706:
963:
1751:
1736:
1389:
1384:
1284:
1152:
933:
106:
49:
271:, where he was CTO and lead programmer, receiving several awards for technology and innovation.
1835:
1711:
1471:
1190:
1185:
411:
348:
294:
248:
208:
114:
88:
1741:
1726:
1691:
1379:
1279:
1147:
825:
690:; et al. (25 February 2015). "Human-level control through deep reinforcement learning".
240:
54:
1609:
1830:
1761:
1716:
1162:
1107:
953:
948:
220:
73:
399:
8:
1336:
1314:
1063:
1058:
1016:
968:
685:
513:
391:
182:
286:. His lectures on Reinforcement Learning are available on YouTube. Silver consulted for
1721:
1299:
1787:
1775:
1579:
1231:
1102:
1095:
854:
777:
769:
717:
709:
585:
545:
442:
434:
613:
1532:
1522:
1329:
1123:
1073:
1068:
1011:
999:
785:
761:
752:
725:
701:
692:
537:
450:
426:
417:
110:
259:(co-authored with Sylvain Gelly) was one of the strongest Go programs as of 2009.
1645:
1589:
1411:
1053:
973:
742:
359:
287:
200:
132:
309:
subsequently received an honorary 9 Dan
Professional Certification; and won the
1619:
1584:
1574:
1399:
1157:
983:
669:"RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning"
395:
336:
268:
236:
140:
482:"David Silver: The unsung hero and intellectual powerhouse at Google DeepMind"
430:
1824:
1564:
1544:
1461:
1140:
773:
713:
599:
549:
505:
438:
412:
298:
267:
After graduating from university, Silver co-founded the video games company
1650:
1481:
896:
781:
721:
646:
589:
446:
251:, where he co-introduced the algorithms used in the first master-level 9Ă—9
789:
729:
572:
Proceedings of the Twenty-Third AAAI Conference on
Artificial Intelligence
454:
235:, graduating in 1997 with the Addison-Wesley award, and having befriended
1871:
Fellows of the
Association for the Advancement of Artificial Intelligence
1746:
1517:
1426:
1421:
1043:
1021:
122:
765:
705:
1640:
1599:
1594:
1507:
1416:
1324:
1236:
1216:
1635:
1604:
1502:
1346:
1309:
1246:
1200:
1195:
1180:
747:
314:
252:
216:
69:
153:
1537:
1369:
541:
325:
in the same way, to higher levels than any other computer program.
279:
851:"Royal Society elects outstanding new Fellows and Foreign Members"
1660:
1497:
1451:
1374:
1274:
1269:
1221:
672:
529:
Reinforcement
Learning and Simulation-Based Search in Computer Go
363:
329:
306:
212:
154:
Reinforcement learning and simulation-based search in computer Go
65:
239:
whilst at
Cambridge. Silver returned to academia in 2004 at the
1675:
1655:
1527:
1319:
147:
173:
1476:
1456:
1446:
1441:
1436:
1431:
1394:
1226:
584:
322:
318:
302:
804:"Google DeepMind AlphaGo in U.K. Wins Innovation Grand Prix"
606:
1466:
614:"What the AI Behind AlphaGo Can Teach Us About Being Human"
467:
The
Cambridge University List of Members up to 31 July 1998
368:
Association for the
Advancement of Artificial Intelligence
562:
244:
255:
programs and graduated in 2009. His version of program
351:
for breakthrough advances in computer game-playing.
565:"Achieving Master Level Play in 9 Ă— 9 Computer Go"
199:(born 1976) is a principal research scientist at
1822:
290:from its inception, joining full-time in 2013.
912:
475:
473:
926:
276:Royal Society University Research Fellowship
84:Royal Society University Research Fellowship
313:for innovation. He then led development of
919:
905:
679:
595:Artificial Intelligence: A Modern Approach
512:
470:
398:
301:, including a program that learns to play
181:
293:His recent work has focused on combining
499:
1861:Academics of University College London
1823:
525:
262:
1841:Alumni of Christ's College, Cambridge
900:
536:(PhD thesis). University of Alberta.
342:
1757:Generative adversarial network (GAN)
563:Sylvain Gelly; David Silver (2008).
405:
387:
385:
383:
686:Volodymyr Mnih; Koray Kavukcuoglu;
278:in 2011, and subsequently became a
13:
823:
736:
14:
1887:
479:
380:
366:. He was elected a Fellow of the
16:Computer scientist and researcher
1795:
1794:
1774:
868:
843:
817:
796:
661:
631:
358:(FRS) for his contributions to
1707:Recurrent neural network (RNN)
1697:Differentiable neural computer
578:
556:
519:
461:
1:
1752:Variational autoencoder (VAE)
1712:Long short-term memory (LSTM)
979:Computational learning theory
510:Mathematics Genealogy Project
373:
1876:Fellows of the Royal Society
1856:University of Alberta alumni
1732:Convolutional neural network
354:In 2021, Silver was elected
347:Silver was awarded the 2019
226:
7:
1727:Multilayer perceptron (MLP)
356:Fellow of the Royal Society
233:Christ's College, Cambridge
41:1976 (age 47–48)
10:
1892:
1803:Artificial neural networks
1717:Gated recurrent unit (GRU)
943:Differentiable programming
671:. 13 May 2015 – via
1770:
1684:
1628:
1557:
1490:
1362:
1262:
1255:
1209:
1173:
1136:Artificial neural network
1116:
992:
959:Automatic differentiation
932:
431:10.1038/S41586-019-1724-Z
284:University College London
207:. He has led research on
205:University College London
168:
164:
146:
137:University College London
128:
102:
95:
79:
61:
45:
37:
23:
964:Neuromorphic engineering
927:Differentiable computing
394:publications indexed by
1737:Residual neural network
1153:Artificial Intelligence
107:Artificial intelligence
50:University of Cambridge
876:"Elected AAAI Fellows"
526:Silver, David (2009).
349:ACM Prize in Computing
295:reinforcement learning
249:reinforcement learning
209:reinforcement learning
115:Reinforcement learning
89:ACM Prize in Computing
1851:Go (game) researchers
1692:Neural Turing machine
1280:Human image synthesis
639:"CSML | David Silver"
274:Silver was awarded a
241:University of Alberta
55:University of Alberta
1846:Computer programmers
1783:Computer programming
1762:Graph neural network
1337:Text-to-video models
1315:Text-to-image models
1163:Large language model
1148:Scientific computing
954:Statistical manifold
949:Information geometry
1129:In-context learning
969:Pattern recognition
766:10.1038/NATURE16961
706:10.1038/NATURE14236
486:businessinsider.com
335:of 93 according to
263:Career and research
203:and a professor at
1722:Echo state network
1610:JĂĽrgen Schmidhuber
1305:Facial recognition
1300:Speech recognition
1210:Software libraries
343:Awards and honours
1818:
1817:
1580:Stephen Grossberg
1553:
1552:
760:(7587): 484–489.
700:(7540): 529–533.
586:Stuart J. Russell
425:(7782): 350–354.
311:Cannes Lion award
189:
188:
97:Scientific career
1883:
1808:Machine learning
1798:
1797:
1778:
1533:Action selection
1523:Self-driving car
1330:Stable Diffusion
1295:Speech synthesis
1260:
1259:
1124:Machine learning
1000:Gradient descent
921:
914:
907:
898:
897:
891:
890:
888:
886:
872:
866:
865:
863:
861:
855:royalsociety.org
847:
841:
840:
838:
836:
821:
815:
814:
812:
810:
800:
794:
793:
740:
734:
733:
683:
677:
676:
665:
659:
658:
656:
654:
649:on 24 April 2021
645:. Archived from
635:
629:
628:
626:
624:
610:
604:
603:
598:(3rd ed.).
582:
576:
575:
569:
560:
554:
553:
523:
517:
516:
503:
497:
496:
494:
492:
477:
468:
465:
459:
458:
409:
403:
402:
389:
198:
185:
180:
177:
175:
160:
111:Machine learning
32:
21:
20:
1891:
1890:
1886:
1885:
1884:
1882:
1881:
1880:
1866:Google DeepMind
1821:
1820:
1819:
1814:
1766:
1680:
1646:Google DeepMind
1624:
1590:Geoffrey Hinton
1549:
1486:
1412:Project Debater
1358:
1256:Implementations
1251:
1205:
1169:
1112:
1054:Backpropagation
988:
974:Tensor calculus
928:
925:
895:
894:
884:
882:
874:
873:
869:
859:
857:
849:
848:
844:
834:
832:
822:
818:
808:
806:
802:
801:
797:
741:
737:
684:
680:
667:
666:
662:
652:
650:
637:
636:
632:
622:
620:
612:
611:
607:
583:
579:
567:
561:
557:
524:
520:
504:
500:
490:
488:
478:
471:
466:
462:
410:
406:
390:
381:
376:
360:Deep Q-Networks
345:
288:Google DeepMind
265:
243:to study for a
229:
219:and co-lead on
201:Google DeepMind
194:
172:
158:
139:
135:
133:Google Deepmind
121:
117:
113:
109:
87:
72:
68:
53:
46:Alma mater
33:
28:
26:
17:
12:
11:
5:
1889:
1879:
1878:
1873:
1868:
1863:
1858:
1853:
1848:
1843:
1838:
1833:
1816:
1815:
1813:
1812:
1811:
1810:
1805:
1792:
1791:
1790:
1785:
1771:
1768:
1767:
1765:
1764:
1759:
1754:
1749:
1744:
1739:
1734:
1729:
1724:
1719:
1714:
1709:
1704:
1699:
1694:
1688:
1686:
1682:
1681:
1679:
1678:
1673:
1668:
1663:
1658:
1653:
1648:
1643:
1638:
1632:
1630:
1626:
1625:
1623:
1622:
1620:Ilya Sutskever
1617:
1612:
1607:
1602:
1597:
1592:
1587:
1585:Demis Hassabis
1582:
1577:
1575:Ian Goodfellow
1572:
1567:
1561:
1559:
1555:
1554:
1551:
1550:
1548:
1547:
1542:
1541:
1540:
1530:
1525:
1520:
1515:
1510:
1505:
1500:
1494:
1492:
1488:
1487:
1485:
1484:
1479:
1474:
1469:
1464:
1459:
1454:
1449:
1444:
1439:
1434:
1429:
1424:
1419:
1414:
1409:
1404:
1403:
1402:
1392:
1387:
1382:
1377:
1372:
1366:
1364:
1360:
1359:
1357:
1356:
1351:
1350:
1349:
1344:
1334:
1333:
1332:
1327:
1322:
1312:
1307:
1302:
1297:
1292:
1287:
1282:
1277:
1272:
1266:
1264:
1257:
1253:
1252:
1250:
1249:
1244:
1239:
1234:
1229:
1224:
1219:
1213:
1211:
1207:
1206:
1204:
1203:
1198:
1193:
1188:
1183:
1177:
1175:
1171:
1170:
1168:
1167:
1166:
1165:
1158:Language model
1155:
1150:
1145:
1144:
1143:
1133:
1132:
1131:
1120:
1118:
1114:
1113:
1111:
1110:
1108:Autoregression
1105:
1100:
1099:
1098:
1088:
1086:Regularization
1083:
1082:
1081:
1076:
1071:
1061:
1056:
1051:
1049:Loss functions
1046:
1041:
1036:
1031:
1026:
1025:
1024:
1014:
1009:
1008:
1007:
996:
994:
990:
989:
987:
986:
984:Inductive bias
981:
976:
971:
966:
961:
956:
951:
946:
938:
936:
930:
929:
924:
923:
916:
909:
901:
893:
892:
867:
842:
816:
795:
735:
678:
660:
630:
605:
577:
555:
542:10.7939/R39D8T
518:
498:
469:
460:
404:
396:Google Scholar
378:
377:
375:
372:
344:
341:
337:Google scholar
269:Elixir Studios
264:
261:
237:Demis Hassabis
231:He studied at
228:
225:
187:
186:
170:
166:
165:
162:
161:
150:
144:
143:
141:Elixir Studios
130:
126:
125:
123:Computer Games
104:
100:
99:
93:
92:
81:
77:
76:
63:
62:Known for
59:
58:
47:
43:
42:
39:
35:
34:
27:
24:
15:
9:
6:
4:
3:
2:
1888:
1877:
1874:
1872:
1869:
1867:
1864:
1862:
1859:
1857:
1854:
1852:
1849:
1847:
1844:
1842:
1839:
1837:
1836:Living people
1834:
1832:
1829:
1828:
1826:
1809:
1806:
1804:
1801:
1800:
1793:
1789:
1786:
1784:
1781:
1780:
1777:
1773:
1772:
1769:
1763:
1760:
1758:
1755:
1753:
1750:
1748:
1745:
1743:
1740:
1738:
1735:
1733:
1730:
1728:
1725:
1723:
1720:
1718:
1715:
1713:
1710:
1708:
1705:
1703:
1700:
1698:
1695:
1693:
1690:
1689:
1687:
1685:Architectures
1683:
1677:
1674:
1672:
1669:
1667:
1664:
1662:
1659:
1657:
1654:
1652:
1649:
1647:
1644:
1642:
1639:
1637:
1634:
1633:
1631:
1629:Organizations
1627:
1621:
1618:
1616:
1613:
1611:
1608:
1606:
1603:
1601:
1598:
1596:
1593:
1591:
1588:
1586:
1583:
1581:
1578:
1576:
1573:
1571:
1568:
1566:
1565:Yoshua Bengio
1563:
1562:
1560:
1556:
1546:
1545:Robot control
1543:
1539:
1536:
1535:
1534:
1531:
1529:
1526:
1524:
1521:
1519:
1516:
1514:
1511:
1509:
1506:
1504:
1501:
1499:
1496:
1495:
1493:
1489:
1483:
1480:
1478:
1475:
1473:
1470:
1468:
1465:
1463:
1462:Chinchilla AI
1460:
1458:
1455:
1453:
1450:
1448:
1445:
1443:
1440:
1438:
1435:
1433:
1430:
1428:
1425:
1423:
1420:
1418:
1415:
1413:
1410:
1408:
1405:
1401:
1398:
1397:
1396:
1393:
1391:
1388:
1386:
1383:
1381:
1378:
1376:
1373:
1371:
1368:
1367:
1365:
1361:
1355:
1352:
1348:
1345:
1343:
1340:
1339:
1338:
1335:
1331:
1328:
1326:
1323:
1321:
1318:
1317:
1316:
1313:
1311:
1308:
1306:
1303:
1301:
1298:
1296:
1293:
1291:
1288:
1286:
1283:
1281:
1278:
1276:
1273:
1271:
1268:
1267:
1265:
1261:
1258:
1254:
1248:
1245:
1243:
1240:
1238:
1235:
1233:
1230:
1228:
1225:
1223:
1220:
1218:
1215:
1214:
1212:
1208:
1202:
1199:
1197:
1194:
1192:
1189:
1187:
1184:
1182:
1179:
1178:
1176:
1172:
1164:
1161:
1160:
1159:
1156:
1154:
1151:
1149:
1146:
1142:
1141:Deep learning
1139:
1138:
1137:
1134:
1130:
1127:
1126:
1125:
1122:
1121:
1119:
1115:
1109:
1106:
1104:
1101:
1097:
1094:
1093:
1092:
1089:
1087:
1084:
1080:
1077:
1075:
1072:
1070:
1067:
1066:
1065:
1062:
1060:
1057:
1055:
1052:
1050:
1047:
1045:
1042:
1040:
1037:
1035:
1032:
1030:
1029:Hallucination
1027:
1023:
1020:
1019:
1018:
1015:
1013:
1010:
1006:
1003:
1002:
1001:
998:
997:
995:
991:
985:
982:
980:
977:
975:
972:
970:
967:
965:
962:
960:
957:
955:
952:
950:
947:
945:
944:
940:
939:
937:
935:
931:
922:
917:
915:
910:
908:
903:
902:
899:
881:
877:
871:
856:
852:
846:
831:
827:
824:Ormond, Jim.
820:
805:
799:
791:
787:
783:
779:
775:
771:
767:
763:
759:
755:
754:
749:
745:
739:
731:
727:
723:
719:
715:
711:
707:
703:
699:
695:
694:
689:
682:
674:
670:
664:
648:
644:
640:
634:
619:
615:
609:
601:
600:Prentice Hall
597:
596:
591:
587:
581:
573:
566:
559:
551:
547:
543:
539:
535:
531:
530:
522:
515:
511:
507:
502:
487:
483:
476:
474:
464:
456:
452:
448:
444:
440:
436:
432:
428:
424:
420:
419:
414:
413:Oriol Vinyals
408:
401:
397:
393:
388:
386:
384:
379:
371:
369:
365:
361:
357:
352:
350:
340:
338:
334:
332:
326:
324:
320:
316:
312:
308:
304:
300:
299:deep learning
296:
291:
289:
285:
281:
277:
272:
270:
260:
258:
254:
250:
246:
242:
238:
234:
224:
222:
218:
214:
210:
206:
202:
197:
193:
184:
179:
171:
167:
163:
156:
155:
151:
149:
145:
142:
138:
134:
131:
127:
124:
120:
116:
112:
108:
105:
101:
98:
94:
90:
85:
82:
78:
75:
71:
67:
64:
60:
56:
51:
48:
44:
40:
36:
31:
22:
19:
1651:Hugging Face
1615:David Silver
1614:
1263:Audio–visual
1117:Applications
1096:Augmentation
941:
883:. Retrieved
879:
870:
858:. Retrieved
845:
833:. Retrieved
819:
807:. Retrieved
798:
757:
751:
744:David Silver
743:
738:
697:
691:
688:David Silver
687:
681:
663:
651:. Retrieved
647:the original
642:
633:
621:. Retrieved
617:
608:
594:
590:Peter Norvig
580:
571:
558:
533:
528:
521:
506:David Silver
501:
491:26 September
489:. Retrieved
485:
480:Shead, Sam.
463:
422:
416:
407:
392:David Silver
353:
346:
330:
327:
292:
273:
266:
256:
230:
192:David Silver
191:
190:
176:.davidsilver
152:
129:Institutions
96:
25:David Silver
18:
1831:1976 births
1799:Categories
1747:Autoencoder
1702:Transformer
1570:Alex Graves
1518:OpenAI Five
1422:IBM Watsonx
1044:Convolution
1022:Overfitting
534:ualberta.ca
1825:Categories
1788:Technology
1641:EleutherAI
1600:Fei-Fei Li
1595:Yann LeCun
1508:Q-learning
1491:Decisional
1417:IBM Watson
1325:Midjourney
1217:TensorFlow
1064:Activation
1017:Regression
1012:Clustering
374:References
1671:MIT CSAIL
1636:Anthropic
1605:Andrew Ng
1503:AlphaZero
1347:VideoPoet
1310:AlphaFold
1247:MindSpore
1201:SpiNNaker
1196:Memristor
1103:Diffusion
1079:Rectifier
1059:Batchnorm
1039:Attention
1034:Adversary
885:3 January
790:Q28005460
774:1476-4687
748:Aja Huang
730:Q27907579
714:1476-4687
643:ucl.ac.uk
618:Wired.com
550:575410609
455:Q72988805
439:1476-4687
370:in 2022.
315:AlphaZero
227:Education
221:AlphaStar
217:AlphaZero
74:AlphaStar
70:AlphaZero
1779:Portals
1538:Auto-GPT
1370:Word2vec
1174:Hardware
1091:Datasets
993:Concepts
786:Wikidata
782:26819042
726:Wikidata
722:25719670
592:(2009).
451:Wikidata
447:31666705
280:lecturer
119:Planning
1661:Meta AI
1498:AlphaGo
1482:PanGu-ÎŁ
1452:ChatGPT
1427:Granite
1375:Seq2seq
1354:Whisper
1275:WaveNet
1270:AlexNet
1242:Flux.jl
1222:PyTorch
1074:Sigmoid
1069:Softmax
934:General
835:2 April
830:acm.org
673:YouTube
508:at the
364:AlphaGo
307:AlphaGo
213:AlphaGo
169:Website
66:AlphaGo
1676:Huawei
1656:OpenAI
1558:People
1528:MuZero
1390:Gemini
1385:Claude
1320:DALL-E
1232:Theano
860:8 June
809:27 May
788:
780:
772:
753:Nature
728:
720:
712:
693:Nature
653:27 May
623:17 May
548:
453:
445:
437:
418:Nature
333:-index
159:(2009)
157:
148:Thesis
103:Fields
91:(2019)
86:(2011)
80:Awards
1742:Mamba
1513:SARSA
1477:LLaMA
1472:BLOOM
1457:GPT-J
1447:GPT-4
1442:GPT-3
1437:GPT-2
1432:GPT-1
1395:LaMDA
1227:Keras
568:(PDF)
323:shogi
319:chess
303:Atari
297:with
211:with
57:(PhD)
1666:Mila
1467:PaLM
1400:Bard
1380:BERT
1363:Text
1342:Sora
887:2024
880:AAAI
862:2021
837:2020
811:2017
778:PMID
770:ISSN
718:PMID
710:ISSN
655:2017
625:2016
546:OCLC
493:2020
443:PMID
435:ISSN
362:and
321:and
257:MoGo
52:(BA)
38:Born
1407:NMT
1290:OCR
1285:HWR
1237:JAX
1191:VPU
1186:TPU
1181:IPU
1005:SGD
762:doi
758:529
702:doi
698:518
538:doi
427:doi
423:575
282:at
247:on
245:PhD
196:FRS
178:.uk
174:www
30:FRS
1827::
878:.
853:.
828:.
784:.
776:.
768:.
756:.
746:;
724:.
716:.
708:.
696:.
641:.
616:.
588:;
570:.
544:.
532:.
484:.
472:^
449:.
441:.
433:.
421:.
382:^
339:.
253:Go
223:.
215:,
920:e
913:t
906:v
889:.
864:.
839:.
813:.
792:.
764::
732:.
704::
675:.
657:.
627:.
602:.
574:.
552:.
540::
495:.
457:.
429::
331:h
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.