-
Notifications
You must be signed in to change notification settings - Fork 33
/
Copy pathindex.html
345 lines (326 loc) · 42.6 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
<meta charset="utf-8">
<link href="https://fonts.googleapis.com/css?family=Muli|Open+Sans|Poppins:300" rel="stylesheet">
<link href="https://cdnjs.cloudflare.com/ajax/libs/twitter-bootstrap/3.3.7/css/bootstrap.min.css" rel="stylesheet">
<link href="guide.css" rel="stylesheet"><link href="https://fonts.googleapis.com/css?family=Muli" rel="stylesheet">
<meta name="viewport" content="width=500, initial-scale=1">
<div class="container">
<h1 class="title">A Guide to Deep Learning by
<a href="http://yerevann.com" class="logo">
<svg class="logo">
<g stroke="none" fill="none" fill-rule="evenodd">
<path d="M13.6348948,40.6781387 C12.7574088,39.215662 12.0789032,38.3099782 11.6692836,38.0078838 C11.2940956,37.7311826 10.766715,37.5844116 10.0667718,37.5844116 L9.3167718,37.5844116 L9.3167718,33.940155 L23.9527581,33.940155 L23.9527581,37.5844116 L23.2027581,37.5844116 C22.1454281,37.5844116 21.8278192,37.7547655 21.8278192,37.9741516 C21.8278192,38.2242265 22.0499603,38.7714522 22.5120358,39.5378214 L26.1530954,45.5851616 L29.4604072,40.0730524 C29.8040371,39.5146538 29.967315,39.0095129 29.967315,38.5536804 C29.967315,37.8969839 29.5072368,37.5844116 28.1867059,37.5844116 L27.4367059,37.5844116 L27.4367059,33.940155 L37.9194024,33.940155 L37.9194024,37.5844116 L37.1694024,37.5844116 C36.5264626,37.5844116 35.9576138,37.8187871 35.4302579,38.3103033 C34.8300233,38.8697453 33.9100482,40.1358804 32.7076071,42.077321 C32.7076071,42.077321 27.8549188,49.8544006 27.8549188,49.9219666 C27.8549188,51.7926025 27.8549188,55.5338745 27.8549188,55.5338745 C27.8549188,56.568522 28.0185187,57.2410994 28.2812161,57.5454948 C28.5089424,57.8093681 29.1569105,57.9906006 30.2343743,57.9906006 L30.6400445,57.9906006 L31.3900445,57.9906006 L31.3900445,61.6348572 L15.8461297,61.6348572 L15.8461297,57.9906006 L16.4220129,57.9906006 L16.5961297,57.9906006 L17.0211175,57.9906006 C18.1299306,57.9906006 18.7857247,57.8049985 18.998855,57.5429531 C19.2604454,57.2213255 19.4198907,56.5495642 19.4198907,55.5338745 L19.4198907,50.1442566 L13.6348948,40.6781387 Z"
id="Y" fill="#FFFFFF"></path>
<path d="M48.9671858,54.1816406 C48.9671858,55.9045905 49.2618872,56.9482122 49.7345545,57.3211423 C50.2961663,57.7642488 51.0304935,57.9906006 51.9648054,57.9906006 L52.7148054,57.9906006 L52.7148054,61.6348572 L41.3821333,61.6348572 L41.3821333,57.9906006 L42.1321333,57.9906006 C42.8275254,57.9906006 43.4118835,57.8277043 43.9035328,57.5050594 C44.3859684,57.1884611 44.6664669,56.8527825 44.777694,56.4977114 C44.9173975,56.0517348 44.9945296,55.277448 44.9945296,54.2009583 L44.9945296,40.8331604 C44.9945296,38.5710454 44.0698583,37.5844116 41.9969099,37.5844116 L41.2469099,37.5844116 L41.2469099,33.940155 L51.179605,33.940155 L51.4031908,34.1899211 L64.4473799,48.7725677 L64.4473799,41.1229248 C64.4473799,38.6523438 63.5478154,37.5844116 61.5849837,37.5844116 L60.8349837,37.5844116 L60.8349837,33.940155 L71.742668,33.940155 L71.742668,37.5844116 L70.992668,37.5844116 C70.4783595,37.5844116 70.0016732,37.724173 69.5472554,38.0100989 C69.1099202,38.2852761 68.8298108,38.6163144 68.6830136,39.0154195 C68.5128523,39.4780454 68.4200362,40.2118732 68.4200362,41.2001953 L68.4200362,62.059845 L64.8598742,62.059845 L64.6360431,61.8072392 L48.9646713,44.1212314 L48.9671858,54.1816406 Z"
id="N" fill="#FFFFFF"></path>
<path d="M76.1732357,31.9476095 L75.8838924,31.9476095 L75.8734318,31.6584554 C75.8368836,30.6481861 75.9652465,29.7062177 76.2592917,28.8341479 C76.5544004,27.9589242 76.9983825,27.1918867 77.5905149,26.5354702 C78.1857925,25.8755671 78.9331492,25.3623121 79.8292393,24.9962976 C80.7223902,24.6314837 81.7438566,24.45 82.892102,24.45 C83.7690175,24.45 84.606442,24.5764928 85.4033236,24.8296823 C86.2037378,25.0839942 86.9101422,25.4498503 87.52085,25.9272998 C88.1370003,26.4090042 88.62776,27.003308 88.9913993,27.7077901 C89.357841,28.4177013 89.5406371,29.2123405 89.5406371,30.087726 C89.5406371,30.9919851 89.381723,31.7757843 89.0608212,32.4367161 C88.7457696,33.0855989 88.328371,33.6624877 87.809561,34.1661301 C87.3001759,34.6606229 86.7233705,35.1083974 86.079501,35.509477 C85.4521361,35.9002756 84.8203669,36.286995 84.1841747,36.6696466 C83.5570533,37.0468425 82.9473565,37.4521242 82.3550441,37.8855209 C81.7776316,38.3080153 81.2499847,38.8789215 81.2499847,38.8789215 L89.646446,38.8789215 L89.646446,42.4524493 L75.45,42.4524493 L75.45,42.1524493 C75.45,41.1231976 75.6113539,40.2226698 75.9365232,39.4522385 C76.2584507,38.6894882 76.6959425,38.0063288 77.2483025,37.4040754 C77.7933009,36.8098487 78.4324039,36.2627409 79.1651572,35.7626213 C79.882183,35.273236 80.6388439,34.7758255 81.4340517,34.2710751 C81.8431846,34.0088306 82.2787141,33.7426633 82.7406696,33.4725409 C83.1839049,33.2133648 83.5880532,32.9272174 83.9533485,32.6141156 C84.3098749,32.3085299 84.6074653,31.9657788 84.8469755,31.5852864 C85.0718912,31.2279793 85.1844205,30.8203905 85.1844205,30.355831 C85.1844205,29.608916 84.9574645,29.0453606 84.5046103,28.6422436 C84.0417895,28.2302546 83.4474302,28.0235278 82.7069364,28.0235278 C82.2135584,28.0235278 81.8053765,28.127403 81.476536,28.3317484 C81.1378182,28.5422316 80.8675189,28.8165527 80.6627972,29.15754 C80.4510619,29.5102094 80.3000819,29.9012312 80.209855,30.3320237 C80.1170581,30.7750865 80.0707389,31.2135133 80.0707389,31.6476095 L80.0707389,31.9476095 L79.7707389,31.9476095 L76.1732357,31.9476095 Z"
id="2" fill="#FFFFFF"></path>
</g>
</svg>
</a>
</h1>
<section class="top">
<div class="left">
Deep learning is a fast-changing field at the intersection of computer science and mathematics. It is a relatively new branch of a wider field called machine learning. The goal of machine learning is to teach computers to perform various tasks based on the given data. This guide is for those who know some math, know some programming language and now want to dive deep into deep learning.
</div>
<div class="right">
This is <i>not</i> a guide to:<br/>
• general machine learning<br/>
• big data processing<br/>
• data science<br/>
• <a href="http://karpathy.github.io/2016/05/31/rl/">deep reinforcement learning</a>
</div>
</section>
<div class="stripes">
<div class="c1">
<div class="stripe"></div>
</div>
<div class="c2">
<div class="stripe"></div>
</div>
<div class="c3">
<div class="stripe"></div>
</div>
<div class="c4">
<div class="stripe"></div>
</div>
<div class="theano">
<div class="stripe"></div>
</div>
<div class="torch">
<div class="stripe"></div>
</div>
<div class="tensorflow">
<div class="stripe"></div>
</div>
</div>
</div>
<div class="prerequisites container-fluid">
<div class="container">
<h2>Prerequisites</h2>
<div>
<div class="left">
<p>You must know standard university-level math. You can review those concepts in the first chapters of the book <a href="http://www.deeplearningbook.org/">Deep learning</a>:</p>
<ul>
<li><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/linear_algebra.html">Deep learning, chapter 2: Linear algebra</a></li>
<li><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/prob.html">Deep learning, chapter 3: Probability and information theory</a></li>
<li><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/numerical.html">Deep learning, chapter 4: Numerical computation</a></li>
</ul>
</div>
<div class="right">
<p>You must know programming to develop and test deep learning models. We suggest using Python for machine learning. NumPy/SciPy libraries for scientific computing are required.</p>
<ul>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://cs231n.github.io/python-numpy-tutorial/">Justin Johnson's Python / NumPy / SciPy / Matplotlib tutorial for Stanford's CS231n</a> ★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://www.scipy-lectures.org/">Scipy lecture notes</a> - cover commonly used libraries in more details and introduce more advanced topics ★★</li>
</ul>
</div>
</div>
</div>
</div>
<div class="container">
<div class="well head">
<div class="content">
<p>When you are comfortable with the prerequisites, we suggest four options for studying deep learning. Choose any of them or any combination of them. The number of stars indicates the difficulty.</p>
<ul>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/playlist?list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH">Hugo Larochelle's video course</a> on YouTube. The videos were recorded in 2013 but most of the content is still fresh. The mathematics behind neural networks is explained in detail. <a href="http://info.usherbrooke.ca/hlarochelle/neural_networks/content.html">Slides and related materials</a> are available. ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="http://cs231n.stanford.edu/">Stanford's CS231n</a> (Convolutional Neural Networks for Visual Recognition) by Fei-Fei Li, Andrej Karpathy and Justin Johnson. The course is focused on image processing, but covers most of the important concepts in deep learning. <a href="https://www.youtube.com/playlist?list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA">Videos (2016)</a> and <a href="http://cs231n.github.io/">lecture notes</a> are available. ★★</li>
<li class="c3"><span class="glyphicon glyphicon-book"></span> Michael Nielsen's online book <a href="http://neuralnetworksanddeeplearning.com/">Neural networks and deep learning</a> is the easiest way to study neural networks. It doesn't cover all important topics, but contains intuitive explanations and code for the basic concepts. ★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/">Deep learning</a>, a book by Ian Goodfellow, Yoshua Bengio and Aaron Courville, is the most comprehensive resource for studying deep learning. It covers a lot more than all the other courses combined. ★★★</li>
</div>
<div class="right-content">
<p>There are many software frameworks that provide necessary functions, classes and modules for machine learning and for deep learning in particular. We suggest you <i>not</i> use these frameworks at the early stages of studying, instead we suggest you implement the basic algorithms from scratch. Most of the courses describe the maths behind the algorithms in enough detail, so they can be easily implemented.</p>
<ul>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="https://jupyter-notebook-beginner-guide.readthedocs.io/en/latest/">Jupyter notebooks</a> are a convenient way to play with Python code. They are nicely integrated with matplotlib, a popular tool for visualizations. We suggest you implement algorithms in such environments. ★</li>
</ul>
</div>
</div>
<div class="well">
<h2>Machine learning basics</h2>
<div class="content">
<p>Machine learning is the art and science of teaching computers based on data. It is a relatively established field at the intersection of computer science and mathematics, while deep learning is just a small subfield of it. The concepts and tools of machine learning are important for understanding deep learning.</p>
<ul>
<li><span class="glyphicon glyphicon-eye-open"></span> <a href="http://www.r2d3.us/visual-intro-to-machine-learning-part-1/">Visual introduction to machine learning</a> - decision trees ★</li>
<li><span class="glyphicon glyphicon-film"></span> <a href="https://www.coursera.org/learn/machine-learning">Andrew Ng's course on machine learning</a>, the most popular course on Coursera ★★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> Larochelle's course doesn't have separate introductory lectures for general machine learning, but all required concepts are defined and explained whenever needed.</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=g-PvXUjD6qg&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA">1. Training and testing the models (kNN) ★★</a></li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=hAeos2TocJ8&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=2">2. Linear classification (SVM) ★★</a></li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=WjY57K9xX4s&index=3&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA">3. Optimization (stochastic gradient descent)</a> ★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/ml.html">5. Machine learning basics</a> ★★★</li>
<li><span class="glyphicon glyphicon-eye-open"></span> <a href="http://setosa.io/ev/principal-component-analysis/">Principal Component Analysis explained visually</a> ★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://distill.pub/2016/misread-tsne/">How to Use t-SNE Effectively</a> ★★</li>
</ul>
</div>
<div class="right-content">
<p>Most of the popular machine learning algorithms are implemented in the <a href="http://scikit-learn.org/stable/tutorial/basic/tutorial.html">Scikit-learn</a> Python library. Implementing some of them from scratch helps with understanding how machine learning works.</p>
<ul>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="https://pythonprogramming.net/machine-learning-tutorial-python-introduction/">Practical Machine Learning Tutorial with Python</a> covers linear regression, k-nearest-neighbors and support vector machines. First it shows how to use them from scikit-learn, then implements the algorithms from scratch. ★</li>
<li><span class="glyphicon glyphicon-film"></span> Andrew Ng's course on Coursera has many assignments in Octave language. The same algorithms can be implemented in Python. ★★</li>
</ul>
</div>
</div>
<div class="well">
<h2>Neural networks basics</h2>
<div class="content">
<p>Neural networks are powerful machine learning algorithms. They form the basis of deep learning.</p>
<ul>
<li><span class="glyphicon glyphicon-eye-open"></span> <a href="http://jalammar.github.io/visual-interactive-guide-basics-neural-networks/">A Visual and Interactive Guide to the Basics of Neural Networks</a> - shows how simple neural networks can do linear regression ★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/playlist?list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH">1. Feedforward neural network</a> ★★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=5adNQvSlF50&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=7">2. Training neural networks (up to 2.7)</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=GZTvxoSHZIo&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=4">4. Backpropagation</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=GUtlrDbHhJM&index=5&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA">5. Architecture of neural networks</a> ★★</li>
<li class="c3"><span class="glyphicon glyphicon-book"></span> <a href="http://neuralnetworksanddeeplearning.com/chap1.html">1. Using neural nets to recognize handwritten digits</a> ★</li>
<li class="c3"><span class="glyphicon glyphicon-book"></span> <a href="http://neuralnetworksanddeeplearning.com/chap2.html">2. How the backpropagation algorithm works</a> ★</li>
<li class="c3"><span class="glyphicon glyphicon-book"></span> <a href="http://neuralnetworksanddeeplearning.com/chap4.html">4. A visual proof that neural nets can compute any function</a> ★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/mlp.html">6. Deep feedforward networks</a> ★★★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="https://medium.com/@karpathy/yes-you-should-understand-backprop-e2f06eab496b">Yes you should understand backprop</a> explains why it is important to implement backpropagation once from scratch ★★
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://colah.github.io/posts/2015-08-Backprop/">Calculus on computational graphs: backpropagation</a> ★★
</li>
<li><span class="glyphicon glyphicon-eye-open"></span> <a href="http://playground.tensorflow.org/">Play with neural networks!</a> ★</li>
</ul>
</div>
<div class="right-content">
<p>Try to implement a single layer neural network from scratch, including the training procedure.</p>
<ul>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://cs231n.github.io/neural-networks-case-study/">Implementing softmax classifier and a simple neural network in pure Python/NumPy</a> - Jupyter notebook available ★</li>
<li><span class="glyphicon glyphicon-pencil"></span> Andrej Karpathy implements backpropagation in Javascript in his <a href="http://karpathy.github.io/neuralnets/">Hacker's guide to Neural Networks</a>. ★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://www.wildml.com/2015/09/implementing-a-neural-network-from-scratch/">Implementing a neural network from scratch</a> in Python ★</li>
</ul>
</div>
</div>
<div class="well">
<h2>Improving the way neural networks learn</h2>
<div class="content">
<p>It's not very easy to train neural networks. Sometimes they don't learn at all (underfitting), sometimes they learn exactly what you give them and their "knowledge" does not generalize to new, unseen data (overfitting). There are many ways to handle these problems.</p>
<ul>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=JfkbyODyujw&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=14">2.8-2.11. Regularization, parameter initialization etc.</a> ★★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=UcKPdAM8cnI&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=55">7.5. Dropout</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=KaR4lIdI1MQ&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=6">6 (first half). Setting up the data and loss</a> ★★</li>
<li class="c3"><span class="glyphicon glyphicon-book"></span> <a href="http://neuralnetworksanddeeplearning.com/chap3.html">3. Improving the way neural networks learn</a> ★</li>
<li class="c3"><span class="glyphicon glyphicon-book"></span> <a href="http://neuralnetworksanddeeplearning.com/chap5.html">5. Why are deep neural networks hard to train?</a> ★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/regularization.html">7. Regularization for deep learning</a> ★★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/optimization.html">8. Optimization for training deep models</a> ★★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/guidelines.html">11. Practical methodology</a> ★★★</li>
<li><span class="glyphicon glyphicon-eye-open"></span> <a href="http://cs.stanford.edu/people/karpathy/convnetjs/demo/trainers.html">ConvNetJS Trainer demo on MNIST</a> - visualizes the performance of different optimization algorithms ★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://sebastianruder.com/optimizing-gradient-descent/">An overview of gradient descent optimization algorithms</a> ★★★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://colah.github.io/posts/2014-03-NN-Manifolds-Topology/">Neural Networks, Manifolds, and Topology</a> ★★★</li>
</ul>
</div>
<div class="right-content">
<p>There are many frameworks that provide the standard algorithms and are optimised for good performance on modern hardware. Most of these frameworks have interfaces for Python with the notable exception of Torch, which requires Lua. Once you know how basic learning algorithms are implemented under the hood, it's time to choose a framework to build on.</p>
<ul>
<li class="theano"><a href="http://deeplearning.net/software/theano/">Theano</a> provides low-level primitives for constructing all kinds of neural networks. It is maintained by <a href="https://mila.umontreal.ca/en/">a machine learning group at University of Montreal</a>. See also: <span class="glyphicon glyphicon-pencil"></span> <a href="http://www.wildml.com/2015/09/speeding-up-your-neural-network-with-theano-and-the-gpu/">Speeding up your neural network with Theano and the GPU</a> - Jupyter notebook available ★</li>
<li class="tensorflow"><a href="https://www.tensorflow.org/">TensorFlow</a> is another low-level framework. Its architecture is similar to Theano. It is maintained by the Google Brain team.</li>
<li class="torch"><a href="http://torch.ch/">Torch</a> is a popular framework that uses Lua language. The main disadvantage is that Lua's community is not as large as Python's. Torch is mostly maintained by Facebook and Twitter.</li>
</ul>
<p>There are also higher-level frameworks that run on top of these:</p>
<ul>
<li class="theano"><a href="http://lasagne.readthedocs.io/en/latest/user/tutorial.html">Lasagne</a> is a higher level framework built on top of Theano. It provides simple functions to create large networks with few lines of code.</li>
<li class="keras"><a href="https://keras.io/">Keras</a> is a higher level framework that works on top of either Theano or TensorFlow.</li>
<li><span class="glyphicon glyphicon-film"></span> If you need more guidance on which framework is right for you, see <a href="https://www.youtube.com/watch?v=XgFlBsl0Lq4&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=12">Lecture 12 of Stanford's CS231n</a>. ★★</li>
</ul>
</div>
</div>
<div class="well">
<h2>Convolutional neural networks</h2>
<div class="content">
<p>Convolutional networks ("CNNs") are a special kind of neural nets that use several clever tricks to learn faster and better. ConvNets essentially revolutionized computer vision and are heavily used in speech recognition and text classification as well.</p>
<ul>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=rxKrCa4bg1I&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=69">9. Computer vision (up to 9.9)</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=KaR4lIdI1MQ&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=6">6 (second half). Intro to ConvNets</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=V8JDMkARdfU&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=7">7. Convolutional neural networks</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=2xtx-gk3PqY&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=8">8. Localization and detection</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=N--YsFUyYnE&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=9">9. Visualization, Deep dream, Neural style, Adversarial examples</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=UFnO-ADC-k0&index=13&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA">13. Image segmentation (up to 38:00)</a> includes upconvolutions ★★</li>
<li class="c3"><span class="glyphicon glyphicon-book"></span> <a href="http://neuralnetworksanddeeplearning.com/chap6.html">6. Deep learning</a> ★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/convnets.html">9. Convolutional networks</a> ★★★</li>
<li><span class="glyphicon glyphicon-eye-open"></span> <a href="http://setosa.io/ev/image-kernels/">Image Kernels explained visually</a> - shows how convolutional filters (also known as image kernels) transform the image ★</li>
<li><span class="glyphicon glyphicon-eye-open"></span> <a href="http://cs.stanford.edu/people/karpathy/convnetjs/demo/mnist.html">ConvNetJS MNIST demo</a> - live visualization of a convolutional network right in the browser ★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://colah.github.io/posts/2014-07-Conv-Nets-Modular/">Conv Nets: A Modular Perspective</a> ★★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://colah.github.io/posts/2014-07-Understanding-Convolutions/">Understanding Convolutions</a> ★★★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://www.wildml.com/2015/11/understanding-convolutional-neural-networks-for-nlp/">Understanding Convolutional neural networks for NLP</a> ★★</li>
</ul>
</div>
<div class="right-content">
<p>Convolutional networks are implemented in every major framework. It is usually easier to understand the code that is written using higher level libraries.</p>
<ul>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://deeplearning.net/tutorial/lenet.html">Theano: Convolutional Neural Networks (LeNet)</a> ★★</li>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://luizgh.github.io/libraries/2015/12/08/getting-started-with-lasagne/">Using Lasagne for training Deep Neural Networks</a> ★</li>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://jeffreydf.github.io/diabetic-retinopathy-detection/">Detecting diabetic retinopathy in eye images</a> - a blog post by one of the best performers of Diabetic retinopathy detection contest in Kaggle. Includes a good example of data augmentation. ★★</li>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="https://deepsense.io/deep-learning-right-whale-recognition-kaggle/">Face recognition for right whales using deep learning</a> - the authors used different ConvNets for localization and classification. <a href="https://www.dropbox.com/s/rohrc1btslxwxzr/deepsense-whales.zip?dl=0">Code and models are available</a>. ★★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="https://www.tensorflow.org/tutorials/deep_cnn/">Tensorflow: Convolutional neural networks</a> for image classification on CIFAR-10 dataset ★★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="http://www.wildml.com/2015/12/implementing-a-cnn-for-text-classification-in-tensorflow/">Implementing a CNN for text classification in Tensorflow</a> ★★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="https://nbviewer.jupyter.org/github/tensorflow/tensorflow/blob/master/tensorflow/examples/tutorials/deepdream/deepdream.ipynb">DeepDream implementation in TensorFlow</a> ★★★</li>
<li class="torch"><span class="glyphicon glyphicon-pencil"></span> <a href="http://torch.ch/blog/2015/07/30/cifar.html">92.45% on CIFAR-10 in Torch</a> - implements famous VGGNet network with batch normalization layers in Torch ★</li>
<li class="torch"><span class="glyphicon glyphicon-pencil"></span> <a href="http://torch.ch/blog/2016/02/04/resnets.html">Training and investigating Residual Nets</a> - Residual networks perform very well on image classification tasks. Two researchers from Facebook and CornellTech implemented these networks in Torch ★★★</li>
<li><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=ue4RJdI8yRA&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=11">ConvNets in practice</a> - lots of practical tips on using convolutional networks including data augmentation, transfer learning, fast implementations of convolution operation ★★</li>
</ul>
</div>
</div>
<div class="well">
<h2>Recurrent neural networks</h2>
<div class="content">
<p>Recurrent networks ("RNNs") are designed to work with sequences. Usually they are used for sentence classification (e.g. sentiment analysis) and speech recognition, but also for text generation and even image generation.</p>
<ul>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://karpathy.github.io/2015/05/21/rnn-effectiveness/">The Unreasonable Effectiveness of Recurrent Neural Networks</a> - describes how RNNs can generate text, math papers and C++ code ★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> Hugo Larochelle's course doesn't cover recurrent neural networks (although it covers many topics that RNNs are used for). We suggest watching <a href="https://www.youtube.com/watch?v=56TYLaQN4N8">Recurrent Neural Nets and LSTMs</a> by Nando de Freitas to fill the gap ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=cO0a0QYmFm8&list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&index=10">10. Recurrent Neural Networks, Image Captioning, LSTM</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://youtu.be/UFnO-ADC-k0?list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&t=2280">13. Soft attention (starting at 38:00)</a> ★★</li>
<li class="c3"><span class="glyphicon glyphicon-book"></span> Michael Nielsen's book stops at convolutional networks. In the <a href="http://neuralnetworksanddeeplearning.com/chap6.html#other_approaches_to_deep_neural_nets">Other approaches to deep neural nets</a> section there is just a brief review of simple recurrent networks and LSTMs. ★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/rnn.html">10. Sequence Modeling: Recurrent and Recursive Nets</a> ★★★</li>
<li><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=nwcJuGuG-0s&index=8&list=PLmImxx8Char9Ig0ZHSyTqGsdhb9weEGam">Recurrent neural networks</a> from Stanford's CS224d (2016) by Richard Socher ★★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://colah.github.io/posts/2015-08-Understanding-LSTMs/">Understanding LSTM Networks</a> ★★</li>
</ul>
</div>
<div class="right-content">
<p>Recurrent neural networks are also implemented in every modern framework.</p>
<ul>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://deeplearning.net/tutorial/rnnslu.html">Theano: Recurrent Neural Networks with Word Embeddings</a> ★★★</li>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://deeplearning.net/tutorial/lstm.html">Theano: LSTM Networks for Sentiment Analysis</a> ★★★</li>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://www.wildml.com/2015/09/recurrent-neural-networks-tutorial-part-2-implementing-a-language-model-rnn-with-python-numpy-and-theano/">Implementing a RNN with Python, Numpy and Theano</a> ★★</li>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="https://github.com/Lasagne/Recipes/blob/master/examples/lstm_text_generation.py">Lasagne implementation of Karpathy's char-rnn</a> ★</li>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="https://yerevann.github.io/2016/06/26/combining-cnn-and-rnn-for-spoken-language-identification/">Combining CNN and RNN for spoken language identification</a> in Lasagne ★</li>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://yerevann.github.io/2016/09/09/automatic-transliteration-with-lstm/">Automatic transliteration with LSTM</a> using Lasagne ★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="https://www.tensorflow.org/tutorials/recurrent/">Tensorflow: Recurrent Neural Networks</a> for language modeling ★★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="http://r2rt.com/recurrent-neural-networks-in-tensorflow-i.html">Recurrent Neural Networks in Tensorflow</a> ★★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="http://blog.evjang.com/2016/06/understanding-and-implementing.html">Understanding and Implementing Deepmind's DRAW Model</a> ★★★</li>
<li class="torch"><span class="glyphicon glyphicon-pencil"></span> <a href="http://apaszke.github.io/lstm-explained.html">LSTM implementation explained</a> ★★</li>
<li class="torch"><span class="glyphicon glyphicon-pencil"></span> <a href="https://github.com/jcjohnson/torch-rnn">Torch implementation of Karpathy's char-rnn</a> ★★★</li>
</ul>
</div>
</div>
<div class="well">
<h2>Autoencoders</h2>
<div class="content">
<p>Autoencoders are neural networks designed for unsupervised learning, i.e. when the data is not labeled. They can be used for dimension reduction, pretraining of other neural networks, for data generation etc. Here we also include resources about an interesting hybrid of autoencoders and graphical models called variational autoencoders, although their mathematical basis is not introduced until the next section.</p>
<ul>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=FzS3tMl4Nsc&t=2s&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=44">6. Autoencoder</a> ★★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=z5ZYm_wJ37c&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=56">7.6. Deep autoencoder</a> ★★</li>
<li class="c2"><span class="glyphicon glyphicon-film"></span> <a href="https://youtu.be/I-i1KBuShCc?list=PLlJy-eBtNFt6EuMxFYRiNRS07MCWN5UIA&t=1949">14. Videos and unsupervised learning (from 32:29)</a> - this video also touches an exciting topic of generative adversarial networks. ★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/autoencoders.html">14. Autoencoders</a> ★★★</li>
<li><span class="glyphicon glyphicon-eye-open"></span> <a href="http://cs.stanford.edu/people/karpathy/convnetjs/demo/autoencoder.html">ConvNetJS Denoising Autoencoder demo</a> ★</li>
<li><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=P78QYjWh5sM&index=3&list=PLE6Wd9FR--EfW8dtjAuPoTuPcqmOV53Fu">Karol Gregor on Variational Autoencoders and Image Generation</a> ★★</li>
</ul>
</div>
<div class="right-content">
<p>Most autoencoders are pretty easy to implement. We suggest you try to implement one before looking at complete examples.</p>
<ul>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://deeplearning.net/tutorial/dA.html">Theano: Denoising autoencoders</a> ★★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="http://cmgreen.io/2016/01/04/tensorflow_deep_autoencoder.html">Diving Into TensorFlow With Stacked Autoencoders</a> ★★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="https://jmetzen.github.io/2015-11-27/vae.html">Variational Autoencoder in TensorFlow</a> ★★</li>
<li class="torch"><span class="glyphicon glyphicon-pencil"></span> <a href="https://siavashk.github.io/2016/02/22/autoencoder-imagenet/">Training Autoencoders on ImageNet Using Torch 7</a> ★★</li>
<li class="keras"><span class="glyphicon glyphicon-pencil"></span> <a href="https://blog.keras.io/building-autoencoders-in-keras.html">Building autoencoders in Keras</a> ★</li>
</ul>
</div>
</div>
<div class="well">
<h2>Probabilistic graphical models</h2>
<div class="content">
<p>Probabilistic graphical models (“PGMs”) form a separate subfield at the intersection of statistics and machine learning. There are many books and courses on PGMs in general. Here we present how these models are applied in the context of deep learning. Hugo Larochelle's course describes a few famous models, while the book Deep Learning devotes four chapters (16-19) to the theory and describes more than a dozen models in the last chapter. These topics require a lot of mathematics.</p>
<ul>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=GF3iSJkgPbA&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=18">3. Conditional Random Fields</a> ★★★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=6dpGB60Q1Ts&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=28">4. Training CRFs</a> ★★★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=p4Vh_zMw-HQ&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=36">5. Restricted Boltzman machine</a> ★★★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=vkb6AWYXZ5I&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=57">7.7-7.9. Deep Belief Networks</a> ★★★</li>
<li class="c1"><span class="glyphicon glyphicon-film"></span> <a href="https://www.youtube.com/watch?v=y0SISi_T6s8&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=78">9.10. Convolutional RBM</a> ★★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/linear_factors.html">13. Linear Factor Models</a> - first steps towards probabilistic models ★★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/graphical_models.html">16. Structured Probabilistic Models for Deep Learning</a> ★★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/monte_carlo.html">17. Monte Carlo Methods</a> ★★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/partition.html">18. Confronting the Partition Function</a> ★★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/inference.html">19. Approximate Inference</a> ★★★</li>
<li class="c4"><span class="glyphicon glyphicon-book"></span> <a href="http://www.deeplearningbook.org/contents/generative_models.html">20. Deep Generative Models</a> - includes Boltzmann machines (RBM, DBN, ...), variational autoencoders, generative adversarial networks, autoregressive models etc. ★★★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="https://openai.com/blog/generative-models/">Generative models</a> - a blog post on variational autoencoders, generative adversarial networks and their improvements by OpenAI. ★★★</li>
<li><span class="glyphicon glyphicon-pencil"></span> <a href="http://www.asimovinstitute.org/neural-network-zoo/">The Neural Network Zoo</a> attempts to organize lots of architectures using a single scheme. ★★</li>
</ul>
</div>
<div class="right-content">
<p>Higher level frameworks (Lasagne, Keras) do not implement graphical models. But there is a lot of code for Theano, Tensorflow and Torch.</p>
<ul>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://deeplearning.net/tutorial/rbm.html">Restricted Boltzmann Machines in Theano</a> ★★★</li>
<li class="theano"><span class="glyphicon glyphicon-pencil"></span> <a href="http://deeplearning.net/tutorial/DBN.html">Deep Belief Networks in Theano</a> ★★★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="http://blog.otoro.net/2016/04/01/generating-large-images-from-latent-vectors/">Generating Large Images from Latent Vectors</a> - uses a combination of variational autoencoders and generative adversarial networks. ★★★</li>
<li class="tensorflow"><span class="glyphicon glyphicon-pencil"></span> <a href="https://bamos.github.io/2016/08/09/deep-completion/">Image Completion with Deep Learning in TensorFlow</a> - another application of generative adversarial networks. ★★★</li>
<li class="torch"><span class="glyphicon glyphicon-pencil"></span> <a href="http://torch.ch/blog/2015/11/13/gan.html">Generating Faces with Torch</a> - Torch implementation of Generative Adversarial Networks ★★</li>
</ul>
</div>
</div>
</div>
<div class="footer">
<div class="container">
<h2>The state of the art</h2>
<div class="left">
<p>Deep learning is a very active area of scientific research. To follow the state of the art one has to read new papers and follow important conferences. Usually every new idea is announced in a preprint paper on arxiv.org. Then some of them are submitted to conferences and are peer reviewed. The best of them are presented in the conferences and are published in journals. If the authors do not release code for their models, many people attempt to implement them and put them on GitHub. It takes a year or two before high quality blog posts, tutorials and videos appear on the web that properly explain the ideas and implementations.</p>
<ul>
<li><span class="glyphicon glyphicon-th-list"></span> <a href="https://github.com/songrotek/Deep-Learning-Papers-Reading-Roadmap">Deep learning papers reading roadmap</a> contains a long list of important papers.</li>
<li><span class="glyphicon glyphicon-th-list"></span> <a href="http://www.arxiv-sanity.com/">Arxiv Sanity Preserver</a> is a nice UI for browsing papers from arXiv.</li>
<li><span class="glyphicon glyphicon-film"></span> <a href="http://videolectures.net/">Videolectures.net</a> contains lots of videos on advanced topics.</li>
<li><span class="glyphicon glyphicon-th-list"></span> <a href="https://www.reddit.com/r/MachineLearning/">/r/MachineLearning</a> is a very active subreddit. All major new papers are discussed there.</li>
</ul>
</div>
<div class="right">
<p>We are going to keep this guide up to date. </p>
<ul>
<li><span class="glyphicon glyphicon-flag"></span> If you find broken links or any other problems, please report an <a href="https://github.com/YerevaNN/A-Guide-to-Deep-Learning/issues">issue on GitHub</a>.</li>
<li><span class="glyphicon glyphicon-time"></span> Last updated on December 26, 2016</li>
</ul>
</div>
</div>
</div>