Update posts

nipunsadvilkar · nipunsadvilkar · commit 2c55effa67ea · 2019-02-28T21:59:02.000+05:30
diff --git a/_posts/blog/2018-09-04-neural-networks-and-deep-learning-book-chap1-ex1-part1-solution.markdown b/_posts/blog/2018-09-04-neural-networks-and-deep-learning-book-chap1-ex1-part1-solution.markdown
@@ -0,0 +1,136 @@
+---
+layout: post
+title:  "Neural Networks And Deep Learning Book Chapter 1 Exercise 1.1 Solution"
+date:   2018-09-04 21:00:00
+comments: true
+categories: blog
+description: Solutions of "Neural Networks and Deep Learning by Michael Nielsen" Exercises Chapter 1 Part I
+---
+
+
+I must say [Neural Networks and Deep Learning by Michael Nielsen](http://neuralnetworksanddeeplearning.com/) is best deep learning book I have came across. It has perfect combination of theory plus code. It delves into deep mathematics as much as code. Methodology of "from-scratch" implementation of neural networks and exercises in between really encourages you to think carefully about what's actually happening under the hood of neural networks.
+
+Following is my attempt to those exercises:
+
+
+<h1 style="font-size: 40px;">Exercise 1</h1>
+<hr>
+
+<h1 style="font-size: 30px;">Sigmoid neurons simulating perceptrons, part I</h1>
+
+Suppose we take all the weights and biases in a network of perceptrons, and multiply them by a positive constant, $$c>0$$. Show that the behaviour of the network doesn't change.
+<hr>
+**Solution 1:**
+
+we know perceptron rule can be written as:
+
+$$
+\begin{eqnarray}
+  \mbox{z} = \left\{
+    \begin{array}{ll}
+      0 & \mbox{if } w\cdot x + b \leq 0 \\
+      1 & \mbox{if } w\cdot x + b > 0
+    \end{array}
+  \right.
+\tag{1}\end{eqnarray}
+$$
+
+<br>
+where $$z, w, b$$ represents *output*, *weights* and *bias* repectively.
+
+We are asked to see perceptron's behaviour after multiplying weights and biases in a network of perceptrons by $c>0$ which happens to positive constant.
+
+Let's multiply the above equation by $$c$$,
+
+$$
+\begin{eqnarray}
+  \mbox{z} = \left\{
+    \begin{array}{ll}
+      0 & \mbox{if } cw\cdot x + cb \leq 0 \\
+      1 & \mbox{if } cw\cdot x + cb > 0
+    \end{array}
+  \right.
+\end{eqnarray}
+$$
+
+Focusing on condition part:
+
+In both conditions,
+<br>
+
+$$
+cw \cdot x + cb = 0
+$$
+
+And
+
+$$
+cw \cdot x + cb > 0
+$$
+
+<br>
+Using basic algebra $$a(b + c) = ab + ac$$, we can take out common factor constant $$c$$.
+
+
+$$
+c [w \cdot x + b] = 0
+$$
+
+$$
+c [w \cdot x + b] > 0
+$$
+
+
+Multipling both sides by positive constant $$c$$, doesn't change sign of the equation, it only changes magnitudes and the equation is unchanged.
+
+
+If you find dot product confusing you can put equation $$1$$ in basic algebraic form.
+<hr>
+**Solution 1 in basic precise algebraic form:**
+
+$$
+\begin{eqnarray}
+  \mbox{output} & = & \left\{ \begin{array}{ll}
+      0 & \mbox{if } \sum_j w_j x_j + b\leq \mbox{0} \\
+      1 & \mbox{if } \sum_j w_j x_j +b > \mbox{0}
+      \end{array} \right.
+\tag{2}\end{eqnarray}
+$$
+
+<br>
+
+Note that above dot product representation is same as current algebraic form:
+
+$$
+w \cdot x + b \equiv \sum_j w_j x_j + b
+$$
+
+
+Multiplying by $$c$$,
+
+$$
+\begin{eqnarray}
+  \mbox{output} & = & \left\{ \begin{array}{ll}
+      0 & \mbox{if } \sum_j cw_j x_j + cb\leq \mbox{0} \\
+      1 & \mbox{if } \sum_j cw_j x_j + cb > \mbox{0}
+      \end{array} \right.
+\end{eqnarray}
+$$
+
+Factoring out common term $$c$$,
+
+$$
+\begin{eqnarray}
+  \mbox{output} & = & \left\{ \begin{array}{ll}
+      0 & \mbox{if } c[\sum_j w_j x_j + b]\leq \mbox{0} \\
+      1 & \mbox{if } c[\sum_j w_j x_j + b] > \mbox{0}
+      \end{array} \right.
+\end{eqnarray}
+$$
+
+<br>
+As you can see, dividing each side by the constant $$c$$, we will have the same behavior of the perceptron as represented in equation $$2$$. That means perceptrons is unaffected by multiplying its weights and bias by a postive constant $$c$$ and ultimately the behavior of the entire network of perceptrons doesn't change.
+
+Will post rest solutions soon.
+
+Hope you found it helpful. If you have any doubts or found any mistakes please comment below.
diff --git a/_posts/blog/2018-09-05-neural-networks-and-deep-learning-book-chap1-ex1-part2-solution.markdown b/_posts/blog/2018-09-05-neural-networks-and-deep-learning-book-chap1-ex1-part2-solution.markdown
@@ -0,0 +1,66 @@
+---
+layout: post
+title:  "Neural Networks And Deep Learning Book Chapter 1 Exercise 1.2 Solution"
+date:   2018-09-05 21:00:00
+comments: true
+categories: blog
+description: Solutions of "Neural Networks and Deep Learning by Michael Nielsen" Exercises Chapter 1 Part II
+---
+
+I have been solving exercises of [Neural Networks and Deep Learning Book by Michael Nielsen](http://neuralnetworksanddeeplearning.com/). If you are following along my solutions, that's great. Thank you so much! If not, here is link to Chapter 1 Exercise 1.1 Solution about [Sigmoid neurons simulating perceptrons, part I](https://nipunsadvilkar.github.io/blog/2017/09/04/neural-networks-and-deep-learning-book-chap1-ex1-part1-solution.html)
+
+Following is my attempt to second exercise:
+
+
+<h1 style="font-size: 40px;">Exercise 1.2</h1>
+<hr>
+
+<h1 style="font-size: 30px;">Sigmoid neurons simulating perceptrons, part II</h1>
+
+Suppose we have the same setup as the last problem - a network of perceptrons. Suppose also that the overall input to the network of perceptrons has been chosen. We won't need the actual input value, we just need the input to have been fixed. Suppose the weights and biases are such that $$w \cdot x + b \neq 0$$ for the input $$x$$ to any particular perceptron in the network. Now replace all the perceptrons in the network by sigmoid neurons, and multiply the weights and biases by a positive constant $$c > 0$$. Show that in the limit as $$c \rightarrow \infty$$ the behaviour of this network of sigmoid neurons is exactly the same as the network of perceptrons. How can this fail when $$w \cdot x + b = 0$$ for one of the perceptrons?
+<hr>
+**Solution:**
+
+
+We are asked to keep the same setup as in [Exercise 1.1](https://nipunsadvilkar.github.io/blog/2017/09/04/neural-networks-and-deep-learning-book-chap1-ex1-part1-solution.html).
+Refering to that,
+
+$$
+\begin{eqnarray}
+  \mbox{z} = \left\{
+    \begin{array}{ll}
+      0 & \mbox{if } w\cdot x + b \leq 0 \\
+      1 & \mbox{if } w\cdot x + b > 0
+    \end{array}
+  \right.
+\tag{1}\end{eqnarray}
+$$
+
+<br>
+we saw for perceptrons, $$(w \cdot x + b)$$ multiplied by positive constant $$c > 0$$ does not affect the result of $$cw \cdot x + cb = 0$$ or $$cw \cdot x + cb > 0$$. Similarly, squashing function *sigmoid* doesn't have any effect on behaviour of network given output $$z$$ have values at **extremities**.
+
+To explain that extremities point further,
+
+As we have seen in the first chapter, Sigmoid function looks like this:
+
+<p align="center">
+  <img src="{{ site.url }}/assets/img/sigmoid.png" alt="sigmoid" border="5">
+</p>
+
+Algebraically, Sigmoid function is represented as:
+
+$$
+\begin{eqnarray}
+  \sigma(z) \equiv \frac{1}{1+e^{-z}}.
+\tag{2}\end{eqnarray}
+$$
+
+<br>
+When $$z \equiv w \cdot x + b$$ is a large positive number. Then $$e^{-z} \approx 0$$ and so $$\sigma(z) \approx 1$$ (asymptotically). To put it in the words, when $$z = w \cdot x+b$$ is large and positive, the output from the sigmoid neuron is approximately $$1$$, just as it would have been for a perceptron. By referring to above diagram, we can say $$\sigma(z) > 0.5$$. On the other hand that $$z = w \cdot x+b$$ is very negative. Then $$e^{-z} \rightarrow \infty$$, and $$\sigma(z) \approx 0$$ (asymptotically). So when $$z = w \cdot x +b$$ is very negative, the behaviour of a sigmoid neuron also closely approximates a perceptron. Again by referring to above diagram, we can say $$\sigma(z) < 0.5$$. Here, the determination of 0.5 has not much effect. However, when $$w \cdot x+b = 0$$ i.e., $$\sigma(z) = 0.5$$ catergory of the result cannot be judged, so the binary classification is difficult to perform and this is where behaviour of sigmoid neurons deviates from the perceptron model.
+
+If above paragraph seems difficult to grasp, always refer to diagram after each sentence, it would help to understand the theory more concretely.
+
+
+Thanks for reading! Hope you found it helpful. If you have any doubts or found any mistakes please comment below.
+
+Will post rest solutions soon.
diff --git a/_posts/blog/2018-10-08-code-walkthrough-tablib-short-circuits-in-python.markdown b/_posts/blog/2018-10-08-code-walkthrough-tablib-short-circuits-in-python.markdown
@@ -0,0 +1,73 @@
+---
+layout: post
+title:  "Code Walkthrough: Tablib, a Python Module for Tabular Datasets"
+date:   2018-10-08 21:00:00
+comments: true
+categories: blog
+image: /assets/img/source_code.png
+description: Reading Great Code and it's benefits. Code walkthrough of tablib python module by Nipun Sadvilkar
+---
+<hr>
+
+<h1 style="font-size: 30px;">Motivation</h1>
+
+Oftentimes, I like to dive into open source projects to learn best practices and design patterns programming pundits use to do things correctly and optimally. In addition, [Peter Norvig](https://en.wikipedia.org/wiki/Peter_Norvig) has also said in his famous blog post [Teach Yourself Programming in Ten Years](http://norvig.com/21-days.html)
+
+> *Talk with other programmers; read other programs. This is more important than any book or training course.*
+
+I am big advocate of it. This blog post is to emphasize - how reading open source code helps you identify and understand efficient patterns and coding constructs.
+
+<h1 style="font-size: 30px;">Tablib</h1>
+
+I admire [Kenneth Reitz](https://github.com/kennethreitz) very much. Do read and follow his [The Hitchhiker’s Guide to Python!](https://docs.python-guide.org) to be a a great Python programmer. Lesson from this book - [Reading Great Code](https://docs.python-guide.org/writing/reading/?highlight=tablib#reading-great-code) - is the main reason why I decided to give a go at reading source code of [Tablib](https://github.com/kennethreitz/tablib). Reading source code is initially daunting because of certain constructs which are obscure or you may not be familiar with it, and which is natural. Despite such hurdles, if you keep con oncentrating you will find lot of "Aha!" moments by identifying useful patterns. Here is my experience, I came across a very simple yet useful code snippet which is very important and widely used task in data cleaning i.e., removing duplicates.
+
+[Source code: tablib removing_duplicates menthod:](http://docs.python-tablib.org/en/master/_modules/tablib/core/#Dataset.remove_duplicates)
+
+```python
+def remove_duplicates(self):
+    """Removes all duplicate rows from the :class:`Dataset` object
+    while maintaining the original order."""
+    seen = set()
+    self._data[:] = [row for row in self._data if not (tuple(row) in seen or seen.add(tuple(row)))]
+```
+
+Check the `if ` statement followed by [_generator expression_](https://dbader.org/blog/python-generator-expressions). If you look closely inside generator expression the technique used to check for duplicate rows is called [_short circuit technique_](https://www.geeksforgeeks.org/short-circuiting-techniques-python/) implemented in python.
+
+
+[Short circuit explained by official docs](https://docs.python.org/2/library/stdtypes.html#boolean-operations-and-or-not):
+
+|Operation|Result|Notes|
+|---|---|---|
+|`x or y` |if x is false, then y, else x| Only evaluates the second argument(`y`) if the first one is `False`.|
+|`x and y`|if x is false, then x, else y| Only evaluates the second argument(`y`) if the first one(`x`) is `True`.|
+|`not x`|if x is false, then True, else False|`not` has a lower priority than non-Boolean operators|
+
+<br>
+`remove_duplicates` method uses 1st and 3rd Operation from above table.
+
+Key thing to remember is:
+
+**The evaluation of expression takes place from left to right.**
+
+Explained with toy example:
+```python
+>>> _data = [[1, 2, 3], [4, 5, 6], [1, 2, 3]]
+>>> seen = set()
+>>> data_deduplicated = [row for row in _data if not (tuple(row) in seen or seen.add(tuple(row)))]
+
+>>> print(data_deduplicated)
+# [[1, 2, 3], [4, 5, 6]]
+```
+
+To put it into words, within list comprehension - iterate over data row by row and check if given row is present within `seen` _set_. If it's not present, meaning
+```python
+tuple(row) in seen
+```
+evaluates to `False` and as per 1st operation from the table, evaluate second argument which is to add given row in `seen` _set_. Furthermore, `if not ()` condition gets satisfied and given row is added to outer list. Subsequently, if the same row occurs then we know it's already in `seen` _set_ and hence that row will not be added to outer list. In overall, resulting into removing of duplicate rows.
+
+If you are more of a visual learning person, following demonstartion using [Python tutor tool](http://pythontutor.com/) built by an outstanding academic and prolific blogger - [Philip Guo](http://pgbovine.net) - would help*:
+> *If below IFrame is not visible then please enable **"load unsecure script"** of your browser. Don't worry! it's saying unsecure because of http protocol used by [Python tutor](http://pythontutor.com/) and not **https**.
+
+<iframe width="820" height="650" frameborder="1.5" src="http://pythontutor.com/iframe-embed.html#code=_data%20%3D%20%5B%5B1,2,3%5D,%20%5B4,5,6%5D,%20%5B1,2,3%5D%5D%0Aseen%20%3D%20set%28%29%0Adata_deduplicated%20%3D%20%5Brow%20for%20row%20in%20_data%20if%20not%20%28tuple%28row%29%20in%20seen%20or%20seen.add%28tuple%28row%29%29%29%5D&codeDivHeight=400&codeDivWidth=350&cumulative=false&curInstr=6&heapPrimitives=nevernest&origin=opt-frontend.js&py=2&rawInputLstJSON=%5B%5D&textReferences=false"> </iframe>
+
+I hope by now you have understood [_short circuit technique_](https://www.geeksforgeeks.org/short-circuiting-techniques-python/) and importance of reading open source code. So keep exploring and do share your experience with me. Thank you! :)