From 0e40e9fbe6496916a7659e0e14582f339bb7c7ba Mon Sep 17 00:00:00 2001 From: Tanner Gilbert <36239763+TannerGilbert@users.noreply.github.com> Date: Mon, 19 Oct 2020 18:55:57 +0200 Subject: [PATCH] Fixing tanh formula and typo in 01_intro (#276) * fixing error in tanh formula #273 * fixes typo described in #241 --- 01_intro.ipynb | 4 ++-- 12_nlp_dive.ipynb | 4 ++-- 2 files changed, 4 insertions(+), 4 deletions(-) diff --git a/01_intro.ipynb b/01_intro.ipynb index e9cd461..07a9f31 100644 --- a/01_intro.ipynb +++ b/01_intro.ipynb @@ -2282,7 +2282,7 @@ "cell_type": "markdown", "metadata": {}, "source": [ - "A brief one-line explanation is provided by `doc`. The \"Show in docs\" link take you to the full documentation, where you'll find all the details and lots of examples. Also, most of fastai's methods are just a handful of lines, so you can click the \"source\" link to see exactly what's going on behind the scenes.\n", + "A brief one-line explanation is provided by `doc`. The \"Show in docs\" link takes you to the full documentation, where you'll find all the details and lots of examples. Also, most of fastai's methods are just a handful of lines, so you can click the \"source\" link to see exactly what's going on behind the scenes.\n", "\n", "Let's move on to something much less sexy, but perhaps significantly more widely commercially useful: building models from plain *tabular* data." ] @@ -2963,4 +2963,4 @@ }, "nbformat": 4, "nbformat_minor": 4 -} +} \ No newline at end of file diff --git a/12_nlp_dive.ipynb b/12_nlp_dive.ipynb index 3738fa2..8ebfd2b 100644 --- a/12_nlp_dive.ipynb +++ b/12_nlp_dive.ipynb @@ -1582,7 +1582,7 @@ "source": [ "In this picture, our input $x_{t}$ enters on the left with the previous hidden state ($h_{t-1}$) and cell state ($c_{t-1}$). The four orange boxes represent four layers (our neural nets) with the activation being either sigmoid ($\\sigma$) or tanh. tanh is just a sigmoid function rescaled to the range -1 to 1. Its mathematical expression can be written like this:\n", "\n", - "$$\\tanh(x) = \\frac{e^{x} + e^{-x}}{e^{x}-e^{-x}} = 2 \\sigma(2x) - 1$$\n", + "$$\\tanh(x) = \\frac{e^{x} - e^{-x}}{e^{x}+e^{-x}} = 2 \\sigma(2x) - 1$$\n", "\n", "where $\\sigma$ is the sigmoid function. The green circles are elementwise operations. What goes out on the right is the new hidden state ($h_{t}$) and new cell state ($c_{t}$), ready for our next input. The new hidden state is also used as output, which is why the arrow splits to go up.\n", "\n", @@ -2362,4 +2362,4 @@ }, "nbformat": 4, "nbformat_minor": 2 -} +} \ No newline at end of file