Deep Learning for Natural Language Processing

Online Courses

About the Instructor

Jon Krohn is the Chief Data Scientist at untapt, a machine learning startup in New York. He leads a Deep Learning Study Group and, having obtained his doctorate in neuroscience from Oxford University, continues to publish academic papers. Jon is the author and presenter of Deep Learning with TensorFlow LiveLessons.

Skill Level


Learn How To

  • Preprocess natural language data for use in machine learning applications
  • Transform natural language into numerical representations with word2vec
  • Make predictions with Deep Learning models trained on natural language
  • Apply state-of-the-art NLP approaches with Keras, the high-level TensorFlow API
  • Improve Deep Learning model performance by tuning hyperparameters

Who Should Take This Course

These LiveLessons are perfectly-suited to software engineers, data scientists, analysts, and statisticians with an interest in applying Deep Learning to natural language data. Code examples are provided in Python, so familiarity with it or another object-oriented programming language would be helpful.

Course Requirements

The author’s earlier Deep Learning with TensorFlow LiveLessons, or equivalent foundational Deep Learning knowledge, are a prerequisite.

Table of Contents

Lesson 1: The Power and Elegance of Deep Learning for Natural Language Processing

This lesson starts off by examining Natural Language Processing and how it has been revolutionized in recent years by Deep Learning approaches. It continues with a little linguistics section that introduces the elements of natural language and breaks down how these elements are represented both by Deep Learning and by traditional machine learning approaches. This is followed up with a tantalizing overview of the broad natural language applications in which Deep Learning has emerged as state-of-the-art. The lesson then reviews how to run the code in these LiveLessons on your own machine, as well as the foundational Deep Learning theory that is essential for building an NLP specialization upon. The lesson wraps up by taking a sneak peek at the capabilities you’ll develop over the course of all five lessons.

Lesson 2 Word Vectors

The lesson begins by illustrating what word vectors are as well as how the beautiful word2vec algorithm creates them. Subsequently, the lesson arms you with a rich set of natural language data sets that you can train powerful Deep Learning models, and then swiftly moves along to leveraging those data to generate word vectors of your own.

Lesson 3 Modeling Natural Language Data

In the previous lesson, you learned about vector-space embeddings and created word vectors with word2vec. In that process, we identified shortcomings of our natural language data, so the current lesson begins with coverage of Natural Language Processing best practices. Next, on the whiteboard, the author works through how to calculate a concise and broadly useful summary metric called the Area Under the Curve of the Receiver Operator Characteristic. We immediately calculate that summary metric in practice by building and evaluating a dense neural network for classifying documents. The lesson then goes a step further by showing you how to add convolutional layers into your deep neural network as well.

Lesson 4 Recurrent Neural Networks

This lesson kicks off by delving into the essential theory of Recurrent Neural Networks, a Deep Learning family that’s ideally suited to handling data that occur in a sequence like languages. You immediately apply this theory by incorporating an RNN into your document classification model. The author then briefly returns to the whiteboard to provide a high-level theoretical overview of especially powerful RNN variants‚Äìthe Long Short-Term Memory Unit and the Gated Recurrent Unit (4.3)‚Äìbefore incorporating these into your Deep Learning models as well.

Lesson 5 Advanced Models

This lesson expands our natural language modeling capabilities further by examining special cases of the LSTM, namely the Bi-Directional and Stacked varieties. We then take a mind-bending journey into the world of non-sequential network architectures‚Äìwhere instead of only stacking neural layers on top of each other as we’ve always done‚Äìwe run layers side-by-side in parallel as well. To wrap up these LiveLessons, the author summarizes the hyperparameters that we can consider tuning to optimize model performance.


Course Detail
Classes Start

Query Form

Ducat: 70-70-90-50-90

Noida - Delhi - GR.Noida - Ghaziabad Faridabad - Gurugram

Events Gallery


Ducat provides the best available programs which helps in enhancing the technical skills which seems to be beneficial for all the applicants.

© Copyright 1999-2019 Ducat Creative, All rights reserved.