Solutions of the course assignment.
- Understanding of the effective modern methods for deep learning.
- Basics first, then key methods used in NLP: Recurrent networks, attention, etc.
- Big picture understanding of human languages and the difficulties in understanding and producing them.
- Understanding of and ability to build systems (in PyTorch) for some of the major problems in NLP:
- Word meaning, dependency parsing, machine translation, question answering.
- Implementing co-occurence matrix.
- Utilizing the SVD concept to reduce embedding dimentionality.
- Analysing similarity between words.
- Implementing the negative sampling loss and gradient.
- Implementing the skip-gram model.
This assignment is to implement a Neural Transition-Based Dependency Parsing model.
- Implementing a parsing step given a transition type.
- Implementing a minibatch parsing given a batch of sentences.
- Implementing a neural model using PyTorch to predict a transition.
- Implementing the encoder layer to generate the representation of sentences for the decoder.
- Implementing the decoder layer which is a a one step model repeated for every step in the input.
This layer computes the attention scores over the inputs for each step in the decoding process.
This assignmnet is to experience pretraining and how finetuning a language model gives access to its hidden knowledge.
- Walking through the minGPT demo provided as most of the code is inherited from it.
- Implementing the
fintuning
setup to setup the training configurations and save the model. - Implementing the naiive approach by predicting London for every example in
london_baseline.py
. - Implementing the Span Corruption Function to introduce noise to the text by randomly selecting the span to be masked.
- Implementing the Synthesizer Attention which eliminates the quadratic regular attention.
The chosen final project is IID SQuAD in which the Bidirectional Attention Flow model in used as the baseline.
The requirements is to enhance the model via any additions whether from another research or comming up with a novel idea.
The project will be submitted in a separate repository later.
- Course website
- Instructors