Characted-Identification-in-Multi-Party-Dialogues

Main project repository for NLP course 2018.
The goal of this project was to assign each mention in a dialogue to the entity it's referring to. This is also known as the coreference resolution problem.

Requirements

Data is taken from https://github.com/emorynlp/character-mining

For pretrained word embeddings, GloVe is used.

Create a directory pretrained_embeds/ in the same directory as this notebook. Download twitter embeddings from http://nlp.stanford.edu/data/glove.twitter.27B.zip Unzip it and place file glove.twitter.27B.25d.txt in pretrained_embeds/ directory.

Create an empty directory data/ in the same directory as this notebook where all the processed data will get saved.

Folders/Files discription

json_data/ - Folder contains data downloaded from https://github.com/emorynlp/character-mining

data/ - Folder contains the processed data that has been converted to features for training.

models/ - trained models are saved here.

feature_generation.ipynb - Used to generate neural network feedable data(word embeddings).

model.py - Contains classes that defines neural network model.

train.py - Takes features and model, trains and saves models in models folder.

evaluate.py - Used to calculate the accuracy on test data by making use of saved models.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
__pycache__		__pycache__
data		data
dataV2		dataV2
json_data		json_data
models		models
.gitignore		.gitignore
EvaluateV2.ipynb		EvaluateV2.ipynb
LSTM-keras.ipynb		LSTM-keras.ipynb
README.md		README.md
evaluate.py		evaluate.py
feature_generation.ipynb		feature_generation.ipynb
feature_generationV2.ipynb		feature_generationV2.ipynb
input.py		input.py
model.py		model.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Characted-Identification-in-Multi-Party-Dialogues

Requirements

Folders/Files discription

About

Releases

Packages

Contributors 3

Languages

oppasource/Characted-Identification-in-Multi-Party-Dialogues

Folders and files

Latest commit

History

Repository files navigation

Characted-Identification-in-Multi-Party-Dialogues

Requirements

Folders/Files discription

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages