TARS

Top-notch Automatic speech Recognition System (TARS)

Installation

Follow the instructions as the original wavenet implementation.

However, instead of installing scikits.audiolab, use soundfile instead, which supports Python 3.x

Also, when compiling Libsndfile from source / installing from brew, make sure to enable Flac support

MacOS: (Courtesy of this person)

brew install libsndfile --with-lame --with-flac --with-libvorbis
brew link --overwrite libsndfile

Also, installed ffmpeg

Important files

preprocess.py - written by someone else,used to create lmfcc from the sound files and store those the labels in asset/data/preprocess folder Conf.py - Parameters and alphabet mapping

train.py - Main file for training, run this to get the log files which you can plot in tensorboard model.py - Creates the graph for the model dataloder.py - The dataloading pipeline, loads data efficiently. Does the batching too! test_ctc.py - Some experiments, can be ignored

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
Misc		Misc
saved		saved
wavenet		wavenet
.gitignore		.gitignore
MFCC		MFCC
README.md		README.md
Untitled Diagram.xml		Untitled Diagram.xml
conf.py		conf.py
dataloader.py		dataloader.py
model.py		model.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
speechSynthesisGtts.py		speechSynthesisGtts.py
speechSynthesisWavenet.py		speechSynthesisWavenet.py
test_ctc.py		test_ctc.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TARS

Installation

Important files

About

Releases

Packages

Contributors 2

Languages

yonkshi/TARS

Folders and files

Latest commit

History

Repository files navigation

TARS

Installation

Important files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages