WER calculator

WER calculator is a Heroku Flask app that records you saying a sentence that is difficult for speech recognition software to detect and calculates the Word Error Rate (WER) between the reference sentence and Google Web To Speech API's hypothesis.

How to use it

Go to the Heroku app page and follow the instructions on the page.

Example output

How it works

I implemented the Levenshtein algorithm and generation of a sequence of edit steps from scratch in Python based on the description in Juraskfy and Martin (2008, pp. 74-77). You can see the Levenshtein matrix generated by clicking the "show Levenshtein matrix" link displayed after recording.

The reference sentence and audio data are sent via a POST request to the Flask app, which sends back information about the Word Error Rate as an HTML string.

Credits

The Python SpeechRecognition library is used to interface with the Google Web Speech API.
Matt Diamond's Recorder.js plugin is used to record and export audio.
Example sentences come from Will Styler's article Doing Terrible things to Speech Recognition Software.

References

Jurafsky, D. and Martin J. H. (2008), Speech and language processing. 2nd edn. New Jersey: Prentice Hall.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
readme-img		readme-img
wer_calculator		wer_calculator
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WER calculator

How to use it

Example output

How it works

Credits

References

About

Releases

Packages

Languages

ljdyer/wer-calculator

Folders and files

Latest commit

History

Repository files navigation

WER calculator

How to use it

Example output

How it works

Credits

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages