SpeechRecPipeLine

Pipeline for speech recognition, covering everything from sound source localisation to natural language processing

Disclaimer

This Speech recognition pipeline is in development and currently not recommended for use. Requirements will most likely change in the near future; We will move away from jackaudio as sound framework to a more dedicated framework called esiaf (see https://github.com/Slothologist/esiaf_ros), which is currently beeing worked on.

Principles

All audio is transmitted between most stages via Jack audio (TODO: find a way to reliably match ssl to recognized speech)
Additional information is transmitted using ROS
Highly modular design to ensure maximum reusability and flexibility

Stages

Recording
- no special software available yet, can be done by jackaudio itself
- (will in the future be done by an dedicated esiaf node)
Sound Source Localisation, Separation, Filtering
- will most likely be based on ODAS, but no implementation yet
Segmentation
- AudioSegmenter
  - Has its own repo over here: https://github.com/Slothologist/AudioSegmenter
Speech recognition
- DeepSpeech4Ros
  - Has its own repo over here: https://github.com/Slothologist/DeepSpeech4Ros
Natural Language Preprocessing
- no software available yet

Dependencies

Required

Jackaudio
- sudo apt install jackd2
ROS
- see http://wiki.ros.org/ROS/Installation to install ROS. I develop on Melodic, but Kinetic should be fine as well
somewhat recent gcc

Optional

Certain parts of the pipeline can have different/ additional requirements. Be sure to check there as well!

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
msgs		msgs
stage3		stage3
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpeechRecPipeLine

Disclaimer

Principles

Stages

Dependencies

Required

Recommended

Optional

About

Releases

Packages

Slothologist/SpeechRecPipeLine

Folders and files

Latest commit

History

Repository files navigation

SpeechRecPipeLine

Disclaimer

Principles

Stages

Dependencies

Required

Recommended

Optional

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages