voice-js
is a JavaScript library for converting text to speech and speech to text, with customizable options for language, volume, rate, and pitch. It also provides a utility to fetch available voices for enhanced text-to-speech experiences.
npm install @webbro-software/voice-js
- Text-to-Speech: Convert text to spoken audio with customizable settings.
- Speech-to-Text: Convert spoken words to text using the Web Speech API.
- Voice Management: Retrieve available voices for text-to-speech, with error handling for unsupported environments.
import {
speechToText,
textToSpeech,
getVoices,
} from "@webbro-software/voice-js";
The textToSpeech
function converts provided text into spoken audio, with options for language, volume, rate, pitch, and voice.
textToSpeech("Hello, world!", "en-US", 1, 1, 1, 5);
text
(string): Text to be spoken.lang
(string): Language code for the voice (default:"en-US"
).volume
(number): Volume of the speech (range:0
to1
, default:1
).rate
(number): Speed of the speech (range:0.1
to10
, default:1
).pitch
(number): Pitch of the speech (range:0
to2
, default:1
).voice
(number): Index of the voice to use from the available voices list (default:5
).
Note: Voice availability and the exact number of voices vary across browsers.
The speechToText
function listens to spoken input and returns it as a text string.
const recognition = speechToText();
recognition
.getTranscript()
.then((transcript) => console.log("Recognized Text:", transcript))
.catch((error) => console.error("Error:", error));
// To stop listening
recognition.stopSpeech();
getTranscript()
: Starts listening and returns a promise with the recognized text. Includes error handling for issues such as unsupported environments or recognition failures.stopSpeech()
: Stops the current speech recognition session manually.
The getVoices
function retrieves all available voices for text-to-speech, useful for selecting specific voices.
getVoices();
The getVoices
function logs the list of available voices to the console when window.speechSynthesis.onvoiceschanged
fires, which occurs when new voices are loaded. This allows you to inspect and select voices for textToSpeech
.
speechToText
: Throws errors if the Web Speech API isn’t supported. Provides meaningful error messages if recognition fails, such as"Speech recognition error: network"
or"Speech recognition ended without capturing results."
textToSpeech
andgetVoices
: Both functions check for browser compatibility and handle unsupported environments gracefully.
This library uses the Web Speech API, supported by most modern browsers. For speech recognition, ensure microphone access is enabled in the browser.
MIT License. See LICENSE for more details.