Add Support for OpenAI Whisper API as Alternative Transcription Service (Issue #137) #150

agentmarketbot · 2025-01-26T16:16:07Z

Pull Request Description

Background

This pull request addresses issue #137, titled "support additional whisper services (focus on openai whisper api)", which can be viewed here. The objective of this enhancement is to integrate the OpenAI Whisper API as an additional transcription backend in the existing Telegram bot. This functionality will complement the current AWS Transcribe service, providing users with more options and flexibility regarding transcription performance, cost-effectiveness, and personalization based on individual needs.

Summary of Changes

This PR implements significant updates to the transcription capabilities of the Telegram bot. Specifically, the changes include:

Introduction of New Transcription Services:
- Added support for the OpenAI Whisper API by creating a new transcription service class called OpenAITranscriptionService.
- Refactored the existing AWS Transcribe functionality into a dedicated class named AWSTranscriptionService.
- Established an abstract base class TranscriptionService to standardize the interface across various transcription services.
Service Architecture Update:
- Modified the bot handlers to support the updated transcription service architecture.
- Integrated environment variable options that allow users to select their preferred transcription service.
Environment Configuration:
- Users can now set the TRANSCRIPTION_SERVICE environment variable to either 'aws' or 'openai'. By default, the system will utilize AWS Transcribe if no preference is indicated.
- Required authentication details:
  - For AWS: Users must provide AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY.
  - For OpenAI: Users must provide OPENAI_API_KEY.
Documentation Updates:
- The README.md file has been thoroughly updated to include the new features, prerequisites, and detailed configuration instructions for utilizing both transcription services.
Dependency Management:
- Introduced the necessary OpenAI package as a required dependency of the project.

Next Steps

To utilize this new functionality, please ensure that the environment variables are accurately configured to reflect the desired transcription service. Once set up, you can run the bot to test and confirm that both AWS Transcribe and OpenAI transcription services function as intended.

Feel free to reach out if any questions or further assistance are needed. Thank you for considering this enhancement!

Implement alternative transcription service using OpenAI Whisper API: - Create abstract TranscriptionService base class - Add AWSTranscriptionService and OpenAITranscriptionService implementations - Update configuration to support service selection via TRANSCRIPTION_SERVICE env var - Add OpenAI dependencies and configuration requirements - Update documentation with new service options and setup instructions - Enhance logging to include transcription service type The change allows users to choose between AWS Transcribe for enterprise-grade transcription or OpenAI Whisper API for high-accuracy multilingual support.

vadanrod14 closed this Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Support for OpenAI Whisper API as Alternative Transcription Service (Issue #137) #150

Add Support for OpenAI Whisper API as Alternative Transcription Service (Issue #137) #150

agentmarketbot commented Jan 26, 2025

Add Support for OpenAI Whisper API as Alternative Transcription Service (Issue #137) #150

Add Support for OpenAI Whisper API as Alternative Transcription Service (Issue #137) #150

Conversation

agentmarketbot commented Jan 26, 2025

Pull Request Description

Background

Summary of Changes

Next Steps