Babagaboosh

Simple app that lets you have a verbal conversation with OpenAi's GPT 4. Originally written by DougDoug, his explanation of the code .

SETUP:

This was written in Python 3.9.2. Install page here: https://www.python.org/downloads/release/python-392/
Run pip install -r requirements.txt to install all modules.
This uses the Microsoft Azure TTS, llamaindex and OpenAi services. You'll need to set up an account with these services and generate an API key from them. Then add these keys to .env file with:

AZURE_TTS_KEY = "some_key"
AZURE_TTS_REGION = "norwayeast"
OPENAI_API_KEY = "some_key"

This app uses the GPT-4 model from OpenAi. As of this writing (Jan 13 2024), you need to pay $1 to OpenAi in order to get access to the GPT-4 model API. So after setting up your account with OpenAi, you will need to pay for at least $1 in credits so that your account is given the permission to use the GPT-4 model when running my app. See here. Microsoft Azure is the service for AI voices.
Add your data to a folder named data and get context from existing data by running get_context.py.
Can estimate cost of running a query to chatgpt by running estimate_cost.py script on the data

Run `chatgpt_character.py'
Once it's running, press F4 to start the conversation, and Azure Speech-to-text will listen to your microphone and transcribe it into text.
Once you're done talking, press P. Then the code will send all of the recorded text to the Ai. Note that you should wait a second or two after you're done talking before pressing P so that Azure has enough time to process all of the audio.
Wait a few seconds for OpenAi to generate a response and for Elevenlabs to turn that response into audio. Once it's done playing the response, you can press F4 to start the loop again and continue the conversation.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.gitignore		.gitignore
ChatHistoryBackup.txt		ChatHistoryBackup.txt
LICENSE		LICENSE
Pajama Sam 1.png		Pajama Sam 1.png
README.md		README.md
TestAudio_MP3.mp3		TestAudio_MP3.mp3
TestAudio_WAV.wav		TestAudio_WAV.wav
audio_player.py		audio_player.py
azure_speech_to_text.py		azure_speech_to_text.py
azure_text_to_speech.py		azure_text_to_speech.py
chatgpt_character.py		chatgpt_character.py
create_context.py		create_context.py
eleven_labs.py		eleven_labs.py
estimate_cost.py		estimate_cost.py
generate-enviroment.sh		generate-enviroment.sh
generate-eviroment.bat		generate-eviroment.bat
obs_websockets.py		obs_websockets.py
openai_chat.py		openai_chat.py
requirements.txt		requirements.txt
websockets_auth.py		websockets_auth.py