Skip to content

k1vd/Python-ChatGPT-Voice-Assistant

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ChatGPT Voice Assistant with role selection

A simple ChatGPT voice assistant using OpenAI API and Gradio to provide a user interface where users can record their queries, choose a role for ChatGPT to take and receive spoken responses tailored to that role.

Requirements

  • Python 3.6 or higher
  • OpenAI
  • Gradio
  • gTTS
  • Soundfile
  • FFmpeg

Installation

1. Clone the repository:

git clone https://github.com/your-github-username/chatgpt-voice-assistant.git

2. Navigate to the directory:

cd chatgpt-api-whisper-api-voice-assistant

3. Create a virtual environment:

python3 -m venv venv

4. Activate the virtual environment:

  • MacOS:

    source venv/bin/activate
  • Windows:

    .\venv\Scripts\activate

5. Install the dependencies:

pip install -r requirements.txt

6. Add OpenAI API key to config.py:

If not done already, please create a config.py file in the project folder and add the OpenAI API key you obtained from the OpenAI Dashboard

OPENAI_API_KEY = "your_openai_api_key_here"

Replace your_openai_api_key_here with your actual API key.

Installing FFmpeg

  • For Windows:

Download the FFmpeg binaries from the official website: FFmpeg Download-Seite Extract the downloaded files and add the extracted folder containing the FFmpeg exe files (usually bin) to the PATH environment variable. You can follow instructions on how to do that here: Install FFmpeg on Windows

  • For MacOS:

brew install ffmpeg
  • For Linux:

sudo apt-get update
sudo apt-get install ffmpeg

Execution

To start the ChatGPT voice assistant, run the main script:

python assistant.py

Or start your IDE Python environment with the assistant.py script.

After that, open your web browser and navigate to the displayed URL (usually http://localhost:7860/), to use the voice assistant interface.

Author

Author: Martin Haferanke - Original version forked by: Hacking the Markets, Part Time Larry

About

A ChatGPT voice assistant using speech input and output, leveraging Gradio, OpenAI API, and Whisper ASR, to perform versatile roles in a user-friendly interface.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages