ChatGPTVoice

Base on whisper and PyQT(PySide6), a RealTime Voice GPT chat tool, supporting historical conversations. Enjoy chatting with GPT voice without relying on ChatGPT Plus. 🐔🐔

Requirement

python >= 3.10

OS

win10+, Linux(Just test in Ubuntu, works), Mac(Based on Linux as reference, theoretically feasible, but not tested.)

GPU

Running the whisper base model requires less than 1GB of available memory, and the results are passable with no noise and an accuracy rate around 90% in accurate spoken language situations. The whisper large model requires over 8GB of available memory, yet it provides excellent performance. Even my poor English speaking skills are recognized fairly accurately. Moreover, it handles long speech segments and interruptions quite effectively.

In summary, the base model is more user-friendly, but if conditions allow, it's recommended to use the large model. In cases of recognition errors, modifications can be directly made to the recognized results in the GUI.

Install

Clone repo

git clone https://github.com/QureL/ChatGPTVoice.git
cd ChatGPTVoice

Create and activate a virtual environment.(powershell. In Bash, you may need to run scripts like activate.)

mkdir venv
python -m venv .\venv\
.\venv\Scripts\Activate.ps1

Install dependencies.

pip install -r requirements.txt

In Linux, you need to run the following command to install the required dependencies.

apt install portaudio19-dev python3-pyaudio
apt install espeak

Run

Execute directly within the virtual env.

python ./main.py

whisper run remotely

I have a Linux host with 12GB of GPU memory and a laptop with a weak 1650 GPU. To run the Whisper large model, you can host Whisper on Linux and use websocket communication between the client and Whisper.

Linux：

python scrpit/whisper_server.py --model large-v2

client：

python .\main.py --whisper_mode remote --whisper_address ws://{You Linux IP}:3001

Proxy for openai

python .\main.py --proxy http://127.0.0.1:10809

After enabling the proxy, all OpenAI GPT requests and model downloads will pass through the proxy node.

Thanks

PyQt-Fluent-Widgets A fluent design widgets library based on PyQt5

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
audio		audio
config		config
controller		controller
gpt		gpt
img		img
processor		processor
resources		resources
script		script
ui		ui
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_cn.md		README_cn.md
const.py		const.py
error.py		error.py
hook.py		hook.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatGPTVoice

Requirement

OS

GPU

Install

Run

whisper run remotely

Proxy for openai

Thanks

TODO LIST

About

Releases

Packages

Languages

License

QureL/ChatGPTVoice

Folders and files

Latest commit

History

Repository files navigation

ChatGPTVoice

Requirement

OS

GPU

Install

Run

whisper run remotely

Proxy for openai

Thanks

TODO LIST

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages