A Chrome extension for WhatsApp Web that captures voice notes, identifies mentioned names, and automatically tags those people in the chat.
Check the assets folder.
- Records and processes voice notes sent through WhatsApp Web
- Uses speech recognition to transcribe audio
- Identifies the addressee/recipient mentioned in the voice note
- Automatically tags the identified person in the chat
- When you record a voice note in WhatsApp Web, the extension captures the audio in real-time
- When you send the voice note, the audio is processed to:
- Transcribe the speech to text
- Identify who is being addressed in the message
- The extension then automatically sends a follow-up message tagging the identified person
- Download or clone this repository
- Open Chrome and go to
chrome://extensions/
- Enable "Developer mode" (toggle in the top-right corner)
- Click "Load unpacked" and select the extension directory
- The extension is now installed and active
The extension requires a server component to process the audio. You can: Run the server locally:
python main.py
And set the Server URL in the extension popup to http://localhost:5000
- Open WhatsApp Web at https://web.whatsapp.com/
- Start a chat with someone
- Record a voice note saying something like "Hey John, can you check this document?"
- Send the voice note
- The extension will automatically tag John in a follow-up message
Click the extension icon in Chrome's toolbar to access settings:
- Enable/Disable: Toggle the extension on or off
- Server URL: Set the URL of the processing server
- Content Script: Monitors WhatsApp Web for voice recordings and captures audio
- Background Script: Handles communication between content script and server
- Server: Processes audio using AI models for transcription and name entity recognition
- Popup: Provides user interface for configuration
- All audio processing happens on the server
- No recordings are stored permanently
- Only text transcriptions are used for identifying addressees