A powerful web-based tool that uses Google's Gemini AI to analyze your images, providing detailed descriptions, insights, and suggestions - all through a simple user interface.
- Web UI Interface: Easy-to-use interface with image preview
- Multiple Image Upload: Process multiple images at once with the same style/prompt
- Multiple Analysis Types: Choose from 10+ pre-defined analysis options
- Custom Prompts: Ask specific questions about your images
- Real-time Feedback: Get detailed AI-powered image analysis
- Simple Setup: Works on Windows, Mac, and Linux
- Python 3.7 or higher
- Google Gemini API key (get one free at Google AI Studio)
- Internet connection
git clone https://github.com/yourusername/gemini-image-editor.git
cd gemini-image-editor# Create virtual environment
python -m venv venv
# Activate virtual environment
venv\Scripts\activate# Create virtual environment
python3 -m venv venv
# Activate virtual environment
source venv/bin/activatepip install -r requirements.txt-
Copy the example environment file:
cp .env.example .env
-
Edit
.envand replaceyour_gemini_api_key_herewith your actual Gemini API key.
# Ensure your virtual environment is activated
python gemini_webui.pyThis will start a local web server at http://localhost:7860. Open this URL in your browser to access the interface.
- Upload Images: Click the gallery area or drag and drop one or multiple images (max 10MB per image)
- Choose Analysis Type: Select from the dropdown menu or pick "none" to use only a custom prompt
- Enter a Custom Prompt (optional): For example, "What objects are in this image?"
- Click "Analyze Images": Wait a few seconds for Gemini to analyze your images
- View Results: Read Gemini's detailed analysis of your images
| Type | Description |
|---|---|
| none | Custom prompt only |
| describe | Detailed description of the image |
| analyze | Composition and element analysis |
| style | Artistic style and technique identification |
| objects | List of all visible objects |
| colors | Color palette and mood analysis |
| text | Text extraction and reading |
| quality | Technical quality assessment |
| suggest | Improvement suggestions |
| context | Context and story interpretation |
| similar | Description of similar images |
- "What emotions does this image convey?"
- "Is this image suitable for a professional presentation?"
- "What photography techniques were used?"
- "Identify any potential copyright issues"
- "What time of day was this photo taken?"
- "Suggest ways to improve the composition"
- "What story does this image tell?"
- "Identify the location or type of environment"
- Double-check your API key in the
.envfile - Ensure you have the free Gemini API tier activated
- Make sure you're using Python 3.7+
- Verify all packages are installed with
pip list - If gradio installation fails, try:
pip install --upgrade pipthen reinstall
- Ensure images are in common formats (JPG, PNG)
- Try with smaller images if you encounter timeouts
- Check your internet connection
- Uses the
gemini-1.5-flashmodel for image analysis - Note: This tool analyzes images but does not generate or edit them
- Free tier has usage limitations (check Google AI Studio for details)
- Analysis quality varies based on image clarity and prompt specificity
- Maximum file size: 10MB per image
- Images are processed through Google's Gemini API
- Review Google's privacy policy for more information
- No images are permanently stored by this application
MIT License
Crafted with ❤️ for creative image enthusiasts