Skip to content

An AI cursor for desktop using Gemini 2.0 Flash (Experimental)

License

MIT, Apache-2.0 licenses found

Licenses found

MIT
LICENSE
Apache-2.0
LICENSE_GOOGLE
Notifications You must be signed in to change notification settings

13point5/gemini-cursor

Repository files navigation

Gemini Cursor ✨

A second AI cursor 🖱️ for your desktop that can see your screen, hear you speak, and talk to you.

Demo

Powered by Google's Gemini 2.0 Flash (Experimental) model, the Multimodal Live API, Pointing, and Function calling capabilities.

Created by @13point5.

Features

  • 🖱️ Second AI cursor on your desktop
  • 🚀 Multimodality: The model can see 📸, hear 🎤, and speak 🔊
  • ⚡️ Real-time with low latency

Use Cases

  • 📚 Understanding complex diagrams in Research papers, Architecture diagrams, etc
  • 🌐 Navigating complex websites to perform a task like adding a payment method on Amazon
  • 📝 Real time AI tutor with whiteboards

Tech Stack

  • Frontend: Electron, React, TypeScript, Vite
  • AI: Google Gemini API

Acknowledgements

Prerequisites

Installation

  1. Clone the repository
git clone https://github.com/13point5/gemini-cursor.git
cd gemini-cursor
  1. Install dependencies
npm install
  1. Run the app
npm run start
  1. Enter the Gemini API key in the app

  2. Click the Play button and the Share Screen button

  3. Minimize the app and enjoy!

About

An AI cursor for desktop using Gemini 2.0 Flash (Experimental)

Resources

License

MIT, Apache-2.0 licenses found

Licenses found

MIT
LICENSE
Apache-2.0
LICENSE_GOOGLE

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published