Skip to content

Latest commit

 

History

History
51 lines (35 loc) · 1.59 KB

README.md

File metadata and controls

51 lines (35 loc) · 1.59 KB

Classify real time desktop and speech

Overview

Team DeepThings (Mez Gebre and I) won the Best Product Category at the Deep Learning Hackathon in San Francisco. We developed in three days a real-time system capable of identifying objects and speaking what it sees, thinking about making a useful tool for the visually impaired, as it could make navigation easier. Proof of concept on a laptop, final model running on Android.

This is only the first prototype for Windows.

The goals / steps of this project are the following:

  • Get the Webcam feed without bottlenecks.
  • Recognize images using Inception v3.
  • Text to speech with Google TTS API.
  • Making a functional model.
  • Tuninning the parameters.
  • Output visual display of the results.

Dependencies

This module requires:

Usage

Just run: python classify_real_time_v2.py

The output should look like this:

alt text

More details

For more information, check my medium post here

Licence

This proyect is Copyright © 2016-2017 Lucas Gago. It is free software, and may be redistributed under the terms specified in the MIT Licence.