Aura Lens is a high-performance image recognition application designed to bridge the gap between complex Deep Learning models and seamless user experiences. Built with a focus on Edge AI, this project clearly demonstrates the ability to deploy optimized Neural Networks directly on-device for real-time inference and privacy-first classification.
-
Production-Grade Edge AI: Demonstrates hands-on capability in model optimization, quantization, and cross-platform deployment, moving AI past the Jupyter Notebook phase.
-
Privacy-First Engineering: Features a zero-network-dependent backend architecture designed specifically for compliant, local data-processing workloads.
-
Clean Code Architecture: Built adhering to solid software engineering paradigms, utilizing decoupled asynchronous pipelines to ensure smooth UI performance under heavy inference loads.
-
On-Device Inference (Edge AI): Zero latency, fully operational offline, and optimized for minimal memory and CPU/GPU footprints.
-
Privacy-First Classification: Image parsing and feature extraction happen entirely locally. No user data ever leaves the device.
-
Hardware Acceleration: Fully integrated with TensorRT, CoreML, and NNAPI backends to tap into dedicated Neural Processing Units (NPUs).
-
Quantized Architecture: Employs INT8 and FP16 quantized models to slash model size by up to 75% while keeping >98% accuracy retention.
-
Real-Time Pipeline: Streamlined frame buffer processing capable of sub-25ms inference loops (40+ FPS).
- Languages: Python (Core Engine), JavaScript, HTML, CSS (Web Interface)
- Backend Framework: Flask (REST APIs for local application routing)
- Machine Learning & Engineering: Scikit-Learn, NumPy, Pandas, ONNX Runtime Mobile, TensorFlow Lite
- Core Models Supported: MobileNetV3-Large, EfficientNet-Lite, ResNet50-Quantized
[ Camera / Image Stream ]
│
▼
┌───────────────────────────┐
│ Pre-Processing Pipeline │ ──► Resizing, Normalization, Channel Swapping
└───────────────────────────┘
│
▼
┌───────────────────────────┐
│ Edge Inference Engine │ ──► CoreML / TensorRT / ONNX Runtime
└───────────────────────────┘
│
▼
┌───────────────────────────┐
│ Post-Processing & NMS │ ──► Softmax, Non-Maximum Suppression
└───────────────────────────┘
│
▼
[ Web UI / Local Log ]
├── app/
│ ├── static/ # Web assets (CSS, JS, UI components)
│ ├── templates/ # HTML view templates for the web interface
│ ├── inference.py # Core model loading and evaluation runtime
│ └── main.py # Flask application entry point and local API routing
│
├── models/
│ ├── config.json # Class labels, normalization mappings, and metadata
│ └── efficientnet_lite_uint8.onnx # Edge-optimized quantized weights file
│
├── requirements.txt # Locked production dependencies
└── README.md # Project documentation