Skip to content

doeunyy/jeans-ai

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

👖 Jeans: AI-Based Photo Editing and Sharing Service for Seniors

This repository contains the AI component of Jeans, an AI-powered photo editing and sharing service designed to increase digital accessibility and social participation among seniors. The project was awarded the Excellence Prize at the SK Telecom FLY AI Challenger Program, one of South Korea’s leading national AI innovation competitions. I served as the sole AI engineer, responsible for designing and implementing the entire machine learning pipeline.

Table of Contents

  1. Overview
  2. Key Features
  3. System Architecture
  4. Implementation Details
  5. Model Training & Deployment
  6. Impact
  7. Team

Overview

열정4팀_최종발표-01

Jeans is a senior-friendly photo editing and sharing application powered by multimodal AI. The service enables seniors to edit photos using voice commands, automatically detect and enhance faces, and share images easily with family and friends.

The goal of the project was to help older adults overcome technological barriers and participate more actively in digital communication.

I led all AI development, including data preparation, model fine-tuning, API design, and deployment using AWS.

Key Features

열정4팀_최종발표-08
  • Voice-driven photo editing using Whisper fine-tuned on Korean regional dialect speech
  • Face detection & enhancement using YOLO and FaceNet
  • Automated photo captioning & tagging for accessibility
  • FastAPI backend with scalable REST APIs
  • Real-time inference server deployed on AWS EC2
  • CI/CD automation using GitHub Actions
  • User-centered design based on interviews and usability testing with seniors

System Architecture

열정4팀_최종발표-28
User Voice Input
        ↓
Whisper (Fine-tuned) — Korean Dialect ASR
        ↓
Command Parsing & Image Processing Logic
        ↓
YOLO / FaceNet — Face Detection, Cropping, Enhancement
        ↓
FastAPI Inference Server
        ↓
AWS EC2 Deployment + CI/CD
        ↓
Mobile Frontend (Photo Editing / Sharing)

This end-to-end flow enables seniors to issue simple voice commands and receive enhanced images instantly.

Implementation Details

Dataset Construction

  • Collected and cleaned Korean dialect audio dataset
  • Labeled command-specific phrases to enable action-triggering
  • Applied audio preprocessing (normalization, trimming, silence removal)

Models

  • Whisper fine-tuning

    • Trained on senior & dialectal Korean speech
    • Improved recognition accuracy for age-affected pronunciation
  • YOLO + FaceNet integration

    • YOLO for coarse face detection
    • FaceNet for identity-level refinement or alignment
  • Pipeline logic

    • Map recognized text → edit operations (e.g., crop, brighten, sharpen)

Backend (FastAPI)

  • Designed REST endpoints for image upload, transformation, and retrieval
  • Handled model loading, GPU utilization, and async batching
  • Built a modular router structure for future expansion

Model Training & Deployment

  • Training

    • PyTorch-based training pipelines
    • Custom DataLoader for speech–command pairs
    • Evaluation pipeline for WER and command classification accuracy
  • Deployment

    • AWS EC2 GPU instance
    • FastAPI inference server
    • Nginx + SSL for production deployment
  • CI/CD

    • GitHub Actions for automated testing, formatting, and deployment
    • Containerized build process for reproducibility

Impact

  • Conducted interviews and usability sessions with seniors to validate real-world needs
  • Improved accessibility by tailoring voice commands to speech patterns common among older adults
  • Provided a fully functional prototype with measurable improvements in user satisfaction
  • Strengthened expertise in AI deployment, REST API design, multimodal integration, and end-to-end MLOps

Team

  • Doeun Kim: PM, AI Lead - Sole AI Engineer (model development, dataset construction, API design, deployment)
  • Jihye Yoo: Frontend Lead
  • Junyong Lee: Frontend Engineer
  • Soyeon Cha: Backend Lead
  • Boseok Park: Backend Engineer

About

Jeans: AI-Based Photo Editing and Sharing Service for Seniors

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.9%
  • Python 0.1%