Video Transcript Analysis System with Gemini API Integration #590

Konikz · 2025-03-28T12:08:28Z

This PR if my contribution for GSoC which adds a comprehensive Video Transcript Analysis System that leverages Google's Gemini API to process video transcripts and answer complex questions. The system is designed with production-readiness in mind, featuring robust error handling, caching, and comprehensive test coverage.

Key Features

Transcript Processing: Intelligent chunking with semantic awareness and timestamp parsing
Question Optimization: Analyzes question dependencies and optimizes processing order
Gemini API Integration: Robust integration with retry logic and rate limiting
Hybrid Caching: Multi-layer caching system with semantic search capabilities
FastAPI Endpoints: RESTful API for transcript analysis and cache management
Production Ready: Includes monitoring, logging, and comprehensive error handling
Docker Support: Multi-stage build for optimized container images
CI/CD Pipeline: GitHub Actions workflow for testing and deployment

Usage Example

from video_transcript_analysis.core.orchestrator import TranscriptAnalyzer

# Initialize the analyzer
analyzer = TranscriptAnalyzer()

# Analyze a transcript
result = await analyzer.analyze_transcript(
    video_id="video123",
    transcript="[00:00] Welcome to the video...",
    questions=[
        "What is the main topic?",
        "Who are the key speakers?",
        "What are the three main points discussed?"
    ]
)

# Access the results
for question, answer in zip(result.questions, result.answers):
    print(f"Q: {question}")
    print(f"A: {answer}\n")

API Endpoints

POST /analyze: Process a video transcript and answer questions
DELETE /cache/{video_id}: Clear cache for a specific video
GET /health: Health check endpoint

Testing

The PR includes comprehensive test coverage for all major components:

Transcript processing and chunking
Question dependency analysis and optimization
Caching system
API integration

To run tests:

pytest tests/ --cov=src

…n, question optimization, caching, and CI/CD

google-cla · 2025-03-28T12:08:32Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

…eadme docs: Add comprehensive README.md for Video Q&A System (Resolves #1)

shari759 · 2025-04-12T09:45:58Z

Konikz:feature/video-transcript-analysis

feat: Add Video Transcript Analysis System with Gemini API integratio…

1e4b0a2

…n, question optimization, caching, and CI/CD

Konikz added 6 commits March 28, 2025 17:59

docs: Add comprehensive README.md for Video Q&A System (Resolves #1)

a77dc29

Merge pull request #2 from Konikz/feature/video-transcript-analysis-r…

f8be9ba

…eadme docs: Add comprehensive README.md for Video Q&A System (Resolves #1)

Update README.md

3ec0c3b

Merge branch 'main' into feature/video-transcript-analysis

823ff88

Merge branch 'googleapis:main' into feature/video-transcript-analysis

9ff1bfb

Merge branch 'googleapis:main' into feature/video-transcript-analysis

9df44b6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Video Transcript Analysis System with Gemini API Integration #590

Video Transcript Analysis System with Gemini API Integration #590

Konikz commented Mar 28, 2025

google-cla bot commented Mar 28, 2025

shari759 commented Apr 12, 2025

Video Transcript Analysis System with Gemini API Integration #590

Are you sure you want to change the base?

Video Transcript Analysis System with Gemini API Integration #590

Conversation

Konikz commented Mar 28, 2025

Key Features

Usage Example

API Endpoints

Testing

google-cla bot commented Mar 28, 2025

shari759 commented Apr 12, 2025