AI-Driven Android App Crawler

Overview

An automated Android app testing tool powered by pluggable AI model adapters (Gemini, Ollama, OpenRouter). Intelligently explores applications by analyzing visual layouts and structural information to discover new states and interactions.

Available Interfaces:

CLI Controller - Command-line interface for automation and scripting. See docs/cli-user-guide.md.
UI Controller - Graphical user interface for interactive use. See docs/gui-user-guide.md.

Features

AI-Powered Exploration - Multiple provider support (Gemini, Ollama, OpenRouter)
Intelligent State Management - Visual and structural hashing for unique screen identification
Loop Detection - Prevents repetitive patterns
Traffic Capture - Optional network monitoring via PCAPdroid during crawl (saves .pcap files)
Video Recording - Optional screen recording of entire crawl session (saves .mp4 files)
MobSF Integration - Optional automatic static security analysis after crawl completion
Focus Areas - Customizable privacy-focused testing targets
Comprehensive Reporting - PDF reports with crawl analysis

AI Model Support

Supported Providers

Google Gemini - Cloud-based multimodal model with excellent image understanding
Ollama - Local models (supports vision-capable variants like llama3.2-vision)
OpenRouter - Cloud router to top models via OpenAI-compatible API

Configuration Example

{
  "AI_PROVIDER": "ollama",
  "DEFAULT_MODEL_TYPE": "llama3.2-vision",
  "OLLAMA_BASE_URL": "http://localhost:11434"
}

Vision-capable Ollama models: llama3.2-vision, llava, bakllava

Appium Integration

The system uses Appium-Python-Client for direct mobile device interaction. No external server is required beyond the standard Appium server.

Configuration

{
  "APPIUM_SERVER_URL": "http://127.0.0.1:4723"
}

Note: Ensure Appium server is running on the configured port (default: 4723).

Architecture

Core Components

run_cli.py - CLI entry point
run_ui.py - GUI entry point
cli/main.py - CLI command orchestration
core/crawler.py - Main crawling logic and state transitions
domain/agent_assistant.py - AI-driven action orchestration
domain/model_adapters.py - Unified AI provider integration
domain/agent_tools.py - Device interaction tools
infrastructure/appium_helper.py - Core Appium session management
infrastructure/device_detection.py - Device/emulator detection
infrastructure/capability_builder.py - W3C capability building
domain/screen_state_manager.py - State tracking and transitions

Agent-Based Workflow

Observe - Capture screenshot and XML representation
Reason - Analyze screen elements and available actions
Plan - Determine optimal next action
Act - Execute action via agent tools
Observe Again - Receive feedback and adapt

CLI Usage

Detailed CLI usage, command reference and examples have been moved to the dedicated CLI user guide:

docs/cli-user-guide.md

For GUI usage and interactive workflows see the GUI user guide:

docs/gui-user-guide.md

Configuration Management

Simplified two-layer configuration system:

Secrets (API keys): Environment variables only (never stored in SQLite)
Everything else: SQLite only (int, str, bool, float values)

On first launch, simple type defaults are automatically populated into SQLite from module constants. Complex types (dict, list) are excluded and remain in code only.

Environment Variables (.env):

GEMINI_API_KEY=your_gemini_key
OPENROUTER_API_KEY=your_openrouter_key
OLLAMA_BASE_URL=http://localhost:11434
MOBSF_API_KEY=your_mobsf_key
PCAPDROID_API_KEY=your_pcapdroid_key

Note: All non-secret configuration values are stored in SQLite (config.db). Secrets are read from environment variables only and never persisted to disk.

System Variables:

ANDROID_HOME=C:/Users/youruser/AppData/Local/Android/Sdk

Output Structure

Session-based output per device/app run:

output_data/<device_id>_<app_package>_<timestamp>/
├── screenshots/
├── annotated_screenshots/
├── database/<app_package>_crawl_data.db
├── traffic_captures/        # PCAP files (if traffic capture enabled)
├── video/                   # Video recordings (if video recording enabled)
├── logs/
├── reports/
├── mobsf_scan_results/      # MobSF analysis results (if MobSF analysis enabled)
└── extracted_apk/

App info caches (stable, reusable):

output_data/app_info/<device_id>/
├── device_<device_id>_all_apps.json
└── device_<device_id>_filtered_health_apps.json

Prerequisites

Required

Python 3.8+
Node.js & npm (for Appium)
Android SDK with ADB

Optional

MobSF (Docker or native)
PCAPdroid (for traffic capture)
Ollama (for local AI models)

Device Setup

Enable Developer options (tap Build number 7 times)
Enable USB debugging
Connect via USB and authorize ADB

MobSF Integration

MobSF (Mobile Security Framework) must be installed and running before enabling MobSF analysis. For installation instructions, see the official MobSF documentation.

Docker Setup (Recommended)

# Basic (ephemeral)
docker run -d --name mobsf -p 8000:8000 opensecurity/mobile-security-framework-mobsf:latest

# With persistent storage (Windows)
mkdir C:\mobsf\uploads, C:\mobsf\signatures
docker run -d --name mobsf -p 8000:8000 `
  -v "C:\mobsf\uploads:/home/mobsf/Mobile-Security-Framework-MobSF/uploads" `
  -v "C:\mobsf\signatures:/home/mobsf/Mobile-Security-Framework-MobSF/signatures" `
  opensecurity/mobile-security-framework-mobsf:latest

Note: For native installation or other setup methods, refer to the official MobSF installation guide.

Configuration

{
  "ENABLE_MOBSF_ANALYSIS": true,
  "MOBSF_API_URL": "http://localhost:8000/api/v1",
  "MOBSF_API_KEY": "YOUR_API_KEY_HERE"
}

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
.github		.github
.idea		.idea
.vscode		.vscode
cli		cli
config		config
core		core
docs		docs
domain		domain
infrastructure		infrastructure
interfaces		interfaces
tools		tools
ui		ui
utils		utils
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
crawler.db		crawler.db
crawler.db-journal		crawler.db-journal
crawler_logo.ico		crawler_logo.ico
crawler_logo.png		crawler_logo.png
create_desktop_shortcut.bat		create_desktop_shortcut.bat
create_desktop_shortcut.ps1		create_desktop_shortcut.ps1
done-soundeffect.mp3		done-soundeffect.mp3
error-soundeffect.mp3		error-soundeffect.mp3
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run_cli.py		run_cli.py
run_ui.py		run_ui.py
run_ui.vbs		run_ui.vbs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-Driven Android App Crawler

Overview

Features

AI Model Support

Supported Providers

Configuration Example

Appium Integration

Configuration

Architecture

Core Components

Agent-Based Workflow

CLI Usage

Configuration Management

Output Structure

Prerequisites

Required

Optional

Device Setup

MobSF Integration

Docker Setup (Recommended)

Configuration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

ganainy/ai-mobile-ui-crawler

Folders and files

Latest commit

History

Repository files navigation

AI-Driven Android App Crawler

Overview

Features

AI Model Support

Supported Providers

Configuration Example

Appium Integration

Configuration

Architecture

Core Components

Agent-Based Workflow

CLI Usage

Configuration Management

Output Structure

Prerequisites

Required

Optional

Device Setup

MobSF Integration

Docker Setup (Recommended)

Configuration

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages