Skip to content

DanielSiebert-dev/InstaScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Instagram Scraper

A modular, scalable Instagram scraper built with Instaloader.

Setup

  1. Install dependencies:

  2. Update config/config.json with your Instagram credentials, SMTP settings, and proxies (e.g., from free-proxy-list.net).

  3. Run the scraper:

  • For profiles: python main.py --profile username
  • For hashtags: python main.py --hashtag hashtag
  • For scheduling: python main.py --schedule

Features

  • Multi-threading for faster scraping
  • SQLite storage for efficient metadata management
  • Proxy rotation to avoid IP bans
  • Rotating file logs to manage disk space
  • Error handling with email notifications
  • Scheduling with logged execution times and rate limit mitigation

Troubleshooting

  • 401 Unauthorized: Wait 15–30 minutes, use proxies, or adjust delays in config.json.
  • IDE Errors: Invalidate caches in PyCharm (File > Invalidate Caches / Restart).
  • Dependencies: Use --break-system-packages with pip on macOS due to PEP 668.

Compliance

Respect Instagram’s terms of service, including rate limits and data usage policies.

About

Developed a modular, production-ready Instagram scraping system with Oxylabs proxy integration, automated scheduling, and real-time data usage tracking. The scraper supports image, video, and metadata collection via Instaloader with deduplication, SQLite persistence, and email notifications for reliability. For educational purpose only.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages