📈 OpenInsider Data Scraper

A robust Python scraper for collecting insider trading data from openinsider.com.

✨ Features

Multi-threaded data collection for high performance
Intelligent caching system to minimize server load
Configurable filters for transaction types and values
Flexible data export in CSV and Parquet formats
Comprehensive logging and error handling
Automatic retry mechanism for failed requests
Progress tracking with progress bar
Docker support for easy deployment

🚀 Installation

Clone the repository:

git clone [email protected]:sd3v/openinsiderData.git
cd openinsiderData

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # Linux/Mac
# or
venv\Scripts\activate  # Windows

Install dependencies:

pip install -r requirements.txt

⚙️ Configuration

All settings are managed through config.yaml:

📁 Output Settings

output:
  directory: data       # Output directory for scraped data
  filename: insider     # Base filename for output files
  format: csv          # Output format (csv or parquet)

🔄 Scraping Settings

scraping:
  start_year: 2024           # Start year
  start_month: 3             # Start month
  max_workers: 10            # Number of parallel downloads
  retry_attempts: 3          # Number of retry attempts
  timeout: 30               # Request timeout in seconds

🔎 Filter Settings

filters:
  min_transaction_value: 50000  # Minimum transaction value in USD
  transaction_types:            # Transaction types to include
    - P - Purchase
    - S - Sale
    - F - Tax
  exclude_companies: []         # Companies to exclude (by ticker)
  min_shares_traded: 100        # Minimum number of shares

📝 Logging Settings

logging:
  level: INFO          # Logging level (DEBUG, INFO, WARNING, ERROR)
  file: scraper.log    # Log file name
  rotate_logs: true    # Enable log rotation
  max_log_size: 10     # Max log size in MB

💾 Cache Settings

cache:
  enabled: true        # Enable caching
  directory: .cache    # Cache directory
  max_age: 24         # Cache max age in hours

🔧 Usage

Run the scraper:

python openinsider_scraper.py

🐳 Docker Support

Build the container:

docker build -t openinsider-scraper .

Run the container:

docker run -v $(pwd)/data:/app/data openinsider-scraper

💼 Transaction Types

Available transaction types:

P - Purchase
S - Sale
F - Tax
D - Disposition
G - Gift
X - Exercise
M - Options Exercise
C - Conversion
W - Will/Inheritance
H - Holdings
O - Other

👥 Contributing

Fork the repository
Create your feature branch
Commit your changes
Push to the branch
Create a Pull Request

🔍 Troubleshooting

If you encounter rate limiting, adjust the max_workers setting
For memory issues, try using Parquet format for large datasets
Check the log file for detailed error messages

⚠️ Disclaimer

This tool is for educational purposes only. Ensure you comply with the website's terms of service and local regulations when scraping data.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
SECURITY.md		SECURITY.md
config.yaml		config.yaml
openinsider_scraper.py		openinsider_scraper.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📈 OpenInsider Data Scraper

✨ Features

🚀 Installation

⚙️ Configuration

📁 Output Settings

🔄 Scraping Settings

🔎 Filter Settings

📝 Logging Settings

💾 Cache Settings

🔧 Usage

🐳 Docker Support

💼 Transaction Types

👥 Contributing

🔍 Troubleshooting

⚠️ Disclaimer

About

Releases 2

Packages

Contributors 4

Languages

sd3v/openinsiderData

Folders and files

Latest commit

History

Repository files navigation

📈 OpenInsider Data Scraper

✨ Features

🚀 Installation

⚙️ Configuration

📁 Output Settings

🔄 Scraping Settings

🔎 Filter Settings

📝 Logging Settings

💾 Cache Settings

🔧 Usage

🐳 Docker Support

💼 Transaction Types

👥 Contributing

🔍 Troubleshooting

⚠️ Disclaimer

About

Topics

Resources

Security policy

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 4

Languages

Packages