GeoGuesser Project

Overview

The GeoGuesser project is a machine learning-based geolocation prediction tool. This project leverages deep learning models to predict geographical locations from images, specifically using grid-based classification and regression techniques. The project involves scraping images, preprocessing them, training a grid classifier to predict grid cells, and a location regressor to predict precise latitudes and longitudes.

Features

Image Scraping: Scrape images from Google Street View based on specified geographical bounds.
Image Preprocessing: Resize, normalize, and prepare images for model training.
Grid Classification: Classify images into grid cells based on their geographical locations.
Location Regression: Predict precise latitudes and longitudes within the classified grid cells.
Custom Loss Functions: Use custom loss functions like haversine distance for training the models.
Model Evaluation: Evaluate the models using accuracy, mean distance error, and median distance error.

Installation

To install the GeoGuesser project, follow these steps:

Clone the repository:

git clone https://github.com/munimdev/git
cd geoguesser

Install poetry with the dotenv plugin:

 curl -sSL https://install.python-poetry.org | python3 -
 poetry self add poetry-dotenv-plugin

 poetry env use python3.12
 poetry install

Set up the environment variables:

Create a .env file in the root directory and add the following environment variables:
```
GOOGLE_MAPS_API_KEY=<YOUR_GOOGLE_MAPS_API_KEY>
```
Run the project:
```
 poetry run python -m geoguesser.main
```

Usage

Scraping Images

To scrape images from Google Street View:

from scrapers.maps_scraper import scrape_images

scrape_images(
    grid_size=10,
    images_per_grid=3,
    image_shape=(640, 640),
    bounding_box=None,
    location_name="London",
    keep_current_images=True
)

Preprocessing Images

To preprocess images for model training:

from preprocessor.image_preprocessor import preprocess_images
from pathlib import Path

metadata_file = Path("data/scraped_images/metadata.json")
train_loader, validation_loader, test_loader, train_lat_lng_labels, validation_lat_lng_labels, test_lat_lng_labels = preprocess_images(
    metadata_file,
    output_shape=(224, 224),
    grid_size=10
)

Training

To train the grid classification and location regression models:

from models.cnn_geoguesser import train_geoguesser, create_location_regressor

# Train grid classifier
grid_classifier = train_geoguesser(
    train_loader,
    validation_loader,
    num_classes=100,
    input_shape=(224, 224, 3),
    grid_classifier_epochs=20
)

# Prepare data for location regressor
train_grid_predictions = grid_classifier.predict(train_loader)
validation_grid_predictions = grid_classifier.predict(validation_loader)

# Train location regressor
location_regressor = create_location_regressor(grid_size=100, learning_rate=0.001)
location_regressor.regressor.fit(
    train_grid_predictions,
    train_lat_lng_labels,
    epochs=50,
    validation_data=(validation_grid_predictions, validation_lat_lng_labels)
)

Evaluation

To evaluate the grid classification and location regression models:

from evaluation.evaluator import evaluate_geoguesser

evaluate_geoguesser(
    grid_classifier,
    location_regressor,
    test_loader,
    test_lat_lng_labels,
    grid_size=100
)

Project Structure

The project structure is as follows:

geoguesser/
├── data/
│   ├── scraped_images/
│   ├── preprocessed_images/
│   └── models/
├── geoguesser/
│   ├── evaluation/
│   ├── models/
│   ├── preprocessor/
│   └── scrapers/
├── .env
├── main.py
├── README.md
└── pyproject.toml

Model Training

The GeoGuesser project uses a grid-based classification and regression approach to predict geographical locations from images. The project involves training two models:

Grid Classifier: A convolutional neural network (CNN) that classifies images into grid cells based on their geographical locations. The grid classifier is trained using a custom loss function that combines cross-entropy loss and haversine distance loss.
Location Regressor: A fully connected neural network that predicts precise latitudes and longitudes within the classified grid cells. The location regressor is trained using mean squared error loss.

Evaluation Criteria

The GeoGuesser project evaluates the grid classification and location regression models using the following metrics:

Accuracy: The percentage of correctly classified grid cells.
Mean Distance Error: The mean distance between the predicted and actual latitudes and longitudes.
Median Distance Error: The median distance between the predicted and actual latitudes and longitudes.

Contributing

Contributions to the GeoGuesser project are welcome! To contribute, follow these steps:

Fork the repository.
Create a new branch:
```
git checkout -b feature/my-feature
```
Make your changes and commit them:
```
 git commit -am "Add new feature"
```
Push your branch:
```
git push origin feature/my-feature
```
Submit a pull request.

License

The GeoGuesser project is licensed under the MIT License. See the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.github/workflows		.github/workflows
.vscode		.vscode
data/scraped_images		data/scraped_images
geoguesser		geoguesser
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GeoGuesser Project

Overview

Table of Contents

Features

Installation

Usage

Scraping Images

Preprocessing Images

Training

Evaluation

Project Structure

Model Training

Evaluation Criteria

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

munimdev/GeoGuesser

Folders and files

Latest commit

History

Repository files navigation

GeoGuesser Project

Overview

Table of Contents

Features

Installation

Usage

Scraping Images

Preprocessing Images

Training

Evaluation

Project Structure

Model Training

Evaluation Criteria

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages