Skip to content
View NavidTavakoli's full-sized avatar

Block or report NavidTavakoli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
NavidTavakoli/README.md

Hi, I'm Navid

Data Engineer | Environmental Data Specialist

[email protected]

LinkedIn Portfolio

πŸ‘¨β€πŸ’» About Me

Β Β Β Β Β Β Β Β Β Β Β Β Β Β Β 

Welcome to my GitHub! I'm a Data Engineer and Data Analyst with strong expertise in Python, MATLAB, and Geospatial Data Analysis. Passionate about Computer Vision, Remote Sensing, and leveraging spatial data for impactful environmental and urban solutions.

  • πŸ”­ Currently freelancing on data engineering projects with a focus on geospatial data analysis, deep learning for satellite imagery, and remote sensing applications

  • 🌱 Continuously learning and improving in Deep Learning, Data Engineering, and Big Data technologies

  • πŸ’‘ Skilled in data wrangling, visualization, and creating automated pipelines for large-scale geospatial datasets

GitHub most commit language

πŸ› οΈ Skills

  • Programming: Python, R, MATLAB, SQL
  • Data Engineering & Cloud: ETL pipelines, Apache Airflow, Docker, AWS, PostgreSQL, MariaDB, PostGIS
  • Big Data & Streaming: Apache Spark, Apache Flink, Apache Kafka, ClickHouse, Elasticsearch
  • Geospatial Tools: GIS (QGIS), GDAL, Remote Sensing
  • Computer Vision & Machine Learning: OpenCV, TensorFlow, PyTorch, scikit-learn
  • Data Analysis & Visualization: pandas, numpy, Tableau, matplotlib, seaborn
  • Version Control & CI/CD: Git, GitHub Actions
  • Other Tools: Git, SGems, Postgre, PostGIS

πŸ”­ Selected Projects

Real Time Fraud Detection Pipeline πŸ”— GitHub Repo

-Real-Time Fraud Detection & Analytics Pipeline Designed and implemented a scalable, containerized data pipeline to process high-velocity retail transactions. The system utilizes Change Data Capture (CDC) via Debezium to stream row-level changes from sharded PostgreSQL and MySQL databases into Apache Kafka. A Spark Structured Streaming job consumes these streams, performing real-time ETL and stateful aggregations to detect sales anomalies, writing the final metrics to Redis for low-latency access and visualization.

TravelPulse: Real-Time Tourism Analytics Platform πŸ”— GitHub Repo

-TravelPulse is a real-time data analytics platform built with Kafka, Spark Structured Streaming, Prometheus, and Grafana, designed to simulate and monitor tourism activity across Italy. It collects live data from flights, hotel bookings, and weather sources, processes them in Spark to compute KPIs such as flight delays, booking trends, and tourism season scores, and visualizes everything through Grafana dashboards. By turning streaming data into actionable insights, TravelPulse helps city planners, airlines, and hotels make faster, data-driven decisions in the tourism ecosystem.

TOP – Turin Open Platform πŸ”— GitHub Repo

-A near real-time open data platform for Turin, integrating weather, air quality, traffic, and social sentiment into actionable KPIs for smarter urban planning and decision-making.

GDPR-Aware Reddit Data ETL Workflow Implemented with Apache Airflow and PostgreSQL πŸ”— GitHub Repo

-This project implements an automated data pipeline using Apache Airflow to extract data from Reddit, store it as CSV files, transform and combine the data, and finally load it into PostgreSQL. The workflow is designed with GDPR-awareness, ensuring proper handling and archiving of raw and processed data.

Computer Vision Deep Learning DeepLabV3 ResNet Backbone for Image Segmentation πŸ”— GitHub Repo

  • Applied deep learning models like DeepLabV3 with ResNet50 backbone to extract building footprints from satellite imagery for urban planning.

Geospatial Data Preparation for Deep Learning πŸ”— GitHub Repo

  • A collection of Python scripts for preprocessing and postprocessing geospatial imagery, designed to prepare satellite and aerial data for deep learning models. Includes tools for raster clipping, merging, tiling, CRS adjustment, format conversion, and vectorization of model outputs β€” bridging Remote Sensing and Computer Vision workflows.

Geostatistical Modeling and-Environmental Data Analysis πŸ”— GitHub Repo

  • Conducted thorough O3 density study in 5 European countries using EEA data. Analyzed with R Studio: distance calculations, variogram modeling (linear, spherical, Gaussian, exponential), model comparison via cross-validation. Optimal model chosen. Produced kriging maps in SGems.

Movie Library Desktop Application πŸ”— GitHub Repo

  • Movie Library is a simple desktop app to organize and track your movies. Easily add, edit, and categorize films you’ve watched or want to watch. Built with Python and includes an easy Windows installer for quick setup.

COVID-19 Data Web Scraping and Analysis πŸ”— GitHub Repo

  • Perform web scraping to extract a global COVID-19 dataset from a public Wikipedia page, followed by comprehensive data analysis tasks on the collected data.

Popular repositories Loading

  1. TOP-turin-open-data-platform TOP-turin-open-data-platform Public

    Turin Open Data Platform (TOP) is a prototype smart city dashboard that collects, processes, and visualizes real-time urban data to provide actionable insights for Turin, Italy.

    5

  2. Geospatial-Data-Preparation-for-Deep-Learning Geospatial-Data-Preparation-for-Deep-Learning Public

    A collection of Python scripts for preprocessing and postprocessing geospatial imagery, designed to prepare satellite and aerial data for deep learning models. Includes tools for raster clipping, m…

    Python 1

  3. TravelPulse-spark-stream-tourism-analytics TravelPulse-spark-stream-tourism-analytics Public

    Real-time tourism analytics platform for Italy, built with Apache Kafka, Spark, Prometheus & Grafana. It processes live streams from flights, weather, and bookings to generate KPIs and visualize It…

    Python 1

  4. Movie-Library-Desktop-Application Movie-Library-Desktop-Application Public

    Movie Library is a simple desktop app to organize and track your movies. Easily add, edit, and categorize films you’ve watched or want to watch. Built with Python and includes an easy Windows insta…

    Python

  5. Geostatistical-Modeling-and-Environmental-Data-Analysis Geostatistical-Modeling-and-Environmental-Data-Analysis Public

    Conducted thorough O3 density study in 5 European countries using EEA data. Analyzed with R Studio: distance calculations, variogram modeling (linear, spherical, Gaussian, exponential), model compa…

    R

  6. COVID19-WebScraper-in-R COVID19-WebScraper-in-R Public

    Perform web scraping to extract a global COVID-19 dataset from a public Wikipedia page, followed by comprehensive data analysis tasks on the collected data.

    R