Skip to content
View mjoshua97241's full-sized avatar
๐Ÿ–ฅ๏ธ
Crunching the data
๐Ÿ–ฅ๏ธ
Crunching the data

Block or report mjoshua97241

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
mjoshua97241/README.md

Typing SVG


๐Ÿ‘จโ€๐Ÿ’ป About Me

Iโ€™m a Data Scientist and Machine Learning Engineer passionate about merging AI, analytics, and user interfaces to create impactful end-to-end products.
From building scalable fraud detection systems to NLP-powered insight engines, I love solving problems where data meets design.

  • ๐Ÿ”ญ Current: Lead Data Scientist @ AR Data Technologies โ€” building IoT & BIM ML pipelines.
  • ๐Ÿง  Previously: Eskwelabs Fellow โ€” socio-economic prediction models (92.56% accuracy).
  • ๐ŸŒฑ Learning: Generative AI & agentic systems for automated insight extraction.
  • ๐Ÿ’ฌ Ask me about: ML pipelines, NLP, Marimo dashboards.
  • โšก Fun fact: I used to design buildings as an architect โ€” now I design data systems!

๐Ÿง  Tech Stack

Core Skills:

  • ML & AI: XGBoost, TensorFlow, PyTorch, Deep Learning, NLP, LLM Concepts
  • Data: Pandas, NumPy, SQL, Power BI, Seaborn, Matplotlib
  • Web: React, Tailwind CSS, Flask, Streamlit, Cytoscape.js
  • Other: Docker (learning), AWS MLOps (in progress)

๐Ÿš€ Featured Projects

Cloud-native financial data platform designed to ingest hourly stock market data, compute technical trading signals, monitor portfolio-level risk metrics, and track historical signal performance.
The platform simulates backend infrastructure that could power financial analytics products used by active traders, quantitative analysts, and small investment firms.

AI-powered building code compliance for AEC. This project helps architects and designers check early space planning (rooms, doors, corridors) against building codes and internal standards, without relying on full BIM. It combines Vision LLMs (blueprint extraction), RAG (building code Q&A with citations), and deterministic compliance checking in a single proof-of-concept aimed at future CAD Add-In integration (AutoCAD/Revit).

๐ŸŽฅVideo Walkthrough

๐Ÿง‘โ€๐ŸซCanva Slides

๐Ÿ•ต๏ธโ€โ™‚๏ธ Fraud Detection & Network Mapping (94% Precision)

End-to-end scalable ML pipeline reducing manual fraud review by 4ร—.
Built with XGBoost, Cytoscape.js, and React, featuring 20+ node fraud cluster visualization.
๐Ÿ”— Live Demo

NLP pipeline analyzing AI perception in Philippine media (2020โ€“2025).
Used spaCy, Selenium, and BeautifulSoup to extract sentiment and trends.

Deep Learning model (TensorFlow) predicting user repurchase behavior with 80% accuracy.

Achieved 97.5% accuracy using a two-hidden-layer NN โ€” core deep learning fundamentals.


๐Ÿ“ˆ GitHub Analytics


๐Ÿงฉ Current Focus

  • ๐ŸŒ Building AI-driven dashboards with Streamlit and React
  • ๐Ÿงฎ Experimenting with Agentic AI for automated analytics pipelines
  • ๐Ÿ“Š Developing visual storytelling with Power BI + Python

๐Ÿ—๏ธ Experience Snapshot

AR Data Technologies โ€” Lead Data Scientist (2025โ€“Present)
โ†’ Designed data pipeline for IoT & geospatial ML systems.
โ†’ Architected early-stage MLOps dashboard and rule-based prototype.

Eskwelabs โ€” Data Science Fellow (2025)
โ†’ Built Gradient Boosting model (92.56% accuracy) & skill-network analysis using centrality metrics.

VAA Philippines โ€” Amazon PPC Specialist (2023โ€“2025)
โ†’ Automated 40+ performance reports, boosting ad ROI by 20โ€“30%.


๐Ÿ”— Connect With Me

Pinned Loading

  1. finstream-market-risk-data-platform finstream-market-risk-data-platform Public

    Python 7

  2. bank-fraud-detection-project bank-fraud-detection-project Public

    Jupyter Notebook 7 1

  3. space-code-copilot space-code-copilot Public

    Python 1

  4. nlp-ai-perception-ph nlp-ai-perception-ph Public

  5. audiobook-customer-repurchase-prediction audiobook-customer-repurchase-prediction Public

    2