Skip to content
View kowshik24's full-sized avatar
πŸ”
Focusing
πŸ”
Focusing

Highlights

  • Pro

Block or report kowshik24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kowshik24/README.md

Hi, I'm Koshik πŸ‘‹

πŸ“ Bangladesh ↔ Remote | 🧠 GenAI Researcher | πŸ› οΈ Full Stack ML Engineer

Python PyTorch TensorFlow FastAPI React TypeScript Docker OpenAI PostgreSQL

Bridging the gap between complex reasoning research and production-grade AI. Building novel multi-modal models and shipping open-source tools.

πŸ”­ Portfolio – My research, projects, and journey.

Current Projects

  • βš™οΈ ConfigSync - A CLI that treats configuration as code, with versioning, diffing, and team-wide enforcement.
  • 🏠 HomeOps - An iOS app that is essentially a "Digital Glovebox" for the home.
  • ⚑ FOMI - Frequency-Optimized Manifold Indexing. High-performance library for image vector similarity search & clustering.
  • πŸ”„ Atomic Sync - Real-time, end-to-end encrypted sync for Obsidian using Supabase.
  • πŸ“„ PaperToCode - Converts research papers into executable code implementations using GenAI.
  • πŸ“Ή Loom Clone - Cloud-native video messaging platform with direct Google Drive storage integration.
  • πŸ•΅οΈ SynthDetect Ultra - Forensic analysis tool distinguishing authentic photos from AI-generated media (GANs/Diffusion).
  • 🧠 Obsidian AI Plugin - Context-aware AI assistant with multi-file context and streaming, built for Obsidian users.
  • πŸ’Ή Binance AI Agent - Multi-agent system providing crypto investment recommendations via OpenAI/Gemini APIs.
  • πŸ‡§πŸ‡© Bengali Semantic Retrieval - Optimizing retrieval for low-resource languages using Matryoshka Representation Learning.
  • πŸ“‰ Stock Forecast BD - LSTM models forecasting stock prices for Bangladeshi and global markets.
  • 🧩 PineconeUtils - Authored two Python libraries to simplify data handling for RAG systems.

Open Source Contributions

  • πŸ› οΈ OpenLLMetry - Fixed serialization bugs for Python dataclasses and TypeErrors in OpenAI embeddings metrics (PR #2800, #1836).
  • 🌲 Pinecone Canopy - Contributed to the Retrieval-Augmented Generation (RAG) framework.

Selected Publications

  • [Mathematical Biosciences] Bayesian Physics-Informed Neural Networks for Parameter Inference in Wound Healing (Under Review, 2025)
  • [TALLIP] Optimizing Semantic Retrieval for Bengali: A Comparative Analysis (Under Review)
  • [ICCIT 2023] An Attention-Based Deep Learning Approach to Knee Injury Classification from MRI Images
  • [ECCE 2025] Advancing Low-Resource NLP: Contextual Question Answering for Bengali Language Using Llama
  • [NCIM 2025] Distinguishing Human-Written and AI-Generated Text: A Comprehensive Study Using XAI

Achievements

  • πŸ† Hackathon Champion at Machine Hack (Global Rank 539/8,861)
  • πŸ₯ˆ Top 7 in Data Science Student Championship (1,000+ participants)
  • πŸ₯‰ 3rd Place Rental Bikes Volume Prediction Hackathon
  • πŸŽ“ Analytics Olympiad 2022 - Ranked 82nd out of 1,029 participants

GitHub Activity

GitHub Contribution Graph

Connect

Gmail LinkedIn Google Scholar Twitter Kaggle


Random Facts
  • 🧠 Research focus: Explainable AI (XAI), Low-Resource NLP, Large Languge Models(LLMs) and ML/DL.
  • πŸ’» Expert in Python, but currently building decentralized governance platforms with Solidity & React.
  • πŸ“š Researcher at Young Learners' Research Lab (YLRL).
  • ⚑ Focused on optimizing LLMs via Matryoshka Representation Learning.

kowshik24

Pinned Loading

  1. traceloop/openllmetry traceloop/openllmetry Public

    Open-source observability for your GenAI or LLM application, based on OpenTelemetry

    Python 6.9k 888

  2. embeddings-benchmark/mteb embeddings-benchmark/mteb Public

    MTEB: Massive Text Embedding Benchmark

    Python 3.1k 564

  3. CaptionCraft CaptionCraft Public

    🌟 CaptionCraft: Your AI-Powered Social Media Companion πŸ“ΈπŸŒˆ Seamless Integration Across Platforms: Effortlessly generate platform-specific captions for Facebook πŸ“˜, Instagram πŸ“Έ, LinkedIn πŸ”—, Twitter 🐦,…

    Python 2 1

  4. PineconePDFExtractor PineconePDFExtractor Public

    PineconePDFExtractor is a Python library for extracting text from PDF files for pinecone.

    Python 2

  5. convolution-visualizer convolution-visualizer Public

    Interactive Convolution Visualizer

    JavaScript 2

  6. PineconeUtils PineconeUtils Public

    PineconeUtils is a Python module designed to handle and process data for embedding and indexing using Pinecone, Cohere, and OpenAI services. This utility module makes it easy to load, chunk, prepar…

    Python 1