Skip to content
View pari1jay's full-sized avatar
๐Ÿ 
Working from home
๐Ÿ 
Working from home

Block or report pari1jay

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
pari1jay/README.md

Hi, I'm Pari! ๐Ÿ‘‹

Data Engineer | Systems Analyst | Cloud Enthusiast

๐Ÿ”— Portfolio | ๐Ÿ”— LinkedIn


About Me

I'm a Data Analyst with experience in data engineering, system integration, and cloud-based solutions. I specialize in designing and implementing scalable data pipelines, optimizing ETL processes, and leveraging cloud platforms to deliver data-driven insights. I have a Master of Science degree in Applied Data Science from Indiana University, and I am passionate about data analytics, machine learning, and cloud computing. I'm actively seeking opportunities to contribute to impactful projects as a Data Analyst or Data Engineer.

When I'm not working with data, I love learning new technologies, spending time with animals, and sneaking in a chapter or two of a fictional novel during breaks.


Technical Skills ๐Ÿ’ป

Data Analysis & Engineering

  • Languages: Python, R, SQL, Java, C
  • Big Data Tools: Apache Spark, Hadoop, Hive, Kafka, Airflow
  • ETL Tools: Talend, SSIS, Apache NiFi
  • Databases: PostgreSQL, MySQL, MongoDB, Snowflake, MS Access
  • Data Visualization: Tableau, Power BI, Plotly, Excel

Cloud Platforms

  • AWS: S3, Redshift, Lambda, EMR
  • GCP: BigQuery, Dataflow
  • Microsoft Azure: Data Factory, Synapse Analytics

Project Management & Collaboration

  • Tools: Jira, Confluence, Lucidchart, MS Project
  • Methodologies: Agile, Scrum, Waterfall

Certifications

  • Career Essentials in Data Analysis by Microsoft
  • Microsoft Azure Data Fundamentals
  • Data Analytics with Microsoft Fabric
  • HackerRank SQL (Intermediate)
  • Atlassian Agile Project Management Professional

Projects ๐Ÿš€

1. Consumer Complaints Prediction

  • Tools: Python, NLP, Data Visualization
  • Description: Applied NLP techniques to analyze customer feedback and classify sentiment as positive, negative, or neutral. Achieved 79% accuracy using machine learning models (Naive Bayes, Decision Tree, KNN).

2. Real Estate Sales Prediction Web Application

  • Tools: Python, Machine Learning, Streamlit
  • Description: Developed a web app to predict real estate sales using Linear Regression, Random Forest, and Gradient Boosting. Enabled city-specific and overall sales predictions with user input.

3. ETL and Data Pipelines with Shell, Airflow and Kafka

  • Tools: Shell, Airflow and Kafka
  • Description: Designed and implemented ETL pipelines to integrate data from multiple sources into a centralized data warehouse, improving data quality by 25%.
  • Coursera: Link

Experience ๐Ÿ’ผ

Data Engineer | Netcube Technologies | Bangalore, India | Jan 2019 โ€“ Feb 2022

  • Led data analysis and visualization projects for small-scale businesses, managing 100+ product datasets to deliver actionable insights.
  • Designed and implemented ETL processes using Talend and SSIS, reducing integration time by 30%.
  • Optimized SQL queries and database performance, reducing query execution time by 25%.
  • Automated testing and deployment processes using CI/CD pipelines with Selenium and Python.

Associate Software Engineer | Tech Mahindra | Bangalore, India | Aug 2016 โ€“ Oct 2018

  • Transitioned 5+ releases from manual to automated processes, reducing manual testing time by 20 hours per week.
  • Led end-to-end functionality and automation testing for a sub-product of British Technologies project.
  • Won the teamโ€™s Bravo Award for attention to detail and resolving critical issues.

Education ๐ŸŽ“

  • Master of Science in Applied Data Science | Indiana University | Jan 2023 โ€“ May 2024

    • Coursework: Data Analytics using Python and R, Data Visualization, Deep Learning, Cloud Computing, DBMS, Statistics
    • Deanโ€™s Scholarship Recipient
  • Bachelor of Engineering in Aeronautical Engineering | Mangalore Institute of Technology and Engineering, VTU | Aug 2012 โ€“ Aug 2016


Let's Connect! ๐ŸŒ

I'm always open to collaborating on interesting projects or discussing new opportunities. Feel free to reach out!


Pinned Loading

  1. Sales-Prediction-using-ML Sales-Prediction-using-ML Public

    The project is on developing a sales prediction Web app using Texas housing dataset('txhousing'). The goal here is to provide insights into real estate sales trends using this dataset. I have used โ€ฆ

    Jupyter Notebook 1 1

  2. Crop-row-detection Crop-row-detection Public

    Developed a deep learning model in Python to detect crop rows from input images, utilizing U-Net architecture with TensorFlow for image segmentation. Evaluated model performance using the Intersectโ€ฆ

    Jupyter Notebook 1

  3. Customer-sentiment-Analysis Customer-sentiment-Analysis Public

    This project focuses on analyzing customer sentiment based on textual data, such as product reviews, feedback, or social media posts. The goal is to classify customer feedback into different sentimโ€ฆ

    Jupyter Notebook 1

  4. Spotify-classification-R Spotify-classification-R Public

    Exploring Audio Features and Genre Classification for Spotify data

    1

  5. Data-Visualization-projects Data-Visualization-projects Public

    The purpose of this interactive storyboard is to visually explore the transition to clean fuel technology in the BRICS nations (Brazil, Russia, India, China, and South Africa). By analyzing data onโ€ฆ

  6. InfoAssignment InfoAssignment Public

    Jupyter Notebook