Pritam Gajbhiye

Data Scientist

Chicago, Illinois, United States3 yrs 7 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Shipped forecasting models with over 90% accuracy.
  • Developed scalable ETL pipelines processing 1M+ records.
  • Implemented RAG systems for intelligent document retrieval.
Stackforce AI infers this person is a Data Scientist specializing in ML and GenAI systems for government and tech sectors.

Contact

Skills

Core Skills

Data AnalyticsMachine LearningNatural Language Processing (nlp)Data ScienceA/b TestingStatistical Data AnalysisData Visualization

Other Skills

Snowflake CloudPython (Programming Language)Retrieval-Augmented Generation (RAG)Time Series AnalysisSpatial AnalysisPythonRAmazon Web Services (AWS)ETLAWSNLPLiterature ReviewsSocial Media AnalyticsCommunity DetectionNetwork Metrics

About

I build ML and GenAI systems that go from messy, unstructured data to production. Over 4+ years across a DOE national lab, UChicago Booth, enterprise, and county government, I've shipped forecasting models with 90%+ accuracy, RAG systems over thousands of unstructured documents, and ETL pipelines processing 500GB+ daily. What connects all of it: I care as much about the deployment and the decision it enables as I do about the model. What I work on: GenAI & NLP: Production RAG architectures (LangChain, Qdrant, GPT-4o), LLM-powered document intelligence, embeddings, information retrieval Predictive Modeling: Time-series forecasting (LSTM, N-BEATS, Prophet), causal inference, A/B testing, demand optimization Data Platforms: Snowflake, AWS (S3, Glue, SageMaker, EMR), PySpark, Airflow, ETL at scale Stack: Python · SQL · R · TensorFlow · scikit-learn · LangChain · Docker · MLflow · Power BI · Streamlit Currently exploring roles in Data Science, AI/ML Engineering, and Applied AI. Let's connect: pgajbhiye2405@gmail.com

Experience

3 yrs 7 mos
Total Experience
11 mos
Average Tenure
9 mos
Current Experience

Cook county government

Data/Policy Analyst

Sep 2025Present · 8 mos · Chicago, IL

Data AnalyticsData VisualizationMachine LearningSnowflake CloudPython (Programming Language)

Applied data fellowship

Applied Data Fellow

Aug 2025Present · 9 mos · Chicago, IL · Hybrid

University of chicago

3 roles

Teaching Assistant

Mar 2025May 2025 · 2 mos

  • Course: MACS 30113 1 Principles of Computing 3: Big Data and High-Performance Computing for Social Scientists
  • Assist the instructor during lectures by offering technical support and helping students troubleshoot code and understand key computational concepts.
  • Offer weekly office hours to provide one-on-one help with course material, programming assignments, and questions related to big data tools and parallel computing.
  • Lead and manage weekly lab sections for over 30 students, guiding them through hands-on exercises involving big data analysis, parallel computing, and high-performance environments.
  • Evaluate assignments and final projects for a class of 30+ students, ensuring consistent grading and delivering constructive feedback to enhance learning.
  • Support students in configuring and using cloud-based and university HPC environments (e.g., AWS, Slurm clusters), and resolve related technical issues.

Teaching Assistant

Aug 2024Sep 2024 · 1 mo · Chicago, Illinois, United States

  • 𝐂𝐨𝐮𝐫𝐬𝐞: MACS 30120 - Computing Fundamentals Boot Camp
  • Hold regular office hours to assist students with Python programming concepts, data structures, and algorithms.
  • Debug and troubleshoot students' code during lab sessions.
  • Grade weekly programming assignments using established rubrics and provide constructive feedback on code quality, efficiency, and style.

Research Assistant

Jun 2024Aug 2024 · 2 mos · Chicago, Illinois, United States

  • Collaborated with a professor in coursework development to integrate advanced social network analysis techniques into the curriculum, facilitating hands-on learning experiences.
  • Developed a comprehensive document on SNA methods tailored to social media and an R Markdown tutorial on identifying and quantifying ego networks and polarization, incorporating practical examples and code implementation.
  • Designed a capstone project that involved exploring Twitter data, visualizing networks, and generating SNA measures, providing students with hands-on experience.

The university of chicago booth school of business

Research Assistant - Data Scientist

Aug 2024Jun 2025 · 10 mos · Chicago, Illinois, United States

  • Led end-to-end analysis of Chicago 311 service request data to evaluate the impact of housing policies on landlord behavior, leveraging Python for large-scale data processing and modeling.
  • Applied time-series and spatial analytics in R to identify neighborhood-level trends in housing quality, urban stratification, and policy outcomes.
  • Designed and implemented a Retrieval-Augmented Generation (RAG) system by embedding property tax PDF documents into a vector database (QuadrantDB), enabling natural language querying via GPT-4o API.
  • Built an interactive chatbot to query and summarize property tax records, translating complex, unstructured PDF data into actionable insights for researchers and policymakers.
  • Engineered feature extraction pipelines from semi-structured PDF data (e.g., property attributes, assessments, temporal indicators) to support downstream analytics and intelligent retrieval.
Data AnalyticsNatural Language Processing (NLP)Retrieval-Augmented Generation (RAG)Time Series AnalysisSpatial Analysis

Labhanya infotech

Data Scientist

Feb 2022Jul 2023 · 1 yr 5 mos · Nagpur, Maharashtra, India · On-site

  • Built scalable ETL pipelines processing 1M+ records, improving data processing efficiency by 40% and enabling downstream ML workflows.
  • Developed and deployed ML-driven NLP chatbot systems, including Transformer-based multilingual models, improving response quality and reducing translation errors by 25%.
  • Designed and automated A/B testing frameworks to optimize recommendation strategies and support data-driven product decisions.
  • Deployed ML pipelines on AWS (EC2, S3, SageMaker, Lambda) using CI/CD, reducing production downtime by 30%.
Data ScienceStatistical Data AnalysisData VisualizationAmazon Web Services (AWS)A/B Testing

Illinois institute of technology

Research Assistant

Aug 2021Dec 2021 · 4 mos · Chicago, Illinois, United States

  • Conducted social media network analysis focusing on political polarization.
  • Applied NetworkX and Gephi for network visualization and statistical analysis to compute and assess diverse network metrics such as degree distribution, closeness centrality, and betweenness centrality.
Literature ReviewsSocial Media AnalyticsCommunity DetectionNetwork Metrics

Pacific northwest national laboratory

Data Scientist CO-OP

May 2021Dec 2021 · 7 mos · Richland, Washington, United States

  • Developed graph-based deep learning (GCNN + LSTM) models to forecast curbside parking availability, achieving 90%+ prediction accuracy.
  • Benchmarked advanced ML models (N-BEATS, LSTM) against classical approaches, improving performance by 10% over best baselines.
  • Designed end-to-end pipelines for time-series prediction, anomaly detection, and mobility demand forecasting.
Statistical Data AnalysisMachine LearningDeep LearningData VisualizationLiterature Reviews

Department of resource economics, umass amherst

Research Assistant

May 2020Jul 2020 · 2 mos · Amherst, Massachusetts, United States · Remote

  • Utilized ’tidycensus’ package in R to extract socioeconomic and demographic data of two metro-cities (Boston, Phoenix) from the US Census datasets and performed EDA, plotting geographical heatmaps to find correlation with e-Bird checklists data.
  • Executed extensive data preprocessing and wrangling procedures, coupled with in-depth visualization techniques and hypothesis formulation, leveraging datasets including IHDS and NCRB.
  • Explored correlations between crime rates and factors such as unemployment rates, education levels, and women empowerment, providing valuable insights into socioeconomic dynamics.
Statistical Data AnalysisData VisualizationEDAR

Education

University of Chicago

Master of Arts - MA — Computational Social Science

Aug 2023Jun 2025

Illinois Institute of Technology

Master's degree — Computer Science with concentration in Data Analytics and Computational Intelligence

Jan 2020Dec 2021

Massachusetts Institute of Technology

MicroMasters Program — Statistics and Data Science

May 2019May 2021

Dr. Babasaheb Ambedkar Technological University

Bachelor of Technology - BTech — Information Technology

Aug 2014May 2018

Stackforce found 100+ more professionals with Data Analytics & Machine Learning

Explore similar profiles based on matching skills and experience