Shashank Gupta

Data Engineer

New York City6 yrs 1 mo experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in designing healthcare ML solutions.
  • Proven track record in predictive analytics.
  • Strong background in data-driven problem solving.
Stackforce AI infers this person is a Data Science expert in Healthcare and Oil & Gas industries.

Contact

Skills

Core Skills

Machine LearningData EngineeringChemometricsData ScienceGenerative Neural NetworksPredictive AnalyticsProduct Development

Other Skills

A/B TestingAWS LambdaAmazon DynamodbArtificial Intelligence (AI)Azure Data FactoryAzure Data LakeAzure DatabricksComputer ScienceComputer VisionData AnalysisData MiningData MonitoringData VisualizationData WarehousingDecision Sciences

About

Shashank Gupta | Data Scientist | Data-driven Problem Solver | Tech Enthusiast I am a seasoned Data Scientist with a strong track record in designing and developing robust data-driven solutions and Machine Learning applications. Portfolio Links: 1. GitHub Profile Link - https://github.com/Sha661nk 2. Kaggle Profile Link - https://www.kaggle.com/wrecked22 3. Medium Profile Link - https://medium.com/@shashank.and.gupta. I write technical blogs on Data Science & Machine Learning on Medium & EnjoyAlgorithms platform.

Experience

6 yrs 1 mo
Total Experience
2 yrs 1 mo
Average Tenure
1 yr 11 mos
Current Experience

Pediatric associates family of companies

Data Scientist & Data Engineer III

Jul 2024Present · 1 yr 11 mos · New York, New York, United States · Hybrid

  • Designing and deploying healthcare ML solutions and scalable Medallion-architecture pipelines on Azure Databricks, leveraging RAG, CI/CD, and MLflow to improve predictive analytics, compliance, and RCM performance KPIs.
Python (Programming Language)Machine LearningArtificial Intelligence (AI)Data EngineeringSQL

Sanofi

2 roles

Data Scientist II

Jan 2024Jul 2024 · 6 mos · Framingham, Massachusetts, United States · On-site

  • As part of Sanofi's R&D team, I developed an AutoML Streamlit application for bio-process manufacturing, reducing RMSE by 10% and cutting manual modeling time by 16x. Implemented Kalman Filters to scale bio-processes from development scale (2L) to commercial scale (10,000L) bio-reactors.
ChemometricsPython (Programming Language)Machine Learning

Data Science Co-Op

Jun 2023Dec 2023 · 6 mos · Framingham, Massachusetts, United States · On-site

  • Developed AutoML pipeline for Raman Spectroscopy in pharmaceutical downstream, reducing modeling time and improving amino acid prediction accuracy by 25% while ensuring resilience to sensor noise and variability.
ChemometricsPython (Programming Language)Machine LearningData Science

Rutgers university

Research Assistant

Mar 2023Jan 2024 · 10 mos · New Brunswick, New Jersey, United States · Hybrid

  • As a Research Assistant at Rutgers University, I contributed to collaborative research by testing Generative AI (GenAI) models for synthetic data generation while retaining the missing data distribution.
Python (Programming Language)Generative Neural Networks

Enjoyalgorithms

Product Owner

Jul 2021Aug 2022 · 1 yr 1 mo · Pune, Maharashtra, India

  • Responsible for designing an industry-centric data science curriculum for beginners and working professionals, sharing ideas to increase the reach of the content, and managing the product performance.
Nonprofit OrganizationsSearch Engine Optimization (SEO)Product Development

Larsen & toubro infotech ltd (lti)

Senior Data Scientist

Jul 2019Sep 2022 · 3 yrs 2 mos · Mumbai Metropolitan Region · On-site

  • Designed and deployed a Logistic Regression-based predictive maintenance model for steam generators, driving a 40% reduction in maintenance costs and a 20% increase in Mean Time Between Failures (MTBF). Developed a scalable real-time pump health monitoring system, leveraging R for RUL modeling and Power BI for intuitive visualization. Automated equipment name extraction from industrial CAD drawings using PyTorch, pytesseract OCR, and CV2, transforming manual workflows into efficient, fully automated processes.
Python (Programming Language)Predictive AnalyticsOil & Gas IndustryData AnalysisA/B TestingData Visualization+3

Ideas revenue solutions

Data Science Intern

May 2019Jul 2019 · 2 mos · Pune/Pimpri-Chinchwad Area · On-site

  • As a Data Science Intern at IDeaS, I utilized Machine Learning Demand Forecasting models to accurately forecast hotel prices and occupancy rates, resulting in a 15% increase in revenue by optimizing room rates based on predicted demand patterns and market trends.
Computer ScienceR (Programming Language)Exploratory Data AnalysisData Science

Education

Rutgers Business School

Master's degree — Information Technology & Analytics

Sep 2022Dec 2023

Indian Institute of Technology, Kanpur

Bachelor of Technology — Electrical and Electronics Engineering

Jan 2015Jan 2019

Stackforce found 100+ more professionals with Machine Learning & Data Engineering

Explore similar profiles based on matching skills and experience