Shubham .

Lead ML Engineer

Bengaluru, Karnataka, India5 yrs 8 mos experience
Highly Stable

Key Highlights

  • Over 5 years of experience in data science.
  • Expertise in NLP and predictive analytics.
  • Proven track record in optimizing industrial processes.
Stackforce AI infers this person is a Data Scientist with expertise in Fintech and Maritime industries.

Contact

Skills

Core Skills

Machine LearningLarge Language Models (llm)Natural Language Processing (nlp)

Other Skills

PostgreSQLTensorFlowPredictive ModelingPythonApache SparkAmazon Web Services (AWS)Deep LearningLangChainXGBoostCNNPySparkJanusGraphKerasMicrosoft Azure Machine LearningTime Series Analysis

About

Hello there! I'm a data scientist with over 5 years of hands-on experience in the field. My journey has been filled with exciting opportunities to address real-world business challenges across diverse industries, including finance, chemical and petroleum, and maritime. My expertise spans various domains of data science, such as time series forecasting, natural language processing (NLP), and predictive analytics. I take immense pride in my contributions to improving Tenancy document processing through NLP and computer vision by reducing processing time and reduced errors, enabling businesses to make quicker and smarter decisions. Additionally, I've played a vital role in optimizing industrial processes by harnessing the power of data science. Throughout my career, I've demonstrated my prowess in building end-to-end machine learning pipelines and implementing robust monitoring systems on cloud platforms such as Microsoft Azure and IBM Cloud. These systems have not only streamlined workflows but have also ensured that the models perform effectively in real-world scenarios. With a relentless learning attitude, I thrive on staying ahead of the curve in this dynamic industry. Each project presents an opportunity for me to broaden my skill set and explore novel solutions to intricate problems.

Experience

5 yrs 8 mos
Total Experience
2 yrs 7 mos
Average Tenure
5 mos
Current Experience

Paypal

Senior Machine Learning Engineer

Dec 2025Present · 5 mos · Bengaluru, Karnataka, India · Hybrid

S&p global

Data Scientist

Oct 2023Dec 2025 · 2 yrs 2 mos · Bengaluru, Karnataka, India · Hybrid

  • Dual Usage Goods Classification:
  • Led end-to-end development of a Dual Usage Goods classification system using Retrieval-Augmented Generation (RAG) and LLMs with Python, PostgreSQL (pgvector), and Langchain.
  • Designed and deployed scalable data pipelines to extract export control regulations from PDF documents across 25+ countries and enriched the text using complementary internal data sources, significantly enhancing classification accuracy.
  • Delivered a 25% improvement in solution accuracy and a 30% reduction in SME manual effort by replacing a legacy system with an advanced AI-driven solution.
  • GIA Research Assistant:
  • Designed and deployed a domain-specific RAG-based chatbot for Macroeconomics and Geopolitical Risk, automating 80%+ of SME queries reducing SME efforts and time using LLMs, LangChain, and PostgreSQL (pgvector).
  • Built scalable multi-modal data pipelines (text/audio) and enabled geo-coordinate-based-queries, applied LLMs for speech-to-text and contextual chat handling, enhancing real-time system intelligence and user experience.
  • Vessel ETA and Route Prediction:
  • Developed and deployed vessel ETA prediction models using XGBoost and route prediction using CNN model, achieving a 15% performance improvement over the existing rule-based system.
  • Designed a scalable pipeline in Python and PySpark to process large-scale maritime data and identified optimal shipping paths using graph-based algorithms.
Machine LearningPostgreSQLTensorFlowPredictive ModelingPythonApache Spark+3

Ibm

2 roles

Data Scientist

Sep 2020Oct 2023 · 3 yrs 1 mo · Bengaluru, Karnataka, India · Hybrid

  • Work Experience in Industries:
  • 1. Chemical and Petroleum
  • 2. Transportation and Logistics
  • 3. Finance
  • Some projects I've worked on:
  • Petroleum Coke Quality Prediction/Optimization
  • Predict Petroleum Coke Quality parameters and forecast 10+ crudes and 13 different properties time series using models such as XGBoost, Facebook Prophet, and Gradient Boost
  • Optimized refinery process to achieve higher and sustained coke quality using Model Explainability (SHAP)
  • Developed a robust Model Monitoring module that detected and prevented performance drifts; reduced model downtime by around 80% and possible a loss of around $0.6-1M/year
  • Created end-to-end machine learning pipelines on Azure ML
  • Cargo Vessel Trajectory and ETA Prediction
  • Designed a network of Neural Network models using TensorFlow to predict the Vessel Trajectory and ETA. The network consists of 1000+ nodes and each node in the network consists of 2-3 different Neural Network models
  • Created new features from online sourced data which improved model performances by 4-5% on average
  • Conceptualized Solution performance criteria and implemented Model monitoring functionality
  • Property Finance Content Intelligence
  • Developed Bi-LSTM based Neural network using PyTorch to perform content intelligence on 14+ Tabular Data Formats extracted from PDF
  • Incorporated LSH model and fuzzy matching to find the closest matching entity names; reduced search time by around 10 times
  • The solution reduced the time to execute property finance evaluations by over 80%
  • HSN Code Classification
  • Developed BERT-based hierarchical classification Neural Network model using transfer learning; improved performance by 10%
  • Extracted entities and relationships from product descriptions which are analyzed for auditing
JanusGraphMachine LearningKerasMicrosoft Azure Machine LearningTime Series AnalysisAnalytical Techniques+11

Data Scientist

Jan 2020Jun 2020 · 5 mos · Greater Bengaluru Area

  • During my internship at IBM, I worked on the following assignment:
  • Graph Analytics Platform:
  • Implementing graph algorithms.
  • Enhancing and improving the implementations for computationally efficient system.
  • Integrating with Spark for cluster-computing.
  • Created a modern, responsive and user-friendly web-platform.
  • Technologies: JanusGraph, Gremlin, Spark, NodeJs, Python, jQuery, JavaScript.

Niit limited

Web Developer

Jan 2019Mar 2019 · 2 mos · Rajasthan, India

  • During my internship at NIIT Limited, I worked on the following assignment
  • Interaction Portal for Prospective students of the University:
  • Login and Signup using OAuth.
  • Developing features for effective interaction such as: Personal chat, Posts, Discussion Forums, User Profiles etc.
  • Creating a modern, responsive and user-friendly web-platform.
  • Technologies: PHP, Laravel, jQuery, JavaScript, HTML, CSS.

W3dev private limited

Full Stack Web Developer

Dec 2017Apr 2018 · 4 mos · IIIT Delhi, New Delhi

  • During my internship at NIIT Limited, I worked on the following assignment:
  • SellBuyBook: An online platform to sell and buy books
  • Created a modern, responsive and user-friendly web -platform.
  • Login and Signup using OAuth process.
  • Integrated payment gateways such as Paytm, RazorPay etc.
  • Backend Development.
  • Technologies: PHP, Laravel, jQuery, JavaScript, AWS Services (EC2, S3, Cloud9 IDE), HTML, CSS.
  • KissanX: Crops and Agricultural tools Management Platform
  • Created a modern, responsive and user-friendly web -platform.
  • Backend Development.
  • Technologies: PHP, Laravel, jQuery, JavaScript, AWS Services (EC2, S3, Cloud9 IDE), HTML, CSS.
  • MetGiMet: Doctor Appointment Platform
  • Created a modern, responsive and user-friendly web -platform.
  • Login and Signup using OAuth process.
  • Integrated payment gateways such as Paytm, RazorPay etc.
  • Backend Development.
  • Technologies: PHP, Laravel, jQuery, JavaScript, AWS Services (EC2, S3, Cloud9 IDE), HTML, CSS.

Niit limited

Web Designer

Jun 2016Nov 2016 · 5 mos · Rajasthan, India

  • During my internship at NIIT Limited, I worked on the following assignment:
  • Data Visualizer Platform
  • Understanding and implementing different visualization techniques.
  • Choosing suitable visualization for effective data representation.
  • Creating a modern, user-friendly web-platform.
  • Technologies: d3.js, jQuery, JavaScript, HTML, CSS.

Education

Indian Institute of Technology Hyderabad

Master of Technology - MTech — Data Science

Aug 2024Sep 2027

NIIT University

Bachelor of Technology - BTech — Computer Science

Jan 2016Jan 2020

Kendriya Vidyalaya

Stackforce found 100+ more professionals with Machine Learning & Large Language Models (llm)

Explore similar profiles based on matching skills and experience