Soumyadip Ghorai

Data Scientist

Bangalore, Karnataka, India3 yrs 9 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in machine learning and predictive analytics.
  • Proven track record in developing AI-driven solutions.
  • Strong mentor with experience guiding over 200 students.
Stackforce AI infers this person is a Data Science and Machine Learning expert in the SaaS industry.

Contact

Skills

Core Skills

Machine LearningGenerative AiData Science

Other Skills

AnalyticsBeautiful SoupBusiness AnalyticsClassificationComputer ScienceDashboardData AnalysisData AnalyticsData MiningData StructuresData VisualizationDeep LearningDescriptive AnalysisDjangoDocumentGPT

About

🌟 Data Scientist | Machine Learning Engineer | AI Engineer | Dreamer 🌟 Driven by a passion for innovation and a deep commitment to data-driven solutions, I specialize in machine learning (ML), predictive analytics, natural language processing (NLP), and Generative AI to tackle complex, real-world challenges. With a focus on delivering impactful results, I bring a unique combination of technical expertise, analytical thinking, and problem-solving skills to every project. πŸ“Œ Key Strengths - Machine Learning: Proven ability to design, train, and deploy scalable ML models to address critical business needs and optimize processes. - Data Analysis: Skilled in analyzing diverse datasets, uncovering actionable insights, and driving informed, strategic decision-making. - Python Expertise: Proficient in using the Python ecosystem for developing data pipelines, implementing machine learning algorithms, and automating workflows. - Problem-Solving: Expertise in solving complex challenges through statistical modeling, simulations, and optimization techniques. πŸ’» Core Technical Skills - Machine Learning & AI: Regression, classification, clustering, time series forecasting, deep learning, and LLMs (GPT-based models). - Programming: Advanced proficiency in Python (pandas, NumPy, scikit-learn, PyTorch), SQL, FastAPI, Flask, LangChain, and Streamlit. - Data Visualization: Experienced in creating impactful dashboards with Tableau, matplotlib, seaborn, and other visualization tools to track KPIs and metrics. - Optimization & Simulation: Hands-on experience with Monte Carlo simulations, inventory optimization, and decision-support systems. 🎯 Future Aspirations As a lifelong learner, I am eager to contribute to cutting-edge advancements in Generative AI, deep learning, and data engineering. I aim to collaborate with innovative teams to solve industry challenges in areas like NLP, business intelligence, supply chain analytics, and decision science. πŸ’‘ What I Bring - A commitment to continuous learning and delivering impactful, data-driven solutions. - Recognition as a problem solver and innovation driver, with a track record of leveraging AI/ML technologies for tangible results. - A strong foundation in data science, coupled with a collaborative mindset and an enthusiasm for working with like-minded innovators. πŸ” Let’s connect and explore opportunities to build transformative solutions together! 🌟

Experience

3 yrs 9 mos
Total Experience
11 mos
Average Tenure
1 yr
Current Experience

Ge vernova

Data Scientist

Jun 2025 – Present Β· 1 yr Β· Bengaluru, Karnataka, India Β· Hybrid

Kpmg

2 roles

Data Scientist

Promoted

Jan 2024 – Jun 2025 Β· 1 yr 5 mos Β· Bengaluru, Karnataka, India Β· Hybrid

  • I have collaborated with one of the largest pharmaceutical company, leveraging advanced AI/ML and statistical techniques to optimize inventory and supply chain operations. My expertise extends to developing multiple POCs and platforms utilizing Large Language Models (LLMs) and GenAi.
  • Implemented an end-to-end automation solution for time series forecasting. Starting from context extraction from user queries using LLM, dynamically generating and executing SQL queries, and performing forecasting on the filtered dataset
  • Developed multiple customized AI/ML tools tailored to diverse use cases, empowering custom AI agents to generate efficient responses.
  • Integrated open-source Language Models such as Llama-3 and Mistral seamlessly into the existing architecture powered by GPT3.5 turbo.
  • Implemented functional calling mechanisms to efficiently leverage the capabilities of these LLMs, resulting in significant reductions in operating costs while enhancing overall system performance and versatility.
  • Leveraged DocumentGPT's advanced capabilities to enhance text and table extraction from PDF documents.
  • Enhanced the accuracy of the RAG method by implementing a custom distance-based document chunking algorithm.
  • Developed and implemented a custom algorithm to replicate PDF content into Markdown format, enabling seamless integration with Language Model (LLM) for improved text processing and retrieval accuracy.
  • Optimized inventory management parameters through the implementation of Monte Carlo Simulation, successfully achieving the desired service level.
Data MiningLarge Language Models (LLM)Python (Programming Language)LangChainData StructuresInventory Optimization+15

Data Scientist

Feb 2023 – May 2023 Β· 3 mos Β· Bengaluru, Karnataka, India Β· Hybrid

  • Successfully completed training in Python, SQL, machine learning, Linear Programming, and other consulting skills.
  • Won 2 consecutive coding competitions hosted by KPMG.
Data MiningMachine LearningPython (Programming Language)Data StructuresOperations ResearchSQL+5

Upgrad

Data Science Mentor

Jan 2023 – Dec 2023 Β· 11 mos

  • Guided and mentored over 200+ students in Data Science, Machine Learning, Python, SQL, Statistics, and Analytics.
  • Discussed real-world data science project development using Python and how to deploy them using Flask.
  • Conducted interactive doubt-clearing sessions on core data science concepts.
Machine LearningPython (Programming Language)Data StructuresSQLStatisticsData Analysis+6

Tweek labs

Data Scientist

Mar 2022 – Aug 2022 Β· 5 mos Β· Bangalore Urban, Karnataka, India

  • Implemented new features like max shoulder-speed of a fast bowler from labeled sensor data.
  • Implemented methods like moving avg, selective scaling to remove fluctuations in sensor data.
  • Setup a separate notebook of interactive charts to check for anomalies in various parameters of athletes using plotly.
  • Developed aggregated scoring methods to rank players according to their stats.
  • Setup automatic pipeline to store data in using google API, made interactive dashboard using Meta Base to track KPIs
  • Migrated old data pipeline from c# to python and backend code from pandas to numpy.
  • Most exciting! Applied Machine learning to predict ground contact with an avg accuracy of 11 milliseconds.
  • Team : Motion Data Analyst
  • language : python
Data MiningMachine LearningPython (Programming Language)MetaBaseData StructuresComputer Science+11

Ericsson

Data Scientist

Sep 2021 – Oct 2021 Β· 1 mo Β· India

  • Project : Predict the root cause and recommend possible resolutions from the error messages from ENM upgradation logs. To build the resolution recommendation module JIRA database was used as a historical training data source
  • Team : Log Analytics
  • My task was to write a generalized parser in python to convert the JIRA xml tickets into json files. Which I have done within time and uploaded the files on elastic search and the visualization was done on kibana.
  • Tech : Python extensively
Python (Programming Language)Data StructuresComputer ScienceAnalyticsMatplotlibData Science

Education

Indian Institute of Technology, Madras

Bachelor of Science - BS β€” Data Science

Jan 2021 – Jan 2025

Christ University, Bangalore

Master of Science - MS β€” Data Science

Jan 2021 – Jan 2023

University of Calcutta

Bachelor of Science β€” statistics

Jan 2018 – Jan 2021

Stackforce found 100+ more professionals with Machine Learning & Generative Ai

Explore similar profiles based on matching skills and experience