Apoorv Singh N.

Data Scientist

Gurugram, Haryana, India0 mo experience

Key Highlights

  • Expert in Machine Learning and Data Science.
  • Proven track record in NLP and automation.
  • Strong background in embedded systems development.
Stackforce AI infers this person is a Data Science and Embedded Systems specialist with a focus on automation and machine learning.

Contact

Skills

Core Skills

Machine LearningData ScienceNatural Language Processing (nlp)MlopsSqlPython (programming Language)Embedded Systems

Other Skills

C++Data StructuresLarge Language Models (LLM)STM32

Experience

Capgemini

2 roles

Data Scientist

Promoted

Aug 2022Present · 3 yrs 7 mos · Bengaluru, Karnataka, India

  • Chatbot and Excel Assistant-
  • Developed a scalable RAG pipeline using Parquet-based storage, cutting data load time by 60%.
  • Integrated a Pinecone vector Database with Cohere Reranker, Metadata filtering and BGE embeddings to build semantic indexes, achieving 85% retrieval accuracy and saving 50K USD annually in triage costs.
  • Fine-tuned Llama 3.1 8B using QLoRA on 150K+ customer support tickets, achieving 94% classification accuracy and reducing response time by 70%
  • Built a smart Excel plugin enabling users to Rephrase, Summarize, and Highlight data (cell/row/sheet), seamlessly integrated with Excel UI using LLMs.
  • Achieved a 45% reduction in OpenAI token usage per query and 30% faster response time by minimizing data scope through intelligent filtering
  • Incident Automation
  • Collaborated with cross-functional teams to develop an NLP based Automated Incident Resolution System, processing over 1M incidents across 61 global service lines saving 100K USD annually.
  • Implemented a One-vs-Rest classifier to categorize newly clustered incidents, optimizing DBSCAN epsilon value to 0.7, achieving 82% accuracy.- Automated workflows using Apache Airflow, reducing manual intervention by 40%.
  • Cloud Infrastructure Services
  • Automated ETL workflows and optimized MSSQL databases for 200+ batch processes, reducing incident response time.
  • Improved system reliability by integrating Control-M for batch job scheduling and ensuring 90% server uptime.
Python (Programming Language)Natural Language Processing (NLP)Machine LearningData ScienceLarge Language Models (LLM)SQL+1

Senior Analyst

Mar 2022May 2022 · 2 mos · Bengaluru, Karnataka, India

SQLPython (Programming Language)

Defence research and development organisation (drdo)

Embedded systems

Sep 2021Nov 2021 · 2 mos

  • Worked in development of communication protocols like I2C, SPI, UART on STM32 based microcontroller
STM32Embedded Systems

Carnegie mellon university

International Fellow@ RSS'21 Robotics: Science and Systems

Jul 2021Jul 2021 · 0 mo · Philadelphia, Pennsylvania, United States

Embedded Systems

Solvex systems

System Design Consultant

Jun 2021Jul 2021 · 1 mo

STM32Embedded Systems

Ntpc limited

Industrial Trainee

Jun 2021Jun 2021 · 0 mo

Education

The LNM Institute of Information Technology

Bachelor of Technology - BTech

Jan 2018Jan 2022

Stackforce found 100+ more professionals with Machine Learning & Data Science

Explore similar profiles based on matching skills and experience