Siddharth Bajpai

Data Scientist

Kanpur, Uttar Pradesh, India1 yr 9 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in architecting AI-driven solutions.
  • Proven track record in reducing development cycles significantly.
  • Strong foundation in both software engineering and data science.
Stackforce AI infers this person is a Data Scientist with expertise in SaaS and Fintech industries.

Contact

Skills

Core Skills

Software DevelopmentData Science

Other Skills

A/B TestingAWS EKSAgile MethodologiesAmazon RedshiftAmazon Web Services (AWS)Analytical SkillsAngularBack-End Web DevelopmentBusiness Data ManagementBusiness Intelligence ToolsCascading Style Sheets (CSS)ChromaDBCommunicationConversational AICredit Risk Management

About

A graduate of Harcourt Butler Technical University with a B.Tech in Mechanical Engineering, I am currently a Data Scientist at Great Learning. With prior experience as a Software Engineer at gen Z Solutions and Data Scientist at Calance, I have contributed to projects that integrate advanced AI technologies like LangChain, RAG, and GPT for innovative solutions, including the creation of TestMate AI and document intelligence systems. My core competencies include communication, marketing, and understanding user requirements. Passionate about leveraging generative AI to simplify complex processes and optimize workflows, I aim to develop tools and systems that foster efficiency and innovation across industries.

Experience

1 yr 9 mos
Total Experience
10 mos
Average Tenure
1 yr
Current Experience

Great learning

Data Scientist

Jun 2025 – Present Ā· 1 yr Ā· Gurugram, Haryana, India Ā· Hybrid

Gen z solutions

Software Engineer - GenAI šŸ‘Øā€šŸ’»

Sep 2024 – Jun 2025 Ā· 9 mos Ā· Pune, Maharashtra, India Ā· Hybrid

  • Architected and deployed TESTMATE AI (testmateai.com), an autonomous test generation platform utilizing agentic workflows to transform Jira tickets into executable BDD test suites, achieving 67% reduction in QA development cycles
  • Engineered sophisticated Multi-Agent RAG Pipeline with LangChain, LangGraph, and ReAct agents for context-aware code synthesis, integrating dense vector retrieval and hybrid search using pgvector and BM25 for optimal semantic pattern matching
  • Built production-grade microservices architecture with FastAPI, async/await patterns, and event-driven design, implementing CQRS patterns and distributed caching with Redis for high-throughput Jira API integrations
  • Deployed containerized AI services using Docker, Kubernetes, and AWS EKS with auto-scaling capabilities, implementing GitOps workflows with ArgoCD and comprehensive monitoring using Prometheus and Grafana
  • Developed intelligent API Virtualization System using Spring Boot and reactive programming, creating dynamic mock services with ML-driven payload generation and dependency injection for enhanced testing isolation
LangChainRAGFastAPIRedisDockerKubernetes+5

Calance

Data Scientist

Jun 2024 – Sep 2024 Ā· 3 mos Ā· Gurugram, Haryana, India Ā· Hybrid

  • Architected production-ready Document Intelligence System integrating GPT-4 Turbo with advanced ChromaDB vector operations, implementing hierarchical clustering and semantic chunking for enterprise HR document retrieval
  • Optimized inference pipeline through asyncio parallelization, batch processing, and intelligent caching strategies, achieving 85% latency reduction (13.3s → 2.0s) while maintaining 95% retrieval precision
  • Developed responsive Chainlit interface with WebSocket streaming, server-sent events, and persistent conversation state management using Redis for seamless user experience
GPT-4 TurboChromaDBasyncioRedisData Science

Dmi finance private limited

Data Science & Analytics

Mar 2024 – May 2024 Ā· 2 mos Ā· New Delhi, Delhi, India Ā· On-site

  • Engineered intelligent SQL Code Generation System using GPT 4 Turbo with advanced prompt engineering, in-context learning, and schema-aware reasoning, reducing development cycles by 90%
  • Fine-tuned SQLcoder-7B-2 using LoRA/QLoRA techniques on domain-specific financial corpus, achieving 97.4% query accuracy with Redshift dialect optimization and syntax validation
  • Built comprehensive ETL automation suite with intelligent parsing, performance profiling, and anomaly detection, enhancing analytics pipeline efficiency by 65% through ML-driven optimizations.
GPT-4 TurboSQLETLML-driven optimizationsData Science

Education

Harcourt Butler Technical University (HBTU), Kanpur

Bachelor of Technology - B.Tech — Mechanical Engineering

Nov 2020 – May 2024

Kendriya Vidyalaya

12th Class (Higher Secondary School)

Apr 2019 – Mar 2020

Stackforce found 100+ more professionals with Software Development & Data Science

Explore similar profiles based on matching skills and experience