Soham K. — Data Scientist
I’m a Data Scientist with 2+ years of industry experience building Explainable AI Systems and scalable, intelligent systems across Conversational AI, Generative AI, Agentic AI, Voice AI, RAG, NLP, and Computer Vision. Currently at CoffeeBeans, I work on next-gen AI products that improve customer interactions and automate decision-making. Key contributions include: • Legal Chatbot (MVP Product) – Developed a chatbot that matches users with lawyers based on query and location using Python, GraphQL, Deepval, and LLMs. • Payment Collection Agent – Built a voice-based AI agent that auto-calls users for payment reminders, understands responses, and logs them using LangChain, LangGraph, and Ngrok. • Audio Analytics Platform – Built a system to upload and summarize service calls, analyze fluency using Librosa, and derive customer interaction scores. • Dental Disease Detection – Fine-tuned YOLOv8 using LoRA and QLoRA for X-ray classification. Integrated a RAG system to answer doctors' queries using domain documents. • LLM-Based Fraud Detection – Fine-tuned LLMs on transaction + ticket data to detect fraud and generate explainable flags, integrated into client infrastructure. • Client Conversation Insight Tool – Extracted structured insights from PM-client calls for automated report generation. • AI Interviewer Platform – Developed an end-to-end autonomous interview system that scores answers, assigns feedback, and generates reports. Previously at InCruiter, I led AI efforts for their flagship product InCBot, an AI-powered interview assistant. Notable achievements include: • Face verification and real-time proctoring (YOLOv5 + MediaPipe + DeepFace) • AI talking avatar using Wav2Lip, Whisper, and fluency scoring with HuBERT • Resume parsing via OCR + Regex in <500ms • Feedback generation with LangChain + LLaMA (RAG pipeline) • Compiler-integrated DSA feedback and JD–CV matching using LLM embeddings • Dynamic job description generation using prompt engineering + few-shot examples With a B.S. in Data Science from IIT Madras, I specialize in LLMs, NLP, audio processing, and computer vision with strong hands-on experience in Python, SQL, Flask, LangChain, TensorFlow, Scikit-learn, HuggingFace, and Docker. I’m driven by solving complex real-world problems and creating impact with production-grade AI solutions. Let’s connect if you're working on anything AI, RAG, or LLM-related!
Stackforce AI infers this person is a Data Scientist specializing in AI-driven solutions across various industries.
Location: Bengaluru, Karnataka, India
Experience: 3 yrs 2 mos
Skills
- Artificial Intelligence (ai)
- Data Science
- Audio Analysis
- Computer Vision
- Machine Learning
- Natural Language Processing
Career Highlights
- Expert in building Explainable AI systems.
- Proficient in Conversational and Generative AI technologies.
- Strong background in Computer Vision and NLP applications.
Work Experience
Tredence Inc.
Senior Data Scientist (9 mos)
CoffeeBeans
Software Engineer AI (11 mos)
InCruiter
Data Scientist (9 mos)
Associate Data Scientist (2 mos)
Tata Consultancy Services
Training (5 mos)
Indian Institute of Information Technology Nagpur
Deep Learning Research Associate (6 mos)
Suven Consultants and Technology Pvt.Ltd.
Internship (1 mo)
Education
Bachelor of Science - BS at Indian Institute of Technology, Madras
Bachelor of Engineering - BE at Government College of Engineering and Research Avasari Pune