Karan Taware

Software Engineer

Pune City, Maharashtra, India5 yrs 1 mo experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Expert in building scalable ML systems on AWS.
  • Achieved significant cost and performance optimizations.
  • Proven track record in financial compliance solutions.
Stackforce AI infers this person is a Fintech professional specializing in Applied AI and Cloud Infrastructure.

Contact

Skills

Core Skills

Applied Machine LearningCloud ComputingDatabase Management

Other Skills

Python (Programming Language)MCP (Multi Agent Protocol)Faster-WhisperPyAnnoteNVIDIA GPU accelerationGPU batchingpipeline optimizationApplied AIArtificial Intelligence (AI)Software DocumentationProject ManagementMoralis SDKERC721/ERC1155 Smart ContractsDocument ManagementGenerative AI Development

About

I am an Applied AI / ML Engineer with over almost 3 years of experience building and scaling production machine learning systems on AWS. My engineering philosophy is simple: I don't just build models in a research vacuum; I own systems end-to-end from high-level architecture and implementation to deployment and rigorous production operations. Currently, I leverage my expertise in Python, NLP, Information Retrieval, and distributed system design to tackle complex problems within the highly regulated Financial Compliance domain. I design resilient, distributed pipelines that combine Automatic Speech Recognition (ASR), NLP, and retrieval-augmented reasoning (RAG) to handle massive, multilingual datasets securely. My core engineering focus is solving ambiguous production bottlenecks by ruthlessly balancing ML quality, latency, and infrastructure costs. Some of my recent architectural wins include: Cost & Scale Optimization. High-Throughput Infrastructure. Information Retrieval. I thrive at the intersection of applied machine learning and cloud-native infrastructure, ensuring that experimental AI concepts translate into resilient, stateless services with safe rollouts, idempotent processing, and metrics-driven observability. Technical Snapshot: Core: Python, System Design, Applied Machine Learning, Full SDLC. AI/ML: NLP, ASR, RAG Pipelines, PyTorch, Hugging Face. Cloud & Infra: AWS (EKS, EC2, S3, IAM), Kubernetes, Distributed Microservices, CI/CD. Data & Search: Elasticsearch, Lucene, CJK Tokenization, PostgreSQL.

Experience

5 yrs 1 mo
Total Experience
1 yr 8 mos
Average Tenure
2 yrs 10 mos
Current Experience

Nice actimize

2 roles

Software Engineer (AI)

Promoted

Jul 2025Present · 11 mos · Pune District, Maharashtra, India · Hybrid

  • Reduced compute cost to $0.064 per hour of audio via GPU batching, parallel chunk inference, and pipeline optimization.
  • ​Improved multilingual WER by 20% on customer datasets (English/French/Latin) and 12% for Mandarin/Japanese.
  • ​Maintained p95 inference latency under 2.8s per 30s chunk in production while processing 150+ hours/day of multilingual audio.
  • ​Architected cloud-native systems on AWS EKS with queue-based ingestion, autoscaling workers, and SLO-driven monitoring.
  • Architected and deployed a production-grade transcription and speaker diarization pipeline leveraging Faster-
  • Whisper, PyAnnote, and NVIDIA GPU acceleration, improving accuracy and reducing latency.
  • Implemented dynamic speaker diarization using PyAnnote, enabling accurate multi-speaker separation across long audio
  • files.
  • Replaced Silero VAD with ONNX-optimized GPU-based VAD, enabling real-time audio splitting and scaling efficiently
  • for enterprise workloads.
  • Designed parallelized, chunked inference pipelines with Multi-Processing and GPU memory management, boosting
  • throughput by 3x.
  • Integrated retrieval-augmented transcription workflows (RAG-style) to support multilingual-to-English translation us-
  • ing Hugging Face, NLLB and Helsinki-Opus Mul to En model.
  • Developed a robust audit and monitoring framework for transcription jobs, ensuring complete success/failure/skip track-
  • ing across pipeline stages.
  • Led offline-first model packaging initiative: centralized preloading of Faster-Whisper, PyAnnote into Docker/EFS for
  • fully disconnected deployments.
  • Improved Recall to 0.86 and Accuracy to 0.79 with Reduction Inference Time by using Parallel
  • KV Cache Compute due to which the performance gain was from 1 request per 15 seconds to 5
  • requests per 18 seconds without drop in accuracy.
  • Established LLMOps and Observability Metrics for the GenAI Applications
Python (Programming Language)MCP (Multi Agent Protocol)Applied Machine LearningCloud Computing

Associate Software Engineer

Jul 2023Jun 2025 · 1 yr 11 mos · Pune District, Maharashtra, India · Hybrid

  • Playing a pivotal role in replacing the existing transcription service with a new, optimized solution.
  • Leading the transition to a high-
  • Migrated SQL-compliant products to PostgreSQL, enhancing database efficiency.
  • Implemented monitoring metrics logging, improving observability by 30%.
  • Received a Spot Award for exceptional contributions to the project.
Applied AIArtificial Intelligence (AI)Applied Machine LearningDatabase Management

Cloudegic inc

2 roles

Project Team Lead Intern

Jun 2022Nov 2022 · 5 mos · Pune District, Maharashtra, India · On-site

Software DocumentationProject Management

Software Development Engineer Intern

Jun 2022Nov 2022 · 5 mos · Pune District, Maharashtra, India · On-site

  • Worked on Blockchain Dapps Project in the Ethereum Mainnet with Moralis SDK, ERC721/ERC1155 Smart Contracts with Database Designing and Developing formulas for Stats Page for the Website
Software DocumentationProject Management

Computer society of india - viit chapter

2 roles

Documentation Head

Jul 2021Jul 2022 · 1 yr · Pune District, Maharashtra, India

  • I have helped in Decision-making and curating reports and content for events also Did FEEDBACK ANALYSIS OF THE EVENTS, curating interview drafts
Software DocumentationProject Management

Joint Documentation Head

Aug 2020Jul 2021 · 11 mos · Pune District, Maharashtra, India

  • I helped in curating reports, making interview draftss and in creating appreciation letters.
Software DocumentationDocument Management

Education

Vishwakarma Institute of Information Technology

Bachelor of Technology - BTech — Computer Engineering

Aug 2019May 2023

Modern Education Society's Nowrosjee Wadia College Arts, Science, Pune 01

12th — Science

Jan 2017Jan 2019

Sinhgad City School

10th — 10th

Jan 2009Jan 2017

Zero To Mastery Academy

Stackforce found 100+ more professionals with Applied Machine Learning & Cloud Computing

Explore similar profiles based on matching skills and experience