C

Chinmay Bapat

Director of Engineering

Seattle, Washington, United States10 yrs 10 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Led engineering for scalable generative AI infrastructure.
  • Spearheaded cost-optimization initiatives driving down customer TCO.
  • Transformed operational culture maintaining 99.99% availability.
Stackforce AI infers this person is a leader in AI infrastructure and mobile software engineering.

Contact

Skills

Core Skills

Gen Ai InfrastructureTeam ManagementCost OptimizationOperational ExcellenceMachine LearningTechnical LeadershipPerformance EngineeringSystem DesignMobile EngineeringFeature Delivery

Other Skills

Amazon ForecastAmazon TrainiumAndroid DevelopmentAutomationAutopilotCC++Core JavaEnglishFleet ManagementInferentiaJavaLLM serving optimizationLinuxLoRA support

About

Chinmay is an Engineering Manager in the Amazon SageMaker AI Inference team at AWS, where he leads engineering efforts focused on building scalable infrastructure for generative AI inference. His work enables customers to deploy and serve large language models and other AI models efficiently at scale.

Experience

10 yrs 10 mos
Total Experience
3 yrs 7 mos
Average Tenure
6 yrs 5 mos
Current Experience

Amazon web services (aws)

3 roles

Software Development Manager

Promoted

Oct 2023Present · 2 yrs 8 mos

  • Gen AI Infrastructure Leadership:
  • Direct the delivery of critical Gen AI capabilities, including LoRA support and Voice Agents. Collaborate with LLM serving optimization teams to unblock new revenue streams by enabling customers to deploy thousands of custom models in Bedrock and SageMaker AI with minimal TTFT while only paying for compute actually used. Architected robust detection and recovery mechanisms for critical workloads across the latest NVIDIA GPUs as well as Amazon Trainium / Inferentia accelerators. Engineered the platform to ensure high reliability for models served using vLLM, DJL, and Triton.
  • Cost & Efficiency Strategy:
  • Spearheaded cost-optimization initiatives, architecting multi-model serving and smart routing strategies that maximize KV cache reuse and hardware utilization (NVIDIA/Trainium) to drive down customer TCO.
  • Team Building & Talent Development:
  • Manage a high-performing organization of 15 engineers. Directly hired 6 key members and mentored 4 engineers to promotion (Junior to Mid/Senior levels), fostering a culture of continuous growth and high talent density.
  • Strategic Customer Impact:
  • Partner with Product, Science, and Technical Account Managers to define the product roadmap. Directly manage relationships with strategic enterprise customers representing $10M+ in annual revenue, resolving critical escalations and driving platform enhancements that prevented churn and secured long-term adoption.
  • Operational Excellence & Culture:
  • Transformed the team’s operational culture by instituting rigorous COE (Correction of Error) processes and leading weekly operations reviews. Established mechanisms to identify repeated failures and drive long-term architectural fixes, maintaining 99.99% availability for a platform serving 3 million+ requests/second.
Gen AI InfrastructureLoRA supportVoice AgentsLLM serving optimizationNVIDIA GPUsAmazon Trainium+9

Senior Software Engineer

Promoted

Oct 2021Oct 2023 · 2 yrs

  • Technical Leadership: Led a team of 8 engineers to build core time-series forecasting functionality within SageMaker Canvas and Autopilot, democratizing ML for low-code/no-code users.
  • Performance Engineering: Re-architected data processing pipelines using Spark and Pandas to remove blocking dependencies, delivering 50% faster training and 50% faster inference than existing solutions with zero loss in accuracy.
  • Service Ownership: Led the Amazon Forecast service, enabling customers to automate heavy lifting tasks like data featurization, hyper-parameter tuning, and back-testing without requiring ML expertise.
  • Customer Impact: Enabled major enterprise customers like Foxconn to improve forecasting accuracy and successfully handle supply chain disruptions driven by demand fluctuations during the Covid pandemic.
time-series forecastingSageMaker CanvasAutopilotSparkPandasAmazon Forecast+5

SDE 2

Jan 2020Jan 2022 · 2 yrs

Amazon

SDE 2

Aug 2017Dec 2019 · 2 yrs 4 mos · Hyderabad, Telangana, India

  • System Design & Scale: Designed and implemented a Fleet Management system to track vehicles for Amazon’s delivery partners, improving visibility across the logistics network.
  • Platform Re-architecture: Redesigned the Logistics Portal platform, a critical internal tool used by more than 20 downstream applications. Resolved scalability bottlenecks in the legacy design to support growing transaction volumes.
  • Operational Excellence: Owned operations for Amazon Logistics' common stacks. Improved system stability by automating manual on-call processes and resolving multiple high-severity incidents with minimal customer impact.
System DesignFleet ManagementLogistics PortalOperational Excellence

Microsoft

2 roles

Software Engineer

Jul 2015Aug 2017 · 2 yrs 1 mo · Hyderabad Area, India

  • Mobile Engineering (Office on Android): Core member of the FileIO team for Word, Excel, and PowerPoint on Android. Responsible for critical file operations (Create, Open, Save) across local and cloud endpoints.
  • Performance Optimization: Investigated and resolved performance bottlenecks using telemetry and profiling. Achieved an 8% improvement in general file open times and reduced open times for large Excel files on OneDrive by 4 seconds.
  • Feature Delivery: Designed and implemented the "Quick Reply to Outlook" feature, which scaled to support over 150,000 monthly active users.
  • Automation: Built automated performance measurement tools that saved the team 80 engineering man-hours per month.
Mobile EngineeringPerformance OptimizationFeature DeliveryAutomation

Intern

Apr 2013Jun 2013 · 2 mos

Simversity

Intern

Apr 2012May 2012 · 1 mo · Pune

Education

Indian Institute of Technology, Madras

Master of Technology (M.Tech.) — Computer Science and Engineering

Jan 2014Jan 2015

Indian Institute of Technology, Madras

Bachelor of Technology (B.Tech.) — Computer Science and Engineering

Jan 2010Jan 2015

Stackforce found 100+ more professionals with Gen Ai Infrastructure & Team Management

Explore similar profiles based on matching skills and experience