M

Madhukar Mishra

CTO

Delhi, India11 yrs 3 mos experience
Highly Stable

Key Highlights

  • Expert in Kubernetes and cloud infrastructure management.
  • Proven track record in optimizing costs for AI workloads.
  • Strong leadership in engineering and technical project execution.
Stackforce AI infers this person is a SaaS and E-commerce expert with strong DevOps and engineering capabilities.

Contact

Skills

Core Skills

Platform ArchitectureDevopsMlopsEngineeringTechnical LeadershipSoftware Development

Other Skills

Go (Programming Language)KubernetesVirtualizationAmazon Web Services (AWS)PrometheusGrafanaLokiClickhouseAWS LambdaGoogle Cloud Platform (GCP)Python (Programming Language)OpenTelemetryGPUAutoscalingComputer Science

About

Software Engineer with a penchant for Distributed Systems, DevOps, and Developer experience with my head in the clouds and eyes on the ground.

Experience

11 yrs 3 mos
Total Experience
4 yrs
Average Tenure
3 yrs 2 mos
Current Experience

Darzee app

CTO (Fractional)

Apr 2025Present · 1 yr 2 mos · Remote

Storyteller.ai

Software Engineer

Dec 2023Dec 2024 · 1 yr · Remote

  • Set up observability for an AI production using Opentelemtry and Grafana with custom alert sinks for Rust and Python applications
  • Set up a kubernetes based hybrid cloud deployment that generates 10 days of audiovisual data daily using about 1200 GPU-hours/day, and serves and hosts 200 terabytes of user generated content and models.
  • Wrote performant backend AI inference workloads for generative audio and video that maximizes GPU utilization
  • Autoscaling for GPU workloads to save 30% costs ~ 15k USD/mo
Platform ArchitectureAmazon Web Services (AWS)Google Cloud Platform (GCP)KubernetesMLOpsPython (Programming Language)

Cloudraft

Technical Lead

Apr 2023Present · 3 yrs 2 mos · Remote

  • Built a custom vertical pod autoscaling strategy for an MLOPS product based on Kubernetes.
  • Built AI PaaS which involved architecture, design, and implementation on the Kubernetes platform on bare metal including setting up telemetry data using Prometheus Grafana and Loki
  • Set up log analytics for B2B e-commerce to ingest logs from 100s of AWS Lambda functions into Clickhouse for analysis
Platform ArchitectureDevOpsGo (Programming Language)KubernetesVirtualizationAmazon Web Services (AWS)

Lummo

Infrastructure & Platform Technical Lead

Aug 2020Mar 2023 · 2 yrs 7 mos

  • Migrated infrastructure from Heroku to GKE, Manage cloud infrastructure (GCP)
  • Set up observability practices (Grafana, Prometheus, Datadog, Sentry), Define and monitor SLOs/SLIs
  • Set up CI/CD and release practices (gitops with ArgoCD, Argo Rollouts, GitHub actions, Jenkins)
  • Set up incident response and management practices (Pagerduty)
  • Led capacity planning and load testing
  • Infrastructure cost estimation and forecasting, cost optimization
  • Making build/buy decisions on infrastructure and other supporting tools
  • Extend Kubernetes using controllers to fill in the gaps from off-the-shelf components for our operational needs
  • Maintain libraries and tooling for the Dev team to own operations - enable “you build it, you run it” via the Internal Developer Platform
  • Document and train developers on platform capabilities and enable self-service
  • Collaborate on team OKRs and long-term backlog with relevant stakeholders
  • Led a team of 4 engineers (+4 external contractors), Coach the team on Agile methodologies and Scrum
EngineeringGoogle Cloud Platform (GCP)Computer ScienceTechnical LeadershipBuild ToolsSoftware Development+10

Blinkit

2 roles

Platform Engineer

Jul 2018Aug 2020 · 2 yrs 1 mo

  • Led various Release engineering initiatives for the microservices ecosystem, decoupling the backend from app releases and allowing the backend to be released multiple times a day with confidence instead of a few times a month.
  • Decoupled backend and front-end releases by developing proof of concepts and coaching the Dev team on feature toggles and contract testing
  • Enabled canary releases using flagger on Kubernetes
  • Brought together Dev and QA to work on test automation initiatives and enable integration tests in fungible environments
  • Scaled the system to ~5000 runs a day at 5 minutes per build.
  • Maintained Python libraries used by developers to manage microservices on Kubernetes in fungible environments, staging, and production
  • Maintained CI/CD infrastructure and libraries
Agile MethodologiesEngineeringAmazon EKSAnsibleComputer ScienceBuild Tools+10

Software Engineer - Full stack

Feb 2015Jul 2018 · 3 yrs 5 mos

  • Led the development of the Marketplace Platform in a 3-person team that allowed merchants to self-onboard and manage their stores, inventory, and catalog - onboarded about 12k stores from SMEs to big chains like DMart, Sangeetha mobiles, and Reliance Fresh
  • Maintained the catalog and inventory management system used to manage products, prices, and stock by internal teams of about 50+ employees in category and content teams.
  • Maintained the inventory service used to manage warehouse inventory and integrations with vendors, onboarded chains like DMart, Sangeetha Mobiles, and Prestige
  • Maintained the Promotions and Merchandising system that allowed the Marketing teams to run scheduled and duration-bound banner promotions and boost products in the search and catalog APIs opening up new revenue streams.
  • Maintained search and catalog API used for full text and attribute search for products
  • Contributed features to the Cart service to enable schedules for store serviceability and delivery and validate the cart’s coupon, prices, and inventory availability before checkout.
  • Developed the RBAC system as team size grew and more roles opened and need for Quality Assurance, Quality Check, approvals, and scoping access by role were needed, extended for use by the marketplace ecosystem for merchants to manage their stores and employees
EngineeringDjangoComputer ScienceSoftware DevelopmentFront-End DevelopmentPython (Programming Language)+9

Healthindya

Software Engineering Intern

May 2014Jul 2014 · 2 mos · Faridabad Area, India

  • Was involved in developing the ReSTful web service, and admin dashboard for a prototype of smart card based public health record system for my summer internship.
  • Stack: DB: MySQL, Server: PHP (Laravel MVC framework), Front end:HTML (with Laravel's templating engine), CSS, JavaScript (JQuery)
EngineeringComputer ScienceSoftware DevelopmentDatabasesCommunication

Education

Jaypee University of Information Technology

Bachelor of Technology - BTech — Computer Science

Jan 2011Jan 2015

Stackforce found 100+ more professionals with Platform Architecture & Devops

Explore similar profiles based on matching skills and experience