Vipul Malkoti

DevOps Engineer

Bengaluru, Karnataka, India11 yrs 5 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Led AWS migration achieving $15M annual cost reduction.
  • Implemented observability for AI models enhancing reliability.
  • Built and managed SRE teams across multiple regions.
Stackforce AI infers this person is a SaaS Infrastructure and Reliability Engineering expert with strong leadership capabilities.

Contact

Skills

Core Skills

Site Reliability EngineeringCloud ComputingReliability Engineering

Other Skills

AIAWSAnsibleApache KafkaAutomationBashC++Chaos EngineeringContainerizationContinuous DeliveryContinuous IntegrationContinuous Integration and Continuous Delivery (CI/CD)Cost ReductionDevOpsDocker

About

I am a Infrastructure and Reliability engineering leader, with an experience in building complex infrastructure and automation across the cloud. Currently leading/managing an SRE team which is building and maintaining a critical production setup, owning observability and creating automation tools for improved developer productivity, and yes I work with AI pipelines, montoring and building internal AI tools. I love talking about the cloud, kubernetes, observability and anything crazy happening across the world. Feel free to reach out!

Experience

Indeed.com

Engineering Manager - Infrastructure and SRE

Aug 2022Present · 3 yrs 7 mos · Bengaluru, Karnataka, India

  • Building scalable, reliable and resilient systems which help people get jobs.
  • As the only Infra and SRE EM in India, I lead multiple teams for the reliability and ML platform efforts for Indeed's critical business components across India and Japan.
  • Created a new team from scratch in the APAC region.
  • Led the complete migration from datacenter to aws cloud.
  • Achieved $15m (annual) in cost reduction for our cloud cost.
  • Implemented observability and alerting for our AI agents and critical GNN models (LLM)
  • Achieved 4 nine resilience for our revenue critical features from earlier 3 nines
  • spearheaded MLOps best practices for model features, deployment and monitoring
InfrastructureOperational ExcellenceSite Reliability EngineeringCloud ComputingReliability EngineeringTeam Management+2

Harness

2 roles

Manager - Product Reliability Engineering

Feb 2021Jul 2022 · 1 yr 5 mos

Lead Software Engineer

Jun 2020Jan 2021 · 7 mos

Palo alto networks

Staff Site Reliability Engineer

Feb 2019Jun 2020 · 1 yr 4 mos · Bengaluru, Karnataka, India

Red hat

Software Engineer - DevOps

Sep 2016Feb 2019 · 2 yrs 5 mos · Bengaluru Area, India

  • Working as a part of the DevOps team. A contributor to release automation, configuration management and building the delivery pipeline. Working with the latest technology and tools to create the the fastest, most efficient and error free releases. Deploying rpms, containers, cloud images (qcow2, ami, azure) in the best way possible for customers. Implementing Devops principles among all collaborating teams and continuously improving and optimizing processes to stay ahead in the industry.

Atos

Software Engineer

Jun 2014Jul 2016 · 2 yrs 1 mo · Pune Area, India

  • Helped develop the PoS software for card transactions.

Education

Bharati Vidyapeeth

Bachelor of Technology (BTech) — Computer Engineering

Somerville school, New Delhi

High school — Non-Medical Science

Stackforce found 100+ more professionals with Site Reliability Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience