Akhil M.

DevOps Engineer

Bengaluru, Karnataka, India16 yrs 11 mos experience
Highly Stable

Key Highlights

  • 15 years of experience in Internet Scale systems.
  • Expert in Observability and Reliability Engineering.
  • Proficient in Go, Python, and Java programming.
Stackforce AI infers this person is a Backend-heavy Infrastructure Engineer specializing in Observability and Reliability for large-scale systems.

Contact

Skills

Core Skills

Reliability EngineeringObservabilityDistributed SystemsAutomation

Other Skills

AIXAgile MethodologiesAmazon Web Services (AWS)AnsibleApache 2ArchitectureCloud ComputingCoaching & MentoringCommunicationComputer ScienceDjangoDocker ProductsFlaskGo (Programming Language)Google Cloud Platform (GCP)

About

Over 15 years of experience in scaling, breaking and fixing at Internet Scale. Expert in Observability, Distributed Systems, Reliability, Resilience and Infrastructure. Speak Go, Python and Java.

Experience

Coupang

AI Infra

Aug 2025Present · 7 mos · Bengaluru, Karnataka, India · Hybrid

Flipkart

Sr Architect/Sr Staff Engineer - Reliability Engineering

Jan 2022Jul 2025 · 3 yrs 6 mos · Bengaluru, Karnataka, India · Hybrid

  • Building reliability in engineering culture
  • Observability
  • Resilience
  • Reliability Engineering
  • DevOps
ObservabilityComputer ScienceGoogle Cloud Platform (GCP)PythonKubernetesCloud Computing+22

Atlassian

Principal Site Reliability Engineer

Jun 2019Dec 2021 · 2 yrs 6 mos · Bengaluru, Karnataka, India

  • Distributed Systems
  • Observability
  • Resilience Engineering
  • Developer Productivity and Effectiveness
  • Go, Kotlin, Java, Python
  • AWS, Opentelemetry
ObservabilityComputer SciencePythonCloud ComputingTroubleshootingResiliency+19

Linkedin

3 roles

Staff Site Reliability Engineer

Promoted

Oct 2017May 2019 · 1 yr 7 mos

  • Speak Python and Java. Automate anything that needs to be done third time because second time it was just annoying. Specialize in Frankenstein projects
ObservabilityComputer SciencePythonCloud ComputingTroubleshootingResiliency+14

Sr Site Reliability Engineer

Promoted

Apr 2016Oct 2017 · 1 yr 6 mos

ObservabilityComputer SciencePythonCloud ComputingTroubleshootingOBSERVABILITY+10

Site Reliability Engineer

Jun 2014Mar 2016 · 1 yr 9 mos

ObservabilityComputer SciencePythonCloud ComputingTroubleshootingOBSERVABILITY+4

Intuit

DevOps Engineer

Nov 2012Jun 2014 · 1 yr 7 mos · Bengaluru, Karnataka, India

  • DevOps Engineer working on a variety of technologies, tools and platforms such as Tomcat, Jboss, glassfish, Hudson, Jenkins, Puppet, Capistrano, AWS, Splunk, Keynote, NewRelic, Perforce.
  • Developing solutions for CI and configuration management automation and creating and evolving monitoring platforms and frameworks.
  • Working closely with Dev and QA teams in an agile development environment with very short and fully automated delivery pipelines for both application and infrastructure.
Computer SciencePythonCloud ComputingTroubleshootingOBSERVABILITYAmazon Web Services (AWS)+4

Ipsoft

Applications Engineer

Jul 2011Oct 2012 · 1 yr 3 mos · Bengaluru, Karnataka, India

  • SME for IBM WebSphere stack of products including WebSphere Application Server.
  • Responsible for maintenance/support of various Java based middleware technologies such WebSphere, JBoss, Tomcat, WebLogic, Active MQ.
  • Working across multiple business verticals and clients including BFS, FMCG, Hi-Tech, Healthcare
  • Project work experience in Installation and upgrade of Enterprise infrastructure including WebSphere Application Server, IBM HTTP Server, Apache Web Server, JBoss Application Server
  • Handling of routine tasks such application troubleshooting, application deployments, apache configuration.
  • Responsible for setting up monitoring using IPSoft’s proprietary monitoring/automation tools
  • Mentoring incoming new engineers with technical and client processes
Computer ScienceTroubleshootingOBSERVABILITYLinuxUnix

Tata consultancy services

Systems Engineer

Nov 2008Jul 2011 · 2 yrs 8 mos

  • Performance Engineering & Enterprise Architecture Solutions)
  • Passport Seva Project
  • Experience in setting up and managing entire infrastructure based on IBM Stack including WebSphere Application Server, IBM HTTP Server, IBM MQ, IBM DB2 and custom JVMs, performing performance tests using Rational Performance tester and setting up monitoring including Nagios and custom scripts
  • Responsible for setting monitoring infrastructure for the entire architecture that included real time performance data spread across multiple layers of components
  • Responsible for initial performance test of whole application and required tuning as well. Performance tuning areas included application code level tuning, Application server tuning and DB query tuning
  • Basic DB2 administration
  • Responsible for creating daily functional and performance reporting tool.
  • Experience in client auditing of application at functional and performance level.
Computer SciencePython (Programming Language)TroubleshootingLinuxUnix

Education

The LNM Institute of Information Technology

B.Tech — Communication and Computers

Jan 2004Jan 2008

Stackforce found 100+ more professionals with Reliability Engineering & Observability

Explore similar profiles based on matching skills and experience