Vimal Krishnamoorthy

SRE (Site Reliability Engineer)

Bengaluru, Karnataka, India13 yrs 4 mos experience
Highly Stable

Key Highlights

  • Over 13 years of experience in cloud engineering.
  • Expert in Infrastructure as Code and CI/CD practices.
  • Proven track record in cloud cost optimization.
Stackforce AI infers this person is a Cloud Infrastructure and DevOps expert specializing in scalable solutions.

Contact

Skills

Core Skills

Cloud InfrastructureDevopsCloud MigrationSystem ReliabilityCloud OperationsInfrastructure ManagementIncident ResponseCloud SupportSystem Administration

Other Skills

AWS CodeDeployAmazon Web Services (AWS)ApacheArtifactoryAzureAzure DevOpsAzure DevOps ServicesAzure Kubernetes Service (AKS)BashBindCI/CDCentOSChefCloud ComputingDNS Administration

About

I’m a Cloud Engineering specialist with over 13 years of hands-on experience delivering robust, scalable, and secure cloud solutions across Azure and AWS platforms. My core strengths lie in building and managing infrastructure through Infrastructure as Code (IaC), streamlining deployments with CI/CD pipelines, and ensuring system reliability through strong monitoring and automation practices. Throughout my career, I’ve worked closely with cross-functional teams to modernize cloud environments, reduce operational costs, and improve deployment speed. I've led efforts around Terraform automation, Kubernetes deployments, and production-grade incident management using tools like xMatters, PagerDuty, ServiceNow, OpsGenie, Splunk and Jira Service Management. Technical Skills & Tools: ☁️ Cloud Platforms: Azure, AWS, GCP, OpenStack, OneOps ⚙️ IaC & Configuration Management: Terraform, Helm Charts, ARM templates, Chef, Puppet 🚀 CI/CD & GitOps: Azure DevOps, Jenkins, GitHub, Bitbucket, Git, SVN, FluxCD, GitOps, Hygiea 📦 Containers & Orchestration: Docker, Kubernetes, Azure Kubernetes Service (AKS), Istio 📈 Monitoring & Logging: Grafana, Prometheus.io, Sensu, Nagios, Splunk 🔐 Security & Governance: SSL, Cloud cost optimization, RBAC, Cloud Custodian 🧪 Scripting & Automation: Python, Bash 🗂️ Documentation & ITSM: Confluence (documentation, runbook), Jira (ticketing), ServiceNow (incident/task) 📊 Architecture & Diagrams: Lucidchart (cloud solution design & infrastructure diagrams)

Experience

Phonepe

Lead SRE

Jul 2025Present · 8 mos · Bengaluru, Karnataka, India · On-site

Walmart global tech

2 roles

Lead DevOps Engineer

Promoted

Nov 2020Jun 2025 · 4 yrs 7 mos

  • Cloud Infrastructure & DevOps Architect with expertise across Azure, Kubernetes, and automation.
  • Owned and managed Azure environments using Terraform with reusable, scalable modules.
  • Served as an on-call engineer supporting 24/7 production systems with rapid incident resolution.
  • Led incident response via xMatters and coordinated with Microsoft for RCA and mitigation.
  • Guided developers on secure, cost-effective, and high-performing cloud architecture.
  • Provisioned and upgraded AKS clusters with LTS and automated lifecycle management.
  • Developed Helm charts and implemented GitOps with FluxCD for deployment consistency.
  • Migrated AKS from service principals to managed identities with secret rotation policies.
  • Built secure CI/CD pipelines in Azure DevOps with integrated compliance checks.
  • Optimized AKS node pools by tuning SKUs to maintain performance and reduce costs.
  • Led migration of deprecated Kubernetes APIs and tuned resources to reduce evictions.
  • Enforced Azure governance with tagging, compliance monitoring, and auto-cleanup rules.
  • Drove cloud cost optimization through orphaned disk cleanup, pricing tier reviews, and usage policies.
  • Automated SQL DB scaling and Cosmos DB autoscale to eliminate overprovisioning.
  • Controlled Log Analytics costs with data ingestion caps and refined retention settings.
  • Decommissioned idle AKS clusters through phased cleanup and autoscaler activation.
  • Conducted annual DR drills with defined RTO/RPO and cross-team coordination.
  • Managed SSL cert lifecycle to ensure uninterrupted and secure Azure operations.
  • Hardened IAM by using Cloud Custodian to auto-remove vulnerable RBAC assignments.
  • Authored runbooks, led support onboarding, and centralized CI/CD for security and compliance.
AzureKubernetesTerraformPythonGCPPrometheus.io+16

Senior DevOps Engineer

Jan 2018Oct 2020 · 2 yrs 9 mos

Altisource

Senior DevOps Engineer

Nov 2016Jan 2018 · 1 yr 2 mos · Bengaluru, Karnataka, India

  • Migrated all on-premise systems to AWS as part of the core DevOps transformation team.
  • Worked in AWS disaster recovery architecture for business continuity.
  • Deployed and managed Chef frontend/backend servers with PostgreSQL, RabbitMQ, and Elasticsearch.
  • Migrated Chef infrastructure to secure VPCs to meet compliance and security standards.
  • Upgraded GitLab and Chef clusters; migrated 300+ repositories from SVN to GitLab.
  • Built CI/CD pipelines using Jenkins, Git/SVN, Artifactory, Chef, and AWS CodeDeploy.
  • Developed real-time AWS infrastructure inventory tool using Lambda and Python.
  • Implemented Hygieia dashboard for monitoring Jenkins builds and Git commits.
  • Automated LDAP password expiry alerts using Python scripting.
  • Created custom Sensu checks and plugins using shell scripting for proactive monitoring.
Amazon Web Services (AWS)KubernetessensuchefHygieasvn+4

Yahoo

Service Reliability Engineer

Jul 2015Nov 2016 · 1 yr 4 mos · Greater Bengaluru Area

  • Worked as a System SRE, maintaining DNS clusters, cloud platforms, monitoring tools, and storage systems.
  • Provided 24/5 on-call support, engaging quickly to identify and resolve critical production issues.
  • Managed Anycast/Unicast BIND DNS across enterprise-grade production and corporate networks.
  • Administered OpenStack private cloud services including Nova, Keystone, Glance, and Horizon.
  • Owned internal monitoring tools, alerting systems, and operational dashboards for system health.
OpenStackDNS AdministrationmonitoringSystem ReliabilityCloud Operations

Inmobi

Engineer - Production & Infrastructure Engineering

Feb 2014Jul 2015 · 1 yr 5 mos · Bengaluru, Karnataka, India

  • Monitored and maintained a large global server fleet across five data centers.
  • Served in a 24/7 incident response team, handling infrastructure alerts and escalations.
  • Performed OS installation and bootstrapping for bare-metal servers and KVM virtual machines.
  • Managed Nagios monitoring systems, tuned alerts, and triaged incidents effectively.
  • Supported Dell/HP hardware, RAID setups, LVM, and maintained detailed incident runbooks.
Data Center ManagementSystem MonitoringSystem AdministrationAmazon Web Services (AWS)NagiosInfrastructure Management+1

Velan info services

System Engineer

Sep 2012Feb 2014 · 1 yr 5 mos · Greater Coimbatore Area

  • Provided remote cloud and data center support for global clients across various time zones.
  • Migrated client applications from on-premise infrastructure to AWS cloud platforms.
  • Supported cloud and virtualization technologies including AWS, Rackspace, Linode, OpenVZ, KVM, - Proxmox, and SolusVM.
  • Set up and managed PowerMTA for high-volume bulk mailing operations.
  • Administered Zimbra mail servers and managed hosting environments via cPanel, Plesk, Kloxo, ZPanel, and Virtualmin.
System AdministrationWeb HostingCloud ComputingAmazon Web Services (AWS)Rackspacelinode+5

Education

Anna University Chennai

Bachelor of Engineering (BE) — Electronics and Communications Engineering

Jan 2007Jan 2010

Stackforce found 100+ more professionals with Cloud Infrastructure & Devops

Explore similar profiles based on matching skills and experience