Nikhil Singh

DevOps Engineer

Dehradun, Uttarakhand, India5 yrs 6 mos experience
Highly Stable

Key Highlights

  • Expert in scaling monitoring infrastructure for high-growth environments.
  • Proven track record in Kubernetes governance and compliance.
  • Strong background in DevOps practices and infrastructure automation.
Stackforce AI infers this person is a DevOps Engineer specializing in cloud-native applications and infrastructure automation.

Contact

Skills

Core Skills

Monitoring InfrastructureObservabilityKubernetesDevopsInfrastructure AutomationCi/cd

Other Skills

APIsAWSAlgorithmsAnsibleBashC++Continuous Integration and Continuous Delivery (CI/CD)Creative Problem SolvingData StructuresDebuggingElastic Stack (ELK)GitOpsGrafanaJavaJenkins

About

Accidentally fell into DevOps and stayed for the chaos. Key Skills: AWS, Kubernetes, Terraform, Linux, Scripting (python and bash), observability (EFK, Grafana, Prometheus).

Experience

5 yrs 6 mos
Total Experience
4 yrs 8 mos
Average Tenure
10 mos
Current Experience

Nielsen

Member of Technical Staff 3 (DevOps)

Aug 2025Present · 10 mos · Bengaluru, Karnataka, India · Hybrid

Rapido

Senior Product Engineer (DevOps)

Apr 2025Jul 2025 · 3 mos · Bengaluru, Karnataka, India · Hybrid

  • • Scaling Monitoring Infrastructure: Led scaling of the Thanos/Prometheus/Grafana monitoring stack in a high-growth environment. Improved observability and alerting under increasing cardinality and ingestion, scaling from 60M to 140M active time series.
PrometheusGrafanaThanosMonitoring InfrastructureObservability

Bigbasket.com

3 roles

Senior DevOps Engineer

Promoted

Apr 2024Apr 2025 · 1 yr · Bengaluru, Karnataka, India

  • Kubernetes Cluster Upgrade: Led the upgrade and migration of Kubernetes clusters from version 1.24 to 1.30.
  • Enhanced Kubernetes Governance: Implemented validating and mutating webhooks using Kyverno to enforce configuration policies and compliance across Kubernetes clusters. Implemented policy reporter to generate compliance reports.
  • Event-Driven Autoscaling: Implemented KEDA to facilitate event-driven autoscaling for critical Kubernetes workloads.
  • Automated Alert Management: Streamlined management of Grafana alerts via Terraform for enhanced automation and scalability.
  • Planned and executed the migration of kubernetes node groups from spotinst to cast.ai.
KubernetesKyvernoKEDATerraformGrafanaPrometheus+1

DevOps Engineer-2

Promoted

Apr 2022Mar 2024 · 1 yr 11 mos · Bengaluru, Karnataka, India

  • Self-Service Access Management: Led the design and implementation of a self-service access management system. Leveraged APIs from multiple tools, ensuring accountability through database records and JIRA tickets. Resulted in a 90% reduction in time and effort spent on access management tasks.
  • EKS Cluster Setup: Orchestrated the setup of EKS cluster using Terraform, including spotinst-managed node groups, RBAC for microservices, and deployment of various devops-managed kubernetes services. Integrated the Terraform code with a centralized MySQL database for RBAC management.
  • Zero-Downtime Kubernetes Migration: Contributed to the plan and execution of migrating 100+ microservices from a self-managed Kubernetes cluster (v1.16) to EKS (v1.24).
  • Ingress Controller Setup & Migration: Evaluated various Kubernetes ingress controllers (e.g., Nginx, f5-nginx, ALB, Traefik) and successfully migrated all the microservices from nodeport-based routing to Traefik ingress-controller.
  • Observability Setup and Optimization: Setup Prometheus and Grafana monitoring stack for comprehensive system observability of the kubernetes environments. Successfully optimized Prometheus resource utilization and performance, achieving a roughly 60% reduction in memory consumption.
  • Release Process Efficiency: Automated and streamlined release stages in collaboration with the release team, improving deployment efficiency. Also implemented nightly deployment of microservices.
  • Incident Management: Actively participated in on-call rotations, ensuring prompt resolution of production incidents to maintain service uptime.
AWSKubernetesTerraformJenkinsAnsibleBash+2

DevOps Engineer-1

Jun 2020Mar 2022 · 1 yr 9 mos · Bengaluru, Karnataka, India

  • Microservices Infrastructure Setup: Facilitated infrastructure setup for new microservices, encompassing AWS, Kubernetes, and Jenkins configurations.
  • Production Release Management: Independently managed production releases for a period of 3 months and documented the complete process. Automated manual steps and provided knowledge transfer to an external vendor to handle the daily releases.
  • Jenkins Ownership and Upgrades: Led seamless upgrades of multiple Jenkins servers hosting 100+ jenkins jobs.
  • Terraform & Ansible Code Refactoring: Refactored the Terraform and Ansible codebase of a self-managed Kubernetes cluster, adopting a module-based approach. Enhanced codebase modularity, reusability, and maintainability.
  • Credential Rotation: Executed database credential rotations across 80+ microservices and monolith servers, coordinating with application teams to ensure smooth transitions.
  • Data Refresh Automation: Developed bash scripts to automate production data refreshes for lower environments, cutting execution time by 80%. Supported various data sources such as RDS, Aerospike, and Solr.
  • CI/CD Tool Evaluation: Collaborated with an external vendor to evaluate Spinnaker as a potential CI/CD solution.
  • Kubernetes Cost Tracking: Implemented team-wise infra cost tracking for Kubernetes resources.
AWSKubernetesJenkinsTerraformAnsibleBash+2

Education

CHANDIGARH UNIVERSITY

Bachelor of Engineering - BE — Computer Science

Jan 2016Jan 2020

Kendriya Vidyalaya

High School

Stackforce found 100+ more professionals with Monitoring Infrastructure & Observability

Explore similar profiles based on matching skills and experience