Nikhil Kumar

SRE (Site Reliability Engineer)

Bengaluru, Karnataka, India6 yrs 2 mos experience

Key Highlights

  • Expert in Site Reliability Engineering and Cloud Infrastructure.
  • Proficient in Kubernetes and CI/CD automation.
  • Strong background in network optimization and security.
Stackforce AI infers this person is a Site Reliability Engineer specializing in cloud infrastructure and DevOps automation.

Contact

Skills

Core Skills

Site Reliability EngineeringCloud & Infrastructure As Code (iac)Devops & AutomationLinux System Administration

Other Skills

Agile MethodologyAkamaiAmazon Web Services (AWS)AutomationAzure DevOps ServerBorder Gateway Protocol (BGP)Bug TrackingCICI/CDCloud InfrastructureCloud PlatformsCloud-Native ApplicationsClouderaCollaborative Problem SolvingCommunication

About

As a Site Reliability Engineer, I specialise in building and maintaining highly scalable, resilient, and secure infrastructure. With a strong background in platform operations, DevOps, and cloud computing, I thrive in optimising system reliability, performance, and automation. With expertise in CDN networks, Kubernetes, Jenkins, Git, Ansible, Terraform, and containerization technologies like Docker, I work on enhancing infrastructure as code (IaC), continuous integration/continuous deployment (CI/CD), and cloud solutions across Microsoft Azure, Google Cloud Platform (GCP), and Linode. 🔹 Key Areas of Expertise: ✅ Site Reliability Engineering (SRE): Ensuring high availability, observability, and incident response for large-scale distributed systems. ✅ DevOps & Automation: Streamlining deployment pipelines, improving CI/CD workflows, and managing version control with Git and Jenkins. ✅ Cloud & Infrastructure as Code (IaC): Architecting, deploying, and managing infrastructure using Terraform, Ansible, and Kubernetes. ✅ CDN & Network Optimization: Enhancing content delivery, reducing latency, and ensuring seamless user experience. ✅ Linux System Administration: Strong foundation in system performance tuning, security hardening, and automation. 💡 Passionate about innovation, problem-solving, and driving operational excellence, I continuously explore new technologies and best practices to enhance system efficiency and reliability. Always open to collaborating on exciting projects and networking with like-minded professionals in the SRE, DevOps, and cloud community. 📩 Let’s connect and discuss how we can drive innovation and reliability in cloud & DevOps together!

Experience

Mobile premier league (mpl)

Site Reliability Engineer 2

Jun 2025 – Present · 9 mos · On-site

Career break

Health and well-being

Jun 2024 – May 2025 · 11 mos

Akamai technologies

3 roles

Site Reliability Engineer II

Promoted

May 2023 – May 2024 · 1 yr · Bengaluru, Karnataka, India · On-site

  • Site Reliability Engineer II in the Security Engineering Team, managing customer-facing applications and deploying various components.
  • Part of the Enterprise Threat Protector Team with a strong understanding of SDLC and agile methodology.
  • Proficient in source code management with Git and CI/CD integration using Jenkins.
  • Hands-on knowledge of software containerization platforms like Docker.
  • Experienced in creating and managing Kubernetes clusters across environments, including deploying Kubernetes on Linode with Rancher.
  • Deployed Linode Kubernetes Engine clusters and static sites using Terraform.
  • Used Prometheus, Grafana, and Lens for monitoring Kubernetes clusters, ensuring efficient troubleshooting.
  • Served as POC for server migrations, overseeing physical-to-cloud transitions.
  • Knowledgeable in cloud platforms like Azure, AWS, and Linode.
  • Good in automating Os-level tasks using shell scripting.
Site Reliability EngineeringSDLCAgile MethodologyGitCI/CDDocker+6

Platform Operations Engineer II

Promoted

Jul 2022 – Apr 2023 · 9 mos · Bengaluru, Karnataka, India · On-site

  • Planning, deploying, and maintaining the Akamai EDGE Network.
  • Familiar with security technology, processes and concepts, security event management or security compliance.
  • Working and enhancing knowledge on machines, networking, and Linux, Routers, Switches.
  • Troubleshooting complex network and content delivery-related issues.
  • Resolving customer escalations and acting as a technical point of contact.
  • Providing live network broadcasting event support to ensure smooth streaming and content replication.
  • Working with various third parties (System Architects, Infrastructure Vendors, Customers, and Developers) to narrow down problems and achieve resolution.
  • Handling (system, application) and network-related alerts and complicated issues for deployed machines.
Network EngineeringSecurity TechnologyLinuxTroubleshootingCustomer EscalationsDevOps & Automation+1

Platform Operations Engineer

Apr 2021 – Jun 2022 · 1 yr 2 mos · Bengaluru, Karnataka, India · On-site

  • Dedicated Platform Operations Engineer who enjoys cultivating long-term partnerships with vendors and clients. Expertise in installing, configuring, and monitoring complex systems and infrastructures.
  • Implementation application and system monitoring with Grafana
  • Handling of (system, application) and network-related alerts for deployed machines
  • Worked on configuration management system (UMP)
  • Handling emails that are directed towards Akamai from different ISPs and customers.
  • Having experience in handling complex networks of around 4 lakh servers (Linux)
  • Diagnosing troubles identified by network, monitoring and working to resolve issues.
  • Escalating properly during the incident – acting as phone SME.
  • Ensuring SLAs are achieved and NOCC work quality expectations are met.
  • Handling multiple concurrent tasks with minimal supervision and low escalation level.
System MonitoringConfiguration ManagementLinuxNetwork ManagementDevOps & AutomationLinux System Administration

Randstad

Platform Operations Engineer Associate

Sep 2019 – Apr 2021 · 1 yr 7 mos · Bengaluru, Karnataka, India · On-site

  • ==>>[Deputed to Akamai Technologies]
  • Hands on experience on the bug tracking, issue tracking and project management tool Jira
  • Linux System Administration
  • Unix Shell Scripting.
  • Experienced in handling incident management bridges and coordinate with different teams.
  • Worked on Install-failure troubleshooting of servers.
  • Experienced in implementing DevOps practices, automating CI/CD pipelines, cloud infrastructure, and monitoring systems for enhanced performance and reliability.
Bug TrackingIncident ManagementDevOps PracticesCloud InfrastructureDevOps & AutomationLinux System Administration

Education

Dayananda Sagar Institutions

Bachelor of Engineering - BE — Electronics and Communications Engineering

Jan 2013 – Jan 2017

Stackforce found 100+ more professionals with Site Reliability Engineering & Cloud & Infrastructure As Code (iac)

Explore similar profiles based on matching skills and experience