Nikhil Kumar — SRE (Site Reliability Engineer)

As a Site Reliability Engineer, I specialise in building and maintaining highly scalable, resilient, and secure infrastructure. With a strong background in platform operations, DevOps, and cloud computing, I thrive in optimising system reliability, performance, and automation. With expertise in CDN networks, Kubernetes, Jenkins, Git, Ansible, Terraform, and containerization technologies like Docker, I work on enhancing infrastructure as code (IaC), continuous integration/continuous deployment (CI/CD), and cloud solutions across Microsoft Azure, Google Cloud Platform (GCP), and Linode. 🔹 Key Areas of Expertise: ✅ Site Reliability Engineering (SRE): Ensuring high availability, observability, and incident response for large-scale distributed systems. ✅ DevOps & Automation: Streamlining deployment pipelines, improving CI/CD workflows, and managing version control with Git and Jenkins. ✅ Cloud & Infrastructure as Code (IaC): Architecting, deploying, and managing infrastructure using Terraform, Ansible, and Kubernetes. ✅ CDN & Network Optimization: Enhancing content delivery, reducing latency, and ensuring seamless user experience. ✅ Linux System Administration: Strong foundation in system performance tuning, security hardening, and automation. 💡 Passionate about innovation, problem-solving, and driving operational excellence, I continuously explore new technologies and best practices to enhance system efficiency and reliability. Always open to collaborating on exciting projects and networking with like-minded professionals in the SRE, DevOps, and cloud community. 📩 Let’s connect and discuss how we can drive innovation and reliability in cloud & DevOps together!

Stackforce AI infers this person is a Site Reliability Engineer specializing in cloud infrastructure and DevOps automation.

Location: Bengaluru, Karnataka, India

Experience: 6 yrs 4 mos

Skills

Site Reliability Engineering
Cloud & Infrastructure As Code (iac)
Devops & Automation
Linux System Administration

Career Highlights

Expert in Site Reliability Engineering and Cloud Infrastructure.
Proficient in Kubernetes and CI/CD automation.
Strong background in network optimization and security.

Work Experience

Mobile Premier League (MPL)

Site Reliability Engineer 2 (11 mos)

Career Break

Health and well-being (11 mos)

Akamai Technologies

Site Reliability Engineer II (1 yr)

Platform Operations Engineer II (9 mos)

Platform Operations Engineer (1 yr 2 mos)

Randstad

Platform Operations Engineer Associate (1 yr 7 mos)

Education

Bachelor of Engineering - BE at Dayananda Sagar Institutions

Nikhil Kumar

SRE (Site Reliability Engineer)

Bengaluru, Karnataka, India6 yrs 4 mos experience

Key Highlights

Expert in Site Reliability Engineering and Cloud Infrastructure.
Proficient in Kubernetes and CI/CD automation.
Strong background in network optimization and security.

Stackforce AI infers this person is a Site Reliability Engineer specializing in cloud infrastructure and DevOps automation.

Contact

nikhilkumar07.1994@gmail.com LinkedIn

Skills

Core Skills

Site Reliability EngineeringCloud & Infrastructure As Code (iac)Devops & AutomationLinux System Administration

Other Skills

Agile MethodologyAkamaiAmazon Web Services (AWS)AutomationAzure DevOps ServerBorder Gateway Protocol (BGP)Bug TrackingCICI/CDCloud InfrastructureCloud PlatformsCloud-Native ApplicationsClouderaCollaborative Problem SolvingCommunication

About

Experience

6 yrs 4 mos

Total Experience

1 yr 9 mos

Average Tenure

11 mos

Current Experience

Mobile premier league (mpl)

Site Reliability Engineer 2

Jun 2025 – Present · 11 mos · On-site

Career break

Health and well-being

Jun 2024 – May 2025 · 11 mos

Akamai technologies

3 roles

Site Reliability Engineer II

Promoted

May 2023 – May 2024 · 1 yr · Bengaluru, Karnataka, India · On-site

Site Reliability Engineer II in the Security Engineering Team, managing customer-facing applications and deploying various components.
Part of the Enterprise Threat Protector Team with a strong understanding of SDLC and agile methodology.
Proficient in source code management with Git and CI/CD integration using Jenkins.
Hands-on knowledge of software containerization platforms like Docker.
Experienced in creating and managing Kubernetes clusters across environments, including deploying Kubernetes on Linode with Rancher.
Deployed Linode Kubernetes Engine clusters and static sites using Terraform.
Used Prometheus, Grafana, and Lens for monitoring Kubernetes clusters, ensuring efficient troubleshooting.
Served as POC for server migrations, overseeing physical-to-cloud transitions.
Knowledgeable in cloud platforms like Azure, AWS, and Linode.
Good in automating Os-level tasks using shell scripting.

Site Reliability EngineeringSDLCAgile MethodologyGitCI/CDDocker+6

Platform Operations Engineer II

Promoted

Jul 2022 – Apr 2023 · 9 mos · Bengaluru, Karnataka, India · On-site

Planning, deploying, and maintaining the Akamai EDGE Network.
Familiar with security technology, processes and concepts, security event management or security compliance.
Working and enhancing knowledge on machines, networking, and Linux, Routers, Switches.
Troubleshooting complex network and content delivery-related issues.
Resolving customer escalations and acting as a technical point of contact.
Providing live network broadcasting event support to ensure smooth streaming and content replication.
Working with various third parties (System Architects, Infrastructure Vendors, Customers, and Developers) to narrow down problems and achieve resolution.
Handling (system, application) and network-related alerts and complicated issues for deployed machines.

Network EngineeringSecurity TechnologyLinuxTroubleshootingCustomer EscalationsDevOps & Automation+1

Platform Operations Engineer

Apr 2021 – Jun 2022 · 1 yr 2 mos · Bengaluru, Karnataka, India · On-site

Dedicated Platform Operations Engineer who enjoys cultivating long-term partnerships with vendors and clients. Expertise in installing, configuring, and monitoring complex systems and infrastructures.
Implementation application and system monitoring with Grafana
Handling of (system, application) and network-related alerts for deployed machines
Worked on configuration management system (UMP)
Handling emails that are directed towards Akamai from different ISPs and customers.
Having experience in handling complex networks of around 4 lakh servers (Linux)
Diagnosing troubles identified by network, monitoring and working to resolve issues.
Escalating properly during the incident – acting as phone SME.
Ensuring SLAs are achieved and NOCC work quality expectations are met.
Handling multiple concurrent tasks with minimal supervision and low escalation level.

System MonitoringConfiguration ManagementLinuxNetwork ManagementDevOps & AutomationLinux System Administration

Randstad

Platform Operations Engineer Associate

Sep 2019 – Apr 2021 · 1 yr 7 mos · Bengaluru, Karnataka, India · On-site

==>>[Deputed to Akamai Technologies]
Hands on experience on the bug tracking, issue tracking and project management tool Jira
Linux System Administration
Unix Shell Scripting.
Experienced in handling incident management bridges and coordinate with different teams.
Worked on Install-failure troubleshooting of servers.
Experienced in implementing DevOps practices, automating CI/CD pipelines, cloud infrastructure, and monitoring systems for enhanced performance and reliability.

Bug TrackingIncident ManagementDevOps PracticesCloud InfrastructureDevOps & AutomationLinux System Administration