Sravan Kumar

SRE (Site Reliability Engineer)

Hyderabad, Telangana, India9 yrs experience
Most Likely To Switch

Key Highlights

  • Over five years of experience in SRE.
  • Expertise in AWS and Azure cloud technologies.
  • Passionate about automation and DevOps principles.
Stackforce AI infers this person is a Site Reliability Engineer with expertise in cloud infrastructure and automation in SaaS environments.

Contact

Skills

Core Skills

Site Reliability EngineeringAmazon Web Services (aws)DevopsAutomationContinuous Integration And Continuous Delivery (ci/cd)

Other Skills

Service-Level Agreements (SLA)24x7 Production SupportDockerPython (Programming Language)LinuxSLIKubernetesInfrastructureTerraformMTTRJavaConcourse CIAnsibleGitlabMicrosoft Azure

About

“Failure is simply the opportunity to begin again, this time more intelligently.” – Henry Ford This quote is a reminder that failure is not the end of the world; it is an opportunity to learn and grow. As an SRE, I am constantly learning from my mistakes and finding new ways to improve our systems and processes. I'm a dedicated Site Reliability Engineer (SRE) with over five years of experience in ensuring the reliability, scalability, and performance of critical systems. I'm passionate about automation, scalability, DevOps, and cloud technologies. Throughout my career, I've had the privilege of working on a wide range of projects, from startups to large-scale enterprises. My expertise encompasses: Automation is not just a tool for me; it's a way of life. I am passionate about automating tasks and processes to reduce manual toil, enhance system scalability, and minimize downtime. This includes the automation of deployments, configuration management, and orchestration. I have a deep understanding of cloud technologies, including Amazon Web Services (AWS) and Azure, with a track record of designing and implementing scalable architectures. I firmly believe in the principles of DevOps, where collaboration and communication between development and operations teams are paramount. I've played a central role in bridging this gap, ensuring that code is seamlessly deployed and that infrastructure is code-driven, resulting in more efficient, reliable operations. In the ever-evolving tech landscape, I'm dedicated to staying at the forefront of technology and seeking opportunities for process improvement. I excel at tackling complex challenges and turning them into opportunities for improvement, with a particular focus on root-cause analysis. I enjoy fostering collaboration across cross-functional teams, and promoting a culture of shared responsibility for system reliability and performance in the AWS cloud environment. I'm always open to connecting with like-minded professionals, sharing experiences, and exploring new opportunities for collaboration, especially in the realm of SRE, Cloud, and DevOps. If you're interested in discussing SRE strategies, exploring potential projects, or simply connecting within the tech community, please don't hesitate to reach out. Write to me at Email: sravan.dsk6@gmail.com Thank you for visiting my profile, and I look forward to connecting with you.

Experience

9 yrs
Total Experience
1 yr 6 mos
Average Tenure
1 yr 11 mos
Current Experience

Apple

Site Reliability Engineer

Jun 2024Present · 1 yr 11 mos · Hyderabad, Telangana, India · Hybrid

Vmware

Site Reliability Engineer II

Mar 2022Jun 2024 · 2 yrs 3 mos · Bengaluru, Karnataka, India

  • Product: https://vmc.vmware.com/infrastructure/aws/overview
  • ★ Initiated collaboration with development teams to design and implement automated systems that significantly enhanced application reliability and availability.
  • ★ Developed and executed best practices for incident management, resulting in timely resolution of critical incidents.
  • ★ Continuously monitored and optimized systems to ensure peak performance and availability, proactively identifying and addressing issues before impacting end-users.
  • ★ Effectively managed and streamlined deployment pipelines, reducing release cycles, and enhancing overall system performance.
  • ★ Conducted comprehensive root cause analysis of issues and executed remediation plans to prevent recurrence.
  • ★ Introduced Chaos Engineering practices to proactively identify and mitigate potential failures in production environments.
Service-Level Agreements (SLA)24x7 Production SupportDockerPython (Programming Language)LinuxAmazon Web Services (AWS)+9

Loginradius

Sr. DevOps Engineer

Sep 2021Mar 2022 · 6 mos · Hyderabad, Telangana, India

  • ★ Spearheaded continuous improvements and introduced solutions to enhance the existing infrastructure, tools, and processes.
  • ★ Utilized Sumo Logic to configure alerts and dashboards for streamlined customer onboarding.
  • ★ Successfully orchestrated the migration of Azure function apps from version 1 to 3, implementing automation through GitLab pipelines.
  • ★ Introduced the Karpenter autoscaler to all production clusters, enabling event-based autoscaling.
  • ★ Optimized GitLab pipelines by implementing autoscaling using a Fargate custom executor, significantly improving parallel and concurrent builds.
  • ★ Implemented Docker image caching to reduce the number of calls to DockerHub from the CI/CD infrastructure, mitigating the impact of Docker Hub pull request limits.
  • ★ Elevated the observability and reliability of LoginRadius systems by managing the monitoring and alerting infrastructure using Sumo Logic and Datadog.
AnsibleGitlabMicrosoft AzureDockerShell ScriptingBash+11

Nokia

R & D Engineer(DevOps)

Mar 2021Aug 2021 · 5 mos · Bengaluru, Karnataka, India

  • Project: SDL & PGW CNF'S, https://www.nokia.com/networks/core-networks/subscriber-data-management/
  • ★ Implemented the deployment of SDL and PGW CNFs on Kubernetes via Helm charts.
  • ★ Expanded the AutoFWK and Jenkins master server by incorporating new racks/clouds.
  • ★ Utilized Jenkins for end-to-end automation in the CNF domain, encompassing job creation and configuration modifications as per requirements.
  • ★ Managed the entire CI process for PGW, including overseeing testing cycles, script development, and coordinating user acceptance testing.
AnsibleJenkinsGitlabDockerBashContinuous Integration and Continuous Delivery (CI/CD)+6

Fico

DevOps Engineer I

Feb 2019Mar 2021 · 2 yrs 1 mo · Bengaluru, Karnataka

  • Project: FAWb (SaaS-Product), https://www.fico.com/en/fico-platform
  • ★ Automated and improved the CI/CD process with Git, Jenkins, shell scripting, Docker, and Kubernetes for smoother application deployments.
  • ★ Enhanced security by configuring Docker containers with non-root user settings.
  • ★ Boosted security compliance for Docker images and open-source libraries using Black Duck and AquaSec with Jenkins.
  • ★ Led migration from Ubuntu-based images to Amazon Corretto-based images for optimized infrastructure and performance.
AnsibleJenkinsAWSMicrosoft AzureDockerPython (Programming Language)+10

Teamlease services limited

Customer Support Engineer

Aug 2015Jun 2017 · 1 yr 10 mos · Visakhapatnam, Andhra Pradesh, India

  • Worked for iQor:
  • ★ Provided exceptional customer support, resolving technical issues and inquiries promptly and effectively.
  • ★ Collaborated with cross-functional teams to ensure customer satisfaction and issue resolution.

Education

Jawaharlal Nehru Technological University, Kakinada

Bachelor of Technology - BTech — Electronics and Communications Engineering

Jan 2011Jan 2015

Stackforce found 100+ more professionals with Site Reliability Engineering & Amazon Web Services (aws)

Explore similar profiles based on matching skills and experience