S

Saikiran Reddy Cheruku

DevOps Engineer

Detroit, Michigan, United States9 yrs 10 mos experience
Most Likely To Switch

Key Highlights

  • Expert in Kubernetes and CI/CD pipeline optimization.
  • Proficient in multi-cloud environments including AWS and Azure.
  • Strong background in Site Reliability Engineering and incident management.
Stackforce AI infers this person is a DevOps and Site Reliability Engineer specializing in multi-cloud infrastructure and automation.

Contact

Skills

Core Skills

KubernetesCi/cdSite Reliability EngineeringAws

Other Skills

AWS Command Line Interface (CLI)Amazon EKSAmazon Simple Notification Service (SNS)Amazon Web Services (AWS)AnsibleAzure DevOpsBitbucketButbucket pipelineCentOSCloud Cost optimizationConfluenceContinuous Integration and Continuous Delivery (CI/CD)DiwoDockerElastic Stack (ELK)

About

Experienced Devops Engineer and a Site Reliability Engineer with a demonstrated history of working in the information technology and services industry. Skilled and had a real time working knowledge in Docker, Kubernetes, AWS, Ansible, Git , Terraform, Jira, Jenkins, Prometheus, Grafana, Kibana, Kafka, Datadog, Linux, Confluent and Zenoss.

Experience

9 yrs 10 mos
Total Experience
1 yr 4 mos
Average Tenure
3 yrs
Current Experience

United wholesale mortgage

Data Platform Engineer II

Jun 2023Present · 3 yrs · Pontiac, Michigan, United States · On-site

  • Configured and deployed Confluent for Kubernetes across diverse
  • environments, managing components such as Kafka, Connect, Schema
  • Registry, ksqlDB, and Control Center.
  • Authored Helm charts for seamless integration with Argo CD, optimizing
  • CI/CD pipelines for deploying Confluent for Kubernetes in multiple
  • environments.
  • Orchestrated Redis on Kubernetes to enhance caching capabilities,
  • ensuring optimal performance and resource utilization.
  • Implemented Prometheus and Grafana for comprehensive monitoring of
  • Kafka and Redis, integrating with PagerDuty to promptly notify on-call
  • engineers during incidents.
  • Configured connectors and schemas for Confluent for Kubernetes,
  • facilitating efficient message consumption and production.
  • Actively participated in the on-call rotation for Kafka and Redis production
  • issues, contributing to swift incident resolution.
  • Applied Golang for automation within Kubernetes, demonstrating
  • proficiency in programming languages for streamlined operational
  • processes.
PythonKubernetesRedisKafkaContinuous Integration and Continuous Delivery (CI/CD)CI/CD

Diwo

Lead DevOps Engineer

Mar 2022Feb 2023 · 11 mos · Michigan, United States · On-site

  • ▪Experienced in setting the application from the scratch on AWS cloud
  • and Azure cloud.
  • ▪Experienced in using the native Kubernetes, RKE, K3's, RKE
  • ▪Built the RKE cluster on Rancher and also imported the K3's cluster into
  • Rancher and implemented the prometheus and grafana for monitoring
  • ▪Built the complete CI/CD pipeline in bitbucket and integrated with the
  • nexus for storing artifacts, AWS ECR for storing docker images,
  • sonarqube for code scanning and Rancher for deploying the latest
  • images into the kubernetes cluster.
  • ▪Written the Terraform and Ansible scripts to build the kubernetes cluster
  • and to setup the istio gateway as servicemesh .
  • ▪Experienced in setting up the singlestore database and integrating it with
  • the kubernetes cluster.
  • ▪Experienced in implementing redis for caching as statefulsets in
  • kubernetes.
  • ▪Fixed all the vulnerabilites of the docker images which are scanned using
  • ECR , Jfrog and Prisma.
  • ▪Configured the standalone kafka cluster for logging and also configured ELK cluster.
  • ▪Optimised the infrastructure cost in AWS and Azure in a safely manner and able to decrease the cost by about 40% annually
Cloud Cost optimizationContinuous Integration and Continuous Delivery (CI/CD)KafkaElastic Stack (ELK)Promethean BoardGitOps+9

Vmware

Site Reliability Engineer

Dec 2019Nov 2021 · 1 yr 11 mos · Chennai, Tamil Nadu, India · Remote

  • ▪Automated the infrastructure and day to day operation tasks using Python , API's , AWS and GCP
  • modules, DevOps tools such as Terraform, Ansible and VMware
  • developed tools.
  • ▪ Containerising and Orchestrating the VM’s using docker and Kubernetes.
  • ▪ Building the complete infrastructure and maintaining the application on multiple clouds (AWS and GCP)
  • ▪ Involved in configuring the build pipelines in the Jenkins.
  • ▪ Written ansible files for the automation and provisioning infrastructure on cloud.
  • ▪ Following agile methodology for the Jira Sprint with the regular team meeting and retrospectives
  • ▪ Providing on call support for the development and production issues using pagerduty and also handling the incident management.
Incident ManagementPythonSite Reliability EngineeringKubernetesPagerDutyProject Management

Lifion by adp

Platform Engineer

Jul 2019Dec 2019 · 5 mos · Chennai, Tamil Nadu, India

  • ▪ Building the the Lifion applications on AWS
  • ▪ Containerised and Orchestrated the application as microservices using Docker and Kubernetes.
  • ▪ Built the CI CD pipeline using Jenkins and handled the deployment process from Development to Production.
  • ▪ Built the automations for datadog and Jenkins using shell and python scripting.
  • ▪ Involved in setting up the dashboards in Grafana and Prometheus for AWS cost usage monitoring.
  • Sprint planning using Jira.
  • ▪ Used Stash as the repository for storing the scripts and backups of the infrastructure configuration.
  • ▪ Configured Elastic Search and Kibana for logging in the kubernetes cluster.
  • ▪ Installed and configured datadog for the realtime kubernetes cluster monitoring and integrated it with the chat application for the notifications.
  • ▪ Setting up the tools and implementing new devops technologies for leveraging the infrastructure.
Amazon Web Services (AWS)KubernetesJenkinsAWS

F5 networks

Site Reliability Engineer

Jan 2019Jun 2019 · 5 mos · Hyderabad, Telangana, India

  • ▪ Provisioned the the F5 applications as microservices on AWS EKS cluster using Terraform.
  • ▪ Deployed the applications and services on Kubernetes using EKS under different namespaces using
  • helm.
  • ▪ Automated the monitoring and deployment scripts using Python and Terraform
  • ▪ Promoting the deployments from Development to Test and from Test to Production
  • ▪ Setting up the complete monitoring using Prometheus and Grafana using node exporters to export metrics.
  • ▪ Involved in setting up the alert rules using the alert managers in prometheus and integrating with the Microsoft teams to notify teams when application is down.
  • ▪ Used git as a repository for automation scripts , helm charts.
  • ▪ Setting up the job in Jenkins for automation builds
  • ▪ Setting up the Blackbox monitoring application in kubernetes for the Endpoint monitoring and sending metrics to Prometheus.
  • ▪ Adding the Prometheus metrics in Grafana using promql (Prometheus Query Language).
  • ▪ Created Snapshot of the RDS instances using Python script.
  • ▪ Created unified Grafana for monitoring the kubernetes pods in AWS different regions
  • ▪ Automated the AWS billing alerts using Helm
  • ▪ Created Helm packages for the applications in Kubernetes
  • ▪ Used Siebel for adding the new users who are signed up for F5 applications using the AWS marketplace.
  • ▪ Maintained the uptime of the Kubernetes cluster and confirming all the nodes and pods and up and running.
  • ▪ Documented all the process in the confluence.
  • ▪ Created the Sprint plans using Jira and organised the kanban board.
Amazon Web Services (AWS)GitHubKubernetesJenkinsTerraformAWS

Teradata

DevOps Engineer

Jan 2018Jan 2019 · 1 yr · Pune, Maharashtra, India

  • ▪ Built the infrastructure in AWS and integrated with Teradata database on cloud.
  • ▪ Used Ansible to create cloud formation templates which are used to provision the infrastructure on the AWS as per customer’s requirement.
  • ▪ Automated scripts for AWS CLI using Bash which are used for creating cloudwatch alarms for multiple instances at once.
  • ▪ Worked on AWS EC2 , VPC , S3, Cloudtrail, Cloudwatch, SNS, Cloudformation, AMI’s.
  • ▪ Setting up the life cycle policies for storage buckets of Teradata Backup.
  • ▪ Involved in taking Teradata backup’s of the customers using Teradata internal applications.
  • ▪ Monitoring the backup’s using the monitoring tool DSU and CIMIC.
  • ▪ Performed manual patchings on the servers for the upgradation of packages and services
  • ▪ Triggering the backup jobs and storing them in target groups using viewpoint.
  • ▪ Involved in setting up autoscaling launch configuration in AWS for Node Failure Recovery.
  • ▪ Restoring the data on customer’s requirement.
  • ▪ Performed the backup and restore setup in the viewpoint to take the backups and the restore of the
  • customer data.
  • ▪ Implementation of containerisation and orchestration in the newly building products .
  • ▪ Handled cases with the AWS support for the hardware level issues.
Amazon Web Services (AWS)TeradataAWS

Netenrich, inc.

2 roles

Associate Analyst

Apr 2017Dec 2017 · 8 mos · Hyderabad, Telangana, India

  • ▪ Used Git as source code repository.
  • ▪ Creating the service accounts in Google Cloud Platform and providing the permissions with IAM
  • ▪ Creating Google cloud storage bucket
  • ▪ Also provides permissions to the users in Active Directory
  • ▪ Creating Kubernetes pods and services in YML also monitors Kubernetes dashboard,
  • ▪ Experience in Application containerizations like Docker Swarm, Kubernetes.
  • ▪ Hands on experience in automating CI & CD pipeline using Jenkins tools.
  • ▪ Experience in using version controller tools like Git
  • ▪ Configured Git with Jenkins and schedule jobs using POLL SCM option.
  • ▪ Knowledge on Google Container Engine and creating pods and service in Kubernetes
  • ▪ Expertise in Google compute engine, creation of service accounts and providing permissions to the user with IAM both GCP & AWS platforms.
  • ▪ Hands on experience to setup, configure continuous build processes using Jenkins,
  • ▪ Perform Build activities usingJenkins tool.
Google cloudJenkins

Linux Administrator

Oct 2015Dec 2017 · 2 yrs 2 mos · Hyderabad, Telangana, India

  • Linux administrator with hands on experience as NOC engineer for Disney Consumer Products Interactive servers in google cloud using the monitoring tools such as zenoss and ticketing tool as manage engine
  • ▪Monitoring the servers in Zenoss tool
  • ▪Creating the ticket with Manage Engine tool and reach the L2 team
  • ▪Performs basic Linux tasks such as clearing the disk space and take the backup everyday using some bash scripts
  • ▪Creating the user and groups and performs user management
  • ▪Giving the permissions to the users in the company
  • ▪Monitor the network performance with different tools
  • ▪Monitoring the alerts using Zenoss monitoring tool which is working using SNMP.
  • ▪Working on the tickets queue and creating new tickets using manage engine (Tickets
  • creating tool)
  • ▪Automate routine tasks using Bash/Python scripting.
  • ▪Troubleshoot OS level issues like high system load, configuring log rotation,
  • configuring cron backup jobs, apache or tomcat issues.
  • ▪Prepare weekly and monthly service availability and statistic reports of games by
  • analyzing ticker data
  • ▪Monitor game tickers and analysis data and suggest recommendations for game
  • performance and scaling needs
  • ▪Communicate and help customers resolve technical issues and outages via SLACK
  • channels and escalate to on call support personnel appropriately
  • ▪Taking action on the tickets queue in a timely manner
  • ▪Working on auto patching using jenkins
  • ▪Prioritizing alerts and working on them using standard operation procedures
  • ▪Monitoring automated dashboards.
  • ▪Clearing disk space by log rotating or archiving the files.
  • ▪Assigning permissions to the users using active directory.
  • ▪Taking console of the servers using cloud stack console.
  • ▪Checking the replications of the database servers and taking the required action
  • ▪Providing temporary read only access to prod projects through wolverine
  • ▪Providing viewer and logs access to users for non- prod projects using console.
ZenossLinux

Education

Trine University

Master of Science - MS — Information Science/Studies

Feb 2022May 2023

JNTUH College of Engineering Hyderabad

Btech Mechanical — Mechanical Engineering

Jan 2011Jan 2015

gowtham model school

10th

Jan 2008Jan 2009

Stackforce found 100+ more professionals with Kubernetes & Ci/cd

Explore similar profiles based on matching skills and experience