Sudheer Atchuta

SRE (Site Reliability Engineer)

Dallas, Texas, United States8 yrs 9 mos experience
Highly StableAI Enabled

Key Highlights

  • Expert in AI-powered DevOps automation.
  • Proven track record in cloud cost optimization.
  • Skilled in transforming traditional DevOps practices.
Stackforce AI infers this person is a DevOps and AI automation expert in the SaaS industry.

Contact

Skills

Core Skills

DevopsAi-powered SolutionsObservabilityCloud OptimizationInfrastructure ManagementData EngineeringCloud MigrationTelecommunicationsSoftware DevelopmentNetworking

Other Skills

KubernetesGitHubAIGenAIIncident ManagementNatural Language QueryOpenObserveSplunkPrometheusSQL TuningCost MonitoringAnsibleTerraformSQLAzure DevOps

About

As a seasoned DevOps and Platform Engineer, I specialize in combining cloud-native engineering with the power of AI and GenAI automation to drive performance, cost efficiency, and reliability at scale. My current focus is on building and scaling AI-powered solutions across infrastructure, observability, database, incident management, and developer productivity domains. I have hands-on expertise in automating large-scale cloud operations across hybrid clouds (GCP, Azure, and internal cloud platforms), Kubernetes platforms, and legacy infrastructure. From streamlining postmortem creation to detecting idle cloud resources and optimizing build pipelines with AI, I help fast-moving teams stay resilient and cost-effective. I'm passionate about transforming traditional DevOps into Autonomous DevOps, leveraging GenAI to cut through noise, reduce toil, and make systems smarter.

Experience

8 yrs 9 mos
Total Experience
3 yrs 5 mos
Average Tenure
1 yr 10 mos
Current Experience

Walmart global tech

Senior Site Reliability Engineer / Senior Software Engineer

Jul 2024Present · 1 yr 10 mos · Bentonville, Arkansas, United States · Hybrid

  • 🔹Infrastructure Automation at Scale
  • Managing hybrid cloud platforms (public + native) using Kubernetes, Helm, OneOps, GitHub-driven deployment pipelines.
  • Oversaw compute migrations, cert renewal automation, and cluster modernization.
  • 🔹 AI-Powered Observability & RCA
  • Implemented GenAI solutions for root cause analysis across application, infra, and database layers.
  • Automatically generate postmortem drafts and incident timelines using logs, metrics, and alert patterns.
  • Natural Language Query (NLQ) interface for querying logs and dashboards using OpenObserve, Splunk, and Prometheus.
  • 🔹 Cloud & DB Cost Optimization with AI
  • Built AI systems to detect idle/zombie resources, recommend right-sizing, and monitor cost anomalies.
  • Enabled automated SQL tuning and Liquibase policy violation detection/fixing using LLMs.
  • 🔹 Build & Deployment Intelligence
  • Analyzed build logs and deployment changes using AI to optimize CI/CD performance.
  • Developed bots to identify flaky tests, unused namespaces, and redundant cluster deployments.
  • Developer Productivity with Knowledge Bots
  • Created internal copilots to answer questions on cert renewals, flows, and Jira-based solutions.
  • AI bots analyze historical tickets and logs to auto-suggest resolutions and generate BDD test cases.
  • GenAI/ML Technologies Used:
  • LLMs (Mistral, LLaMA, OpenDevin, Phi-3), LangChain, LlamaIndex, Qdrant
  • OpenObserve, Prometheus, Splunk, Grafana
  • FastAPI, Slack Bots, GitHub Actions, Argo Workflows
  • Custom RAG pipelines, embedding stores, and prompt evaluation guards
KubernetesGitHubAIGenAICloud OptimizationIncident Management+2

Ericsson

ICT DevOps Engineer

Mar 2024Jun 2024 · 3 mos · Dallas, Texas, United States

  • Created and maintained documentation for data pipelines, workflows, and lineage.
  • Creating and maintaining infrastructure using configuration management tools like Ansible and Terraforms.
  • Highly proficient with SQL queries and querying the data according to requirements producing business reports.
  • Proven track record in optimizing the execution time from stored procedures using indexes.
  • Create and maintain continuous delivery pipelines for setting up and running test and production environments for customers using Azure DevOps and create and maintain CI/CD workflow to track and resolve issues with products.
  • Creating and maintaining the Load Balancers using the Azure LB and monitoring the traffic flow in child nodes
  • Creating Scripted and Declarative Pipelines in Jenkins and Azure DevOps to build deployments without human intervention in pre–prod and Production environments.
  • Integrating Sonar Cube into the existing CI/CD pipelines to eradicate compliance & security issues in the code.
  • Actively review and explore opportunities to improve, optimize, and enhance working methods. Provide status communications to project and business teams.
  • Demonstrated ability to present ideas clearly and concisely and prepare presentations for senior-level client stakeholders.
AnsibleTerraformSQLAzure DevOpsCI/CDDevOps+1

Hcltech

2 roles

Senior DevOps Engineer

Jan 2023Feb 2024 · 1 yr 1 mo · On-site

  • Worked as a build and release engineer, deployed the services by (AWS DevOps) pipeline. Created and Maintained pipelines to manage the IAC for all the applications.
  • Created highly scalable and fault-tolerant multi-tier AWS environments spanning across multiple availability zones using Cloud Formation templates.
  • Developed Ansible Playbooks to manage Web applications, environment configuration files, Users, Mount points, and Packages. Customized Ansible modules for finding facts about AWS Monitor alarms and taking actions to manage those alarms during deployments.
  • Worked on setting up Docker to automate container deployment and worked on Docker container to create Docker images for different environments.
  • Involved in Configuration Automation and Centralized Management with Ansible and Implemented Ansible to manage all existing servers and automate the build/configuration of new servers.
  • Implemented Jenkins pipelines into AWS pipelines to drive all microservices builds out to the Docker registry and then deployed to Kubernetes, Created Pods, and managed using Amazon EKS.
  • Used Terraform to set up infrastructure in PCF and Azure Environments. Converted existing Terraform modules that had version conflicts to utilize cloud formation during Terraform deployments to enable more control or missing capabilities.
  • Installed Prometheus and Grafana using Helm to monitor the application performance in the Kubernetes cluster. Created clusters for Microservice applications deployed in different availability zones for high availability.
  • Used Bitbucket to host and manage the source code in private Repositories and configured Jenkins for integrating these Repositories into the CI/CD process. Created Jenkins jobs and distributed load on the Jenkins server by configuring Jenkins nodes, which will enable parallel builds.
AWSDockerAnsibleTerraformKubernetesDevOps+1

Senior Consultant

Apr 2021Jan 2023 · 1 yr 9 mos · On-site

  • Handling E2E migration Finance Applications into SaaS products
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc. ( Currently using Automic)
  • CI/CD to maintain the Git Code ( Currently using a looper, concord)
  • Design, build and launch efficient & reliable data pipelines to move and transform data (both large and small amounts).
  • Design and develop new systems in partnership with software engineers to enable quick and easy data consumption.
  • Optimize existing pipelines and maintain all domain-related data pipelines.
  • Ownership of the end-to-end data engineering component of the solution.
  • Support on-call shift as needed to support the team.
  • Experience with SQL performance tuning and e2e process optimization.
  • Experience working with cloud (Azure) and on-prem
  • Attended the SOX and GRC compliance audit as a Deloitte external auditor and EY.
AWSCI/CDSQLAzureAnsibleDevOps+1

Tata consultancy services

4 roles

DevOps Engineer

Promoted

Jan 2021Apr 2021 · 3 mos

  • Configured AWS application deployment infrastructure using sources, i.e., Virtual Private Cloud (VPC), EC2, Elastic Bean Store (EBS), Identity Access Management (IAM), S3, Dynamo DB, Mongo DB, Route53, Simple Notification Service (SNS), Simple Email Service (SES), Simply Queue Service (SQS), CloudWatch, CloudTrail, Security Group, Auto Scaling Group (ASG), and RDS using CloudFormation, Terraform templates.
  • Created required reliable architectures and end-to-end migration plan for migrating Linux/Windows servers along with web applications into AWS cloud platform using services as IPSec tunnel, VPN gateway, Customer Gateway, and Data Pipeline.
  • Deployed and configured Azure virtual machines to ensure the best possible performance and dependability for key energy infrastructure applications.
  • Developed and maintained automation scripts using tools like PowerShell or Azure CLI to streamline routine tasks and enhance operational efficiency.
  • Implemented virtual networks, such as subnets, route tables and security groups to enable secure and efficient communication across infrastructure components.
  • Set up Jenkins’s server and built jobs to provide continuous automated builds based on polling the GIT SCM during the day and periodically scheduled the builds overnight to support development.
  • Managed Azure Active Directory and AVD related systems.
  • Used Shell, Bash, and Python scripts to supplement automation provided by Ansible and Terraform for encryption.
  • Implemented data retention policies and ensure compliance with regulatory requirements for data storage and management.
  • Responded to security incidents and conducted regular security assessments.
  • Collaborated with cross-functional teams to understand business requirements and align technology solutions with the strategic goals of the energy infrastructure company.
  • Communicated effectively with stakeholders to provide updates on Azure infrastructure projects and initiatives.
AWSTerraformCloudFormationJenkinsPowerShellDevOps+1

Information Technology Analyst

Promoted

Oct 2018Dec 2020 · 2 yrs 2 mos

  • Dealt with multiple vendors for the client and collaborated with teams stationed all over the globe to lead the CPOR.
  • Evaluated and adopted upcoming technologies such as Power BI, and RPA to address changing industry needs and reduce manual efforts by 40%
  • Developed a few API solutions to assist various telecommunication customers of Ericsson, namely AT&T Verizon, MTN, NBN, and Airtel to raise & process their sales orders and billing documents.
Azure Data LakeTelecommunicationsProblem SolvingData Engineering

Assistant Systems Engineer

Oct 2016Sep 2018 · 1 yr 11 mos

  • The Order Office is an internal application developed on .Net by IBM in 1998. Its purpose is to raise orders against FAS orders and tag existing and incoming Po. However, the application faced significant performance issues, being slow and often failing to complete transactions.
  • To address these challenges, the Cloud and Innovation team proposed integrating this functionality into the existing CPOR application. This solution enables end users to raise orders using the CPOR tool with the assistance of FAS orders. I spearheaded the migration of the entire solution to CPOR, ensuring a seamless transition and improved functionality for users.
  • Undertaking sales origination, including working with client teams to identify opportunities. Leverage and improve various frameworks, methodologies, and solutions specific to our offerings.
  • Applied automation techniques (SQL) for reporting and system monitoring, thereby bolstering the accuracy and improving the efficiency by 30%
  • Resolved malfunctions across systems and programs for a 12k+ customer base through troubleshooting, code enhancements, and bug fixes.
Azure Data LakeTelecommunicationsProblem Solving

Internship

Jan 2016Jun 2016 · 5 mos · Greater Hyderabad Area

  • Developer in Cloud Platform and Support Team
  • Worked on R&D and Innovation Projects with Ericsson Client and found the pain areas in the Sales & Marketing portfolio to address that issue I have spent my Intern time to find the solution and proposed to the client on Purchase Order Life Cycle management.
Azure Data LakeTelecommunicationsProblem Solving

Freelance (self employed)

Project Coordinator

Jun 2014Jan 2016 · 1 yr 7 mos · Hyderabad, Telangana, India · Hybrid

  • Built and Deployed Java/J2EE to a web application server in an Agile, continuous integration environment and automated the complete process and responsible for the Build and Release management.
  • Configured AWS Config for setting up CloudTrail and compliance check on AWS Resources like S3.
  • Experienced in creating Security groups, VPC with customized Subnets, Internet Gateways, and Routing tables for Stack setup as well as VPN Tunnelling in AWS cloud environment.
  • Managed to set up EC2 instances with Nginx, Tomcat servers, and installed Docker in the AWS cloud.
  • Configured Minions, Pods along with Docker engine in AWS EC2 instances.
  • Created customized AMI’s and installed EC2 stack using CloudFormation and Terraform templates.
  • Implementing and maintaining a Continuous Delivery process using GitHub(hooks), Build tools like maven, Jenkins and management tools like Chef, Puppet.
  • Implemented Maven builds to automate .JAR and .WAR files and written builds using XML formatted files.
  • Written build and deployment scripts using MAVEN and ANT as build tools in Jenkins to move from one environment to other environments written in XML formats.
  • Also responsible for creating Docker containers using docker images to test the application and created custom images using Docker Files with different servers and differs Operating Systems and maintained Docker Containers to package the application into a standardized unit for Software Development.
  • Written SQL Queries for generating different reports and for data mining.
  • Learned and worked with Ansible to manage the containers and the environments around the containers using the YAML files and experienced in deployment automation using multiple tools like Chef, Puppet, Jenkins, GIT, TFS, SonarQube, Maven and ANT.
JavaAWSDockerCI/CDDevOpsSoftware Development

Cms computers limited (india)

Network Engineer

May 2013Apr 2014 · 11 mos · Hyderabad, Telangana, India · On-site

  • Managing user accounts and providing security-level permissions in Active Directory.
  • Migration of On-premises AD server to AWS
  • Maintaining the AWS EC2 backups and server maintenance activities
  • VPC creation with basic infrastructure for the AD server management.
  • EC2 volume attachments or upgrading the existing volumes.
  • Creating and removing the mail IDs in the mail server.
  • Configuring the IP Phone lines.
  • Patch Management using WSUS services and Group Policies.
  • Installing the Client and Server operating systems as per the management requirement.
  • Maintains weekly backups manually.
  • Arranging the systems and troubleshooting the system issues related to hardware and software.
  • Configuring the mail IDs in mail clients (Outlook Express and Outlook 2007) and troubleshooting the issues related to mail clients.
  • Updating the Group Policies in AD respective to the different departments related to security, customization, and installing the applications in the client systems.
  • Configuring the TCP/IP settings and troubleshooting the network issues.
  • Installing the Printers and troubleshooting the issues with the printers in the network.
  • Troubleshooting the LAN and WLAN connections.
CCNACisco NetworkingNetwork ServicesNetworkingInfrastructure Management

Education

Carnegie Mellon University

Master's Degree — Master of Science in Information Technology

Jan 2014Jan 2016

Rajamahendri Institute of Engineering & Technology, Rajahmundry

Bachelor's Degree — Information Technology

Jan 2009Jan 2013

St.Anns, Rajahmundry

Tenth — High School/Secondary Diplomas and Certificates

Jan 2006Jan 2007

Stackforce found 100+ more professionals with Devops & Ai-powered Solutions

Explore similar profiles based on matching skills and experience