Ajay Agrawal

Software Engineer

Bengaluru, Karnataka, India6 yrs 8 mos experience
Most Likely To Switch

Key Highlights

  • Expert in AWS services and cloud infrastructure.
  • Proficient in Python and Linux server automation.
  • Recognized by Linux Foundation and IBM for contributions.
Stackforce AI infers this person is a Cloud Operations Engineer with expertise in AWS and Linux infrastructure automation.

Contact

Skills

Core Skills

PythonLinuxMachine LearningWindows Server

Other Skills

FastAPICeleryRedisMongoDBJiraServiceNowPagerDutyDockerSplunkDynatracePrism ProMonitoringBashEC2Shell

About

Passionate, innovative, and dedicated to constant development and deep exploration, I am a Cloud Operations Engineer specializing in driving AWS services to new heights. Join me on an exciting journey where each day presents unique opportunities for growth and meaningful contributions. Experience: - Currently working at a distinguished Product Based Multi-National Company, driving AWS services to new heights. - Leading server automation projects, developing Linux software/utilities, scripting with Bash, and fine-tuning server performance. - Creating automation tools and bots, integrating APIs like Jira and PagerDuty. Expertise in Linux Administration: - Skilled in managing Linux infrastructure, orchestrating the intricate dance of LVM, and administering various applications. - Continuous quest for efficiency ensures smooth system operations and optimal performance. Professional Achievements: - Received authorized badges from the Linux Foundation, IBM, and Red Hat for "Linux & Private Cloud Administration on IBM Power Systems" and "Open Source Software Development, Linux, and Git." - Demonstrates commitment to growth, continuous learning, and support from mentors and peers. Windows Server Proficiency: - Developed PowerShell scripts for server automation projects. - Skills include automated service restarts, log management and rotation, and customized application/utility configuration. Python Expertise: - Proficient in Python, utilizing its versatility to create intelligent bots and automation scripts. - Strives to streamline processes and enhance operational efficiency. Unwavering passion for technology combined with a humble mindset drives me forward. Thrives on creating, innovating, and collaborating with others. Commitment to lifelong learning and an open mind paves the way for exploring vast possibilities. Thank you for gaining insights into my professional journey. Connect with me to learn more or explore potential collaborations. Together, let's make a positive impact. Cheers!

Experience

6 yrs 8 mos
Total Experience
2 yrs 2 mos
Average Tenure
2 yrs 3 mos
Current Experience

Walmart global tech india

2 roles

Software Engineer III

Promoted

May 2025Present · 11 mos · Bengaluru, Karnataka, India · On-site

  • Designing and implementing an automated backend framework using Python, FastAPI, Celery, Redis, and MongoDB that is detecting and resolving hardware defects across Walmart’s private datacenters, improving hardware uptime and reducing manual incident response.
  • Architecting and deploying on-prem agent monitoring systems on Linux servers that are continuously tracking hardware health and service status, enabling real-time fault detection and accelerating remediation workflows.
  • Developing a machine learning–powered recommendation engine analyzing long-term usage metrics on Nutanix AHV and VMware ESXi platforms, generating actionable optimization reports integrated with Jira, ServiceNow, and PagerDuty to drive resource efficiency and cost savings.
  • Building scalable microservices and automation pipelines using Docker, Langflow, and Google ADK that are enhancing datacenter operational resilience and reducing system downtime.
  • Leveraging advanced observability and monitoring tools such as Splunk, Dynatrace, and Prism Pro to proactively manage infrastructure performance and detect anomalies.
  • Collaborating with cross-functional software and infrastructure teams to automate operational workflows, improve system reliability, and support seamless integration with enterprise IT tools.
PythonFastAPICeleryRedisMongoDBLinux+7

Systems and Infrastructure Engineer III

Jan 2024May 2025 · 1 yr 4 mos · Bengaluru, Karnataka, India · On-site

  • Architecting and implementing scalable microservices-based solutions on cloud platforms, directly impacting millions of customers by ensuring high system reliability and availability.
  • Managing a vast infrastructure of over 20,000 Linux servers, continuously optimizing performance to maintain high uptime and operational efficiency.
  • Developing and automating operational workflows using Python and Bash, including creating custom tools for EC2 instances, hardware remediation, and Slack bots to streamline processes and reduce manual effort.Leading Proof of Concept (POC) and Proof of Technology (POT) evaluations to validate and deploy innovative technologies, collaborating with DevOps and SRE teams to implement monitoring, self-healing, and disaster recovery mechanisms.
  • Overseeing the provisioning and migration of virtual machines across hybrid cloud environments, ensuring seamless operations and minimal disruption during transitions.
  • Conducting mass security patching across critical systems with a focus on minimizing impact on live environments, demonstrating strong incident management and resolution skills.
  • Streamlining Jira ticket management and enhancing documentation on Confluence and ServiceNow, contributing to the continuous improvement of operational workflows and knowledge sharing.
  • Spearheading automation initiatives, including the integration of monitoring tools and CI/CD pipelines, to accelerate software delivery and enhance system resilience.
  • Coordinating with vendors like ParkPlace and Thrive for hardware dispatches and integrating tools to improve system performance and visibility.
PythonBashLinuxEC2JiraServiceNow

Acquia

Cloud Operations Engineer

Mar 2022Jan 2024 · 1 yr 10 mos · Pune, Maharashtra, India · Remote

  • Leveraged expertise in Python, Shell, and PowerShell scripting for impactful server automation and performance tuning.
  • Specialized in developing scalable solutions and custom tools to optimize server operations.
  • Led Python bot development, integrating Jira and PagerDuty APIs for efficient workflows.
  • Demonstrated proficiency in scripting Bash and Python for Amazon EC2 instances.
  • Managed a core Drupal infrastructure of 20k+ Linux servers, ensuring high uptime.
  • Implemented disaster recovery strategies with a focus on minimal downtime and zero data loss.
  • Proactively solved problems, analyzed issues, and proposed effective solutions.
  • Aligned technical activities with ITIL processes, SLAs, and change management best practices.
  • Streamlined Jira tickets per Scrum requirements, actively managing impediments.
  • Contributed to documentation and improvement of knowledge base articles.
PythonShellPowerShellBashLinuxJira+1

Capgemini

3 roles

Associate Consultant

Promoted

Aug 2021Mar 2022 · 7 mos

  • Worked as a part of the Enterprise Content Management Team, where I handled Image & Workflow and Document Management activities on both Production and Non-Production Environments.
  • Built Server Automation Scripts and administered both Linux-based and Windows Server Performance while taking important measures to fine-tune the environments as part of daily activities. Managed Volumes on Linux (LVM and fdisk) and Windows Servers, Installed/Managed/Administered different kinds of Applications on Linux and Windows Servers in a clustered environment.
  • For Filenet, Different kinds of deployments like Solutions, Security Manifests, and Audit Manifests were part of the daily routine activities. Apart from this, I set up and configured P8 Object Stores, Doc Classes, Access Roles, and imported/exported desktops between the environments, which added to my responsibilities. Tools like vwtool, Filenet Deployment Manager, and IBM Find came in handy for many activities.
  • In the Middleware side, I handled Websphere related tasks such as Deployment of EAR & WAR files on WebSphere Application Server, monitored, analyzed, and fine-tuned application resources, which constituted a major component of my daily workflow. I was also responsible for all Administrative tasks, which included components like WebSphere Application Server, IHS web server, and Apache webserver.
LinuxWindows ServerFilenetWebSphereServiceNow

Senior Analyst

Aug 2020Aug 2021 · 1 yr

ServiceNowLinuxWindows Server

Analyst

Aug 2019Aug 2020 · 1 yr

ServiceNowLinuxWindows Server

Byju's (think & learn pvt. ltd.)

Business Development Intern

Jan 2019Apr 2019 · 3 mos · Pune, Maharashtra, India

  • It was a great experience working with BYJU's as it enhanced my communication and presentation skills which is for sure the most important aspect everyone must hold.
  • As an intern, I enjoyed it thoroughly and got to learn many things about how to deal with complicated and difficult situations whether it be regarding the work or even the personal life. Also learned how to learn to have a balance between work and life simultaneously along with the completion of all the targets and the tasks provided by the managers.
  • Overall, it was a great experience if I look from the point of view of Personality Development.

National service scheme

Cultural Contributer

Jun 2018Jun 2019 · 1 yr · Bhilai, Chhattisgarh, India

Nerdy academy

Senior Educator Intern

Apr 2017Sep 2017 · 5 mos · Bhilai, Chhattisgarh, India

Self employed

Educator

Jan 2017Dec 2019 · 2 yrs 11 mos · Bhilai, Chhattisgarh, India · On-site

LinuxServer AdministrationPython

Education

Shri Shankaracharya Institute of Technology and Management

Bachelor's degree — Computer Science

Jan 2015Jan 2019

Stackforce found 100+ more professionals with Python & Linux

Explore similar profiles based on matching skills and experience