A

Arun Kumar Singh

SRE (Site Reliability Engineer)

Dublin, County Dublin, Ireland12 yrs 6 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Led automation tools development at Google, boosting cloud adoption by 20%.
  • Managed high-availability clusters for Adobe Marketing Cloud products.
  • Mentored teams to develop generative AI tools, enhancing productivity.
Stackforce AI infers this person is a Cloud Computing and AI/ML expert with strong leadership in Site Reliability Engineering.

Contact

Skills

Core Skills

Site Reliability EngineeringTechnical LeadershipCloud ArchitectureGenerative Ai

Other Skills

Algorithm DesignAlgorithmsAmazon CloudFrontAmazon EBSAmazon EC2Amazon RDSAmazon S3Amazon Web Services (AWS)AngularJSAnsibleApache KafkaArtificial Intelligence (AI)AutomationAvailabilityBusiness Strategy

About

Experienced Leader in SRE, DevOps, and AI/ML with Proven Expertise in Driving Innovation and Team Growth With over 11 years of experience, I have built a career at the intersection of technology and strategy, excelling as a Site Reliability Engineer, DevOps expert, and infrastructure specialist. My expertise spans AI/ML, generative AI, automation, cloud computing, and tool development. Throughout my career, I have consistently delivered scalable, efficient, and innovative solutions, while managing and mentoring teams to drive organizational success. At Google, I spearheaded the design and development of automation tools that accelerated Google Cloud adoption by 20%, demonstrating my ability to align technical initiatives with strategic business objectives. Prior to this, at Adobe, I contributed to Adobe Marketing Cloud products like Adobe Social and Adobe Analytics, driving operational excellence and ensuring service reliability for enterprise-scale systems. My technical expertise includes managing high-availability clusters of MongoDB, Cassandra, RabbitMQ, and Elasticsearch, ensuring their performance and reliability. I have led the implementation of automation strategies using tools like Salt Stack, Puppet, and Terraform, significantly improving deployment speeds and operational efficiency. My proficiency with container orchestration through Kubernetes and Docker reflects my commitment to modern software deployment methodologies. In addition to my technical capabilities, I have a proven track record of leadership and team management. I have successfully led a team of 20+ engineers to develop generative AI-powered tools using platforms like Google Vertex AI, leveraging large language models to enhance productivity and collaboration. I have also worked closely with stakeholders and senior leadership to create strategic roadmaps, ensuring seamless alignment between technological advancements and business goals. Mentoring and empowering teams has been a cornerstone of my career, fostering a culture of collaboration and continuous improvement. Currently, I am pursuing an MBA from IIM Lucknow (2023-25) to further strengthen my leadership and strategic management skills. My goal is to excel in managing high-performing technical teams, driving innovation, and delivering measurable business outcomes at the intersection of technology and leadership.

Experience

12 yrs 6 mos
Total Experience
3 yrs 1 mo
Average Tenure
4 yrs 6 mos
Current Experience

Google

2 roles

Senior Site Reliability Engineer

Mar 2025Present · 1 yr 3 mos

  • Managing availability and reliability of Google Search and Gemini AI/ML models
Machine LearningArtificial Intelligence (AI)Technical LeadershipEngineering ManagementProduct DevelopmentMonitoring+8

Senior Cloud Architect and AI/ML Engineer

Dec 2021Mar 2025 · 3 yrs 3 mos

  • Design and develop automation tools for faster GCP adoption using Boq, Goa, Go, Typescript and Soy
  • Work with multiple LLM models (text-bison, chat-bison, Gemini etc) and Generative AI functionalities is existing and new tools
  • Assist teams working closely with customers on google cloud adoption with automation needs (Design terraform templates, build pipeline, data migration etc)
  • Improve existing CI/CD pipeline
Technical LeadershipArtificial Intelligence (AI)Engineering ManagementMicroservicesGenerative AITeam Leadership+14

Adobe

3 roles

Computer Scientist (SRE)

Feb 2019Dec 2021 · 2 yrs 10 mos

TerraformTechnical LeadershipArtificial Intelligence (AI)Engineering ManagementTeam LeadershipPython+5

Software Development Engineer - 2 (SRE)

Feb 2017Jan 2019 · 1 yr 11 mos

TerraformEngineering ManagementTeam LeadershipPython (Programming Language)AutomationSite Reliability Engineering

Software Development Engineer (SRE)

May 2015Jan 2017 · 1 yr 8 mos

  • Currently Working for Adobe Marketing Cloud and taking care of 2 products i.e Adobe Social and Adobe Analytics
  • Highly available Infrastructure, Cluster designing and maintenance.
  • Managing High Availability Clusters of MongoDB, Cassandra, Apache Solr,RabbitMQ, Elasticsearch
  • Remote Execution and Configuration management on thousands of servers using Salt Stack, Capistrano, Puppet and Fabric
  • Automation and innovation in deployment process and auto-remediation of daily alerts
  • Monitoring of all servers using Splunk, Nagios, Pingdom, Scout, New Relic etc
  • Implementation of Mesosphere Cluster
  • Working on Docker and Container architecture
TerraformEngineering ManagementPythonPython (Programming Language)AutomationTroubleshooting+3

Delhivery

Infrastructure Specialist

Mar 2015Apr 2015 · 1 mo · Gurugram, Haryana, India

  • Analysis and Planning for the Architecture for application deployment and implementation of the design
  • Web servers and databases benchmark and tuning
  • Automation of deployment and backup using python and shell scripting
  • Server Management on AWS using EC2, Route53, RDS, S3, VPC, Elastic Cache, ELB and Auto Scaling
  • Hosting of Application using Python, Django, PostgreSQL, Nginx
  • Aggregation queries on MongoDB for Data Analysis
  • Maintenance and monitoring of heavy traffic servers and architectures
  • Database, apache and Nginx fine tuning for better performance
  • Automated backups of server and databases
PythonPython (Programming Language)Automation

Freelance

Freelancer (Linux System Administrator)

Jul 2014Feb 2015 · 7 mos · Gurugram, Haryana, India

  • Create and maintained of personal and client servers on Amazon Web Services
  • Maintenance and handle issue related with FreePBX VoIP server
  • Installation, update, upgrade and apply patches on Linux Servers
  • Daily local user calls and Issues
  • Installation and Maintenance of Server 2003, 2008 and 2012 Servers and Virtual Machines
  • Installation and troubleshooting of Client Systems (Win XP/7/8 and Ubuntu)
  • Implementation and troubleshooting of the Network Switches, Firewall and Wi-Fi Devices
  • Management of File Server, Symantec Antivirus Server, FTP, Backups, etc
  • Management of Cyberoam CR100i NG Firewall, Ruckus Wi-Fi Zone Director Controller
  • Managing 300+ workstations respective to their Group Policies
PythonPython (Programming Language)Automation

Primary modules.com

System Administrator

Jun 2013Jun 2014 · 1 yr · Greater Noida, India

  • Remote Server maintenance based on Centos, Redhat and Ubuntu
  • Creating server and maintenance on Amazon Web Services
  • Update code on server using GIT and SVN
  • CRM management (MaaxFrame)
  • Database Management, Backup and optimization (MySQL and PostgreSQL)
  • OpenERP installation, backup and maintenance
  • Daily local user calls and Issue
  • Resolve issues related with DNS, DHCP, Mailing Server, FTP etc
  • Firewall Maintenance and applying rules (Pfsense and Peplink)
  • VoIP server using FreePBX
  • Optimization of websites, database and apache

Hewlett-packard

2 roles

Summer Trainee

Jun 2012Jul 2012 · 1 mo · Noida, Uttar Pradesh, India

  • Summer Training on Network Management and Security. Handled a project of creating Secured Network Architecture on using Cisco Packet Tracer.

Summer Trainee

Jun 2011Jul 2011 · 1 mo · Noida, Uttar Pradesh, India

  • Summer Training on ASP.NET with C#. Handled a project of social networking website development based on ASP.NET named as Bonjour.

Education

Indian Institute of Management, Lucknow

Master of Business Administration - MBA

Apr 2023Mar 2025

IÉSEG School of Management

MBA - International Immersion Program — Artificial Intelligence and Business Negotiation

Jun 2024Jul 2024

Birla Institute of Technology and Science, Pilani

Master of Technology - MTech — Computer Software Engineering

Jan 2019Jan 2021

Galgotias College of Engg & Tech

Bachelor of Technology (B.Tech.) — Information Technology

Jan 2009Jan 2012

Kendriya Vidyalaya, New cantt , Allahabad

Secondary and higher secondary

Jan 2005Jan 2009

Stackforce found 100+ more professionals with Site Reliability Engineering & Technical Leadership

Explore similar profiles based on matching skills and experience