Abhishek Kumar 'AK'

DevOps Engineer

Noida, Uttar Pradesh, India17 yrs 9 mos experience
Highly Stable

Key Highlights

  • 18+ years in cloud infrastructure and operational excellence.
  • Expert in scaling multi-region Kubernetes platforms.
  • Proven success in aligning engineering velocity with business outcomes.
Stackforce AI infers this person is a SaaS Infrastructure and DevOps expert with extensive experience in cloud operations.

Contact

Skills

Core Skills

Platform EngineeringOperations ManagementDevopsContinuous Integration And Continuous Delivery (ci/cd)Project ManagementInfrastructure Technologies

Other Skills

Global Infrastructure ManagementAmazon Web Services (AWS)Identity and Access Management (IAM)KeycloakKubernetesSecurityInfrastructure as a Service (IaaS)Configuration ManagementcicdOCIGitJenkinsKibanaOracle CloudPython (Programming Language)

About

Strategic and hands-on leader with 18+ years of experience driving platform reliability, cloud infrastructure, and operational excellence across global cloud environments. Proven success in scaling multi-region Kubernetes platforms, leading SRE and DevOps transformations, and aligning engineering velocity with business outcomes. Adept at building high-trust teams, instituting observability and compliance frameworks, and delivering resilient, cost-optimized cloud ecosystems. Passionate about developer experience, data governance, and platform-as-a-product strategy. Decisive support during major customer-impacting incidents to ensure resilience and continuity. Skilled in cloud operation, DevOps, SRE, Change management , Incident management , Oncall management , Amazon Web Services (AWS), Oracle Cloud Infra , Test Strategy, Agile Methodologies, and container management like Kubernetes , docker etc . Strong professional with a Bachelor of Technology (BTech) focused in Electronics and Communications Engineering from Rajiv Gandhi Prodyogiki Vishwavidyalaya.

Experience

Unknown

Senior Development Manager

Oct 2025Present · 5 mos · Pune City · Hybrid

  • Lead a 25+ member global platform organization (DevOps, SRE, Architects, Managers), delivering shared services across 20+ flagship products used by 1000s of enterprise customers.
  • Spearheading Identity Platform enabling Identity First, unified SSO for all products, reducing login fragmentation and improving cross-sell conversion readiness.
  • Defined and executed platform-as-a-product strategy, consolidating 5+ disparate systems into standardized platforms, reducing duplication by ~80%.
  • Built centralized Audit Platform (OpenSearch-based) processing 10M+ events/day, improving audit traceability and reducing investigation time by 60%.
  • Delivered plug-and-play observability framework (logging-first), enabling 80% faster onboarding for new services and standardizing telemetry across 20+ products.
  • Established FinOps platform providing cost visibility across multi-cloud (AWS/azure/GCP/OCI) environments, driving $2.5M+ annual savings (~20–30% optimization).
  • Leading development of AI/ML-based cost recommendation engine, targeting additional 10–15% cost reduction via predictive optimization.
  • Designed Talay (Terraform abstraction layer) adopted by 70%+ teams, reducing infrastructure provisioning time by 60% and eliminating configuration drift.
  • Improved developer experience through golden paths and self-service platforms, reducing environment setup time from days to hours (~75% faster).
Global Infrastructure ManagementAmazon Web Services (AWS)Identity and Access Management (IAM)Operations ManagementPlatform EngineeringKeycloak

Oracle

Senior Development Manager

Mar 2022Present · 4 yrs

  • Lead SRE, devOps and Operations of Identity and Access Management/IDCS service.
  • Spearheaded multi-region cloud infrastructure programs across 100+ data centers. Accomplished 40% cost savings on a $2M+ annual budget through strategic resource planning.
  • Engineered Remote Disaster Recovery (RDR) architecture for IDCS services on OCI. Boosted service uptime to 99.9999% via automation and architectural upgrades.
  • Automated patching, certificate rotation, and scaling, saving 5 FTEs worth of manual effort.
  • Technical
  • Drive strategy, set goals, mentor managers and senior technical leaders. Lead 30+ resources across geographies.
  • Foster team innovation through emerging tech and AI in operations.
  • Executive program management. Coordination and collaboration across team and business units.
  • Cost governance, optimization & Budget Management, Change Management.
  • Resiliency engineering, disaster Recovery & High Availability. Risk management, Change leader, Incident command.
  • Platform engineering and Infrastructure as Code (Terraform, Helm, Kubernetes)
  • DevOps Strategy & CI/CD (GitHub, Jenkins, Python, shell, Java), progressive delivery, golden paths, chaos engineering
  • Database provisioning, high availability, Monitoring & Observability (Grafana, Kibana, Telemetry, Prometheus)
  • Security and compliance basis SOC2 (FedRAMP, FIPS, HIPAA, Policy as Code.)
  • SLAs / SLOs / Error Budgets and business continuity. Focus MTTR, Lead time, failure rate.
  • OCI, AWS, service meshes, WAF/CDN, secrets management, VPC, subnets, routing, load balance
  • production, ensuring consistency and scalability.
  • Integrated Generative AI, MCP agentic workflow into platform and SRE, improving MTTR by 25%.
  • Manage SLAs, SLOs, Error Budget. Reduced IAM region build SLA from 15 days to 12 hours using MFO.
  • Partnered with Security and Legal to embed SOC 2/HIPAA controls and regional compliance playbooks.
  • Shaped strategy, mentor leaders, manage region builds, and ensure resilience during critical incidents.
  • Tec
Operations ManagementInfrastructure TechnologiesKubernetesSecurityDevOpsInfrastructure as a Service (IaaS)+12

Barco

DevOps Architect

May 2021Feb 2022 · 9 mos

  • > Building and maintaining scalable, distributed, and resilient development , deployment and a delivery pipeline for Barco flagship product i.e OCS mastering the best of CI and CD practices.
  • > Increase product quality and reliability by adding smoke and integration testing before production market release.
  • > Planning , architecting and implementation everything from scratch .
  • > Cross site team communication and collaboration.
  • > Hiring up the right people at right time with a long term vision.
  • > Devops projects management using agile methodology .
JenkinsPython (Programming Language)DevOpsShell ScriptingContinuous Integration and Continuous Delivery (CI/CD)Test Strategy+9

Tokopedia

Engineering Manager

Dec 2020May 2021 · 5 mos · Noida, Uttar Pradesh, India

  • Led DevOps transformation across 50+ functional teams.
  • Delivered key initiatives: Beta on Demand, Git Governance, Chaos Lab.
  • Enhanced alerting and detection systems, reducing TTD/TTR by over 20%.
Amazon Web Services (AWS)AutomationConfiguration ManagementContinuous Integration and Continuous Delivery (CI/CD)DevOpsDocker Products+15

Oracle

5 roles

Software Development Manager

Aug 2016Dec 2019 · 3 yrs 4 mos

  • Managing and triaging tickets. Driving prioritization and execution of
  • work based on impact
  • Work in concert with service developers and enable the team to evolve
  • systems/products for better scalability, reliability and development
  • velocity
  • Drives new runbooks to help reduce mean triage time of incidents.
  • Prioritize and automate high hit count runbooks
  • Devops assessment.
  • Conducting system study and coordinating with team members for
  • product documentation, system design, integration, coding, application
  • maintenance, etc.
  • Foster a high performing Agile based culture of trust, teamwork,
  • empowerment, accountability, responsiveness, and communication. Set
  • boundaries, success criteria, and measure progress.
  • Minimize roadblocks and maximize opportunities to keep all members of
  • the teams productive, engaged, and fulfilled in their roles.
  • Call management, Availability mgmt., Incident mgmt, Change mgmt,
  • Release and deployment mgmt.

Principal Software Developer

Promoted

Aug 2014Aug 2016 · 2 yrs

  • Architecting robust, stable and reliable devops process, consuming
  • industry standard software and tools. Fair and wide knowledge of cloud stack helps in designing the CI/CD pipelines and monitoring platform.
  • Jenkin, GIT, Groovy, Java , ELK , Junit, Selenium, Shell, Ansible,Terraform,
  • Devops, Weblogic, Micro-services, Basic Oracle DBA ,Understanding of networking
  • Docker, kubernetes
  • Implement Operational tools like deployment, provisioning, monitoring, performance measurement tools etc.
  • Implementation of scaling out and horizontal scaling infrastructure architectures.
  • Good exposure working with best monitoring tools like Kibana, Graphana, Prometheos etc, thus encompassing high
  • reliability and predictable preventive and corrective actions.
  • Making site reliability and resolution fast by using jupyter notebook styled
  • runbook and slack bots.
  • Experience in customer facing production environment monitoring,
  • performance, release/deployment, security, reliability, availability,
  • capacity, latency, and other non-functional concerns.
Analytical SkillsAnsibleAutomationConfiguration ManagementContinuous Integration and Continuous Delivery (CI/CD)DevOps+18

Project Lead

Promoted

Aug 2012Aug 2014 · 2 yrs

Senior Software Developer

Promoted

Dec 2009Aug 2012 · 2 yrs 8 mos

Software Developer

Jun 2007Dec 2009 · 2 yrs 6 mos

Education

Kendriya Vidyalaya

Rajiv Gandhi Prodyogiki Vishwavidyalaya

Bachelor of Engineering - BE

Stackforce found 100+ more professionals with Platform Engineering & Operations Management

Explore similar profiles based on matching skills and experience