Aditya Munjal

Senior Software Engineer

Bengaluru, Karnataka, India4 yrs 3 mos experience
Highly Stable

Key Highlights

  • Led a team of SREs managing 20+ EKS clusters.
  • Developed observability stack with open-source tools.
  • Implemented disaster recovery infrastructure for business continuity.
Stackforce AI infers this person is a DevOps and SRE expert in SaaS environments, focusing on infrastructure management and CI/CD optimization.

Contact

Skills

Core Skills

DevopsInfrastructure ManagementSite Reliability EngineeringCi/cd OptimizationData Engineering

Other Skills

Amazon S3ArgoCDCognosEKSElasticsearchGitLabGitLab CIGrafanaHadoopIBM BigInsightIBM Infosphere WarehouseIBM SPSSIBM SPSS StatisticsJenkinsKafka

About

With over 7 years in DevOps and SRE, I excel in designing, deploying, and maintaining robust infrastructure. At Uni Cards, I lead a team of five SREs, manage over 20 EKS clusters, and developed observability stack with Loki, Tempo, Mimir, and Grafana. We provision our infra using Terraform on AWS and GCP, and optimize CI/CD pipelines with GitLab CI and ArgoCD. Previously at TO THE NEW, I set up Jenkins libraries for AWS ECS, managed 1.5TB daily log ingestion, and implemented disaster recovery with Terraform. I’m passionate about cost optimization, knowledge sharing, and fostering a culture of continuous improvement. Connect with me to innovate in DevOps and SRE!

Experience

4 yrs 3 mos
Total Experience
1 yr 5 mos
Average Tenure
--
Current Experience

Roku

Senior Software Engineer

Feb 2025Sep 2025 · 7 mos · Bengaluru, Karnataka, India · On-site

  • Part of the DevTools team at Roku, responsible for maintaining a self-hosted GitLab instance supporting 2,500+ engineers, ensuring zero-downtime upgrades and deployments across global sites.
  • Migrated GitLab from an ECS monolith (single containerized application) to a Kubernetes-based microservices architecture, improving scalability, maintainability, and fault isolation.
  • Designed and implemented end-to-end monitoring for GitLab services, including SLI/SLO dashboards and synthetic monitoring to track performance and availability.
  • Developed a custom monitoring agent deployed across all Roku offices worldwide to measure GitLab performance and latency from different geographic regions.
GitLabKubernetesMonitoringPerformance TrackingDevOpsInfrastructure Management

Uni cards

2 roles

Site Reliability Engineer III

Promoted

Aug 2022Jan 2025 · 2 yrs 5 mos

  • As an SRE3 at Uni Cards, I lead a talented team of 5 SREs in designing and managing infrastructure for multiple products and verticals. My key responsibilities include:
  • Infrastructure Leadership: Spearheading the design and creation of resilient and scalable infrastructure solutions, ensuring seamless operations across diverse product lines.
  • CI/CD Pipeline Optimization: Implementing and maintaining an auto-scalable, secure, and fast CI/CD pipeline using GitLab CI and ArgoCD, streamlining the development and deployment processes.
  • Monitoring and Alerting Setup: Setting up end-to-end monitoring and alerting on Grafana using Grafana Operator, ensuring proactive issue detection and resolution.
  • Data Pipeline Management: Managing an ELT data pipeline, pushing changelogs from databases to Snowflake via multiple Kafka source and sink connectors to ensure efficient data processing and analytics.
  • Infrastructure as Code (IaC) and GitOps: Transitioning all infrastructure to IaC and adhering to GitOps practices by deploying everything via ArgoCD, enhancing consistency, reliability, and version control.
TerraformGitLab CIArgoCDGrafanaKafkaSite Reliability Engineering+1

Site Reliability Engineer II

Sep 2021Aug 2022 · 11 mos

  • As an SRE2, I Worked on the following:
  • EKS Cluster Management: Oversaw the management of 20+ EKS clusters, ensuring high availability, performance, upgrades and security.
  • Observability Stack Development: Developed a comprehensive, homegrown observability stack using open-source tools like Loki, Tempo, Mimir, and Grafana to provide end-to-end monitoring and visibility.
  • End-to-End Infrastructure Provisioning: Utilised terraform to provision infrastructure on AWS and GCP, ensuring efficient, repeatable, and scalable deployments.
TerraformEKSGrafanaLokiTempoSite Reliability Engineering+1

To the new

2 roles

DevOps Engineer

Aug 2019Jul 2021 · 1 yr 11 mos

  • During my tenure at TO THE NEW, I played a key role in enhancing and optimizing our DevOps practices. My responsibilities included:
  • Jenkins Shared Library Setup: Developed Jenkins shared libraries to streamline the deployment of Python and Java services to AWS ECS, improving deployment efficiency and consistency.
  • Logging Infrastructure: Designed and implemented a comprehensive logging setup capable of ingesting 1.5TB of logs daily, utilizing Kafka as a buffer and Elasticsearch for log storage to ensure reliable log management and analysis.
  • Disaster Recovery Infrastructure: Set up the entire disaster recovery (DR) infrastructure using Terraform, enabling one-click DR deployment and ensuring business continuity.
  • Cost Optimization: Implemented various cost optimization strategies, including the use of spot instances, Savings Plans, reservations, and right-sizing, significantly reducing AWS costs.
  • Knowledge Sharing: Conducted multiple knowledge-sharing sessions on Kafka, Jenkins, and AWS ECS, fostering a culture of continuous learning and development within the team.
  • Interviewing: Participated in the recruitment process by interviewing college freshers for DevOps intern roles, helping to identify and onboard new talent.
JenkinsTerraformKafkaElasticsearchDevOpsInfrastructure Management

Devops Trainee

Feb 2019Jul 2019 · 5 mos

Makemebuilder.com

Web Developer

Jul 2018Sep 2018 · 2 mos · Gurugram, Haryana, India

Gd goenka university

Cloud Architect Training

Jun 2018Jul 2018 · 1 mo

Education

GD Goenka University

Bachelor of Technology - BTech — Computer Science

Jan 2015Jan 2019

Stackforce found 100+ more professionals with Devops & Infrastructure Management

Explore similar profiles based on matching skills and experience