Akash deep yadav

DevOps Engineer

Noida, Uttar Pradesh, India7 yrs 10 mos experience

Key Highlights

  • Over 8 years of experience in cloud-native infrastructure.
  • Led cross-functional DevOps teams to drive CI/CD transformation.
  • Architected observability frameworks processing 60M+ daily API requests.
Stackforce AI infers this person is a Cloud Computing Infrastructure Expert with a focus on DevOps and CI/CD practices.

Contact

Skills

Core Skills

Cloud InfrastructureDevopsMonitoring & Logging

Other Skills

Amazon Web Services (AWS)AnsibleBashGitNew RelicReliability EngineeringTerraformKubernetesPrometheusAerospikeMongoDBKafkaElastic Stack (ELK)GrafanaTelegraf

About

I am Akash Deep Yadav, a Senior DevOps Lead with over 8 years of experience in architecting and managing cloud-native infrastructure across AWS, GCP, and Kubernetes. My expertise lies in driving CI/CD transformation, implementing Infrastructure as Code, and enabling microservices orchestration, which has led to significant efficiency improvements and enhanced deployment scalability. I have a proven track record of leading cross-functional DevOps teams, providing technical direction aligned with business objectives, and collaborating with stakeholders at all levels of the organisation. Throughout my career, I have built and managed enterprise-grade observability frameworks that process over 60 million daily API requests, enhancing real-time system visibility and performance tuning. I am passionate about infrastructure modernisation and DevSecOps integration, driving resilience and scalability in dynamic cloud environments. Cloud Platforms: AWS (EKS, EC2, RDS, Lambda, Glue, Control Tower, Route53) Containerization & Orchestration: Docker, Kubernetes (EKS), Helm, ECS, GKE CI/CD & DevOps: Jenkins, GitHub Actions, ArgoCD, GitOps Infrastructure as Code: Terraform, Ansible Scripting & Automation: Python, Shell Scripts Monitoring & Logging: Prometheus, Grafana, ELK Stack, Datadog, New Relic, CloudWatch Security & Optimisation: Trivy, OpenVPN, Cloudflare, AWS WAF, Trendmicro, AWS Guard Duty, Cost Management, Access Control Databases & Caching: PostgreSQL, MongoDB, MySQL, Redis, Aerospike I operate with an ownership mindset—focusing not just on deployments, but on architecture, product impact, and user experience. Alongside DevOps practices, I’m actively exploring AI based use cases by building solutions like vulnerability remediation systems with Claude, LangGraph, and MCP, to bridge infrastructure and intelligent automation to put fine tuned guardrails.

Experience

7 yrs 10 mos
Total Experience
1 yr 5 mos
Average Tenure
9 mos
Current Experience

Paytm

Senior Devops Lead

Sep 2025Present · 9 mos · Noida · On-site

Tripjack

Senior Devops Engineer

Jul 2024Sep 2025 · 1 yr 2 mos · Delhi, India · On-site

  • perform migration of complete infra resources which includes: EC2 , RDS, ECS cluster, ECR repository, S3 buckets, DNS records from one aws account to another.
  • implementation of automation using python and shell script for smoother operations and reducing manual efforts.
  • setup Aerospike cluster for maximum availability and better performance.
  • setup mongo db cluster as a part for moving from postgres database to nosql based database to increase the performance
  • implemented ELK cluster with kafka for continuous log streaming and improve business reporting.
  • Implemented autoscaling group with custom policies including predictive scaling to handle 6 crore request per day.
  • Implemented monitoring for all the production servers and capturing all the nginx based matrices to capturing each api request coming on the platform to find out status code, response time.
  • Architected a monitoring pipeline handling 60M+ daily API requests using Telegraf, Prometheus, and Grafana. Built percentile latency metrics (P50–P99), created normalized dashboards with instance auto-discovery, optimized slow queries with Prometheus caching, and enabled long-term metric retention via Thanos and S3.
  • Built a containerized S3 log viewer platform using FastAPI and React, enabling on-demand log access, smart previewing, and keyword-based error highlighting to streamline debugging for developers.
  • I have done the integration of GitHub actions CI pipeline with Robot Framework-based automation suit.
  • Complete infra review on AWS to prepare the cost optimisation recommendations and worked on it and implemented some best practice-based use cases to save cloud cost.
  • Owning the complete infra and managing the weekly task of devops team using JIRA.
  • Developed modern CI/CD pipelines and improved legacy pipelines by introducing faster deployments with minimum api request cancellation during deployment for production and Lower environments.
Amazon Web Services (AWS)AnsibleBashCloud InfrastructureGitNew Relic+9

To the new

Senior Cloud Engineer/SRE

Jul 2021Jun 2024 · 2 yrs 11 mos · Noida, Uttar Pradesh, India · Hybrid

  • As a Senior Cloud Engineer at To The New, I deployed microservices-based applications on AWS, significantly improving scalability and system performance. I applied CI/CD best practices using Jenkins and ArgoCD, which reduced deployment time by 20%. My role involved utilising various AWS services for secure infrastructure provisioning and remediating vulnerabilities through automation, which reduced operational effort by 30%. I also established infrastructure observability, decreasing downtime and enhancing performance metrics.
  • Achievements:
  • Improved scalability and system performance through effective microservices deployment on AWS.
  • Reduced deployment time by 20% by implementing CI/CD best practices.
  • Automated routine tasks, leading to a 30% reduction in operational effort.
  • Decreased downtime by 10% through the establishment of robust infrastructure observability.
  • Drive the 30% implementation task for DataLake using AWS Glue job, connections and airflow for data pipeline management.
  • Taken care of complete cost optimisation initiative and reduced the cost by 20% over the period of 3 months.
ElasticsearchAnsibleCustomer ServiceLinuxTerraformCloud Infrastructure+10

Magic finserv

DevOps Engineer

Jul 2020Jul 2021 · 1 yr · Noida, Uttar Pradesh, India · Remote

  • In my role as a CloudOps Engineer, I automated daily operations, which reduced manual workload by 40% and improved system uptime. I resolved critical application issues across various environments, ensuring stable and reliable deployments. My contributions significantly enhanced operational efficiency and reduced incident response times.
  • Achievements:
  • Achieved a 40% reduction in manual workload through automation of daily operations.
  • Decreased incident response time by 30% by resolving critical application issues effectively.
Reliability EngineeringDatadogAmazon Web Services (AWS)GitElasticsearchProduction Deployment+2

Newgen software

System Engineer

Sep 2019Jun 2020 · 9 mos · Noida Area, India · On-site

  • As a System Engineer, I automated operational tasks, resolving numerous issues across Linux and Windows environments. I deployed and configured client-facing applications, ensuring seamless go-live experiences. My role also involved managing AWS server configurations, which reduced manual errors and ensured consistent updates.
  • Achievements:
  • Enhanced system uptime by automating operational tasks and resolving over 50 issues.
  • Ensured seamless go-live experiences for client-facing applications through effective deployment strategies.

Aeris communications

System Engineer

May 2018Aug 2019 · 1 yr 3 mos · New Delhi Area, India

  • Setting up Nagios to monitor multiple hosts and the Grafana dashboard to monitor Jenkins pipelines.
  • Automate the installation of the application on multiple servers using Ansible.
  • Worked with the AWS IAM service to manage access management for the team.
  • Setting up the security groups for routing, Elastic load balancer and Target group provisioning, created the launch template to Autoscale the instances in the lower environments
  • Troubleshoot production issues and coordinate with the development team to streamline deployment.
  • Developed a Python script using Selenium web driver to automate the web page login procedure.
  • Involved in various telecom testing of new products, services, rate plans, etc.
Reliability Engineering

Education

G.L. BAJAJ INSTITUTE OF ENGINEERING AND TECHNOLOGY, GREATER NOIDA

Bachelor of Technology - BTech — Electronics and Communications Engineering

Jan 2013Jan 2017

Stackforce found 100+ more professionals with Cloud Infrastructure & Devops

Explore similar profiles based on matching skills and experience