Najumudeen M

Platform Engineer

Karnataka, India16 yrs 11 mos experience
Highly Stable

Key Highlights

  • Expert in building scalable CI/CD pipelines.
  • Proven track record in cloud cost optimization.
  • Strong experience in automation and infrastructure management.
Stackforce AI infers this person is a SaaS-focused Site Reliability Engineer with strong automation and cloud infrastructure expertise.

Contact

Skills

Core Skills

Cloud InfrastructureSite Reliability EngineeringAutomationCost OptimizationCi/cdDevopsSystem AdministrationTroubleshooting

Other Skills

AWSAWS CloudFormationAWS LambdaAmazon CloudWatchAmazon EBSAmazon EC2Amazon EKSAmazon Route 53Amazon S3Amazon Simple Notification Service (SNS)Amazon VPCAmazon Web Services (AWS)AnsibleArgoBash

About

Hi, My name is Najumudeen M, and I am a passionate SRE/Devops Engineer with experience in diving seamless automation, optimizing cloud infrastructures, and accelerating software delivery cycles. With a strong basis in information technology and systems. I specialise in developing Continuous Integration and Continuous Delivery (CI/CD) pipelines, automating complex deployments, and leveraging cloud services to create scalable and dependable solutions. I enjoy leveraging tools like Terraform, Docker, Kubernetes, Jenkins, Packer and Ansible to turn manual processes into effective automated workflows. My objective is to assist organizations in increasing productivity by providing accurate, timely, and high-quality mission-critical software. 📝 Key Expertise: ☁ Cloud Platforms: AWS 🔧 Automation & Iac: Terraform, Terragrunt, Ansible ⌛ CI/CD: Jenkins, ArgoCD ✄ Containerisation: Docker, Kubernetes 🔍 Monitoring & Logging: Prometheus, Grafana, CloudWatch 💾 Version Control & SCM: Git, GIt HUB 📺 Scripting & Automation: Shell Scripting, Golang, Python, Test Automation I'm dedicated to continual development and am always keen to learn about new technologies that promote innovation in the SRE/DevOps arena. Let us connect! Please feel free to DM me here.

Experience

Xebia

Platform Engineer

Jul 2025Present · 8 mos · India · Hybrid

Syniti

Principal Site Reliability Engineer

Feb 2022Mar 2024 · 2 yrs 1 mo · Bengaluru, Karnataka, India

  • ▪️Contributed to SYNITI's cloud infrastructure design, as well as being accountable for essential systems/applications uptime and availability.
  • ▪️Build and upgraded over 100+ EKS clusters for multiple environment like development, testing and production on AWS cloud used terraform and github-actions.
  • ▪️Created Golden AMI for several EKS client versions and merged it with an EC2 Linux worker node using Packer to reduce cloud costs by 40%.
  • ▪️Automated end-to-end monolithic application deployment across multi regions, dedicated and shared platforms for SYNITI's SaaS-based products using Terraform in the AWS cloud.
  • ▪️Implemented systems, endpoints and application monitoring for microservices using tools like cloud-watch, Prometheus, Grafana, Loki and Promtail agent.
  • ▪️Experience putting up multi-region Terraform projects using Terragrunt and working with Terraform workspaces.
  • ▪️Managed and maintained the Kubernetes cluster to ensure high availability for containerized applications. Rolling updates and automatic scaling were included to deal with increased demands during peak usage periods.
  • ▪️Implemented solutions with an emphasis on cloud security, cost optimization, and automation.
  • ▪️Developed custom script to install and configure cloud security agents tools like Tenable, CarbonBlack and Splunk integrated with AWS AMI image.
  • ▪️Supporting a 24 x 7 online environment as a part of an on-call rotation.
AWSTerraformGitHub ActionsPackerPrometheusGrafana+4

Sabre corporation

Senior Devops/SRE

Apr 2016Feb 2022 · 5 yrs 10 mos · Bengaluru, Karnataka, India

  • ▪️Implemented and maintained Jenkins CI/CD pipelines to automate and orchestrate the SaaS and hosted cloud operations environment.
  • ▪️Responding to a production incident and determining ways to prevent it in the future.
  • ▪️Experience setting Jenkins master and slave nodes are highly available across many environments on the AWS cloud.
  • ▪️Developed a custom script using Python and shell scripts. saved 90% of the engineer's time spent manually on deployments.
  • ▪️Implemented an automated workflow procedure to improve deployment time for spin groups of stacks in Jenkins parallel deployment using Declarative approaches.
  • ▪️Part of on call rotation and participated in blame-less postmortem activities.
  • ▪️Developing and maintaining technical documentation, runbooks, and procedures.
  • ▪️Application deployments with CI/CD tools, code repository, code scanning, artifact repository, compliance scanning, packaging, deployment, and configuration management.
  • ▪️Developed a CI/CD pipeline for JAVA based a spring boot application and used API Gateway, Jenkins, Bit Bucket, S3, and Lambda to deploy it on AWS.
  • ▪️Integrated tools across CI/CD stage including Nexus, SonarQube for code quality and security checks.
  • ▪️Provisioning AWS cloud infrastructure using Cloud formation and Configuring them using Ansible.
  • ▪️Managed source code using Git, Bit Bucket, and AWS CodeBuild, resolving merge conflicts and controlling branches and permissions to keep development workflows organized.
  • ▪️Streamlined the CI/CD pipeline for java-based and spring boot applications using maven, resulting in a 20% reduction in build and deployments times.
  • ▪️Implemented application and system monitoring and alerting using APP Dynamics and Dashboard.
  • ▪️The Cloud Cost Optimization (CCO) project initiative Created the Auto CloudFormation Stack (ACS), an in-house solution that automatically sets up stacks during business hours and shuts them down during off hours. That has saved the company $100k on its annual budget.
JenkinsPythonShell ScriptingAWSGitBit Bucket+4

Yahoo

Senior Site Reliability Engineer

Jul 2013Mar 2016 · 2 yrs 8 mos · Bengaluru Area, India

  • ▪️Develop tool to automate the deployment, administration and monitoring of a large-scale linux environment.
  • ▪️Part of Yahoo's Native Ads SE on call rotation and working closely with SME/DEV to find out the root cause and applying for the permanent fix.
  • ▪️Collaborated with the Business Continuity Plan (BCP) team to ensure that the internal product was fully certified by BCP prior to coming online.
  • ▪️System SRE team is responsible for working closely with DNS/STORAGE SE team to under stand the Service better and provide support escalation from the Yahoo! Operations Center.
  • ▪️As part of Platform SRE on-call rotation of product support, I contributed to the sustainability and uptime of Yahoo's Sherpa internal distributed key and value data store system, Object Storage Mobstor.
  • ▪️Function as a technical generalist responsible for the overall health and performance of our platform.
  • ▪️Identifying and automating manual process developing and maintaining technical documentation run books and procedures.
  • ▪️Developed more than 25+ run books using ruby programming language and integrated with yahoo's in house tool to fix occurring automatically alerts.
LinuxRubyMonitoringAutomationSite Reliability Engineering

Four interactive pvt ltd (asklaila)

Senior System Engineer

Nov 2007Jun 2013 · 5 yrs 7 mos · Bengaluru Area, India

  • ▪️Linux Server system administration across various environments from deployment to production.
  • ▪️Proven excellent troubleshooting abilities by promptly locating and fixing network and server problems.
  • ▪️Developed shell scripts to automate node health checks, monitor CPU, memory, and other vital metrics, and maintain system stability.
  • ▪️Examining software, hardware, and equipment with the purpose of improving performance.
LinuxShell ScriptingTroubleshootingSystem Administration

Education

V.L.B. JANAKIAMMAL COLLEGE OF ARTS AND SCIENCE

Bachelor’s Degree — Computer Application

Jan 2004Jan 2007

Stackforce found 100+ more professionals with Cloud Infrastructure & Site Reliability Engineering

Explore similar profiles based on matching skills and experience