Simranjeet Singh

SRE (Site Reliability Engineer)

London, Ontario, Canada8 yrs 10 mos experience

Key Highlights

  • Expert in Kubernetes and cloud-native architectures.
  • Led significant cloud migrations and automation initiatives.
  • Proven track record in optimizing operational efficiency.
Stackforce AI infers this person is a Cloud Infrastructure Engineer with expertise in DevOps and Site Reliability Engineering.

Contact

Skills

Core Skills

Infrastructure ManagementContinuous Integration And Continuous Delivery (ci/cd)

Other Skills

AWS CloudFormationAmazon Web Services (AWS)AnsibleApacheBashCactiClusterCommunicationContinuous IntegrationDBMSDHCPDNSDevOpsDjangoDocker Products

About

Passionate about Kubernetes and cloud-native architectures, I bring extensive experience in designing and managing scalable, reliable, and automated cloud infrastructures. I specialize in leading containerization initiatives and migrating applications to Kubernetes, driving significant improvements in operational efficiency and system resilience. With a strong focus on optimizing complex cloud environments, I deliver solutions that meet critical reliability and availability targets. I leverage automation to streamline processes, reduce manual effort, and enhance observability with intelligent alerting. I thrive on solving challenging technical problems , helping businesses run their mission-critical applications smoothly and efficiently.

Experience

8 yrs 10 mos
Total Experience
1 yr 7 mos
Average Tenure
--
Current Experience

Cisco

Site Reliability Engineer

Oct 2022Present · 3 yrs 8 mos · Canada

Infrastructure ManagementCommunication

Microsoft

Site Reliability Engineer

Dec 2021Oct 2022 · 10 mos

  • Cloud Deployment & Ownership: Planned and executed deployment of application components from public to GOV cloud. Owned deployment, availability, reliability, and customer escalation targets for sovereign and public cloud environments.
  • Alert Optimization: Optimized on-call alerts, adding enrichment to improve alerting quality and reduce noise.
  • Microservices Management: Deployed and managed microservices on Kubernetes clusters.
  • CI/CD Pipeline Support: Assisted developers in creating and building build and release pipelines using Azure DevOps.
  • Production Support: Supported production systems during on-call periods and provided immediate solutions.
  • Documentation & Standards: Maintained up-to-date documentation on deployments, processes, and standard operating procedures/run-books.
Infrastructure ManagementInfrastructureContinuous Integration and Continuous Delivery (CI/CD)AnsibleCommunicationDocker Products

Expedia group

Software Development Engineer III (Infra & Devops)

Sep 2019Dec 2021 · 2 yrs 3 mos · Gurgaon, India

  • Cloud Migration & CI/CD: Led application migration from ECS/Docker/EC2 to EKS via Jenkins pipelines and Helm charts. Automated deployments and assisted developers.
  • Infrastructure Automation: Developed and implemented infrastructure automation solutions using Terraform and Ansible for provisioning and deployment. Automated ECS cluster patching with Python/Terraform.
  • Cloud Transformation & Cost Optimization: Planned and executed application migrations from datacenter to cloud. Successfully reduced cloud bills by 60% through strategic cost optimization.
  • Proactive Incident Management: Served as primary point of contact for production incidents, performed root cause analysis, identified problem patterns, and developed automated, self-healing solutions.
  • Operational Excellence & Documentation: Investigated system failures, identified root causes, and implemented remedies for continuous improvement. Maintained up-to-date documentation on deployments, processes, and SOPs.
  • Production Monitoring & Remediation: Monitored production systems during on-call periods, providing immediate remediation for issues.
  • Python-Driven Automation: Automated various operational tasks using Python programming, significantly reducing redundant tickets and improving turnaround times.
Infrastructure ManagementInfrastructureLinux System AdministrationContinuous Integration and Continuous Delivery (CI/CD)AnsibleCommunication+1

Adobe

Site Reliability Engineer

Apr 2017Aug 2019 · 2 yrs 4 mos · Noida Area, India

  • Infrastructure Automation: Developed and implemented infrastructure automation solutions using AWS CloudFormation and Ansible for efficient application deployment.
  • Operational Tooling & Self-Service: Created tools, operational enhancements, and automated solutions to enable self-service configurations, accelerate deployments, and enhance monitoring for critical SaaS applications.
  • Incident Management & Automation: Acted as the primary contact for production incidents, performed root cause analysis, identified problem patterns, and developed automated, self-healing solutions.
  • Documentation & Standards: Maintained up-to-date documentation for deployments, processes, and standard operating procedures/run-books.
  • Continuous Improvement: Investigated system failures, identified root causes, and implemented remedies for ongoing system improvement.
  • Production Monitoring & Remediation: Monitored production systems during on-call periods and provided immediate remediation for issues.
  • Cloud Region Bootstrapping: Executed bootstrapping of AWS regions using Terraform.
  • Automation Backend Development: Developed a provisioning automation backend utilizing Python Django.
Infrastructure ManagementInfrastructureLinux System AdministrationContinuous Integration and Continuous Delivery (CI/CD)AnsibleCommunication+1

Hughes systique corporation (hsc)

Senior Engineer

Apr 2017Aug 2019 · 2 yrs 4 mos · Gurugram, Haryana, India

Infrastructure ManagementInfrastructureLinux System AdministrationContinuous Integration and Continuous Delivery (CI/CD)AnsibleCommunication+1

Global analytics

Sr Devops Engineer

Aug 2016Apr 2017 · 8 mos

  • Operational Oversight: Efficiently manage daily health checks, incident response, and change management liaison, ensuring timely resolution of infrastructure and application issues.
  • Technical Analysis & Reporting: Analyze, communicate, and present on complex technical and system issues, providing data-driven reports and insights on application behavior to aid development.
  • DevOps & Automation: Play a major role in driving DevOps initiatives and automating redundant tasks through shell/Python scripting to enhance operational efficiency.
  • Incident Management & Support: Accountable for day-to-day support, including incident investigations, root cause analysis, fixes, and proactively incorporating new processes for continuous improvement.
  • Quality & Efficiency: Support continuous improvement and quality of operations, actively working to decrease turnaround times.
Infrastructure ManagementInfrastructureLinux System AdministrationAnsibleCommunicationDocker Products

Jaarvis

Team Lead IT

May 2014Aug 2016 · 2 yrs 3 mos · Gurgaon, India

  • Leading Deployment: Strategically directed Python Django application deployments on Linux, ensuring seamless integration and optimal performance.
  • Driving Automation: Championed system administration automation via shell/Python scripting, significantly enhancing efficiency.
  • Orchestrating Monitoring: Pioneered Nagios server configuration and maintenance for robust system/service health oversight.
  • Guiding Server Management: Led end-to-end server installation, rebuilding, and meticulous configuration to organizational standards.
  • Directing Issue Resolution: Provided decisive leadership in resolving complex support tickets, ensuring rapid diagnosis and effective solutions.
  • Overseeing Maintenance & Security: Directed comprehensive system maintenance and patching, fortifying security and performance.
  • Implementing Key Infrastructure: Spearheaded the successful installation/configuration of vital enterprise systems (GIS, Asset Management).
  • Developing Procedures: Authored, disseminated, and enforced clear installation/configuration procedures for consistency and scalability.
  • Orchestrating Monitoring: Instituted daily system monitoring protocols, verifying hardware, resources, processes, logs, and successful backups.
  • Enhancing Security: Led proactive security monitoring to identify/mitigate intrusions, strengthening defensive capabilities.
  • Strategizing Data Protection: Directed daily backup operations, ensuring data integrity and robust disaster recovery.
  • Driving Performance Optimization: Orchestrated ongoing performance tuning, hardware upgrades, and resource optimization (CPU, memory, disk).
  • Pioneering Config Management: Initiated and managed Salt/Ansible adoption, revolutionizing deployment via advanced automation.
Infrastructure ManagementInfrastructureLinux System AdministrationAnsibleCommunicationDocker Products

Estel technologies

Trainee System Engineer

Nov 2013May 2014 · 6 mos · Gurgaon, India

  • Application Deployment & Management: Spearheading the deployment of critical system applications on Linux environments, ensuring robust and reliable operations.
  • Server Provisioning: Meticulously preparing and configuring application servers to facilitate smooth and efficient software deployments.
  • Proactive System Monitoring: Continuously monitoring system logs and conducting thorough health checks to preemptively identify and resolve system anomalies, maintaining high availability.
  • Client Support & Communication: Delivering exceptional technical support to clients through email and chat, addressing their needs and ensuring a positive experience.
  • Technical Issue Resolution: Taking ownership of support tickets, diagnosing problems, and implementing effective solutions to restore service promptly.
  • System Upkeep & Security: Performing essential system maintenance and applying application patches as required, contributing to overall system integrity and security.
Infrastructure ManagementInfrastructureLinux System AdministrationContinuous Integration and Continuous Delivery (CI/CD)AnsibleCommunication

Education

Indira Gandhi National Open University

Post Graduate Diploma In Information Security — Computer and Information Systems Security/Information Assurance

Jan 2019Jan 2020

Guru Gobind Singh Indraprastha University

Bachelor of Technology (B.Tech.) — Computer Science Engineering

Jan 2009Jan 2013

Guru nanak Public school

Jan 2006Jan 2009

Stackforce found 100+ more professionals with Infrastructure Management & Continuous Integration And Continuous Delivery (ci/cd)

Explore similar profiles based on matching skills and experience