G

Gurpreet Singh

DevOps Engineer

India14 yrs experience

Key Highlights

  • Over 14 years of DevOps/SRE experience.
  • Expert in AWS and Kubernetes migrations.
  • Proven track record in cost optimization.
Stackforce AI infers this person is a SaaS Infrastructure Engineer with extensive experience in cloud optimization and automation.

Contact

Skills

Core Skills

Amazon Web Services (aws)KubernetesDatabases

Other Skills

AWS GlueShell ScriptingAthenaPythonLinux System AdministrationTerraformPostgreSQLLinuxDockerAWS EKSAuto ScalingHigh AvailabilityFabricAutomationContainers

About

Experienced DevOps/SRE with more than 14 years in designing, implementing, managing, optimizing and overhauling infrastructures. Seeking role as a Principal DevOps, SRE or Solution Architect.

Experience

14 yrs
Total Experience
1 yr 4 mos
Average Tenure
--
Current Experience

Simpl

Senior Staff DevOps Engineer

Mar 2022Jun 2024 · 2 yrs 3 mos · Remote · Remote

  • Led migration of Flink pipelines from EC2 cluster to EKS for improved efficiency & resource isolation.
  • Introduced IP address based whitelisting for User IAM credentials to block access from outside known networks.
  • Troubleshot and fixed numerous issues in infrastructure over my tenure:
  • ◦ Built Grafana dashboards for quicker identification of issues across clusters.
  • ◦ Resolved critical issues in Opensearch, Dask, Jupyterhub and Concourse clusters.
  • ◦ Fixed dependency issues in Dask & Airflow cluster deployments, multiple times.
  • Enhanced PostgreSQL performance:
  • ◦ Implemented Table partitioning.
  • ◦ Migrated data from existing table of 3TB to a new partitioned table.
  • ◦ Automated partition management/lifecycle with pg_partman & pg_cron.
  • Cost Optimization and Performance:
  • ◦ Implemented Karpenter on EKS utilizing Spot instances.
  • ◦ Migrated RDS PostgreSQL databases to Aurora IO‑Optimized.
  • ◦ Started the migration of a 6TB DynamoDB table to S3, Athena & Glue based solution.
  • ◦ Implemented Blue‑Green migrations for upgrading AWS RDS engine versions.
  • ◦ Implemented VPC Endpoints for S3, DynamoDB to ensure security & optimize costs.
  • ◦ Identified and added lifecycle rule for cleanup of incomplete multipart uploads.
  • ◦ Migrated big‑data infrastructure to Spot instances: Dask EC2 and ECS clusters.
  • ◦ Contributed to the setup of scalable Jupyter clusters on ECS backed by Spot instances.
AWS GlueShell ScriptingAthenaPythonLinux System AdministrationTerraform+2

Ordway - billing and revenue automation

SRE Consultant

Mar 2021Mar 2022 · 1 yr · Online · Remote

  • Hardened Docker images and EC2 AMIs.
  • Configured auto‑scaling on EKS clusters with custom metrics.
  • Introduced Spot Instances for non‑prod workloads to optimize AWS cost.
  • Orchestrated migration from Heroku to AWS EKS for improved scalability:
  • ◦ performed dozens of trial migrations
  • ◦ identified issues in elasticsearch, database migrations and fixed them before actual migration
  • ◦ minimized downtime by figuring out optimal ordering of migration steps
  • ◦ completed migration with zero critical issue
Shell ScriptingDatabasesPythonLinux System AdministrationLinuxAmazon Web Services (AWS)+1

Arcesium

SRE Lead

Apr 2019Mar 2021 · 1 yr 11 mos · Hyderabad, Telangana, India

  • Led Incident Management and Root Cause Analysis.
  • Automated internal workflows and assisted L1 support.
Linux System Administration

Adobe

Site Reliability Engineer

Feb 2017Apr 2019 · 2 yrs 2 mos · Noida Area, India

  • Saved significant AWS cost by optimizing AWS S3 storage.
  • Handled on‑call and regular SRE tasks, including toil reduction.
  • Developed an internal Chaos testing service using Flask, Celery and SaltStack.
PythonLinux System Administration

Gofro.com

Lead Devops

Nov 2015Feb 2017 · 1 yr 3 mos · Delhi

  • Implemented monitoring with OMD server and check_mk.
  • Managed AWS and in‑house infrastructure, including deployment automation with Jenkins and SaltStack.
PythonLinux System Administration

Adobe

Site Reliability Engineer

Apr 2015Oct 2015 · 6 mos · Noida, Uttar Pradesh, India

  • • Created SaltStack states and CloudFormation templates for Mesos cluster provisioning.
PythonLinux System Administration

Snapdeal

Senior DevOps

Feb 2014Apr 2015 · 1 yr 2 mos · Okhala, New Delhi

  • Maintained production and staging servers, implemented autoscaling for Tomcat servers.
  • Automated cloud infrastructure tasks using Boto and Shell scripts.
  • Reduced AWS expenses significantly through infrastructure optimization.
  • Awarded twice for both individual and team contributions.
PythonLinux System Administration

Bquobe

DevOps Engineer

Jan 2013Feb 2014 · 1 yr 1 mo

  • Developed web‑frontend and server‑side technologies for a distributed video encoding infrastructure.
  • Used ffmpeg for video encoding and Nginx for video streaming.
PythonLinux System Administration

Heymath (sankhyaa learning pvt. ltd)

Associate Systems Engineer

May 2011Dec 2012 · 1 yr 7 mos

  • Automated tasks using Crontab and Bash scripts
  • Created and managed a clustered NAS
  • Setup log aggregation with Rsyslog
Linux System Administration

Synechron technologies pvt. ltd

Software Analyst

Mar 2010Apr 2011 · 1 yr 1 mo

  • • Managed client infrastructure and created internal knowledge‑sharing wiki.
Linux System Administration

Education

International Institute of Information Technology, Pune

Master of Business Administration (MBA) — Information Technology

Jan 2007Jan 2009

Kolhapur Institute of Technology's College of Engineering, GOKUL-SHIRGAON

B.E — Electronics

Zero To Mastery Academy

Stackforce found 100+ more professionals with Amazon Web Services (aws) & Kubernetes

Explore similar profiles based on matching skills and experience