Devashish Gupta

DevOps Engineer

Gurugram, Haryana, India4 yrs 5 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in architecting scalable cloud infrastructure.
  • Proven track record in MLOps and AI platform engineering.
  • Strong focus on security and compliance in cloud environments.
Stackforce AI infers this person is a Cloud Infrastructure and DevOps specialist in the AI and SaaS sectors.

Contact

Skills

Core Skills

Cloud Architecture & ResilienceMlops & Ai InfrastructureCloud SecurityDevsecops & AutomationGovernance & ComplianceNetwork EngineeringNetwork Administration

Other Skills

Amazon Web Services (AWS)SecurityRedisSQSStep FunctionsAWS BatchObservabilityTerraformAWS CodePipelineAWS Systems ManagerAWS CloudHSMAnsibleComputer NetworkingServer AdministrationData Governance

About

Strategic Senior DevOps Engineer with a proven track record of architecting mission-critical, scalable, and secure cloud infrastructure for high-growth AI startups. An expert in translating complex business requirements into robust technical solutions that enhance performance, ensure reliability, and control costs. Core Competencies: • MLOps & AI Infrastructure: Designing and deploying high-density model serving platforms (EKS, Triton, ModelMesh) and event-driven batch processing pipelines (AWS Batch, Step Functions). • Cloud Architecture & Resilience: Architecting multi-region disaster recovery strategies, database optimization patterns, and highly available, cost-efficient AWS environments. • DevSecOps & Automation: Implementing "Shift Left" security in CI/CD, automating developer workflows (GitHub Bots), and leading security analysis and remediation (AWS GuardDuty). • Governance & Compliance: Driving initiatives to reduce technical debt, implement cloud governance (AWS Control Tower), and ensure adherence to standards like SOC 2 and ISO 27001. GitHub : https://github.com/dcgmechanics Medium : https://dcgmechanics.medium.com -- Updated On 16 Nov 2025 --

Experience

Spyne

DevOps Engineer 2

May 2025Present · 10 mos · Gurugram, Haryana, India · On-site

  • Cloud Architecture & Performance Optimisation:
  • Resolved RDS performance bottlenecks by architecting a "Cache as a Gatekeeper" pattern (Redis), an Aurora migration strategy, and SQS decoupling for high-volume writes.
  • Mitigated L7 DDoS attacks by implementing an AWS WAF rate-limiting solution, successfully handling traffic spikes exceeding 50,000 requests per minute.
  • Designed ECS Service Connect to reduce latency, cut data transfer costs, and secure internal service communication.
  • AI/ML Platform Engineering (MLOps):
  • Architected a scalable, event-driven ML batch processing pipeline (SQS, Step Functions, AWS Batch) for asynchronous model training/inference.
  • Designed a high-density MLOps platform on AWS EKS using ModelMesh and NVIDIA Triton to efficiently serve hundred of multi-modal AI models.
  • Developed scaling strategies for ML workloads, defining principles for "warm pools" Spot Instance usage for up to 90% cost savings.
  • Automation, Security & Disaster Recovery:
  • Engineered a GitHub App (Bot) from scratch to automate JIRA validation within pull requests, enforcing development best practices and blocking unlinked merges.
  • Designed a comprehensive, multi-region "Warm Standby" disaster recovery (DR) plan for all critical AWS services (ECS, RDS, SQS) to ensure business continuity.
  • Conducted security analysis with AWS GuardDuty, Implementing remediation plans for high-severity risks like data exfiltration and exposed resources.
  • Authored a strategic roadmap to reduce technical debt, proposing migrations from Jenkins to AWS CodePipeline and adopting AWS Control Tower for a multi-account strategy.
  • Cost Optimisation & Cloud Migration:
  • Led complex cloud migrations, including moving a memory-intensive database to GCP using Terraform and strategically migrating all AI workloads from GCP back to AWS.
  • Achieved significant cost savings by migrating GPU instances to more cost-effective types (g5 to g6) and downsizing under-utilised Redis & RDS instances.
Amazon Web Services (AWS)SecurityCloud Architecture & ResilienceMLOps & AI Infrastructure

Smc group

2 roles

Member of Technical Staff - II (Cloud & DevOps)

Sep 2024May 2025 · 8 mos · New Delhi, Delhi, India · Hybrid

  • ● Engineered a cost-efficient GitOps CI/CD pipeline using Terragrunt and Terraform on AWS CodePipeline, standardizing infrastructure deployment with:
  • Enhanced consistency in infrastructure changes
  • Improved auditability and version control
  • Reduced manual intervention and potential human errors
  • ● Transformed incident response capabilities by strategically implementing AWS Systems Manager, resulting in:
  • Dramatically reduced Mean Time to Recovery (MTTR)
  • Proactive application performance management
  • Streamlined operational efficiency through advanced monitoring and rapid incident mitigation
  • ● Implemented advanced data protection strategy utilizing AWS CloudHSM, achieving:
  • Robust encryption key management compliant with SEBI Bring Your Own Key (BYOK) guidelines
  • Minimal infrastructure overhead
  • Enterprise-grade security using Hardware Security Modules (HSMs)
  • Comprehensive protection of sensitive organizational data
  • ● Integrated the L(Loki) G(Grafana) T(Tempo) M(Mimir) Observability Stack with our AWS Infrastructure:
  • Automated deployment of the entire observability stack using Infrastructure as Code for seamless integration with AWS application infrastructure
  • Achieved smooth integration with AWS ECS tasks using Grafana Alloy as an agent sidecar
  • Implemented custom Grafana dashboards and data sources (Loki for Logs, Mimir for Metrics & Tempo for Traces)
ObservabilityDevSecOps & Automation

Senior Software Engineer (Cloud & DevOps)

Mar 2024Aug 2024 · 5 mos · New Delhi, Delhi, India · Hybrid

  • ● Migrated On-Premises Data Center Infrastructure to AWS, Significantly Enhancing Scalability and Cost-Efficiency.
  • ● Championed the Adoption of GitOps, Achieving 96% Infrastructure as Code (IaC) coverage using Terraform, Ansible, Packer, and CloudFormation. This Resulted in a Dramatic Reduction of Infrastructure Provisioning Time from Days to Minutes.
  • ● Cultivated a DevOps and DevSecOps Culture, Integrating Security Controls and Compliance Checks into the Software Delivery Lifecycle. Leveraged Shift Left Security Practices to Accelerate SDLC Time from Weeks to Minutes.
  • ● Designed and Implemented Cloud Infrastructure Compliant with Stringent SEBI Regulations, Strengthening Security Posture and Data Governance.
Cloud SecurityAmazon Web Services (AWS)Cloud Architecture & Resilience

Codelogicx

3 roles

DevOps Engineer

Promoted

Apr 2022Mar 2024 · 1 yr 11 mos · Kolkata, West Bengal, India

  • Provisioned, Managed, and Optimized EC2 Instances on AWS for Various Workloads.
  • Implemented Robust CI/CD Pipelines using AWS Services like CodePipeline, CodeDeploy and Elastic Beanstalk.
  • Configured and Managed Core AWS Services Including VPC, S3, IAM, RDS, and ElastiCache.
  • Implemented Cost Optimization Strategies using AWS CloudWatch and Billing Alerts.
  • Ensured Security Best Practices by Leveraging AWS Services like GuardDuty and Inspector.
  • Reduced Infrastructure Provisioning time by 30% through Automation.
  • Increased Application Deployment Frequency by 2x with CI/CD Pipeline Implementation.

Jr. DevOps Engineer

Sep 2021Apr 2022 · 7 mos · Kolkata, West Bengal, India

  • Contributed to UEM implementation using SureMDM and Endpoint Security with Bitdefender GravityZone.
  • Integrated Endpoint Security Solutions into the Company Infrastructure.

DevOps Engineer Trainee

Jun 2021Sep 2021 · 3 mos · Kolkata, West Bengal, India

  • Implemented version control using Bitbucket, managing repositories and user access.
  • Deployed and configured Pritunl VPN server for secure remote access.

Iec education ltd

Network Engineer

Jan 2018Apr 2018 · 3 mos · New Delhi, Delhi, India · On-site

  • Install, configure, and maintain CCProxy servers
  • Monitor proxy server performance and identify issues
  • Troubleshoot connection problems and performance bottlenecks
  • Optimize proxy server settings for speed and efficiency
  • Implement security measures to protect the proxy server
  • Provide technical support to users
  • Create and maintain documentation
Network EngineeringComputer Networking

Hard shell technologies pvt ltd

Network Administrator

Jun 2017Jul 2017 · 1 mo · Noida, Uttar Pradesh, India · On-site

  • Network Engineer with a focus on firewall administration and network infrastructure maintenance.
  • Proven ability to maintain and monitor on-premises Endian Firewall for optimal performance and security.
  • Skilled in network troubleshooting and problem-solving to ensure network uptime and reliability.
  • Experienced in designing and implementing efficient network layouts using tools like Boson NetSim.
Network AdministrationServer Administration

Education

Maharshi Dayanand University

Bachelor of Technology - BTech — Computer Software Engineering

Apr 2014Oct 2018

College of Commerce

Higher Secondary — Science

Apr 2012Mar 2014

Stackforce found 100+ more professionals with Cloud Architecture & Resilience & Mlops & Ai Infrastructure

Explore similar profiles based on matching skills and experience