V

Vijay Gosai

DevOps Engineer

Delhi, India15 yrs 1 mo experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Expert in cloud-native architecture and large-scale migrations.
  • Reduced production outages by 90% through observability.
  • Developed automated deployment frameworks enhancing productivity.
Stackforce AI infers this person is a SaaS Infrastructure Engineer with expertise in cloud-native solutions and DevOps practices.

Contact

Skills

Core Skills

Cloud-native ArchitectureAwsObservabilityCi/cd PipelineMicroservicesCi/cdCloud ComputingLinux Administration

Other Skills

AIOpsAgentic AIAlgorithmsAmazon Web Services (AWS)AnsibleApacheApache TomcatBashBig DataChefContinuous DeliveryContinuous Integration and Continuous Delivery (CI/CD)Data CentersData StructuresDesign Patterns

About

I build infrastructure and AI systems that reduce toil, boost reliability, and scale with confidence. With over 14 years of experience across cloud-native platforms, large-scale cloud migrations, developer productivity tooling, automated delivery pipelines, and now GenAI-powered agents — I focus on building systems that are not just resilient, but intelligent. My work spans building self-healing infrastructure, scalable CI/CD frameworks, deep observability pipelines, and AI copilots for on-call automation — all aimed at empowering engineers to move faster with less friction and more confidence. Lately, I’ve been exploring how LLMs and Agentic systems are transforming the future of DevOps and AIOps, enabling platforms that are proactive, autonomous, and deeply context-aware. I’m passionate about solving real-world engineering problems with simplicity, scale, and purpose at the core.

Experience

Adobe

3 roles

Senior Computer Scientist

Feb 2024Present · 2 yrs 1 mo

  • Currently focused on driving operational excellence, cloud cost optimisation, and exploring the intersection of AI and DevOps.

Computer Scientist - II

Promoted

Jul 2021Feb 2024 · 2 yrs 7 mos

  • Led the migration of a high-impact, business-critical application from an on-premise DC to a fully cloud-native architecture on AWS.
  • Redesigned the infrastructure for scalability and cost-efficiency, refactored key components for cloud readiness, and led the deployment to production.
  • The application scaled seamlessly to handle 55 billion+ requests during the last holiday season — with notable cost savings and zero performance bottlenecks.
Cloud-Native ArchitectureAWSInfrastructure ArchitectureDevOps Practices and Principles

Computer Scientist - I

Jun 2019Jul 2021 · 2 yrs 1 mo

  • Introduced Observability into the systems that helped reducing the Production outages by 90%.
  • Designed and implemented multi-master SaltMaster architecture to reduce system provisioning time in DataCenters by 80%.
  • Designed, developed and implemented a self-service tool - "OnePortal" to reduce day-to-day SRE's operation toils up to 80%.
  • Reduced cloud infra costs by ~$100K yearly by eliminating unnecessary servers and downsizing capacity based on usage.
  • Wrote several custom exporters in Python to expose the critical application metrics on Prometheus.
  • Designed a single-click release deployment framework ( CI/CD Pipeline) to deploy the latest artifacts.
ObservabilitySaltStackPythonCI/CD Pipeline

Red hat

Senior Software Engineer

Sep 2017Jun 2019 · 1 yr 9 mos · Greater Delhi Area

  • Designed and implemented a fully automated Single Click Release Deployment framework to increase release productivity by 75%.
  • Implemented Opensource Monitoring and Logging solution to optimized infrastructure upfront cost up to 40%.
  • Accelerated Ops team's productivity up to 50% by developing a Python web application to self-provision VM(s) for dev team's testing.
  • Containerised more than 40 micro-services and deployed to Openshift with the help of fully automated Jenkins pipelines.
  • Responsible for designing a resilient, self-healing and scalable infrastructure for MicroServices.
PythonJenkinsOpenShiftMicroservicesCI/CD

Ge digital

2 roles

Software Engineer

Promoted

Jul 2016Sep 2017 · 1 yr 2 mos

  • Automated and orchestrated infrastructure, release deployments and middleware provisioning on the cloud (AWS) using Scalr, Jenkins2, and Chef.
  • Designed and implemented a container orchestration solution with the help of Docker Swarm.
  • Designed and implemented a centralized logging tool - Elastic Stack (ELK) in production for the cloud-hosted application.
  • Designed and developed an app for DevOps team using Python Django.
  • Designed and developed a mission-critical application for ITOPs team using Python Flask.
  • Designed and implemented Continuous Integration (CI) tool - Jenkins Pipeline as Code to automate the release process of cloud-native applications.
  • Designed and implemented Continuous Deployment (CD) tool – Chef to orchestrate cloud-native infrastructure.
  • Designed and implemented a secure, reliable, scalable and highly available architecture of multi-tier applications on AWS.
  • Automated repetitive tasks, batch jobs with the help of Python & Bash.
AWSDockerPythonJenkinsCloud ComputingCI/CD

Software Engineer

Nov 2014Jun 2016 · 1 yr 7 mos

  • Managed more than 500+ Linux servers with multiple websites in heterogeneous environments.
  • Migrated many applications from on-premises to AWS cloud.
  • Configured and managed LAMP middleware stack on cloud
  • Designed application infrastructure provisioning using Puppet/Chef.
  • Managed SQL/NoSQL (MySQL & MongoDB) databases in on-prem environments.
  • Monitored applications/infrastructure using Nagios, ICINGA, New Relic and Splunk.
  • Implemented a clustered RabbitMQ solution for messaging.
  • Automated several ad-hoc tasks using Shell, Python, and Ruby.
LinuxPuppetShell ScriptingMySQLLinux AdministrationCloud Computing

Tetra information services pvt. ltd

Senior System Engineer

Apr 2014Oct 2014 · 6 mos · Delhi Area, India

  • Providing L3 remote support more than 200 clients.
  • Hands on experience in setup, configuration, upgrade, maintenance, performance monitoring and troubleshooting of servers running on different OS platforms like Linux (Centos, Ubuntu, RHEL etc.).
  • Hands on experience on Zimbra, Qmail Mail server, SAMBA4 AD DC, Openldap2.X, Apache, Nagios Server, DNS & DHCP Server, Squid Proxy Server, NFS Server, PXE-Kickstart, Redhat Cluster Suite (RHCS), DRBD & Heartbeat Cluster etc.
  • Writing shell scripts to automate and streamline various tasks.
  • Performance tuning & load management of servers using iostat, top, htop, vmstat, sar etc.
  • Implementation of software Firewalls through Shorewall, TCP wrappers & IP tables.
LinuxShell ScriptingApacheLinux Administration

Dr. shroff's charity eye hospital

Linux System Engineer

Nov 2010Mar 2014 · 3 yrs 4 mos · New Delhi Area, India · On-site

  • Provided L3 remote support of more than 200 to the clients.
  • Handled application configurations/upgradations and performance monitoring.
  • Implemented DNS, DHCP & Squid Proxy Servers.
  • Performed application tuning and load management of servers.
  • Implemented software Firewalls through Shorewall, TCP wrappers & IPtables.
  • Implemented a production-grade open source application like mail servers, Samba4 AD, OpenLDAP, NFS, PXE-Kickstart, Redhat Cluster Suite (RHCS), DRBD and Heartbeat.
  • Automated and streamlined various ad-hoc tasks using Shell scripting.
LinuxShell ScriptingApacheLinux Administration

Education

Indira Gandhi National Open University

Master of Computer Applications - MCA

Jan 2013Jan 2016

Indira Gandhi National Open University

Bachelor of Computer Applications - BCA

Jan 2010Jan 2013

Stackforce found 100+ more professionals with Cloud-native Architecture & Aws

Explore similar profiles based on matching skills and experience