Sarvesh .

SRE (Site Reliability Engineer)

Lehi, Utah, United States15 yrs 6 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • 15 years of experience in complex systems engineering.
  • Expert in Site Reliability Engineering and Database Management.
  • Proven track record in automating deployment processes.
Stackforce AI infers this person is a SaaS Infrastructure Engineer with extensive experience in cloud and database technologies.

Contact

Skills

Core Skills

Database ManagementCloud TechnologiesService EngineeringInfrastructure Management

Other Skills

ACK controllersAPI IntegrationAWSAmazon Web Services (AWS)AnsibleApache DruidAzureCassandraCloudWatchCommunicationCursorDockerGeneral OperationsGrafanaHadoop

About

Dynamic and strategic-minded engineer with 15 years of extensive experience in architecture, design, and implementation of complex systems in platform and infrastructure specializing in Site Reliability Engineering(SRE) and Databases. I believe my strength lies in thinking around a problem, find its every loophole, try to solve it in best possible way and always look for improvements. My specialities include:- • RDBMS concepts including language like SQL Server, MySQL, Postgres. • Windows/Linux system administration.Programming languages like Python, Shell scripting. • Frontend programming languages like ReactJS, Javascript. • Monitoring and dashboarding tools like Nagios, Icinga, NewRelic, Prometheus, Grafana using Cortex • Deployment configuration tools like Saltstack, Rundeck, Ansible • Cloud technologies like AWS, Azure. • BIGDATA technologies like Apache Druid, Hadoop, MapReduce etc. • Containerization technologies like Docker/Kubernetes • NOSQL technologies like Cassandra, MongoDB

Experience

Adobe

4 roles

Senior Database Reliability Engineer

Promoted

Sep 2024Present · 1 yr 6 mos

  • My daily responsibilities include being part of Creative Cloud and Business platforms:
  • Managing a large fleet of databases and clusters with respect to their end-to-end lifecycle using automations, performance tunings, benchmarking, etc.
  • Technologies we use are Cassandra, MySQL (RDS, Aurora, Serverless), MongoDB,Postgres, etc.
  • Managing Infra in which databases are hosted like AWS, Azure and look for Database related cost optimizations wherever necessary.
  • Automating deployment and management processes using Terraform, Shell, Python scripting, Chef cookbooks, including blue-green deployments for zero downtime and other tools to minimize manual intervention.
  • Enhancing observability with advanced monitoring tools such as Grafana, CloudWatch, Prometheus exporters, and alerting systems like ServiceNow Pagers.
  • Syncing with engineering counterparts to discuss architecture, schema changes, performance tuning for databases, and set guidelines.
  • Leading root cause analysis and developing self-healing solutions to minimize downtime.
  • Key Projects:
  • Terraform automation for Aurora Serverless & RDS disaster recovery
  • Multi-account database inventory system with KLAM API integration and MySQL analytics
  • TB-scale Cassandra-to-Keyspaces migration with AI-powered validation
  • Kubernetes-native monitoring: Custom CRDs, CloudWatch automation, Helm/Argo CD deployments
  • LangChain agents and MCP servers for AWS infrastructure monitoring
TerraformShell ScriptingPythonKubernetesAmazon Web Services (AWS)ACK controllers+5

Production Service Engineer 5

Oct 2021Sep 2024 · 2 yrs 11 mos

  • Associated with Adobe Connect team which is a leader in meeting/web-conferencing.
  • Proactively monitoring application and database alerts via key tools like Nagios, Prometheus, Splunk for logging, Grafana for dashboarding, Service Now for Pagers, creation of custom metrics using Python and running them as service and exporting them dashboards etc.
  • Automating certain day to day repetitive tasks using scripting (Python, ShellScript, etc.)
  • Working on different cloud technologies like AWS to manage our infrastructure created using Terraform, use AWS SSM to manage nodes and deploy changes, configure cloudwatch dashboards, manage application and network load balancers, cost optimization for tags, EKS cluster upgrades, RDS and lambda configurations, SSM, creation of components using Terraform etc.
  • Working on different database technologies like SQL Server, Postgres, Cassandra etc.
  • Deployment of new versions of application from end to end using automation, CI/CD tools like Saltstack, Rundeck, Jenkins etc.
  • Exposure to containerization technologies like Docker/Kubernetes for creation of statefulsets, config changes, deployments, etc.
  • Creating and auditing run books for auditing/knowledge sharing.
Root CauseSystem AdministrationInfrastructure as a Service (IaaS)CommunicationGeneral OperationsService Engineering+2

Software Development Engineer 3

Promoted

May 2014Oct 2021 · 7 yrs 5 mos

  • Worked in multiple product teams Adobe Connect, AEMM, Primetime, Adobe Campaign
  • Managed large fleet of Postgres databases our SaaS product.
  • Managed large Druid clusters for Campaign reporting using HBase, Hadoop technologies, hosted in AWS.
  • Writing all configurations of product in javascript and Python to avoid manual product onboarding.
  • Proactively monitoring application and database alerts via key tools like New Relic monitors, Nagios, Icinga, Splunk for logging, Grafana for dashboarding etc.
  • Automating certain day to day repetitive tasks using scripting (Python, ShellScript, TSQL etc.)
  • Developing tools for daily activities or projects e.g. provisioning of customers.
  • Working on different database technologies like SQL Server, MySQL, Postgres etc. as part of different products.
  • Deployment of new versions of application from end to end using automation, CI/CD tools like Rundeck, Ansible etc.
Root CauseSystem AdministrationInfrastructure as a Service (IaaS)CommunicationGeneral OperationsService Engineering+2

Database Administrator

Jul 2013May 2014 · 10 mos

  • We maintain internal databases for various Adobe applications with full accountability working in a 24*7 environment having more than 100 SQL Server instances.
  • Ensuring 24x7 availability.
  • Handling databases around 2TB in size.
  • Attending meetings and understanding requirements of the clients and designing environment according to their needs.
  • Working on different tools as per the requirement.
  • Configured monitoring tool for my environment (SCOM 2007R2) and currently undergoing (Sharepoint 2010) training.
  • Undergo various POC’s from time to time before presenting a valid case study for a particular product before a meeting.

Accenture

Software engineer

May 2013Jun 2013 · 1 mo · Gurgaon, India

  • SQL Server DBA

Hcl technologies (infrastructure services division)

SQL Server Specialist

Aug 2010May 2013 · 2 yrs 9 mos · Noida Area, India

  • Extensive experience in Database Administration, BI Solutions and SQL Server Clustering. Hands on experience in maintenance and administration of databases, configuration, disaster recovery and database security. Proficient in managing a wide range of database environments consisting of multiple SQL Server versions with terabyte sized databases. Possess strong communication, leadership, team management, analytical and relationship management skills. Always keen to learn new technologies and willing to share knowledge among the team.

Education

Northern India Engineering College

B.Tech — Computer Science and Engineering

Jan 2006Jan 2010

HAL

XIIth CBSE

Jan 2004Jan 2005

St Dominic Savio College

Xth ICSE

Jan 2002Jan 2003

Stackforce found 100+ more professionals with Database Management & Cloud Technologies

Explore similar profiles based on matching skills and experience