V

Vishal Uderani

SRE (Site Reliability Engineer)

Mumbai, Maharashtra, India17 yrs 3 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • 7 years of experience in system administration and operations engineering.
  • Expertise in cloud infrastructure automation and monitoring.
  • Proven track record in managing high-availability systems.
Stackforce AI infers this person is a Cloud Infrastructure and Operations Engineer with expertise in SaaS environments.

Contact

Skills

Core Skills

Cloud InfrastructureAutomationMonitoringConfiguration ManagementCloud ManagementSystem AdministrationStorage ManagementSystems ManagementTechnical Support

Other Skills

AWSDatadogAnsibleDockerElasticsearchMongoDBScriptingPuppetNew RelicNagiosDHCPNetAppCactiKickstartUbuntu

About

An architect-level System Administrator/Linux Administrator/Operations Engineer with 7 years of experience and advanced technical and inter-personal skills

Experience

17 yrs 3 mos
Total Experience
4 yrs 3 mos
Average Tenure
10 yrs 4 mos
Current Experience

Opentable

2 roles

Staff Site Reliability Architect

Mar 2021Present · 5 yrs 2 mos

Senior Site Reliability Architect

Jan 2016Present · 10 yrs 4 mos

Webengage

Lead DevOps Engineer

Mar 2015Dec 2015 · 9 mos

  • Writing scripts to automate the creation and maintenance of distributed cloud infrastructure on AWS
  • Experiment with new technologies to optimize the reliability and performance of our infrastructure automation.
  • Design and implement proactive monitoring using Datadog to ensure health, performance and security of production and non-production cloud infrastructure.
  • Utilize the Datadog API to send metrics and time-series events
  • Plan automated backups
  • Design and execute capacity planning for new & existing clusters.
  • Code deployments using Ansible across servers with rolling updates
  • Mongodb cluster setup with replication with sharding
  • Docker-ize the dev stack for devs to run production setups on their laptops
  • Elasticsearch cluster scaling with Ansible
AWSDatadogAnsibleDockerElasticsearchMongoDB+2

Haptik inc

Sr Devops Engineer

Oct 2014Mar 2015 · 5 mos · Mumbai Area, India

  • Haptik allows you to chat with experts and get help with customer support issues, FAQs, information, and almost anything else within minutes. Think WhatsApp - but instead with a certified expert who knows the company inside out.
  • Manage/Maintain/Monitor services and instances deployed in EC2
  • Configuration Management and Application Deployment using Puppet/Ansible
  • Automate EC2 snapshots using aws-cli
  • Automate monitoring using New Relic
  • Deploy gitlab and migrate existing git repo's to it
PuppetAnsibleAWSNew RelicConfiguration ManagementCloud Management

Directi

3 roles

Sr Systems Administrator

Promoted

Sep 2011May 2014 · 2 yrs 8 mos · Mumbai Area, India

  • The main focus here was to maintain up-time of various production and management servers and also managing deployments consisting of 100s of linux servers, conduct RCAs, & working with senior System Administrators for deployment automation and product development and also coming up with permanent resolutions for any issues.
  • Troubleshoot system/application faults using Event logs or error logs.Also coordinating with DC techs to manage the hardware specs of the server.
  • Worked on following technology/tools: Nagios, DHCP, Kickstart,Cobbler, Yum, RPM, GIT, Pulp, Puppet, Cpanel.
  • Storage Administration
  • Solid understanding of Netapp FAS3240 SAN which included but not limited to Installation , Management and Configuration of Netapp Filers
  • Planning , preparation and implementing of Data ONTAP Upgrades
  • Storage LUN Provisioning . Qtree , NFS File System Management , creation , allocation and configuration
  • Physical Storage Management: Assigning disk ownership, aggregate , Flex Vol , Qtree , RAID-DP
  • etc
  • Netapp LUN and Volume Management using tools like Snapshopt , Snapmirror , Snaprestore , Vol copy , ndmp copy , Deduplication etc
  • Automate the LUN Lifecycle Management using the Netapp SDK and Puppet
NagiosDHCPPuppetNetAppSystem AdministrationStorage Management

Sr. Systems Engineer

Promoted

Mar 2010Aug 2011 · 1 yr 5 mos · Mumbai Area, India

  • Evaluate software/hardware products (free/commercial) as per needs.
  • Prepare & present proposals with respect to systems/software infrastructure.
  • Prioritize and allocate tasks to juniors as per their efficiency.
  • Ensure regular follow ups and completion of tasks.
  • Planning & Execution of maintenance/migration of services/servers/applications with other engineers
  • Developing better monitoring and systems management practices to maximize uptime by using Nagios and Cacti and Hyperic HQ  across multiple sites .
  • Develop/Adopt new & efficient strategies to minimize manual/repetitive work, operational costs, deployment times, response times etc.
  • Troubleshooting complex systems/network issues with developers for their staging/live deployments
  • Responsible for managing/maintaining/monitoring the uptime of all critical internet applications which included Jira/Confluence/SVN/GIT/TeamCity/Samba using a cool stack of Glusterfs/Pacemaker and an Active/Active Mysql cluster
  • Automated CentOS  Installations using Kickstart which supported multiple OS and architectures
  • Configured local Ubuntu and CentOS mirror repositories to reduce bandwidth costs and integrated the repo’s with Kickstart using post-install scripts
  • Automated Ubuntu installations using Clonezilla
  • A (non)certified script kiddie who writes minor scripts in Bash and indulges in laziness thereafter
  • Mass Deployment of Snom 300 VOIP Phones using  Auto-Provisioning  via  pxe and dhcp
  • Fair understanding of HP Procurve 2910/2610 switches  . Familiar with concepts of tagging/untagging/link aggregation/creating VLAN’s
  • Deploying RANCID C(Really  Awesome New Cisco  ConfIg Differ) to backup switch configurations across multiple sites .
NagiosCactiKickstartUbuntuSystems ManagementMonitoring

Systems Engineer

Jul 2008Mar 2010 · 1 yr 8 mos · Mumbai Area, India

  • Managing a Helpdesk environment of 500+ users
  • Communicated electronically or in person with computer users experiencing difficulties to determine, diagnose, and resolve problems onsite or remotely
  • Participated in implementation of Windows 2003/2008 domains and services (AD, DNS, DHCP…)
  • Installed and configured Window operating systems, common software (MS Office, Adobe…), professional applications (MS Project, Primavera, AutoCAD, …), Antivirus solutions
  • Installed, configured, upgraded, and maintained hardware including computer systems, scanners, printers, network switches, access points
  • Installed and troubleshot hardware peripherals, accessories and parts (CPU, RAM, HDD, VGA, NIC, cartridges…)
  • A proven record of reliability , the ability to perform under time constraints , and good judgment under pressure.
  • The ability to lift 50 lbs and install computer and electronic equipment in a safe and careful manner.
  • Ability to maintain a reliable and methodical approach to support and documentation.
  • Helped and participated in implementation or renovation of wired & wireless LANs
  • Provided network and Internet and email support to users in response to identified difficulties
  • Performed back/restore for company’s and users’ important data as per the schedule
  • System analysis, improvement and Documentation
  • Designed and developed functional forms and procedures like “device naming standard”, “technical specification form”, “backup & restore procedures" etc
Windows ServerActive DirectoryNetworkingTechnical SupportSystem Administration

Education

Mumbai University

TYBCOM — Commerce

Jan 2008Jan 2010

Stackforce found 100+ more professionals with Cloud Infrastructure & Automation

Explore similar profiles based on matching skills and experience