Vaitheeswaran S

CTO

Bengaluru, Karnataka, India10 yrs 2 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Led a team of 30+ engineers in Site Reliability Engineering.
  • Architected PCI-DSS compliant environments from scratch.
  • Developed a cost management dashboard for AWS infrastructure.
Stackforce AI infers this person is a Site Reliability Engineer with expertise in Fintech and Infrastructure.

Contact

Skills

Core Skills

Site Reliability EngineeringLeadershipAwsPlatform EngineeringInfrastructure OrchestrationSystem ArchitectureCi/cdCost ManagementJava

Other Skills

Amazon Web Services (AWS)Android DevelopmentAnsibleCC++CICDCSSCapacity PlanningCascading Style Sheets (CSS)ChefCloud ApplicationsCloud ComputingCloud costCollaborationCollectd

About

SRE @ CRED Ex-Ola,Directi, FreeCharge

Experience

10 yrs 2 mos
Total Experience
2 yrs 6 mos
Average Tenure
7 yrs 6 mos
Current Experience

Cred

2 roles

Head - Site Reliability Engineering & AI Labs

Promoted

Jul 2021Present · 4 yrs 10 mos

  • Heading multiple charters (30+ engineers) :
  • Core-Infra
  • Data-Infra
  • Platform Engineering
  • Infrastructure Security
  • AI Labs
LeadershipTeam BuildingSolution ArchitectureCloud costData infraCICD+5

Site Reliability Engineer

Oct 2018Jun 2021 · 2 yrs 8 mos

  • Serving as a primary point responsible for the overall health, performance, and capacity of all the services.
  • Own the complete AWS infrastructure and constantly looking for optimising for efficiency, security and cost.
  • Primary stakeholder from Infrastructure for compliance requirements and designing systems adhering to them (PCI-DSS, ISO27001:2013, Data-Localization(RBI) and NPCI guidelines)
  • Collaborating across technical and business functions to bolster and achieve business objectives.
AWSComplianceInfrastructure SecurityCollaborationSite Reliability Engineering

Ola (ani technologies pvt. ltd)

Platform Engineer

Feb 2018Oct 2018 · 8 mos · Bengaluru Area, India

  • Wrote python modules for infrastructure orchestration in Microsoft Azure. Planned and executed the Jfrog Artifactory migration used by Jenkins server that runs thousands of jobs. Wrote python modules for fetching utilisation and pricing data of components running in AWS.
  • Was part of on-call rotation which was responsible for the uptime of the consumer mobile application.
PythonMicrosoft AzureJenkinsInfrastructure OrchestrationPlatform Engineering

Endurance international group

System Architect

Jul 2017Feb 2018 · 7 mos · Bengaluru Area, India

  • Part of the System's Architect team that owns the business critical component "Orderbox".
  • Architected a PCI-DSS compliant environment from scratch.
  • Designed a secure CI/CD pipeline using Atlassian Bamboo, Docker-registry and vault.
  • Used Hashicorp Vault as Encrytion-As-A-Service for encrypting the CHD.
  • Logging and monitoring : rsyslog-EK, nagios
  • Did a POC and implemented a company wide single sign on solution (using FreeIPA and AD) for ssh access across hundreds of servers which came handy for the PCI-DSS compliance requirement 7/8. Used google-authenticator pam module for MFA.
  • Used puppet for configuration management and orchestration.
PCI-DSSCI/CDDockerConfiguration ManagementPuppetSystem Architecture

Freecharge

3 roles

Site Reliability Engineer II

Apr 2017Jun 2017 · 2 mos

Site Reliability Engineer

Jun 2016Mar 2017 · 9 mos

  • As part of the Site Reliability Engineering team, I am responsible for uptime, scalability, optimization and monitoring of the infrastructure consisting of hundreds of servers. Involved in capacity planning and performance tuning during rigorous campaigns. Actively involved in cost saving activities at the infrastructure level. Available 24/7 for pager duties and active member of SWAT force for immediate fixing of any production issue. Also involved in operations activities like infra provisioning and maintenance. (AWS cloudformation)
  • SME for Merchant support and merchant solution infrastructure and carried out architecture reviews for existing and new features/components for building robust and efficient architecture.
  • SME for deployment pipeline using chef, ansible and jenkins.
  • Carried out infrastructure migration across AWS regions and accounts as compliance requirements.
  • Automated various day to day operations tasks using configuration management tools (ansible, chef) and scripts (Python, bash, ruby).
  • Won Devops-HackDay for building Cost Dashboard for AWS infrastructure (grouped by cost centre, region, accounts) using python, mysql and metabase.
AWSCapacity PlanningPerformance TuningConfiguration ManagementSite Reliability Engineering

SRE Intern

Oct 2015Jun 2016 · 8 mos

  • Evidence collection tool using Elastic Map Reduce (EMR), AWS and Java for Law-enforcement queries.
  • Contributed to SSO setup for PCI DSS compliance ( integrating mac with IPA and setting up of OS X server)
  • Modules to automate operations tasks using configuration management tools like ansible, chef.
  • POCs on opensource tools like TICK stack, Zabbix, OSSIM, ELK stack.
  • Coordinated in migration from SVN to GIT
  • Available as On Call for production issues
Elastic Map Reduce (EMR)JavaConfiguration ManagementSSOSite Reliability Engineering

Education

Indian Institute of Management, Calcutta

EPGM

Visvesvaraya Technological University

Bachelor’s Degree — Computer Science and Engineering

Kendriya Vidyalaya

High School

Kendriya Vidyalaya

High School

Stackforce found 100+ more professionals with Site Reliability Engineering & Leadership

Explore similar profiles based on matching skills and experience