M

Maheswaran Veluchamy

SRE (Site Reliability Engineer)

Bengaluru, Karnataka, India20 yrs 8 mos experience
Highly Stable

Key Highlights

  • 18+ years of IT experience in high-traffic environments.
  • Expertise in Azure Kubernetes Service and CI/CD pipelines.
  • Strong troubleshooting skills enhancing system reliability.
Stackforce AI infers this person is a seasoned Site Reliability Engineer with a focus on cloud infrastructure and DevOps in the SaaS industry.

Contact

Skills

Core Skills

Site Reliability EngineeringCloud InfrastructureDevopsSystem Administration

Other Skills

ApacheAppiumAutomation using pythonAzure AdministratorAzure Kubernetes Service (AKS)BashCMDBCapacity PlanningDHCPDNSData CenterDatadogDistributed ComputingDockerGit

About

IT professional with 18+ years of experience aligning IT strategies with business goals, managing high-traffic web services in Unix/Linux environments, and optimizing IP networking (TCP/IP, HTTP, DNS). Proficient in Azure Kubernetes Service (AKS), Jenkins, GitLab, Datadog, Grafana, Prometheus, ELK stack, PagerDuty, observability, CI/CD pipelines, and incident management. Adept at leveraging strong troubleshooting skills to enhance system reliability and performance. Known for effective communication and building strong relationships across all organizational levels.

Experience

20 yrs 8 mos
Total Experience
3 yrs 7 mos
Average Tenure
2 yrs 6 mos
Current Experience

Onetrust

Principal SRE

Dec 2023Present · 2 yrs 6 mos · Bengaluru, Karnataka, India · Hybrid

Azure Kubernetes Service (AKS)DatadogTerraformPagerDutyGitlabJenkins+2

Linkedin

2 roles

Senior Site Reliablity Engineer

Promoted

Oct 2015Nov 2023 · 8 yrs 1 mo · Bangalore

  • Supporting the development and customer engagements of LinkedIn's next-generation distributed system – Databus(https://github.com/linkedin/databus)
  • Work closely with Databus development teams to ensure that platforms are designed with "operability" in mind
  • Assist in restoring stability to services during site critical issues
  • Working with LinkedIn's extensive monitoring, writing custom monitoring services when necessary
  • Drive or participate in technical design and operational acceptance exercises for new services with Engineering and SRE teams
  • Building knowledge base and documentation for day to day operational tasks for the team
Service AvailabilitySite Reliability EngineeringCapacity PlanningProactive MonitoringIncident ManagementTroubleshooting+11

Site Reliability Engineer

Dec 2012Sep 2015 · 2 yrs 9 mos · Bangalore

Service AvailabilityDevOpsTroubleshootingPython (Programming Language)Red Hat Enterprise Linux (RHEL)Microsoft Azure+2

Time inc.

Linux System Engnieer

Nov 2009Dec 2012 · 3 yrs 1 mo · Bangalore, India

  • User and group administration on Linux/Solaris servers
  • Point of contact for all TimeInc Websites issues which have large level of impact around the world.
  • Creating , updating , removing TimeInc Domain names - DNS using bind
  • Coordinating with other teams during launch/deployment
  • Configuring virtual hosts on Apache web server
  • Deploying and launching templates in QA and PROD environment using Vignette Development Center
  • Perl and UNIX scripting, automating jobs.
  • Redirecting URL’s in apache web server
  • Troubleshooting/Working on alerts from Keynote, Nagios, Proactive Net, Sitescope
  • Working on DNS, NFS, TCP/IP and other Internet protocols in Linux/Solaris
  • Understanding of routers, switches, and have dealt with network debugging tools to check the trace routes, packet loss, TCP dumps and relate it to the applications impact
DevOpsSystem AdministrationTroubleshootingLinux System AdministrationRed Hat Enterprise Linux (RHEL)

Yahoo!

NOC Engineer

Nov 2007Nov 2009 · 2 yrs · Banglaore, India

  • Managing user and sudo accounts on Linux/FreeBSD servers
  • Monitoring disk space, cpu and memory usage on all servers
  • Troubleshooting all production Linux servers/network devices
  • Monitoring Yahoo servers and network using monitoring tool like netcool, gomez, Argus, nagios, MRTG etc.
  • Taking individual responsibility for resolving issues reported to the NOC from multiple sources including trouble tickets, phone calls, IMs and automated alerts
  • Resolving critical system issue, including notification, coordination and dispatch of individuals from various functional groups within the organization
Troubleshooting

Verismo networks

Linux System Administrator

Nov 2006Nov 2007 · 1 yr · Bangalore, India

  • Crash recovery, OS reloading and restoring data from backups
  • Perform software builds like kernel compilation, Apache/PHP recompilation and other software installation according to customer requirement.
  • Resolve issues related to web, mail or ftp via email through ticketing system (RT)
  • Regularly updating the patches for Linux servers
  • Server hardening for Linux Servers.
  • Monitoring server performance and disk space using shell scripts
  • Installing and configuring Raid Controller cards and IPMI cards
  • Building Verismo server operating system using LFS (Linux from Scratch)
  • RAID & LVM: Creating and maintaining Raid and Lvm.
System AdministrationTroubleshootingLinux System AdministrationRed Hat Enterprise Linux (RHEL)

Netdevices

Linux System Administrator

Aug 2005Nov 2006 · 1 yr 3 mos · Net Devices India Private Ltd

  • Installing, Configuring & Administering DNS, DHCP & Mail Server for Production and QA
  • Installing, Configuring & Administering Web Server (Apache) & FTP for Production and QA
  • Managing Network systems , Servers uptime
  • Maintenance of logs / files / documentation
  • Troubleshooting Linux Desktops/Servers
  • Backup /Restore - Daily / weekly / monthly
  • Managing user accounts
  • OS reloading and restoring data from backups
System AdministrationTroubleshootingLinux System AdministrationRed Hat Enterprise Linux (RHEL)

Education

Madurai Kamaraj University

M.Sc — IT&M

Jan 2003Jan 2005

Keswick Public School , Madurai

Higher Secondary

Jan 1996Jan 2000

Stackforce found 100+ more professionals with Site Reliability Engineering & Cloud Infrastructure

Explore similar profiles based on matching skills and experience