Tamil Selvan Palani

CTO

Chennai, Tamil Nadu, India18 yrs 8 mos experience
Highly Stable

Key Highlights

  • Doubled deployment frequency with CI/CD modernization
  • Reduced MTTR by 40% through automation
  • Led global teams in high-stakes fintech environments
Stackforce AI infers this person is a Fintech expert with strong capabilities in Site Reliability Engineering and DevOps.

Contact

Skills

Core Skills

Engineering ManagementSite Reliability EngineeringDevopsLeadershipJava Development

Other Skills

Engineering LeadershipStrategic PartnershipsContinuous Integration and Continuous Delivery (CI/CD)People ManagementGitHub ActionsKubernetesCI/CDAutomationMonitoringCollaborative LeadershipDatadogSplunkSignalFxCollaboration SolutionsAlerting

About

As a Senior Engineering Manager at Walmart Global Tech India, I lead global engineering teams focused on building highly reliable, secure, and scalable platforms—critical qualities for fintech and retail systems where uptime, data integrity, and rapid delivery are non-negotiable. I manage a team of 10+ engineers and oversee 100+ microservices supporting payment gateways, transaction processing, and financial reporting systems. By modernizing our CI/CD pipelines with GitHub Actions, Artifactory, and Kubernetes, we doubled deployment frequency, reduced MTTR by 40%, and improved change failure rates—all while maintaining strict governance and auditability standards. A key part of my work is embedding Site Reliability Engineering (SRE) practices into the software delivery lifecycle—establishing service-level objectives (SLOs), improving observability with tools like Splunk, Prometheus and Grafana, and leading high-severity incident response with a focus on containment, root cause analysis, and follow-through. Oversaw the Resiliency Platform, orchestrating automated resiliency tests through CI/CD pipelines in non-production environments and conducting scheduled production resiliency tests for Tier 0 and Tier 1 applications to ensure constant service readiness and disaster recovery preparedness. Developed an intuitive internal portal for application owners to monitor and access detailed resiliency test results. A key strength throughout my time at Walmart and PayPal was my collaborative approach. I partnered extensively with Directors from various domains, identifying their distinct reliability pain points and subsequently developing tailored, co-owned roadmaps with their engineering teams to address and enhance their system reliability. I thrive in regulated, high-stakes environments where security, compliance, and resilience intersect with innovation. I collaborate closely with product, risk, and InfoSec teams to ensure engineering solutions are robust, auditable, and aligned with broader business and regulatory goals. At my core, I’m passionate about growing future leaders, building trust-driven engineering cultures, and delivering software that powers confident financial decisions at scale.

Experience

Walmart global tech india

Senior Software Engineering Manager

Oct 2023Present · 2 yrs 5 mos · Chennai, Tamil Nadu, India · Hybrid

  • Led end-to-end CI/CD modernization across 100+ microservices, boosting release frequency by 3x and improving deployment reliability for mission-critical retail and payment platforms.
  • Spearheaded the management of a critical Resiliency Platform, automating resiliency tests via playbooks within CI/CD pipelines for lower environments and scheduling periodic production tests for Tier 0/1 applications to ensure continuous disaster recovery readiness. Developed an internal portal for real-time tracking and reporting of test results to application owners.
  • Defined and enforced standardized deployment practices and change management workflows, reducing release rollback incidents by 40% and ensuring compliance with audit requirements.
  • Partnered with enterprise architects, SREs, and product teams to align technical delivery with business priorities, resulting in faster time-to-market and reduced operational risk.
  • Drove automation-first initiatives that cut manual deployment overhead by 60%, enhancing developer productivity and enabling consistent delivery pipelines across teams.
  • Mentored a global team of engineers to adopt platform reliability best practices, fostering a high-performance engineering culture.
Engineering LeadershipDevOpsStrategic PartnershipsSite Reliability EngineeringContinuous Integration and Continuous Delivery (CI/CD)Engineering Management+1

Paypal

4 roles

Manager 2, Site Reliability Engineering

Mar 2023Oct 2023 · 7 mos

  • Directed global SRE teams to implement observability and auto-remediation solutions, reducing incident detection time by 50% and improving system uptime across high-traffic fintech services.
  • Leveraged strong collaborative skills, partnering with Directors across multiple domains to evangelize the critical importance of reliability.
  • Proactively identified unique reliability challenges within each domain and co-created joint roadmaps with their engineering teams to define and improve their specific reliability needs.
  • Automated diagnostics, monitoring, and fault injection workflows using Datadog, Splunk, SignalFx, and custom tools, enabling faster root cause analysis and recovery.
  • Influenced platform architecture to embed resilience at the design stage, leading to a 35% drop in critical incidents over two quarters.
  • Championed an SLO-driven reliability culture across engineering teams, resulting in higher service health scores and measurable improvements in customer experience.
DevOpsStrategic PartnershipsCollaborative LeadershipSite Reliability Engineering

Manager, embedded Site Reliability Engineering

Promoted

Jun 2021Mar 2023 · 1 yr 9 mos

  • Strategically injected SRE practices across various PayPal verticals by integrating SREs directly into their scrum teams, effectively embedding reliability into core SDLC workflows and running multiple parallel tracks.
  • Ensured every new feature release adhered to defined service-level objectives and rigorous failure recovery standards.
  • Enabled proactive incident management through the implementation of robust monitoring and alerting pipelines, which improved first-response time by 45%.
  • Led cross-functional alignment on performance and reliability KPIs, helping teams reduce customer-impacting incidents by 30% and streamline on-call practices.
  • Coached engineers in chaos engineering, observability, and operational readiness, fostering a culture of continuous reliability improvement.
Collaboration SolutionsLeadershipStrategic PartnershipsSite Reliability Engineering

MTS 1, Software Engineer

Apr 2017Jun 2021 · 4 yrs 2 mos

  • Engineered scalable backend services in Java and NodeJS to support high-volume payment processing, contributing directly to PayPal’s core financial transaction systems.
  • Developed internal operational dashboards using React to improve real-time visibility into system health, ownership, and deployment pipelines—reducing incident resolution time for on-call teams.
  • Ensured high availability and system resilience across distributed services, consistently meeting 99.99% uptime goals for customer-facing platforms.
  • Collaborated closely with Site Reliability Engineering (SRE) teams to implement monitoring, alerting, and auto-remediation strategies, leading to improved platform observability.
  • Supported live site incident response efforts and participated in post-incident reviews, helping teams identify root causes and implement long-term fixes.

Software Engineer 3

May 2014Mar 2017 · 2 yrs 10 mos

  • Led triaging and debugging efforts across critical customer-impacting issues, collaborating with cross-functional teams to ensure fast, root-cause-driven resolutions in high-pressure environments.
  • Developed and maintained tools for incident tracking, alert routing, and operational analytics, contributing to improved engineering response times and reduced MTTR.
  • Delivered full-stack features and internal platforms using Python, NodeJS, React, and Java, supporting both backend workflows and operational dashboards for engineering and support teams.
  • Worked across multiple platforms, including internal monitoring tools, CI/CD systems, and service ownership dashboards, improving system visibility and engineering efficiency.
  • Streamlined support processes by identifying recurring issues, contributing code fixes, and driving automation to eliminate manual intervention.

Xerox

Associate Lead Engineer

Nov 2011Apr 2014 · 2 yrs 5 mos · Chennai

  • Led product development efforts for enterprise applications using Eclipse RCP, EJB 3, and EclipseLink, contributing to the successful delivery of solutions for document and workflow automation.
  • Designed and implemented modular, component-based user interfaces within the Eclipse Rich Client Platform, improving reusability and reducing UI development time across product lines.
  • Collaborated with cross-functional teams in an Agile development environment, actively participating in sprint planning, backlog grooming, and daily stand-ups to ensure timely and high-quality deliverables.
  • Mentored junior developers, conducted code reviews, and contributed to architecture discussions to promote clean, maintainable, and scalable code practices.
  • Optimized data persistence layers and improved overall application performance by fine-tuning JPA configurations and SQL queries.

Infosys technologies ltd

2 roles

Technology Analyst - Java, J2EE

Promoted

Sep 2009Nov 2011 · 2 yrs 2 mos · On-site

  • Worked on multiple enterprise-level Java and J2EE projects across financial and public sector domains, contributing to both development and production support lifecycles.
  • Developed scalable backend components using Java, JSP, Servlets, and EJBs, ensuring performance and reliability in high-volume transactional systems.
  • Performed in-depth bug analysis and resolution, collaborating with QA and client teams to meet SLAs and improve system stability.
  • Contributed to full SDLC phases, including requirements analysis, technical design, coding, unit testing, and deployment support.
  • Adhered to best practices in code quality, version control, and documentation, while working in globally distributed Agile teams.

Software Engineer

Jun 2007Sep 2009 · 2 yrs 3 mos · On-site

  • Contributed to multiple projects for one of the largest American banks, focusing on small-to-medium feature development and critical bug fixes in Java and JavaScript.
  • Developed and maintained backend components, front-end enhancements, and integration modules supporting financial workflows and customer-facing systems.
  • Collaborated with onshore teams to analyze change requests, implement solutions, and ensure timely deployments aligned with compliance standards.
  • Gained strong foundational experience in enterprise application development, troubleshooting, and client communication within a fast-paced, regulated environment.
Core JavaJavaScriptEJBAgile MethodologiesJava Development

Education

College of Engineering, Guindy - Anna University

M.Sc.

Jan 2002Jan 2007

Blessing Matric Hr Sec School, Keerapakkam

HSC — Computer Science

Jan 2000Jan 2002

Stackforce found 100+ more professionals with Engineering Management & Site Reliability Engineering

Explore similar profiles based on matching skills and experience