E

Eswar Krishnan

DevOps Engineer

Singapore, Singapore19 yrs 6 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Over 18 years of experience in DevOps and SRE.
  • Expert in building and leading high-performing engineering teams.
  • Proven track record in driving cloud solutions and innovation.
Stackforce AI infers this person is a DevOps and Site Reliability Engineering expert in the E-commerce and Fintech sectors.

Contact

Skills

Core Skills

DevopsSite Reliability EngineeringCloud Solutions

Other Skills

Technical LeadershipCross-functional CollaborationProject ManagementCloud EngineeringSolution ArchitectureCloud-native PlatformsDevOps ImplementationSRE OperationsCI/CDInfrastructure ManagementTechnical ManagementDevOps PracticesCloud OperationsCapacity PlanningInfrastructure Optimization

About

- 18+ years of progressive experience in E-commerce, Internet, Fin-tech, and start-up sectors, specializing in DevOps and Site Reliability Engineering. - Expert in technical leadership and management of core DevOps/SRE practices, including strategy definition, roadmap development, and execution. - Extensive hands-on experience in designing and building scalable, multi-tenant distributed systems across various platforms. - Proven ability to build, lead, and mentor high-performing global engineering teams, fostering strong cross-functional collaboration with product and management stakeholders. - Passionate about driving new initiatives and continuously redefining DevOps/SRE practices to build performant and resilient architectures.

Experience

Rakuten symphony

3 roles

Senior Manager - Cloud Solutions

Promoted

Jan 2025Present · 1 yr 2 mos

  • Led cross-functional technical teams in the successful implementation of Proof of Concepts (PoCs) for diverse customers, directly contributing to new business acquisition and expansion.
  • Spearheaded customer and sales team engagements, consistently exceeding expectations in solution delivery and fostering strong, collaborative relationships.
  • Actively engaged with key customers to identify and capitalize on opportunities for expanding Rakuten Symphony's presence across various Operating Companies (OpCos) and Cloud Business Units (BUs).
  • Collaborated extensively with internal Business Units and external partners to achieve organizational objectives and drive synergistic solution development.
  • Systematically identified and resolved critical technical and operational issues impacting product quality, reliability, and project timelines.
  • Proactively planned and mitigated project risks, ensuring streamlined implementations and maintaining transparent communication with stakeholders.
  • Championed a culture of innovation and agility, consistently delivering high-quality solutions with speed and efficiency.
  • Demonstrated strong ownership in maximizing team output, fostering a diverse and inclusive environment that empowered team members.
  • Enhanced customer value by anticipating needs, enriching engagement experiences, and aligning technical solutions with strategic business objectives.
  • Exercised quick and accurate decision-making, consistently driving outcomes aligned with organizational goals.
Cloud SolutionsTechnical LeadershipCross-functional CollaborationProject ManagementDevOpsSite Reliability Engineering

Architect - Cloud Engineering

Jun 2022Dec 2024 · 2 yrs 6 mos

  • Device, architect and manage solutions for the Rakuten Symphony's customers and help them to on-board and take them through the cloud journey.
Cloud EngineeringSolution ArchitectureDevOpsCloud Solutions

Principal Engineer - Site Reliability Engineering (SRE)

Jan 2021May 2022 · 1 yr 4 mos

  • Rakuten Symphony, a Rakuten Group business organization with operations across Japan, the United States, Singapore, India, Europe and the MEA region, develops and brings to the global marketplace cloud-native, open RAN telco infrastructure platforms, services and solutions, including the Rakuten Communications Platform.
Site Reliability EngineeringCloud-native PlatformsCloud Solutions

Dfs group limited

Manager, DevOps and Site Reliability (SRE) Transformation

Jun 2019Dec 2020 · 1 yr 6 mos · Singapore · Hybrid

  • Technical management of DFS's retail luxury online store platform catering towards customers across Asia-pacific.
  • Strategizing the DevOps implementation & SRE operations for the Travel Incentive online portal, which becomes a vital part of the entire travel retail experience.
  • Technologies used:
  • DevOps & CI/CD - Proficient in Jenkins, GitLab CI/CD, Nexus (artifact management), BitBucket/GitHub/Stash (version control), Vault (secrets management), & Harbor (image scanning).
  • Infrastructure & Configuration Management - Experience with Terraform & Ansible for infrastructure provisioning & application stack management.
  • Containerization & Orchestration - Expert in Docker, Kubernetes, and Docker Swarm for microservices deployment, including Istio for service mesh implementation.
  • Cloud Platforms - Deep expertise in AWS Cloud operations and initiatives, with working knowledge of Azure and Alibaba Cloud services.
  • Testing & Quality Assurance - Implemented automated SAST with SonarQube for early defect detection and comprehensive code coverage; experienced with Gatling for performance testing and bench-marking.
  • Data & Stream Processing - Managed distributed Kafka clusters and possesses strong knowledge of stream processing platforms.
  • Observability & Monitoring - Deploying and managing comprehensive monitoring solutions including Datadog, ELK stack, Nagios, New Relic, Rundeck, Uptrends, CloudWatch, Prometheus, and Grafana.
  • Databases: Proficient in RDBMS (MySQL, Oracle) & NoSQL databases (Cassandra, Couchbase).
  • Leadership & Methodologies:
  • Team & Technical Leadership: Led teams of 4-6 SRE/DevOps engineers, providing technical guidance & project delivery oversight.
  • Agile Methodologies: Applied ITIL principles & Agile methodologies (Kanban/Scrum) for efficient project management.
  • Continuous Learning: Demonstrated passion for technology adoption, continuous learning, & adapting to evolving tech landscapes to contribute to business vision & product roadmaps.
DevOps ImplementationSRE OperationsCI/CDInfrastructure ManagementDevOpsSite Reliability Engineering

Rakuten

2 roles

Technical Assistant Manager (DevOps/SRE)

Jan 2017May 2019 · 2 yrs 4 mos · Tokyo, Japan

  • Technical management of core DevOps/SRE practices & new initiatives thereby building highly evolving teams in-sync with broader group-wide strategies and company's vision
  • Part of the Rakuten Super Point Platform DevOps team which award loyalty points to encourage customer retention.
  • Through Rakuten Super Point, enhancing & transforming everyday experiences of millions of users connected to Rakuten's e-commerce services
  • Managing technical team of size: 4-6 members co-located & spread remotely
  • Capacity planning & Infrastructure optimization, BCP planning and implementation
  • Device tech strategy/vision for DevOps teams aligning with group's broader goals
  • Expand DevOps practices and Automate Infrastructure Delivery, Provide Self-service capability
  • Technical Project Management,end to end agile/scrum management,budgeting for the projects and team
  • Build DevOps/Production-Support/Site-Reliability teams from ground-up,Co-located and remote hiring,retaining,personal development, progression & welfare of your team
  • Technical management and technical designation of projects,maintain rich talent pool as evolving teams
  • Promote self-service platforms,self-healing architectures,micro-service based implementations,reduce time to market practices
  • Work closely with cross-functional team members to coordinate operations effort in development of product and deliver new features to the market
  • Resolving conflicts by demonstrating leadership and appropriate decision-making competencies
  • Active involvement in initiating and accomplishing Cloud Platform initiatives and Evangelize Cloud Operations
Technical ManagementDevOps PracticesCloud OperationsDevOpsSite Reliability Engineering

Technical Architect (DevOps/SRE)

Jul 2015Dec 2016 · 1 yr 5 mos · Tokyo, Japan

  • Technically Architect & implement new tech initiatives to build & transform core DevOps/SRE practices
  • Capacity,Infrastructure optimization,BCP
  • High availability architecture from downtime based patterns
  • Automate Infra provisioning via Ansible, Terraform
  • Config management (Infrastructure As Code) via Chef & Ansible
  • Version control via Git & integration to DevOps tools
  • Batch scheduling framework via Mesos/Chronos, Apache Airflow
  • Fully automated Jenkins CI/CD & Continuous Deployment pipelines
  • Kafka for real-time streaming of data from a variety of sources
  • Kubernetes cluster for running various API based applications for Self-Healing, Scaling of services
  • Datadog, ELK, Nagios, NewRelic, Rundeck,Pingdom,Uptrends,Greylog,Cloudwatch,Grafana,Splunk>
  • Incident Resolution via PagerDuty. Analyze patterns & fine-tune processes with postmortems,best practices
Technical ManagementDevOps PracticesCapacity PlanningDevOpsSite Reliability Engineering

Yahoo

3 roles

Technical Lead - Search and E-commerce, Yahoo! Shopping

Mar 2014Jun 2015 · 1 yr 3 mos

  • Technical Lead in Operations for Yahoo's Search Business properties, mainly Toolbar (toolbar.yahoo.com), Downloads (downloads.yahoo.com) and Shopping (shopping.yahoo.com) generating a yearly revenue of about 400 million USD, all the three put together
  • Manage end to end Technical operations for the properties on a global scale and directly reporting to the Operations Director for Search vertical
  • Single point of Operations contact globally for the above mentioned properties. All property related issues, escalations, change requests, bugs, security issues, upgrades, on-call, access, build, release, hardware, performance, monitoring, site-up, component ownership, architecture design/changes and revamps are owned and executed by me.
Infrastructure OptimizationAutomationDevOpsSite Reliability Engineering

Technical Lead - Media

Jun 2013Mar 2014 · 9 mos

  • As Technical Lead, managing end-to-end technical/DevOps operations under Service Engineering for various hosted media properties/websites.
Technical OperationsE-commerceDevOpsSite Reliability Engineering

Technical Lead - Listings and Marketplace (E-commerce)

Mar 2011May 2013 · 2 yrs 2 mos

  • Managing end-to-end technical/DevOps under service engineering for various hosted properties/websites.
  • Lead for various properties like Yahoo Shopping, Games, Toolbar, Deals, Downloads, Sports, Finance and Weather.
  • Development of new properties, involving a Vespa hosted e-commerce backend, from scratch.
  • Functional lead for a team of 4 members involved in day-to-day site up issues and pro-active incident handling.
  • CI process by automating the builds/releases using various config management tools like Jenkins, Hudson.
  • DevOps role at various levels with Core Development Team, Network team, Infrastructure team, Platforms team
Technical OperationsDevOpsSite Reliability Engineering

Wipro

Senior Engineer - Server Management

Jan 2007Mar 2011 · 4 yrs 2 mos · Bengaluru Area, India

  • Being part of the Global System Operations – UNIX team at Goldman Sachs, technically helped equities team to handle huge trading volumes smoothly without compromising on service levels
  • Worked in client location as a part of Global System Operations – UNIX team in Goldman Sachs, Bangalore (Jan 2007 – March 2011). Involved in DevOps/ Production support for Goldman Sachs equities trading Infrastructure catering to multiple stock exchanges round the world
  • Was also part of Goldman Sachs Operation team – Tokyo, Japan. Gained real time experience by working/coordination with Goldman Sachs, equities Operations in Tokyo and also with offshore team with regards to Asia System Operations.
  • Member for the initial onsite transition team for the knowledge transfer of Systems project from Goldman Sachs, New York.
Server ManagementUNIXDevOps

Ipsoft

Systems Management

Jan 2006Dec 2006 · 11 mos · Bengaluru Area, India

  • As a System team member, played active role in empowering various client partners via IPcenter, IPsoft’s Autonomic IT Service Management Platform which puts the benefits of automation at the heart of their IT service delivery leaving time for accelerating spirit of innovation
  • Team Member of Global Systems Management team handling operations involving complex administrative tasks in the front edge Data Center environment having 19 data centers with 1400 servers.
IT Service ManagementAutomation

Education

NUS Business School Executive Education

Future Leaders Programme — Organizational Leadership

Mar 2023May 2023

University College of Engineering (University of Kerala)

Bachelor of Technology (BTech) — Electronics and Communication Engineering

Jan 2001Jan 2005

St. Thomas Central School, Trivandrum

12th standard CBSE — Mathematics and Computer Science

Stackforce found 100+ more professionals with Devops & Site Reliability Engineering

Explore similar profiles based on matching skills and experience