Soumya Ranjan Rout

Director of Engineering

Calgary, Alberta, Canada1 yr 8 mos experience
Most Likely To Switch

Key Highlights

  • Led architecture for high-throughput event processing systems.
  • Designed multi-tenant observability platforms for scalability.
  • Expert in cloud-native architecture and DevOps automation.
Stackforce AI infers this person is a SaaS-focused Infrastructure Architect with strong expertise in cloud-native solutions.

Contact

Skills

Core Skills

Technical ArchitectureCloud DesignDistributed SystemsTechnical LeadershipCloud Solutions ArchitectObservabilitySoftware DevelopmentSite Reliability Engineering

Other Skills

AWSAWS CloudFormationAWS Command Line Interface (CLI)AWS Identity and Access Management (AWS IAM)AWS LambdaAgile MethodologiesAmazon CloudWatchAmazon EC2Amazon EKSAmazon Identity and Access Management (IAM)Amazon Relational Database Service (RDS)Amazon S3Amazon VPCAmazon Web Services (AWS)Ansible

About

I design and lead delivery of production-grade, multi-tenant, multi-region infrastructure and backend platforms: multi-cloud Kubernetes, secure API & data planes (API Gateway / WAF / mTLS mesh), high-throughput streaming pipelines (Pulsar / Kafka), and low-latency Golang/Python services backed by Redis Cluster, Postgres, Neo4j, and time-series/observability stacks. I’ve owned Platform Engineering, Site Reliability Engineering (SRE), Cloud Infrastructure, DevOps, and backend development for systems handling millions of events per minute and 100+ microservices across AWS, GCP, Azure, and on-prem. Focus areas: Developer Productivity (CI/CD, testing, environments & tooling, code standards), Reliability Engineering (SLIs/SLOs, error budgets, MTTR), Observability Platforms (metrics, logs, tracing, topology graph), Data & Storage Services (streaming, blob, graph, time-series), and Online Services (traffic management, capacity planning, load testing). I’ve built and operated: • Event streaming & ingestion: Apache Pulsar / Kafka, partitioning, tenant isolation, retention strategy, backpressure, consumer scaling. • Async orchestration & queues: Redis / Asynq / Temporal patterns for idempotency, retries, and workload shaping under failure. • Storage & observability: Neo4j graph models for dependency and RCA, Postgres/Redis sharding for scale, TSDB / VictoriaMetrics / VictoriaLogs for high-cardinality telemetry, practical incident forensics. • AI / MLOps runtime: containerized inference services with SLOs, autoscaling, and cost/latency budgets enforced at the edge. • Delivery: Golang & Python backend services, gRPC/REST APIs, production Helm + GitOps + Terraform automation, zero-downtime rollout (canary / blue-green), SRE playbooks for resilience. I helped take an event-driven, graph-backed, observability-heavy platform from zero to seed and Series A as system architect and infrastructure leader, owning both the platform and the reliability story behind it. What I care about: clear invariants, measurable trade-offs (cost vs latency vs safety), isolation between noisy tenants, and automating the boring parts so teams ship faster without burning error budget. Open to: Director of Platform / Infrastructure Architecture / Head of SRE / Cloud Engineering & DevOps leadership / MLOps Infra partnerships.

Experience

Manifest

2 roles

Director of Engineering

Promoted

Aug 2025Present · 7 mos · Bengaluru, Karnataka, India · On-site

  • Technology Leadership: Cloud-native architecture, distributed systems, microservices, DevOps automation, site reliability engineering, API platforms, observability & monitoring.
  • Product Strategy: Vision & roadmap definition, market-aligned innovation, UX & design thinking, data-driven decision-making.
  • Business Impact: KPI-driven execution, operational excellence, budget optimization, GTM alignment.
  • People Leadership: Scaling high-performing engineering teams, executive mentorship, culture transformation, inclusion & diversity advocacy.
  • Architected distributed event processing handling 10M+ concurrent tasks using Apache Pulsar, Redis Cluster/Sentinel, Neo4j, and PostgreSQL with sub-second response times.
LeadershipTechnical ArchitectureDistributed SystemsSystems DesignCloud DesignGo (Programming Language)+1

Founding Principal Engineer

Jul 2024Aug 2025 · 1 yr 1 mo · Bengaluru, Karnataka, India · On-site

  • Architected and implemented distributed systems ensuring high availability and fault tolerance by optimizing messaging with Apache Pulsar and the Asynq Redis library. Leveraged a centralized singleton design to reduce redundant connections across applications, which significantly increased queue throughput and accelerated message processing.
  • Independently designed, developed, and implemented a Protocol Buffers–based application to build a CLI Executor for AWS, GCP, Azure, and a Monitoring app for Kubernetes and ArgoCD. Achieved a 70% reduction in latency and a 90% decrease in memory usage by eliminating in-memory JSON marshaling/unmarshaling.
  • Designed and deployed scalable architectures across AWS, Azure, GCP, and OCI, creating high-availability infrastructures that process over 10 million records daily and deliver a seamless client experience.
  • Engineered event-driven applications that boosted system scalability by 50% and improved responsiveness by 40%, enabling rapid adaptation to evolving business needs.
  • Leveraged gRPC to streamline microservice communication, cutting inter-service latency by 35% and increasing data exchange efficiency by 40%.
  • Designed and launched a multi-tenant observability platform ingesting 550 K resources / instance and 10 M Asynq tasks daily, achieving P95 latency < 80 ms.
Technical LeadershipGolangPythongRPCCloud Solutions ArchitectKubernetes+45

Renoworks software

2 roles

Cloud Architect

Promoted

Jan 2023Jul 2024 · 1 yr 6 mos · Calgary, Alberta, Canada · Remote

  • Developed RESTful APIs using Golang: Engineered over 15 high-performance APIs that streamlined communication between frontend and backend systems, reducing response times by 30% and increasing throughput to handle 50% more traffic.
  • Designed secure, scalable, and cost-optimized AWS & GCP architectures: Architected cloud solutions that cut operational costs by 25% while ensuring robust security and scalability, supporting a 3x increase in concurrent users.
  • Built RESTful APIs with Python Django: Developed scalable APIs handling over 10,000 daily requests with 99.9% uptime, facilitating seamless integration across multiple client platforms.
  • Engineered high-availability AWS VPC and Subnet designs: Designed resilient VPC architectures across multiple availability zones, achieving 99.99% uptime and ensuring fault tolerance for mission-critical applications.
  • Applied advanced data science and deep learning frameworks: Utilized scikit-learn, NumPy, SciPy, Pandas, and PyTorch to develop predictive models that boosted accuracy by 40%, driving improved data-driven decision making.
Linux ServerApache pulsarTest AutomationCloud Solutions ArchitectAWS Command Line Interface (CLI)Linux+67

Lead Cloud Engineer

Jan 2022Jan 2023 · 1 yr · Calgary, Alberta, Canada · Remote

  • Design, develop and maintain automated CI/CD pipelines using tools such as Jenkins, and Bitbucket Pipeline.
  • Collaborate with cross-functional teams to deploy and operate systems on cloud infrastructure platforms such as AWS.
  • Implement infrastructure as code using tools such as Terraform, Ansible, or Puppet to provision and configure cloud resources.
  • Monitor and analyze system performance using tools such as AWS cloud watch, and implement proactive measures to prevent issues.
  • Troubleshoot and resolve issues in production and non-production environments.
  • Continuously improve processes and automation to enhance system reliability, availability, and scalability.
KubernetesLinux ServerMavenCloud Solutions ArchitectGitAWS Command Line Interface (CLI)+62

Oracle

Senior Member Technical Staff

Jun 2021Jan 2022 · 7 mos · Bengaluru, Karnataka, India · Remote

  • Developed a Golang-based BDT framework for the OCI SDK: Streamlined integration testing processes, reducing cycle times by 30% and improving overall efficiency.
  • Built distributed job-scheduler in Golang & gRPC orchestrating 15 K VM jobs/min with Kafka exactly-once semantics.
  • Created CI/CD reference stack (Jenkins, Spinnaker, SonarQube) adopted by 40+ internal services.
KubernetesLinux ServerMavenCloud Solutions ArchitectGitLinux+47

Walmart global tech

Senior Software Engineer

Jul 2020Jun 2021 · 11 mos · Bengaluru, Karnataka, India · Remote

  • As a Platform Engineer, I developed foundational tools and services that enabled secure, scalable, and efficient development workflows across cloud and on-premises environments.
  • Designed and developed a robust Python Command-Line Interface (CLI) tool to facilitate seamless, secure access to on-prem cloud resources, empowering engineers to authenticate, provision, and manage infrastructure from local environments.
  • Led the architectural design and implementation of a Spring Boot application focused on enterprise-grade Identity and Access Management (IAM), integrating with SAML2 and OAuth2 providers for user authentication and authorization.
  • Automated infrastructure provisioning using Infrastructure-as-Code (IaC) tools such as Terraform and Ansible, supporting both Azure cloud and on-premises deployments.
Linux ServerMavenGitAWS Command Line Interface (CLI)LinuxGolang+46

Rubrik

Automation Engineer

May 2019Jul 2020 · 1 yr 2 mos · Bengaluru, Karnataka, India · On-site

  • Developed a Golang-based backup application that reduced processing times by 40%, significantly enhancing data reliability and recovery speed.
  • Engineered a Rubrik Cloud Fabric (RCF) platform using Django and ReactJS, streamlining data workflows and handling over 500 daily transactions with 99.9% uptime.
  • Orchestrated multi-cluster Kubernetes deployments on AWS EKS & GCP GKE with Ingress controllers and Kong API Gateway (OAuth2.0), boosting system scalability by 60% and ensuring 99.95% uptime.
  • Automated infrastructure provisioning with Ansible and Terraform, accelerating Kubernetes cluster deployments by 60% while ensuring consistent, error-free environments.
  • Built a full-stack application using Python Django and React for VMware resource management, reducing resource provisioning times by 50% and increasing operational efficiency by 40%.
Go (Programming Language)KubernetesLinux ServerMavenCloud Solutions ArchitectGit+54

Philips

Software Technologist II

Oct 2017May 2019 · 1 yr 7 mos · Bengaluru, Karnataka, India · On-site

  • Developed a one-click deployment service using Python, Ansible, and Terraform, reducing deployment times by 80% and streamlining service rollouts.
  • Built and operated distributed logging systems with ELK and EFK stacks, enhancing log query performance by 60% and cutting incident response times by 50%.
  • Implemented zero-downtime continuous deployment pipelines that achieved 99.99% uptime and reduced deployment-related errors by 90%.
  • Automated rolling server restarts with zero downtime, ensuring uninterrupted service and boosting overall system availability by 30%.
  • Automated server creation, configuration, and database provisioning using Terraform, cutting setup times by 70% while enhancing environment consistency and security.
KubernetesLinux ServerMavenGitAWS Command Line Interface (CLI)Linux+47

Fis

Programmer Analyst II

Sep 2016Sep 2017 · 1 yr · Bengaluru, Karnataka, India · On-site

  • Developed the “SDP-Build” system: Designed and implemented a Docker-based build system for the SDP platform that accelerated artifact creation by 60% and improved deployment consistency.
  • Optimized SDP platform installation: Engineered Vagrant configuration files and shell provisioning scripts to automate the installation process, reducing setup time by 70% and ensuring reliable, reproducible environments.
Linux ServerGitAWS Command Line Interface (CLI)LinuxSoftware Configuration ManagementContinuous Integration and Continuous Delivery (CI/CD)+23

Flipkart

Software Engineer

Aug 2011Sep 2016 · 5 yrs 1 mo · Bengaluru, Karnataka, India · On-site

  • Engineered a Python monitoring application using the Django MVC model to track near real-time metrics for 3,000 Debian VMs, reducing incident detection times by 70% and significantly boosting operational efficiency.
  • Developed a Python API for auto-heal functionality that automates remote restart and redeployment processes, cutting system recovery time by 80% and enhancing service reliability.
  • Architected and implemented CI/CD pipelines by integrating Git, GitHub, Maven, Jenkins, and Puppet, which streamlined deployments and reduced release cycle times by 50%, accelerating time-to-market.
  • Provided 24/7 on-call production support: Actively participated in Dev on-call rotations, improving incident response times by 40% and minimizing downtime.
Linux ServerMavenCloud Solutions ArchitectGitAWS Command Line Interface (CLI)Linux+35

Education

Liverpool John Moores University

Master of Science - MSc — Mathematics and Computer Science

International Institute of Information Technology Bangalore

Postgraduate Degree — Full Stack Development

Biju Patnaik University of Technology, Odisha

Bachelor of Technology - BTech — Electronics and Instrumentation Engineering

May 2006Jun 2010

Stackforce found 100+ more professionals with Technical Architecture & Cloud Design

Explore similar profiles based on matching skills and experience