Narendran G

Software Engineer

Chennai, Tamil Nadu, India9 yrs 8 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Scaled concurrent connections by 2.5x on AWS.
  • Reduced observability costs by over 40%.
  • Achieved 99% improvement in customer issue turnaround.
Stackforce AI infers this person is a DevOps and SRE expert in SaaS environments.

Contact

Skills

Core Skills

Site Reliability EngineeringCloud ComputingDevopsSoftware ObservabilityBackend DevelopmentSoftware EngineeringInfrastructure AutomationApplication Development

Other Skills

RabbitMQNATSKubernetesAWS EKSOpenTelemetryDatadogSplunkGitHub WorkflowsGolangPythonDjangoHTMLCSSJavaScriptTerraform

About

DevOps and SRE engineer with 10 years of experience delivering cloud-native platforms, infrastructure automation, and observability at scale on AWS. Proven outcomes include scaling concurrent connections by 2.5x (40k to 100k), cutting observability cost by over 40% through Datadog-to- Splunk adoption, reducing operational overhead by more than 70% via EKS migration, and shrinking customer-issue turnaround from days to minutes (approximately 99% improvement) through internal tooling and GitHub workflows. Strong hands- on background in Kubernetes, Terraform, Python, Node.js, Golang, messaging (RabbitMQ, NATS), and full-stack reliability practices from CI/CD through DR, SLOs, and on-call.

Experience

9 yrs 8 mos
Total Experience
1 yr 11 mos
Average Tenure
4 yrs
Current Experience

Cisco

2 roles

Software Engineer IV

Promoted

Oct 2025Present · 8 mos

  • Project: Security Service Exchange (SSE) — Core & Eventing
  • RabbitMQ→NATS for SSE Core; scaled concurrent online capacity 40k→100k (Blue-Green); improved throughput/headroom.
  • Self-managed Kubernetes→AWS EKS: >70% less day-2 operational load; better scalability and standardized cluster ops.
  • SOC 2 BCP: DR exercises—rebuilt full SSE clusters from scratch in hours, E2E validation, monitoring/logging end-to-end.
  • Expansion: PROD in India, Australia, UAE, and FedRAMP (US)—infra, monitoring, pipelines, logging, E2E probes.
  • OpenTelemetry on SSE Core/Eventing for tracing and faster MTTD.
  • Datadog→Splunk (metrics, alerts, dashboards); >40% observability cost reduction.
  • Coordinated internal/external stakeholders for steady-state and new-region launches.
  • Project: Security Cloud Control — Platform Engineering
  • GitHub Workflow tools for TAC: turnaround ~3 days→under five minutes (~99%).
  • Splunk adoption for platform telemetry (>40% vs prior Datadog); RUM/APM for user/API issue detection.
  • Splunk anomaly detection (sudden change, historical anomaly).
  • SLIs/SLOs in Splunk from APM/RUM: uptime, UI/API latency, critical workflow success/error rates.
  • Reliability dashboards + SLO alerts; on-call for production health.
  • Golang middleware (SCC Provisioning Service): endpoint RED metrics (rate, errors, duration).
RabbitMQNATSKubernetesAWS EKSOpenTelemetryDatadog+5

Software Engineer III

Jun 2022Oct 2025 · 3 yrs 4 mos

Blackboard

Software Engineer

Dec 2021Jun 2022 · 6 mos

  • Delivered features for BBComms (parent–teacher communications) using Python, Django, a custom ORM, and front-end HTML, CSS, and
  • JavaScript, improving engagement workflows for education customers.
  • Led junior developers on feature delivery and code quality, shortening review cycles and reducing rework through clearer patterns and shared
  • standards.
PythonDjangoHTMLCSSJavaScriptBackend Development+1

Hcl technologies

Senior Software Engineer

Jun 2019Dec 2021 · 2 yrs 6 mos · Chennai, Tamil Nadu, India

  • Upgraded IaC from Terraform 0.8.7 to 0.11.14 across SSE components, reducing drift risk and enabling safer, repeatable releases.
  • Templated deployment code with Jinja2 and improved Node.js-based SSE deployment CLI tools, cutting manual steps and release errors for
  • operators.
  • Built a performance test harness for RabbitMQ using Python, Node.js, Shell/Bash, Terraform, Ansible, and Packer; partnered with consultants
  • to land a stable broker configuration under load.
  • Evolved Kubernetes deployments to run a new RabbitMQ cluster in parallel with legacy messaging for safer cutovers and rollback options.
  • Added housekeeping automation (metrics collection, backup, rebalance), packaged as Docker images, and ran them on the SSE Kubernetes
  • cluster for operational hygiene.
  • Authored Blue-Green migration scripts in Python and Node.js for SSE deployments to minimize downtime during upgrades.
  • Delivered RabbitMQ CLI utilities with sub-commands using Python argparse for consistent operator workflows.
  • Implemented CI/CD for Dexaas (Data Exchange as a Service) with Jenkins and Groovy, increasing release cadence and auditability.
  • Configured Datadog dashboards, monitors, and alerts for Dexaas and RabbitMQ to shorten MTTD and MTTR for incidents.
  • Wrote unit and integration tests with unittest, mock, and pytest to protect critical automation paths.
  • Produced monthly SSE Eventing trend reports using requests, Jinja2, and pandas, and surfaced them in the SSE DevOps portal (MEAN stack)
  • for leadership visibility.
  • Added E2E automation to verify Cisco firewall devices successfully push events into SSE Cloud, improving confidence in the ingestion path.
TerraformNode.jsPythonAnsiblePackerDocker+2

Think42 labs (p) ltd

Application Developer

Oct 2018Jun 2019 · 8 mos · Guindy

  • Delivered custom ERP solutions on Python, Odoo, and JavaScript/jQuery, and exposed RESTful APIs via Odoo and Django REST Framework for mobile and web clients, accelerating product integrations and partner onboarding.
  • Integrated PayU and Twilio for payments and messaging, improving conversion reliability and reducing manual reconciliation for finance and operations teams.
PythonOdooJavaScriptRESTful APIsApplication DevelopmentSoftware Engineering

Aspirant labs india pvt ltd

Software Developer

Sep 2016Sep 2018 · 2 yrs · Arumbakkam

  • Built and extended Odoo ERP modules (Sales, Purchase, Inventory, Accounting, Manufacturing) in Python, streamlining order-to-cash and inventory workflows for customers.
  • Shipped an online scheme payment portal for Kalyan (Python, Odoo, Angular 5) and an inventory analytics dashboard (Django, Pandas, JavaScript, Highcharts), improving payment throughput visibility and decision-making for merchandising and finance stakeholders.
PythonOdooAngular 5JavaScriptApplication DevelopmentSoftware Engineering

Education

KLN college of Engineering, Madurai.

Bachelor's Degree — Electricals and Electronics

Jan 2011Jan 2015

Stackforce found 100+ more professionals with Site Reliability Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience