Venkatesh Madala

DevOps Engineer

Bengaluru, Karnataka, India11 yrs 2 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • 10+ years of experience in Cloud & DevOps engineering.
  • Expert in Infrastructure as Code and Kubernetes management.
  • Proven track record in ML model deployment and data engineering.
Stackforce AI infers this person is a Cloud & DevOps Engineer specializing in AI and scalable infrastructure solutions.

Contact

Skills

Core Skills

Infrastructure As CodeKubernetesMachine LearningData EngineeringMicroservicesSite Reliability EngineeringAzureDevopsInfrastructure ManagementContainerizationMiddleware Management

Other Skills

ADFAWSAWS SageMakerAirflowAmbariAnsibleAntApache AirflowApache FlinkApache KafkaApache ZooKeeperArchitectureArtificial Intelligence (AI)Azure Active DirectoryAzure DevOps

About

Cloud & DevOps Engineer with 10+ years of experience designing, automating, and managing scalable infrastructure across AWS, Azure, and hybrid environments. Expert in Infrastructure as Code (Terraform, Ansible), Kubernetes (deployment, scaling, upgrades), and CI/CD pipelines for rapid, reliable delivery. Proven track record in ML model deployment on Amazon SageMaker, Azure ML, and Kubernetes, with strong expertise in data engineering and vector search pipelines for AI/RAG applications. Skilled in migrating monolith to micro services, high-availability architecture, security compliance (PCI, SOC 2, ISO), and performance optimization. Adept at leading cross-functional initiatives to enhance system reliability, reduce operational overhead, and drive innovation in AI/ML-powered products.

Experience

11 yrs 2 mos
Total Experience
2 yrs 2 mos
Average Tenure
4 yrs
Current Experience

Coschool

Platform Engineering Lead

Jun 2022Present · 4 yrs · Remote

  • 1) Infrastructure Automation on AWS: Designed and provisioned scalable cloud infrastructure using Terraform, integrating services like EC2, Application Load Balancer (ALB), API Gateway, Lambda, Cognito, and SageMaker to support AI/ML workflows and secure API access.
  • 2) Model Lifecycle Management on SageMaker: Deployed multiple decision models for training and inference using Amazon SageMaker, streamlining the end-to-end machine learning lifecycle from dataset ingestion to model versioning and deployment.
  • 3) Kubernetes-based Model Deployment: Deployed and managed ML models in Kubernetes (K8s) clusters to ensure high availability, load balancing, and CI/CD integration for iterative updates.
  • 4) Vector Search for RAG: Engineered a pipeline to generate and push embedding vectors to MilvusDB, enabling Retrieval-Augmented Generation (RAG) capabilities for AI-driven applications.
  • Content Deduplication System: Developed a content deduplication module to identify and flag redundant entries in a centralized content database, improving data quality and reducing storage overhead.
  • 5) Data Engineering for ML: Built robust data extraction and transformation workflows to source datasets from multiple systems, preparing and delivering curated datasets for the ML team to support model training and experimentation.
  • 6) Business Intelligence Reporting: Created interactive business reports and dashboards to derive actionable insights from application-level metrics, supporting strategic decision-making across product and operations teams.
  • 7) Monolith to Microservices Migration: Led the architectural migration of the Teacher App (AI Assist) from a monolithic Liferay platform to a scalable microservices-based architecture, resulting in improved maintainability, deployment speed, and team agility.
TerraformAWSKubernetesAnsibleData EngineeringMachine Learning+1

Citrix

Senior Site Reliability Engineer

Apr 2021Jun 2022 · 1 yr 2 mos · Bengaluru, Karnataka, India

  • 1) Ensured 24×7 availability of the Citrix Analytics Platform, consistently maintaining 99.95% uptime across production environments.
  • 2) Provisioned and managed Azure resources including VMs, VM Scale Sets, Load Balancers, AKS, App Services, Event Hub, Redis, and HDInsight Clusters, supporting high-throughput, scalable analytics workloads.
  • 3) Automated infrastructure deployment using Terraform, and performed configuration management and patching via Ansible, improving infrastructure consistency and reducing manual errors.
  • Led customer onboarding to the Citrix Analytics Platform for Security and Performance Analytics, acting as SME and resolving onboarding and operational issues across multiple regions.
  • 4) Conducted data recovery and disaster recovery drills, validating business continuity plans and ensuring resilience under failure scenarios.
  • 5) Maintained compliance with PCI, SOC 2, and ISO standards across all analytics environments through regular audits and proactive risk mitigation.
  • 6) Installed security agents and performed vulnerability scans across all compute and data resources, reducing attack surface and improving security posture.
  • 7) Deployed ML models using Azure Data Factory (ADF) pipelines for production inference and integration with analytics workflows.
  • 8) Developed and deployed Spark Streaming jobs on HDInsight, enabling real-time processing of telemetry and behavioral data.
  • 9) Managed Airflow DAG deployments for orchestration of batch and streaming pipelines, ensuring timely data availability for downstream applications.
  • 10) Monitored platform health and performance with New Relic, and implemented robust log management and alerting via Splunk.
AzureTerraformAnsibleData RecoveryComplianceSite Reliability Engineering

Harman india

Senior DevOps Engineer

Apr 2019Apr 2021 · 2 yrs · Bengaluru Area, India

  • 1) Provisioned and Managed AWS Infrastructure across Development, QA, Staging, and Production environments for multiple enterprise clients using services including EC2, S3, ALB, Route53, DynamoDB, IAM, KMS, SES, and SNS.
  • 2) Managed Virtualized Environments by provisioning and maintaining virtual machines on VMware ESXi and KVM, supporting critical Dev and QA operations.
  • 3) Installed and Maintained Kubernetes Clusters in both online and air-gapped (offline) environments, enabling consistent deployment of enterprise applications.
  • 4) Led Kubernetes Upgrade Projects, migrating legacy clusters from v1.9 to v1.15, ensuring compatibility, stability, and minimal downtime.
  • 5) Developed and Maintained Infrastructure as Code (IaC) using Terraform for automated provisioning and management of AWS and VMware infrastructure, enhancing deployment efficiency and standardization.
  • 6) Automated Configuration Management with Ansible, creating playbooks to streamline software installation, application deployments, and version upgrades, reducing manual intervention and errors.
  • 7) Designed and Managed Custom Helm Charts for microservices and third-party applications, enabling consistent and environment-specific deployments.
  • 8) Administered WSO2 Identity Server, managing authentication/authorization policies, security configurations, and platform upgrades.
  • 9) Managed Enterprise Product Deployments, installing and upgrading the Ignite Product for diverse client environments, adhering to release timelines and custom deployment specifications.
  • 10) Implemented Monitoring and Alerting Solutions using Prometheus, Grafana, and AlertManager, providing proactive system visibility and incident response.
  • 11) Provided L3 Production Support, troubleshooting critical issues, performing root cause analysis, and delivering timely resolutions under strict SLAs.
AWSKubernetesTerraformAnsibleMonitoringDevOps+1

Accenture

DevOps Engineer

May 2017Apr 2019 · 1 yr 11 mos · Bengaluru Area, India

  • 1) Designed and Deployed Containerized Applications using Docker and Kubernetes, including writing production-grade Dockerfiles and Kubernetes manifests tailored for scalable microservice architectures.
  • 2) Engineered Kubernetes Clusters with advanced configurations including Horizontal Pod Autoscaling (HPA) and Service Discovery for optimized deployment of distributed Spring Boot and web applications.
  • 3) Automated End-to-End Release Management processes using Bash scripting and Jenkins, standardizing deployments and reducing operational overhead across all environments.
  • 4) Established Enterprise Monitoring and Alerting Frameworks leveraging Prometheus and AlertManager, ensuring real-time health monitoring and proactive issue resolution.
  • 5) Developed Log Processing Pipelines using Logstash with custom Grok filters and built Kibana Dashboards for centralized operational insights and performance monitoring.
  • 6) Led Build and Release Automation Initiatives, collaborating with cross-functional teams to streamline application delivery pipelines and resolve complex build and deployment challenges.
  • 7) Managed Large-Scale Spark Workloads on HDP Clusters, overseeing build, deployment, and execution of data processing pipelines aligned with SLA requirements.
  • 8) Oversaw Data Integration into Hadoop Ecosystems, orchestrating data workflows and Spark job execution, with regular performance reporting to leadership.
DockerKubernetesJenkinsMonitoringDevOpsContainerization

Hewlett packard enterprises

DevOps Engineer

Mar 2015Apr 2017 · 2 yrs 1 mo · Bengaluru Area, India

  • 1) Led Migration Projects, overseeing the end-to-end upgrade of Oracle WebLogic, WebCenter Portal (WCP), UCMS, and WCS platforms from 11g to 12c, ensuring minimal downtime, performance optimization, and full environment validation.
  • 2) Administered and Managed Oracle WebLogic Server Clusters, WebCenter Portal (WCP), UCMS, WebCenter Sites (WCS), Coherence, and Solr, including installation, configuration, capacity planning, patch management, and performance tuning across development, QA, and production environments.
  • 3) Provisioned and Scaled WebLogic Clusters, supporting environment expansion and application growth as per evolving enterprise requirements.
  • 4) Developed Ansible Playbooks to automate installation and configuration of WebCenter Portal, UCMS, Coherence 12c, and related middleware components, enhancing consistency and reducing setup time.
  • 5) Automated Application Deployments using Shell scripting and WLST for middleware and portal applications on WebLogic servers, standardizing release procedures and reducing operational risk.
  • 6) Set Up and Administered Code Quality Tools including SonarQube, PMD, and CheckStyle, integrating them into CI pipelines to enforce coding standards and maintain code health.
  • 7) Supported L2 and L3 Teams in diagnosing and resolving production-critical issues across WebLogic, WCP, UCMS, WCS, Coherence, and Solr platforms, contributing to system stability and uptime.
Oracle WebLogicAnsibleShell ScriptingMiddleware ManagementDevOps

Education

University of Mysore

Master of Business Administration - MBA

Jan 2016Jan 2018

Jawaharlal Nehru Technological University Kakinada (JNTUK)

Bachelor of Technology (B.Tech.) — Electrical and Electronics Engineering

Jan 2010Jan 2014

Stackforce found 100+ more professionals with Infrastructure As Code & Kubernetes

Explore similar profiles based on matching skills and experience