Suman Kedala

Product Manager

Hyderabad, Telangana, India20 yrs 1 mo experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Expert in AIOps and observability for enterprise-scale operations.
  • Proven track record in driving operational efficiency through AI/ML.
  • Strong leadership in building high-performing IT teams.
Stackforce AI infers this person is a seasoned IT Operations Manager specializing in AIOps and observability in enterprise environments.

Contact

Skills

Core Skills

AiopsObservabilityIt Operations ManagementIt Infrastructure OperationsNetwork EngineeringIt Operations

Other Skills

AI/ML platformsoperational frameworksautomationadvanced analyticsmachine learningpredictive insightscontextual intelligenceexecutive dashboardsobservability platform strategyDatadogSplunkAIOps Solutionadvanced monitoringIT infrastructurenetwork design

About

Senior IT Manager | Enterprise AIOps & Observability | AI Driven Operations | IT Operations & KPIS Senior IT Manager specializing in enterprise scale AIOps and observability, playing a critical role in defining, implementing, and optimizing operational frameworks, platforms, and processes across complex technology ecosystems. Focusing on applying AI/ML platforms to observability, operational efficiency, and large-scale enterprise environments to drive measurable business outcomes. Leading AI Operations, AIOps, and Observability initiatives that enable resilient, secure, and data driven operations. My work centers on building strong governance models, scalable platforms, and automation first practices that support proactive issue detection, intelligent remediation, and high availability for mission critical systems—including modern AI/ML workloads. A key part of my journey has been designing and architecting a next generation observability AI solution, built to unify signals across infrastructure, applications, data, and security domains. This solution applies advanced analytics and machine learning to move organizations from reactive monitoring to predictive insights, contextual intelligence, and autonomous operations, significantly improving reliability, performance, and MTTR. I strongly believe in treating observability as a product, not a tool. Driving standardization of metrics, logs, traces, and events across the enterprise to deliver actionable dashboards, business aligned SLOs, and data backed insights that leadership teams can trust. This product centric approach enables consistent outcomes across hybrid, cloud native, and distributed architectures. Partnering closely with engineering, data science, Infrastructure, cybersecurity, and service management teams to operationalize AI at scale and embed observability into the full technology lifecycle—from design and deployment to operations and optimization. As a people leader, Building and mentoring high performing AIOps and observability teams, foster a culture of continuous improvement, and deliver cost effective, scalable solutions through disciplined vendor strategy and budget management. My passion lies in transforming operations into a strategic capability that accelerates innovation while maintaining reliability at enterprise scale.

Experience

20 yrs 1 mo
Total Experience
3 yrs 8 mos
Average Tenure
13 yrs 11 mos
Current Experience

Qualcomm

5 roles

Senior IT Manager

Promoted

Dec 2025Present · 4 mos · On-site

  • Senior IT Manager specializing in Enterprise AIOps and Observability for Qualcomm playing a crucial role in defining, implementing, and optimizing operational frameworks, platforms and processes. AI/ML platforms for Observability, operational efficiency and enterprise technology landscape.
  • Leading enterprise-scale AI Operations, AIOps, and Observability initiatives, driving resilient, secure, and data-driven operations aligned to business outcomes. My focus is on building governance frameworks, scalable platforms, and automation-first practices that enable proactive issue detection, intelligent remediation, and high availability for mission‑critical systems, including AI/ML workloads.
  • Specialized in treating observability as a product—standardizing metrics, logs, and traces across infrastructure, applications, security, and data platforms to deliver predictive insights, actionable dashboards, and measurable improvements in performance, reliability, and MTTR. I partner closely with engineering, data science, infrastructure, cybersecurity, and service management teams to operationalize AI at scale.
  • As a people leader, I build and mentor high-performing AIOps teams, foster a culture of continuous improvement, and deliver cost-effective solutions through strong vendor and budget management.
observabilityAIOpsAI/ML platformsoperational frameworksautomationObservability

IT Manager

Promoted

Dec 2020Dec 2025 · 5 yrs · On-site

  • Lead IT Operations Management (ITOMA) initiatives, delivering executive and operational dashboards that provide real‑time visibility into service health, incidents, MTTR, change success rates, and operational trends.
  • Drive observability platform strategy and adoption, including Datadog and Splunk, supporting SRE and application teams with standardized onboarding, dashboards, and best practices.
  • Partner with global stakeholders to design management and drill‑down dashboards, enabling leadership to quickly assess application availability while empowering engineers with deep operational metrics.
  • Play a key role in ITOMA program reviews, contributing to roadmap discussions, platform evolution, and cross‑team alignment on operational priorities.
  • Support CMDB and service mapping initiatives, helping improve dependency visibility, outage readiness, and audit/compliance reporting through structured data and dashboards.
  • Collaborate with finance, security, and platform owners to improve cost transparency and governance for observability tooling, ensuring usage aligns with business value.
  • Mentor and lead high‑performing teams in India, fostering a culture of ownership, automation, and continuous improvement across IT operations.
  • Areas of Expertise:
  • IT Operations Management (ITOM) • Observability & Monitoring (Datadog, Splunk) • SRE Enablement • Executive Dashboards & Metrics • Incident, Change & Problem Management • CMDB & Service Mapping • Platform Modernization • Global Stakeholder Collaboration
IT Operations Managementexecutive dashboardsobservability platform strategyDatadogSplunkObservability

IT Engineer, Staff

Dec 2017Nov 2020 · 2 yrs 11 mos · On-site

  • Building of AIOps Solution for Advanced monitoring of IT Infra.
AIOps Solutionadvanced monitoringIT infrastructureAIOpsIT Infrastructure Operations

Lead IT Engineer

Dec 2014Dec 2017 · 3 yrs · On-site

  • Handling NOC Globally and Monitoring Tools Teams for Telemetry and better Operations efficiency with Telemetry and automation.

Sr IT Engineer

Apr 2012Dec 2014 · 2 yrs 8 mos · On-site

  • Handling NOC Operations at Qualcomm

Infosys

TL

May 2010Apr 2012 · 1 yr 11 mos · Hyderabad · On-site

  • Accountabilities:
  • Accepting Service Requests for implementation and design changes for General Motors.
  • Receive the Pre Sale Architecture requirement, taking inputs from Customer GM.
  • Designing as per the need and presenting it to the customer (Done in Visio).
  • As per the requirement ordering the equipment from Cisco.
  • Configuring the devices and handling the rack and stack and implementing the initial test and turn up.
  • Configuring the devices as per the need and making the devices go live as per the scheduled time from customer.
  • Handling the design and implementation end to end.
network designservice requestsCisco equipmentNetwork EngineeringIT Operations

Infosys technologies

LCM Architect Engineer

May 2010Apr 2012 · 1 yr 11 mos

  • LCM Engineer for ATT.

Wipro technologies

2 roles

Sr.Network Engineer

May 2008May 2010 · 2 yrs

  • Senior TAC Engineer at British Telecom.
  • Routing issues (LAN/WAN).
  • Troubleshooting of routing protocols such as RIP, EIGRP, OSPF and BGP/MBGP.
  • Resolving HSRP related problems along with taking backup, restore and upgrade of IOS.
  • Checking issues for PE configurations and PE- CE configuration mismatches also.
  • Solving ISDN related faults.
  • Implementing and deleting access lists on routers as per the requirement.
  • Troubleshooting of connectivity issues, packet loss, latency issues and over utilization issues.
  • Changing the CE configurations as per the customer requirement and routing policy/topology changes or as per the special request by the customer.

Sr Network TAC Engineer (Service Assurance 2nd line Technical Support Engineer)

May 2008May 2010 · 2 yrs

  • Accountabilities:
  • Accepting faults assigned & change requests from BT Global operational teams.
  • Performing the fault diagnostics and provide proper resolution.
  • Providing remote resolution of faults along with performing changes to Router configurations as per client requirements.
  • Understanding the network and joining con calls with customers and resolving the issues also doing the required changes for resolving the issue.
  • Working with BT suppliers for field support and resolution.
  • Managing frontline updates and updating the ticket throughout the fault handling process.
  • Executing the changes accordingly and joining conference calls with customers for faster fault resolution.
  • Launching ISDN script for few of the specific customers and do troubleshooting the links for unsuccessful sites along with performing monitoring in order to identify abnormal links behaviours as soon as possible.
  • Handling responsibilities as a SPOC (Single Point of Contact) the present working location.
  • Generating SLA (Service Level Agreement) performance reports for the faults raised against BTGS customers.
  • Analysing and generating RCA's (Root cause Analysis) for major outages and providing proper reason for outage, resolution and corrective actions for the faults raised.
  • Highlights:
  • Being a SPOC mentoring the team and handling shifts and creating shift rosters, and training new comers in the team and participated in Technical discussions which improved team's performance as a whole.
  • Bagged the Best performer award in Wipro for training new comers and solving critical issues for the customers.
  • Projects Undertaken

Hcl comnet, hyderabad

Network Engineer

Apr 2007May 2008 · 1 yr 1 mo

  • Complete hardware setup and configuration of Site to Site IPSEC VPN and maintenance of the network equipments 2811 routers and 6509 switches.
  • Only network engineer at the site for any network issues for that particular client.
  • Change implementations, report generations, bandwidth utilization monitoring and chasing the Service Provider for RFO for unstable link related issues.
  • Implemented tools like MRTG with advancements of RRD and automated the complete process of report generation using batch and Perl scripting.
  • Handled complete network transition of the Client network.
  • Highlights:
  • Bagged the award for The Best Performer for outstanding performance and for maintaining 100% quality in the work & reducing the customer setup cost by implementing network management tool from open sources, from HCL Comnet, Hyderabad.

Cms ltd

Resident Engineer

Feb 2006Apr 2007 · 1 yr 2 mos

  • Designated as a Network L2 Engg for managing the computer networks (LAN & WAN) installation, Configuration, Maintenance, administration and monitoring.
  • Conducting regular maintenance of LAN & WAN, generating regular reports from CISCO Works & LAN manager.
  • Configuring and monitoring MRTG and checking for utilization of the links
  • Co-ordinating with AT&T, Sprint & MCI and also chasing local PTT Bharti/VSNL for the ongoing link issues or link fluctuations.
  • Regular health checks of the network and also the MGX devices.

Education

Indian Institute of Management, Kozhikode

Executive Management Development Programme — Senior Management Programme

Sep 2021Sep 2022

University of Hyderabad

Postgraduate Degree

Apr 2016Apr 2017

SICET

2005 B-Tech — Information Technology Communications

Jan 2001Jan 2005

DAV Public School

School

Jan 1988Jan 1999

Stackforce found 100+ more professionals with Aiops & Observability

Explore similar profiles based on matching skills and experience