Manmeet Kaur

Associate Consultant

India8 yrs 9 mos experience

Key Highlights

8 years of experience in site reliability engineering.
Expert in incident management and application support.
Proficient in AWS and GCP cloud platforms.

Stackforce AI infers this person is a Site Reliability Engineer with expertise in SaaS and Fintech industries.

Contact

Skills

Core Skills

Site Reliability EngineeringIncident ManagementRelease ManagementInfrastructure Monitoring

Other Skills

AWSAgile MethodologiesAgile Project ManagementAmazon Web Services (AWS)Analytical SkillsApplication SupportBusiness AnalysisBusiness StrategyC++CSSCloudwatchCompetitive AnalysisData VisualizationDatadogEmotional Intelligence

About

IT professional with around 8 years of experience in site reliability engineering,application support,incident management,customer centricity,system health monitoring. Languages:Core Java,Python Scripting Databases: MYSQL,PostgresSQL Devops:Docker,Kubernetes SRE Tools:Splunk|Catchpoint|Graphana|Jenkins|Github|Cloudwatch|BigPanda|Datadog|PagerDuty|New Relic|Kibana Cloud Platforms:AWS,GCP Ticketing Tools:CDP,JIRA Agile Framework-Scrum Other Tools-ServiceNow,Microsoft Teams,Slack,Trello,Confluence,Putty,Swagger,Postman

Experience

8 yrs 9 mos

Total Experience

1 yr 5 mos

Average Tenure

1 yr 8 mos

Current Experience

Infosys

Senior Consultant

Sep 2024 – Present · 1 yr 8 mos · Mohali district, India · Hybrid

Ukg

Lead Site Relability Engineer

Jan 2023 – Jul 2024 · 1 yr 6 mos · Noida, Uttar Pradesh, India · Hybrid

KibanaGitDatadogGoogle Cloud Platform (GCP)New RelicSite Reliability Engineering+1

Protiviti

Deputy Manager

Dec 2021 – Dec 2022 · 1 yr · Gurugram, Haryana, India

Perfom release management process for production, pre-prod environment using
Jenkins,git,bitbucket
Kubernetes cluster management using command line,Lens
Maintain documentation describing processes and system requirements for all
systems
Configuration changes in code using git and bitbucket
Infrastructure Monitoring using Grafana,Kubernetes, Slack,Cloudwatch,Kibana

KubernetesAmazon Web Services (AWS)Release ManagementInfrastructure Monitoring

Unify technologies

Site Reliability Engineer

Nov 2020 – Dec 2021 · 1 yr 1 mo · Gurugram, Haryana, India

Worked for Mobile Financial Services Team in Airtel Money Project and setup SRE team from scratch for the below digital products and working with stakeholders for
improving application availability.-
Developer portal - https://developers.airtel.africa
Enterprise payments platform - https://enterprise.airtel.africa
Cross border money transfer switch for P2P transfer & merchant payments
Fully configurable integrations platform to support bill payments, international remittances, loans
Roles & Responsibilities-
● Site reliability support across 14 countries of Africa across Airtel Money
digital products and working with stakeholders for
improving application availability.
● Handling production deployments using Jenkins and ensuring the stability of infrastructure.
● Debugging application logs, incident management, ticketing management using
Kibana,Kubernetes,JIRA
● Configuration changes in code using git and bitbucket
● Testing of RESTAPI using Postman
● Monitoring using tools Grafana,Kubernetes

KubernetesSite Reliability Engineering

Expedia group

International Operations and Traffic Analyst II

Jun 2018 – Feb 2020 · 1 yr 8 mos · Gurgaon, Haryana, India

Monitoring live site issues, system health monitoring for Expedia group POS(Point of
Sales) across 36 countries.
Incident Management -Handling P0,P1,P2 incidents to reduce MTTD,MTTR.
Release Support – Supporting daily/weekly release support , performing pre,during
and post the release analysis
Change Support – Support network changes,patching,server upgrade,migration
activities to AWS
Custom Alerts/Dashboard Creation for product
Catchpoint Synthetic Monitoring & Testing , deep diving to find root cause by
analysing waterfall trends
Lead Lab environment support and escalating issues in soak environments before
release.
Error Report Analysis –Analyzing booking trends,error codes,service failures and
working with product teams to optimize,automate processes for better site reliability
Customer Centricity using Dog Food –Identify,investigate,escalate customer pain
points & ensure resolution in an end-to-end manner
Bot Analytics –Identify bot attacks on livesite,creating rules to prevent malicious
attacks on live sites.
Custom Alerts/Dashboard Creation for product teams/internal use on adhoc basis
using Splunk

Amazon Web Services (AWS)SplunkIncident Management

Xerox

Support Analyst

Aug 2016 – Jun 2018 · 1 yr 10 mos · Gurgaon, Haryana, India

Daily server and application availability checkouts, document,track and categorize
incidents reported for LATAM region.
Tracking server reboot/patching activity by Hardware team and supporting network
changes by HCL/ATOS
Ensure effective information security,governance and controls, application support to
ensure application is up and running
Debugging application logs,incident management,ticketing management using CDP