M

Manmeet Kaur

Associate Consultant

India8 yrs 9 mos experience

Key Highlights

  • 8 years of experience in site reliability engineering.
  • Expert in incident management and application support.
  • Proficient in AWS and GCP cloud platforms.
Stackforce AI infers this person is a Site Reliability Engineer with expertise in SaaS and Fintech industries.

Contact

Skills

Core Skills

Site Reliability EngineeringIncident ManagementRelease ManagementInfrastructure Monitoring

Other Skills

AWSAgile MethodologiesAgile Project ManagementAmazon Web Services (AWS)Analytical SkillsApplication SupportBusiness AnalysisBusiness StrategyC++CSSCloudwatchCompetitive AnalysisData VisualizationDatadogEmotional Intelligence

About

IT professional with around 8 years of experience in site reliability engineering,application support,incident management,customer centricity,system health monitoring. Languages:Core Java,Python Scripting Databases: MYSQL,PostgresSQL Devops:Docker,Kubernetes SRE Tools:Splunk|Catchpoint|Graphana|Jenkins|Github|Cloudwatch|BigPanda|Datadog|PagerDuty|New Relic|Kibana Cloud Platforms:AWS,GCP Ticketing Tools:CDP,JIRA Agile Framework-Scrum Other Tools-ServiceNow,Microsoft Teams,Slack,Trello,Confluence,Putty,Swagger,Postman

Experience

8 yrs 9 mos
Total Experience
1 yr 5 mos
Average Tenure
1 yr 8 mos
Current Experience

Infosys

Senior Consultant

Sep 2024Present · 1 yr 8 mos · Mohali district, India · Hybrid

Ukg

Lead Site Relability Engineer

Jan 2023Jul 2024 · 1 yr 6 mos · Noida, Uttar Pradesh, India · Hybrid

KibanaGitDatadogGoogle Cloud Platform (GCP)New RelicSite Reliability Engineering+1

Protiviti

Deputy Manager

Dec 2021Dec 2022 · 1 yr · Gurugram, Haryana, India

  • Perfom release management process for production, pre-prod environment using
  • Jenkins,git,bitbucket
  • Kubernetes cluster management using command line,Lens
  • Maintain documentation describing processes and system requirements for all
  • systems
  • Configuration changes in code using git and bitbucket
  • Infrastructure Monitoring using Grafana,Kubernetes, Slack,Cloudwatch,Kibana
KubernetesAmazon Web Services (AWS)Release ManagementInfrastructure Monitoring

Unify technologies

Site Reliability Engineer

Nov 2020Dec 2021 · 1 yr 1 mo · Gurugram, Haryana, India

  • Worked for Mobile Financial Services Team in Airtel Money Project and setup SRE team from scratch for the below digital products and working with stakeholders for
  • improving application availability.-
  • Developer portal - https://developers.airtel.africa
  • Enterprise payments platform - https://enterprise.airtel.africa
  • Cross border money transfer switch for P2P transfer & merchant payments
  • Fully configurable integrations platform to support bill payments, international remittances, loans
  • Roles & Responsibilities-
  • ● Site reliability support across 14 countries of Africa across Airtel Money
  • digital products and working with stakeholders for
  • improving application availability.
  • ● Handling production deployments using Jenkins and ensuring the stability of infrastructure.
  • ● Debugging application logs, incident management, ticketing management using
  • Kibana,Kubernetes,JIRA
  • ● Configuration changes in code using git and bitbucket
  • ● Testing of RESTAPI using Postman
  • ● Monitoring using tools Grafana,Kubernetes
KubernetesSite Reliability Engineering

Expedia group

International Operations and Traffic Analyst II

Jun 2018Feb 2020 · 1 yr 8 mos · Gurgaon, Haryana, India

  • Monitoring live site issues, system health monitoring for Expedia group POS(Point of
  • Sales) across 36 countries.
  • Incident Management -Handling P0,P1,P2 incidents to reduce MTTD,MTTR.
  • Release Support – Supporting daily/weekly release support , performing pre,during
  • and post the release analysis
  • Change Support – Support network changes,patching,server upgrade,migration
  • activities to AWS
  • Custom Alerts/Dashboard Creation for product
  • Catchpoint Synthetic Monitoring & Testing , deep diving to find root cause by
  • analysing waterfall trends
  • Lead Lab environment support and escalating issues in soak environments before
  • release.
  • Error Report Analysis –Analyzing booking trends,error codes,service failures and
  • working with product teams to optimize,automate processes for better site reliability
  • Customer Centricity using Dog Food –Identify,investigate,escalate customer pain
  • points & ensure resolution in an end-to-end manner
  • Bot Analytics –Identify bot attacks on livesite,creating rules to prevent malicious
  • attacks on live sites.
  • Custom Alerts/Dashboard Creation for product teams/internal use on adhoc basis
  • using Splunk
Amazon Web Services (AWS)SplunkIncident Management

Xerox

Support Analyst

Aug 2016Jun 2018 · 1 yr 10 mos · Gurgaon, Haryana, India

  • Daily server and application availability checkouts, document,track and categorize
  • incidents reported for LATAM region.
  • Tracking server reboot/patching activity by Hardware team and supporting network
  • changes by HCL/ATOS
  • Ensure effective information security,governance and controls, application support to
  • ensure application is up and running
  • Debugging application logs,incident management,ticketing management using CDP

Education

The University of Texas at Austin

Post graduate program — Cloud computing

Nov 2020Jul 2021

Slicksoft Technologies,Patiala

6 months internship

Jan 2015Jan 2015

Shaheed Udham Singh College of Engineering and Technology,Tangori(Mohali)

Bachelor of Technology (B.Tech.) — Computer Science

Jan 2011Jan 2015

SGTBPS

Jan 2009Jan 2011

St. Peters Academy,Patiala

Jan 2009Present

Stackforce found 100+ more professionals with Site Reliability Engineering & Incident Management

Explore similar profiles based on matching skills and experience