Vinit Bhat

SRE (Site Reliability Engineer)

New Delhi, Delhi, India4 yrs 5 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in DevOps and Site Reliability Engineering.
  • Proven track record in automation and incident management.
  • Strong background in IoT and AI technologies.
Stackforce AI infers this person is a DevOps and SRE expert in SaaS and IoT industries.

Contact

Skills

Core Skills

Site Reliability EngineeringDevops & SreAutomation Testing

Other Skills

Change ManagementProduct designdevelopmentPython (Programming Language)Amazon Web Services (AWS)GrafanaPrometheusElasticsearchAppiumJMeterPostmanInternet of Things (IoT)ESP32 MicrocontrollersUiPathIBM Bluemix

About

Energetic, curiosity-driven DevOps & SRE professional with 4+ years of experience spanning hardware and software. I blend low-level understanding of computer hardware with practical software engineering to build reliable, automated systems that scale. Today I focus on creating tools and processes, instrumenting infrastructure with product-facing metrics, and eliminating operational toil through automation. Core strengths: • Android system hacking & porting • RPA (UiPath certified) • IoT development (Raspberry Pi, Intel boards) • Automation testing: APIs, web apps, Android apps • DevOps & SRE: tool development, observability, process improvement, infrastructure automation • Working with new AI tools and creating frameworks around it. I enjoy solving messy problems end-to-end — from hardware bring-up to production monitoring — and I’m always looking to learn new technologies and collaborate on impactful projects. Open to connecting and exploring opportunities in DevOps, SRE, IoT, AI and automation engineering.

Experience

4 yrs 5 mos
Total Experience
2 yrs
Average Tenure
3 mos
Current Experience

Microsoft

Site Reliability Engineer II

Feb 2026Present · 3 mos · Noida, Uttar Pradesh, India · Hybrid

  • Azure Cosmos DB
Site Reliability Engineering

Yahoo

SRE I

Jul 2024Jan 2026 · 1 yr 6 mos · Bengaluru · Remote

  • Built a Chrome extension to streamline alert triage and reduce response times
  • Developed a Python automation framework to cut manual triaging by 50%
  • Automated third-party vendor outage detection and incident workflows
  • Deliver MBR reporting on OKRs, SLAs, and operational performance
  • Created KPI dashboards for real-time service health and team visibility
  • Primary on-call for multiple services, ensuring high availability
  • Led global incidents as Incident Manager, driving comms and RCAs
  • Executed infrastructure upgrades with minimal downtime via change management
  • Improved troubleshooting by authoring and maintaining team runbooks
Change ManagementProduct designdevelopmentSite Reliability Engineering

Impressico business solutions

2 roles

Engineer - DevOps and Cloud Technologies ( SRE )

Promoted

Feb 2023Jun 2024 · 1 yr 4 mos · Noida

  • Bootstrap real-time monitoring and alerting using Grafana, Prometheus, Elasticsearch and CloudWatch
  • Led the migration of Elasticsearch with over 2.5 TB of data, ensuring zero data loss and optimizing it for future growth.
  • Implemented Grafana-as-code while contributing to Open-source library Grafanalib, working on Grafana-as-config framework using Python
  • Setup Grafana Dashboards for Infrastructure, real-time-data and various SLA/SLO metrics along with alerts
  • Optimized monitoring scripts by converting to Prometheus exporters, improving performance 10x through SQL optimization, and reducing costs by migrating from EC2 to AWS Lambda
  • Migrating to scripts to Prometheus push model, implementing distributed tracing with Jaeger and OpenTelemetry, and integrated HubSpot's CRM with Thanos metrics
  • Deployed 30+ AWS Lambda and Batch jobs based on Serverless framework to improve data cleaning, and helped reducing the cost by 20%
Python (Programming Language)Amazon Web Services (AWS)DevOps & SRE

Programmer Analyst ( SDET )

Sep 2021Jan 2023 · 1 yr 4 mos · Noida

  • Joined as a Automation expert, later took the charge as SDET. Highlights:
  • Helped debugging a critical issues in the Production system, on which 10,00,000 beacons where dependent. Later, improved the unit test cases of the python microservice
  • Setup REST API automation testing using Postman, JMeter. Along with that worked on Venom as API integration test tool for new products
  • Established pipeline and regression test scripts for Mobile Apps automation testing using Appium, Selenium using Python and Pytest
  • Define the structure for the Web Apps automation testing using Cypress and JavaScript with complete reporting on each release
AppiumJMeterAutomation Testing

Beyondalphabets

Intern

May 2019Jul 2019 · 2 mos · Remote

  • Developed and deployed a cloud-based bot using UiPath for automated email management, OCR data processing, and storage.
  • Implemented an IoT Proof of Concept (POC) using ESP32 microcontrollers and IBM Bluemix cloud for real-time data management.
Internet of Things (IoT)ESP32 Microcontrollers

Education

Jamia Millia Islamia

Master of Computer Applications - MCA — Computer Science

Jul 2021Jul 2023

Delhi University

Bachelor of Science - BSc — Computer Science

Aug 2017Aug 2020

Stackforce found 100+ more professionals with Site Reliability Engineering & Devops & Sre

Explore similar profiles based on matching skills and experience