H

Harsh Kumar

Software Engineer

Bengaluru, Karnataka, India12 yrs 10 mos experience
Highly Stable

Key Highlights

  • 11 years of backend service development experience.
  • Led modernization of ETL pipelines at planet-level scale.
  • Expert in distributed computing and microservices.
Stackforce AI infers this person is a Backend-heavy SaaS expert with extensive experience in data engineering and compliance.

Contact

Skills

Core Skills

Cloud ComputingData EngineeringData ProtectionApi DevelopmentData Compliance

Other Skills

AWSApache FlinkApache HudiConfluent KafkaElasticsearchData Loss PreventionMicrosoft Graph REST APIsOptical Character RecognitionAzure CommerceData Classification ServiceTraffic ShapingApache SparkRedisODataREST APIs

About

Seasoned backend service developer with 11 years of experience, at Microsoft and two startups viz. Jugnoo and WizCal. Experienced in designing, productionizing, and managing existing and new services at planet-level scale. Well-versed with concepts of distributed computing and microservices. Proficient in ramping up on new technologies and concepts and delivering impact quickly

Experience

12 yrs 10 mos
Total Experience
2 yrs 7 mos
Average Tenure
2 yrs 5 mos
Current Experience

Cohesity

Senior Staff Engineer

Dec 2023Present · 2 yrs 5 mos · Bengaluru, Karnataka, India · On-site

  • Modernizing Cohesity's ETL and reporting pipeline | AWS | Technical Lead
  • As Technical Lead, responsible for facilitating cross-team collaboration, driving architectural decisions, guiding team members, and maintaining engineering velocity across feature development and production operations.
  • Leading the design, implementation, CI/CD, and monitoring of Helios's North-Star architecture for ETL pipelines using big data streaming platforms, Apache Flink and Apache Hudi. Utilized Confluent Kafka REST Proxy, and Schema Registry to enable schema-driven validation with backward transitive compatibility.
  • Led the modernization of Elasticsearch from version 6.8 to 8.15 for Helios, coordinating with 20+ engineers across 10+ teams. Delivered with zero field incidents and minimal downtime, enabling deployment on new platforms.
  • Enhanced throughput by 7x for data persistence into Elasticsearch by implementing efficient batching strategies with optimistic concurrency control.
AWSApache FlinkApache HudiConfluent KafkaElasticsearchCloud Computing+1

Microsoft

4 roles

Principal Software Engineer Manager

Promoted

Aug 2022Oct 2023 · 1 yr 2 mos · Hyderabad, Telangana, India

  • Data Protection & OCR Monetization on Microsoft Substrate | People management
  • Led execution of reclassification of Data-at-Rest (DAR) for customer content in Microsoft Substrate, ensuring compliance with Data Loss Prevention (DLP) policies dynamically updated by administrators.
  • Designed and published Microsoft Graph REST APIs to enable detection of OCR (Optical Character Recognition) capabilities for customer content containers (e.g., users, groups, sites), supporting protection of sensitive data in images.
  • Delivered novel pay-as-you-go monetization model for OCR usage to chargeback enterprises for image volume processed using Azure Commerce as the billing platform.
Data Loss PreventionMicrosoft Graph REST APIsOptical Character RecognitionAzure CommerceData ProtectionAPI Development

Principal Software Engineer

Mar 2022Jul 2022 · 4 mos · Hyderabad, Telangana, India

  • Enterprise Compliance | QOS for Data-in-Transit (DIT)
  • Designed and prototyped solution for Data-at-Rest (DAR) reclassification.
  • Implemented traffic shaping for Data Classification Service (DCS) to prioritize Data-in-Transit (DIT) workloads supporting opportunistic DAR processing.
Data Classification ServiceTraffic ShapingData Compliance

Senior Software Engineer

Promoted

Sep 2018Jun 2022 · 3 yrs 9 mos · Hyderabad, Telangana, India

  • M365 Auditing ETL pipeline | Cost reduction | Azure | OData | REST APIs
  • Architected cost and scale improvements for the Auditing pipeline (handling 16B+ audit records/day) by leveraging Apache Spark and Redis to optimize batching, reduce storage write operations, and enable downstream service down-scaling—achieving a 40% cost reduction.
  • Designed OData-compliant Auditing APIs aligned with Microsoft REST API guidelines; published to Microsoft Graph, enabling 1st-party and 3rd-party apps (e.g., Microsoft Word) to stream audit records from edge devices.
  • Developed near-real-time identification of sensitive information in the text added to O365 documents (Word, PowerPoint. Excel) to enable application of compliance policies using Microsoft’s Augmentation Loop platform.
  • Developed a time-based assistant on Substrate to help compliance admins analyze sensitive content distribution across the enterprise; built with sandboxed code to streamline onboarding for other teams.
Apache SparkRedisODataREST APIsData EngineeringAPI Development

Software Engineer 2

Jun 2017Aug 2018 · 1 yr 2 mos · Hyderabad, Telangana, India

Wizcal

Architect

Mar 2016Jul 2017 · 1 yr 4 mos · Hyderabad, Telangana, India

Socomo technologies private limited

VP Engineering

Oct 2014Sep 2015 · 11 mos · Chandigarh, India

Microsoft

Software Development Engineer

Aug 2012Sep 2014 · 2 yrs 1 mo · Hyderabad, India

Royal bank of scotland

Summer Intern

May 2010Jul 2010 · 2 mos · Gurugram, Haryana, India

Education

Indian Institute of Technology, Delhi

Dual Degree (B.Tech and M.Tech) in Computer Science and Engineering — Computer Science And Engineering

Jan 2007Jan 2012

Stackforce found 100+ more professionals with Cloud Computing & Data Engineering

Explore similar profiles based on matching skills and experience