H

Harshita Singh

Data Engineer

Bengaluru, Karnataka, India4 yrs 7 mos experience

Key Highlights

  • Reduced manual QA efforts by 50%
  • Achieved 20% improvement in data ingestion efficiency
  • Enhanced decision-making through actionable data delivery
Stackforce AI infers this person is a Data Engineering expert in cloud-based solutions, focusing on Azure technologies.

Contact

Skills

Core Skills

Azure Data EngineeringData Pipeline DevelopmentDisaster RecoveryData IngestionData Processing Optimization

Other Skills

Analytical SkillsAnalyticsApache ArrowApache SedonaAttention to DetailAutomationAzureAzure API ManagementAzure Active DirectoryAzure AutomationAzure BackupAzure CLIAzure Data Lake Storage Gen2Azure DatabricksAzure DevOps

Experience

4 yrs 7 mos
Total Experience
4 yrs 7 mos
Average Tenure
4 yrs 7 mos
Current Experience

Boeing

Associate Software Engineer

Nov 2021Present · 4 yrs 7 mos · Bengaluru, Karnataka, India · Hybrid

  • Disaster Recovery (DR) Plan
  • A disaster involves the prolonged or permanent loss of an Azure service, region, or datacenter due to events like fire, flood, or earthquake. Unlike temporary outages, disasters require a strategic recovery approach. This plan defines the procedures to restore the Aviation Insights platform during such critical incidents.
  • Skillset Used:
  • Azure, Azure Site Recovery, Azure Backup, High Availability, Azure Monitor, Azure Service Health, Log Analytics, Azure Resource Manager, Azure CLI, Azure Storage, Azure Virtual Machines, PowerShell, Azure Automation, Virtual Networks, Identity and Access Management, Role-Based Access Control, SLA Management
  • DaaS HotSpot Project
  • Developed an Azure Function App to replicate data from a source Azure Event Hub to multiple receivers. Built a CI/CD pipeline using ARM templates for complete infrastructure deployment, integrated with Azure Key Vault for secure secret management. Enabled data capture to Azure Data Lake Storage (ADLS Gen2) for downstream processing.
  • Skillset Used:
  • Azure Event Hub, Azure Function App, CI/CD, Azure Resource Manager Templates, Azure Key Vault, Azure Data Lake Storage Gen2, Azure DevOps, Infrastructure as Code, PowerShell, Git, YAML Pipelines
  • Phase of Flight Detection
  • Redesigned the Phase of Flight prototype for distributed processing on Azure Databricks using Apache Spark. Converted a fuzzy logic prediction model into a Pandas UDF leveraging Apache Arrow, improving processing speed by 30x. Enhanced accuracy by deriving additional geospatial parameters using Apache Sedona and SparkSQL joins. Modularized Python and Scala codebases following OOP principles and implemented stateful processing using flatMapGroupsWithState and withWatermark. Established unit tests and CI/CD pipelines for deployment.
  • Skillset Used:
  • Azure Databricks, PySpark, Pandas UDF, Apache Arrow, Scikit-Fuzzy, Apache Sedona, SparkSQL, Geospatial Processing, Python,GitLab CI/CD, Unit Testing, Real-time Data Processing
AzureAzure Site RecoveryAzure BackupHigh AvailabilityAzure MonitorAzure Service Health+35

Education

CDAC Bangalore

DBDA

May 2021Sep 2021

Dr.RMLAU INSTITUTE OF ENGINEERING AND TECHNOLOGY

B.TECH — Computer Technology/Computer Systems Technology

Jan 2016Jan 2020

Stackforce found 100+ more professionals with Azure Data Engineering & Data Pipeline Development

Explore similar profiles based on matching skills and experience