Parv Rastogi

Data Engineer

Bengaluru, Karnataka, India3 yrs 8 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Achieved 60% reduction in data processing time.
  • Increased data pipeline reliability by 20%.
  • Crafted custom PowerBI reports for enhanced data visualization.
Stackforce AI infers this person is a Data Engineer specializing in SaaS and Healthcare data solutions.

Contact

Skills

Core Skills

Data EngineeringSqlData Quality

Other Skills

Apache SparkApache HiveData MigrationAIPySparkPowerBIData VisualizationPostgreSQLLinuxJavaScalaPiperAzureSnowflakeExtract

About

Hello! I'm Parv Rastogi, a results-driven Data Engineer with 3.5 years of experience in data acquisition, management, and pipeline optimization, seeking full-time data engineering roles. Proven ability to optimize data pipelines, achieving a 40% reduction in processing time, 20% increase in reliability and 40% reduction in individual job cost through Spark Jobs amd ETL Optimizations. ๐™†๐™š๐™ฎ ๐˜ผ๐™˜๐™๐™ž๐™š๐™ซ๐™š๐™ข๐™š๐™ฃ๐™ฉ๐™จ: - Reduced data processing time by 60% through automation scripts, enhancing operational efficiency by 25%. - Increased data pipeline reliability by 20% by contributing to infrastructure management during FaaS execution. - Identified and resolved 15% of infrastructure-related incidents, minimizing system downtime. - Improved data quality and downstream application performance by 40% through proactive data pipeline maintenance. ๐™๐™š๐™˜๐™๐™ฃ๐™ž๐™˜๐™–๐™ก ๐™Ž๐™ ๐™ž๐™ก๐™ก๐™จ: - Cloud Services (ADLS, Azure Databricks) - Programming Languages (Python, SQL) - Data Engineering (Hadoop, Spark, DataBricks) - Databases (MySQL, PostgreSQL, Presto) - Data Warehouses (Hive, Snowflake) ๐™Ž๐™ค๐™›๐™ฉ ๐™Ž๐™ ๐™ž๐™ก๐™ก๐™จ: - Communication - Teamwork - Cross-Team Collaboration ๐™‹๐™ง๐™ค๐™›๐™š๐™จ๐™จ๐™ž๐™ค๐™ฃ๐™–๐™ก ๐™…๐™ค๐™ช๐™ง๐™ฃ๐™š๐™ฎ: In my previous role at Innovaccer Analytics Pvt Ltd., a US-based healthcare company, I engineered and maintained data pipelines, ensuring the ingestion of clean and validated data onto the platform. I proactively identified and resolved issues across the data processing pipeline, demonstrating expertise in SQL commands and scripting for platform specifications. Notably, I developed Python automation scripts, integrating real-time Slack alerts, to optimize data load processes, showcasing my problem-solving abilities. ๐™’๐™๐™ฎ ๐™ˆ๐™š? As a quick learner with a passion for the data engineering domain, I'm confident in my potential to be a valuable addition to any Data Engineering team. My experience crafting custom PowerBI reports and interactive dashboards underscores my ability to conceptualize, design, and develop solutions tailored to specific business requirements.

Experience

3 yrs 8 mos
Total Experience
2 yrs 4 mos
Average Tenure
1 yr 4 mos
Current Experience

Indium

Data Engineer

Feb 2025 โ€“ Present ยท 1 yr 4 mos ยท Bengaluru, Karnataka, India ยท Hybrid

  • Data Migration & Optimization: Migrated critical Uber trip datasets from GDW to DI, optimizing SQL queries and boosting query performance by 25%.
  • Framework Contribution: Enhanced Sparkle (Spark) framework by adding bucketing support, enabling consistent data distribution and faster joins for large ETL pipelines across DI teams.
  • AI-driven Migration: Developed a POC using Cursor and Claude AI for GDW migrations, cutting manual effort by 40% and accelerating migration timelines.
  • Feature Development: Engineered and maintained Python and Java Spark jobs, resolving data inconsistencies and delivering business-driven features.
  • Legacy Datasets Deprecation: Streamlined data ecosystem by assisting deprecation of legacy datasets, migrating user ETLs and queries to modern SOTs.
  • Customer Support and Migration: Assisted data science and business teams through dataset adoption, ensuring smooth SQL query migration with minimal disruptions.
Apache SparkApache HiveData EngineeringSQL

Uber

External Consultant

Feb 2025 โ€“ Present ยท 1 yr 4 mos ยท Bengaluru, Karnataka, India ยท Hybrid

Apache SparkApache Hive

Career break

2 roles

Professional development

Dec 2024 โ€“ Feb 2025 ยท 2 mos

Health and well-being

Dec 2024 โ€“ Feb 2025 ยท 2 mos

Innovaccer

2 roles

Data Analyst - Data Engineering

Jul 2022 โ€“ Nov 2024 ยท 2 yrs 4 mos ยท Noida, Uttar Pradesh, India

  • Engineered and maintained data pipelines, ensuring the ingestion of clean and validated data onto the Datashop Platform, demonstrating expertise in SQL commands and scripting for platform specifications.
  • Identified bottlenecks in data ingestion from diverse sources, such as SFTP and ADLS. Developed Python scripts to streamline data flow, achieving a 75% reduction in processing time. Integrated real-time Slack alerts within scripts to identify and resolve data load issues, leading to a further 25% improvement in operational efficiency.
  • Contributed to infrastructure management during Function as a Service (FaaS) execution, guaranteeing end-to-end ownership, preventing failures or interruptions in analytics job execution and increasing the overall reliability of the system by 20%.
  • Leveraged monitoring and observability tools to identify and resolve 15% of infrastructure-related incidents during the initial investigation phase. This minimized system downtime and ensured stability, demonstrating a resourceful approach to incident management and system reliability.
  • Orchestrated seamless data acquisition and management processes by implementing robust data cleaning and preprocessing strategies, utilizing PostgreSQL and Snowflake for efficient extraction from diverse sources.
  • Proactively identified and mitigated data pipeline issues, leading to improved data quality and a 40% increase in downstream application performance.
  • Played a key role in crafting a custom PowerBI report and an interactive dashboard for a client, showcasing directed efforts to conceptualize, design and develop solutions to meet specific business requirements. The dashboard streamlined data exploration and analysis, providing the client with an accessible platform for in-depth study of the chronic disease population.
PySparkSQLData EngineeringData Quality

Associate Software Engineer Intern

Jan 2022 โ€“ Jul 2022 ยท 6 mos ยท Noida, Uttar Pradesh, India

  • โ€ข This 6-month training program covered the various tools and technologies required to fulfill the role of a backend developer.
PostgreSQLLinux

Education

KIET Group of Institutions

Bachelor of Technology - BTech โ€” Information Technology

Aug 2018 โ€“ Jun 2022

Seth Anandaram Jaipuria School

12th

Jan 2017 โ€“ Jan 2018

Seth Anandram Jaipuria School

10th

Jan 2015 โ€“ Jan 2016

Stackforce found 100+ more professionals with Data Engineering & Sql

Explore similar profiles based on matching skills and experience