Rahul Patel

Data Engineer

Noida, Uttar Pradesh, India2 yrs 10 mos experience

Key Highlights

  • 3+ years of experience in cloud-native data solutions.
  • Expert in real-time data processing and analytics.
  • Proven track record in optimizing ETL pipelines.
Stackforce AI infers this person is a Data Engineer specializing in Fintech and Cloud Data Solutions.

Contact

Skills

Core Skills

Data EngineeringCloud Computing

Other Skills

AWS LambdaAmazon ConnectAmazon Web Services (AWS)Apache KafkaAzure CloudAzure Cloud Data EngineeringAzure Data LakeAzure DatabricksAzure DevOpsBig DataBig Data and Apache SparkBusiness AnalyticsCommunicationComputer ScienceData Analysis

About

📌 About Me 🎯 Data Engineer | Azure + AWS | Real-Time & Big Data Profesional 📍 Noida, India | 📞 +91-8382020852 | 📧 rahulpatelue188076@gmail.com 🔗 Portfolio: codebasics.io/portfolio/RAHUL-PATEL I’m Rahul Patel, a passionate and performance-oriented Data Engineer with 3+ years of hands-on experience in building cloud-native, scalable, and real-time data solutions using Azure, Databricks, ADF, Spark, and Power BI. 💡 My mission: Transform raw data into business value through clean architecture, optimized pipelines, and actionable insights. 🚀 Expertise At a Glance 🔹 Cloud Platforms: Azure Data Factory, Databricks, Azure Data Lake 🔹 Big Data Tools: Apache Spark (PySpark), Delta Lake, Kafka, Pub/Sub 🔹 Programming & Querying: Python, SQL 🔹 Data Visualization: Power BI, Excel (Advanced Dashboards) 🔹 Development Tools: Azure DevOps, CI/CD, Git 🔹 Key Strengths: ETL/ELT, Data Modeling, Real-Time Streaming, Data Governance 🏆 Key Projects 📍 HDFC Digital Banking (Feb 2025 – Present) • Built real-time ingestion pipelines using Kafka + Pub/Sub • Architected a 3-layer Delta Lakehouse in Databricks:   • Staging – Raw JSON capture   • Curated – Parsed & transformed data by txn_id   • Service – Report-ready aggregated tables using mapping logic • Delivered end-to-end analytics for Net Banking & Mobile Banking platforms 📍 Microsoft Bing Analytics • Developed and managed large-scale telemetry pipelines using ADF + Spark (PySpark) • Processed multi-terabyte datasets to extract user interaction insights • Ensured pipeline optimization and minimal latency for Bing usage analytics • Created reusable data frameworks for telemetry standardization 📍 Tokyo Olympics Analytics • Designed real-time dashboards and data pipelines to monitor Olympic events globally • Ingested and modeled data from live event feeds using Azure Data Lake + Databricks • Enabled stakeholders to track performance metrics and logistics in near real-time 📍 Retail Business 360° (Brick & Mortar + E-commerce) • Integrated data from multiple channels (POS systems, websites, customer apps) • Modeled unified customer and sales views using ADF + SQL + Power BI • Developed executive dashboards for sales trends, inventory, and customer behavior • Improved reporting speed and data refresh rates by 40% through optimization 🤝 Let’s Connect I love working on real-world data problems and collaborating with teams driving innovation through data. Open to networking, learning, and new opportunities in the Cloud/Data Engineering domain. Let’s build data systems that power the future! 💼📊

Experience

Nexgen tech solutions

Data Engineer

Feb 2025Present · 1 yr 1 mo · Noida, Uttar Pradesh, India · On-site

  • 🔹 Leading data engineering initiatives on the HDFC Net Banking and Mobile Banking project, handling high-volume real-time streaming data.
  • 🔹 Ingesting and processing streaming data using Apache Kafka and Google Cloud Pub/Sub, ensuring reliable and scalable data flow.
  • 🔹 Architected and maintained a robust three-layer data architecture (Staging, Curated, Service) on Azure Cloud using Azure Databricks and Delta Lake.
  • Staging Layer: Capturing and storing raw JSON payloads directly from streaming sources.
  • Curated Layer: Parsing and transforming data into well-defined structured tables based on transaction types (e.g., txn_id).
  • Service Layer: Creating high-performance report tables using predefined mapping logic for business consumption.
  • 🔹 Built and optimized ETL pipelines using Databricks (PySpark) and Azure Data Factory, reducing latency and improving data availability.
  • 🔹 Developed interactive Power BI dashboards to deliver actionable insights to business users and stakeholders.
  • 🔹 Integrated CI/CD workflows using Azure DevOps, streamlining deployment and version control of data pipelines.
  • 🔹 Collaborated closely with cross-functional teams including business analysts, DevOps engineers, and QA to align on data requirements and delivery.
  • 🔹 Ensured data governance, quality, and security standards were met by implementing best practices and audit mechanisms across the pipeline.
Apache KafkaGoogle Cloud Pub/SubAzure DatabricksDelta LakeETLPower BI+3

Hcltech

3 roles

Data Engineer

Promoted

Jan 2024Jan 2025 · 1 yr · Noida, Uttar Pradesh, India

  • Data Integration and Modeling: Successfully integrated and modeled complex datasets from
  • multiple sources, ensuring seamless data flow and enabling comprehensive analysis across business
  • units.
  • Enhanced ETL Efficiency: Optimized ETL processes by applying advanced data engineering
  • techniques, boosting data management efficiency and cutting processing time.
  • Leveraged Cloud Solutions: Utilized Azure Cloud technologies to develop scalable data solutions,
  • seamlessly integrating cloud-based workflows for enhanced system performance.
  • Collaborative Project Delivery: Worked cross-functionally to deliver impactful projects, using
  • expertise in data modeling, ETL, and cloud solutions to drive successful business outcomes.
Azure CloudETLData ModelingData EngineeringCloud Computing

Software Engineer

Mar 2023Dec 2023 · 9 mos · Noida, Uttar Pradesh, India

  • Developed automated data processing pipelines using Python, MS Excel, and SQL, reducing manual
  • data entry time by 60% and improving data accuracy by 35%
  • Implemented foundational data principles to enhance data processing efficiency.
  • Kept current with emerging technology trends to optimize software solutions.
  • Collaborated cross-functionally to deliver impactful projects, leveraging technical expertise for business success.
PythonMS ExcelSQLData Engineering

Graduate Engineering Trainee

Oct 2022Mar 2023 · 5 mos · Noida, Uttar Pradesh, India

  • During my tenure as a Graduate Software Engineer Trainee, I gained comprehensive experience in software development, working within a dynamic and collaborative environment. My role involved engaging with senior engineers and cross-functional teams to design, develop, and deploy software solutions that met business and user needs.

Education

Panjab University, Chandigarh

Bachelor of Engineering - BE — Information Technology

Jun 2018Jun 2022

Stackforce found 100+ more professionals with Data Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience