Gaurav Chauhan

Data Scientist

Gurugram, Haryana, India3 yrs 7 mos experience

Key Highlights

  • Expert in Cloud Data Engineering and Data Analytics.
  • Architected scalable data pipelines using Data Mesh architecture.
  • Achieved AWS Developer Associate and Cloud Practitioner certifications.
Stackforce AI infers this person is a Data Engineering expert in SaaS and Healthcare industries.

Contact

Skills

Core Skills

Data EngineeringAws

Other Skills

SQLPythonData Build Tool (DBT)SnowflakeData QualityData GovernanceAirflowBig DataData Warehouse ArchitectureAmazon Web Services (AWS)Fivetran ETL ToolLookerData VisualizationAmazon S3Professional Mentoring

About

A goal-oriented professional with 4 years of experience in the IT industry, specializing in Cloud Data Engineering and Data Analytics. šŸ“Proficient in tools like SQL, DBT, Python, Snowflake, ETL/ELT, Airflow, Data Modelling, Data Transformation, Data Warehousing, Data Visualization (Looker), AWS Services (Redshift, Glue, MWAA, Lambda), PySpark, Excel. šŸ“Expertise in designing and building data pipelines using the Data Mesh architecture, implementing efficient data ingestion, transformation, and loading across Medallion structures (pre-bronze, bronze, silver, gold) to enable informed decision-making and strategic insights. šŸ“Adept at data modeling using DBT and Partnered with business stakeholders and PMs to translate requirements into technical specifications, execution workflows, ensuring scalable reporting solutions. Tech Stack worked on:- 🌟Languages: Python, SQL, C/C++ 🌟Data Engineering Tools: DBT, Snowflake, Airflow, Fivetran, ETL/ELT, PySpark, Looker, Data Pipelines, BigQuery, OLTP/OLAP, Atlan, Oracle, Excel, Linux, CICD 🌟 AWS Services: S3, Glue, Redshift, Airflow, Appflow, EC2, RDS (PostgreSQL), IAM, Secrets Manager 🌟 Databases: SQL Server, My SQL, Oracle DB, PostgreSql 🌟 Version Control: Git/GitHub šŸ“ˆ Achievements: • Received 'On The Spot' in Gemini Solutions Private Limited for exceptional leadership, character, and commitment to work • Achieved AWS Developer Associate Certification and AWS Cloud Practitioner Certification

Experience

3 yrs 7 mos
Total Experience
1 yr 3 mos
Average Tenure
1 yr 1 mo
Current Experience

Topmate.io

Mentor

Jun 2025 – Present Ā· 11 mos Ā· Remote

  • Guiding individuals on Resume Building, Job Search Strategies, Securing Internships, Cracking Interviews, and Optimizing LinkedIn Profiles to maximize opportunities. Connect 1:1 with me for personalized guidance!
Professional MentoringMentoringCareer Counseling

Tide

Senior Data Engineer

Apr 2025 – Present Ā· 1 yr 1 mo Ā· Gurugram Ā· Remote

  • Connecting SMEs with right funding options.
  • Architected and implemented scalable, automated pipelines for data extraction, transformation, and analysis within a Data Mesh framework, optimizing data flow and ensuring high availability and performance across the Partner Credit Services domain.
  • Proficient in data storytelling, transforming complex data insights into simple, easy-to-understand narratives and presentations for cross-functional teams.
SQLPythonData Build Tool (DBT)SnowflakeData QualityData Governance+10

Zs

Data Engineer

Aug 2024 – Apr 2025 Ā· 8 mos Ā· Gurugram Ā· Hybrid

  • Engineering data pipelines to support high-volume processing, leveraging MWAA, SQL, DBT,
  • Appflow tailored to client needs, optimizing business outcomes across multiple pharmaceutical
  • subdomains, including Sales, Customer, Marketing, Engagement etc.
  • Architected multi-layered DBT models leveraging ETL/ELT best practices, implementing snapshots
  • and incremental strategies (SCD1/SCD2), and collaborated with cross-functional teams to deliver
  • scalable data solutions supporting key business requirements.
  • Built a centralized data repository for a pharmaceutical client by ingesting multi-domain (Customer, Sales, Marketing, Engagement etc.) data from sources including Veeva CRM, IQVIA, and Salesforce, modelled the data to deliver analytics-ready datasets that enabled downstream reporting and insights generation.
  • Leveraged DBT materializations (incremental, materialized views, tables) and implemented snapshots and custom macros to design efficient load strategies including SCD1, SCD2, and truncate-load, optimizing data transformations, maintainability and model refresh performance.
  • Conducted data quality checks on datasets exceeding 20M+ records, collaborated with QE team to identify and resolved 150+ DQ issues, ensured readiness for production deployment and achieved 99.9% data accuracy.
  • Collaborated with cross functional teams to deliver the solutions as per the client requirements that optimize their business outputs.
SQLPythonData Build Tool (DBT)Extract, Transform, Load (ETL)Amazon RedshiftAmazon Web Services (AWS)+12

Gemini solutions pvt ltd

4 roles

Senior Software Engineer L1 - PIMCO LLC

Promoted

Apr 2024 – Jul 2024 Ā· 3 mos

  • Integrated enterprise data from diverse sources (SFTP, APIs, databases) into Oracle and Snowflake
  • via Python, Airflow, DBT, and AWS ETL pipelines, enabling scalable storage and unified access.
  • Designed and implemented a scalable data warehousing and transformation framework, optimizing Airflow pipelines, managing feature delivery and rollback strategies, and maintaining system reliability and documentation.
  • Led the modernization of legacy on-prem SybaseIQ database by orchestrating the migration of 120+ production tables (avg. 30GB each) to Snowflake using Airflow pipeline templates for seamless data transfer via AWS S3, reducing operational costs from several million to under $290K.
  • Designed robust and scalable batch data pipelines to ingest Equity/ Stocks/ Indexes/ Securities data from on-prem Sybase IQ into Snowflake via Amazon S3 as an intermediate storage layer, processing CSV, JSON, and Parquet formats, and optimizing data storage with date-based partitioning and columnar compression for efficient querying and cost-effective processing.
  • Orchestrated data pipelines in Airflow to perform historical and incremental data loads, implementing sharding for parallel processing of high-volume datasets, retry and failure handling, and building fact and dimension tables in Snowflake; leveraged zero-copy cloning and Time Travel for data recovery, and enforced robust data quality checks to ensure accuracy, reliability, and trusted reporting.
  • Implemented data governance protocols to ensure data quality and compliance, resulting in improved data accuracy and security measures.
  • Engineered a stats aggregation process to monitor DBT’s Snowflake Warehouse utilization, automating monthly and quarterly reporting and eliminating 4+ hours of manual effort per report.
Data EngineeringSQLPythonSnowflakeData Build Tool (DBT)Apache Airflow+13

Software Engineer L2 - PIMCO LLC

Apr 2023 – Jun 2024 Ā· 1 yr 2 mos

Data EngineeringAmazon S3Amazon Web Services (AWS)CI/CDData PipelinesData Build Tool (DBT)+10

Software Engineer L1 - PIMCO LLC

Aug 2022 – Jun 2023 Ā· 10 mos

PythonData EngineeringApache AirflowAmazon Web Services (AWS)Data PipelinesSnowflake+11

Technical Trainee

Mar 2022 – Aug 2022 Ā· 5 mos

  • Started delivering very quickly straight out of college & always appreciated for fast
  • learning & proactiveness.
PythonApache AirflowAmazon Web Services (AWS)SnowflakeExtract, Transform, Load (ETL)Data Engineering+1

F13 technologies

AWS Cloud Intern

Sep 2021 – Jan 2022 Ā· 4 mos Ā· New Delhi, Delhi, India Ā· Remote

  • Assisted in the design and implementation of AWS cloud solutions for clients
  • Collaborated with the team to troubleshoot and resolve technical issues on AWS platforms
  • Conducted research and provided recommendations on AWS best practices and cost optimization strategies
Amazon S3Problem SolvingAmazon Web Services (AWS)TeamworkAmazon Relational Database Service (RDS)Extract, Transform, Load (ETL)+1

Education

GALGOTIA S COLLEGE OF ENGINEERING AND TECHNOLOGY, GREATER NOIDA

Bachelor of Technology - BTech — Electronics and Communications Engineering

Jul 2018 – Jul 2022

Stackforce found 100+ more professionals with Data Engineering & Aws

Explore similar profiles based on matching skills and experience

Gaurav Chauhan - Data Scientist | Stackforce