Harman Bhatia

Data Engineer

Bengaluru, Karnataka, India7 yrs 8 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in Generative AI and Large Language Models.
  • Reduced testing time by 70% through automation.
  • Led successful data migration projects exceeding 40 TB.
Stackforce AI infers this person is a Data Engineering expert specializing in AI and Big Data solutions.

Contact

Skills

Core Skills

SnowflakePythonPysparkAwsData IngestionMarket Research

Other Skills

Looker (Software)Python (Programming Language)Amazon Web Services (AWS)DevOpsScrumBig DataMySQLData VisualizationWeb ScrapingETLHivePostgreSQLHDFSSpark SQLSQL

About

I am a Data Engineer with specialized expertise in Generative AI and Large Language Models (LLMs). With a strong background in data research, analysis, and transformation, I have successfully led and executed multiple AI projects and hold certifications in LLMs and Prompt Engineering. Key Achievements: ● AI & LLM Expertise: Developed and implemented state-of-the-art solutions using Generative AI and LLMs to drive innovation and efficiency in various projects. ● Automation & Efficiency: Reduced testing time by 70% per metric (handled 240+ metrics) by developing a testing automation framework using web scraping and Python. ● Data Pipeline Development: Built a real-time pipeline to migrate historical data (>1TB) and incremental data (2 million records per day) from RDS Postgres to Snowflake using AWS DMS, Amazon S3, and AWS Lambda. ● Data Warehouse Design: Designed a Snowflake Data Warehouse framework with a star schema for the platform migration of 280 tables. ● ETL Implementation: Implemented an ETL pipeline to migrate 40+ TB of data across 1000+ tables from Teradata to Snowflake using TPT, AWS S3, and AWS EMR. ● Redshift to Databricks Migration: Migrated data from Redshift to Databricks, leveraging DBT and Airflow for efficient data transformation and orchestration. ● Data Ingestion: Created an ingestion module to process historical data (>10 GB) into HDFS from various heterogeneous sources. ● Data Transformation: Developed PySpark code to transform 67 million records using Spark SQL. ● Code Debugging & Testing: Enhanced data accuracy for decision-making by 90% through rigorous unit testing, system integration testing, and performance benchmarking. Leadership & Mentorship: ● Team Leadership: Led a team of 5 associates, effectively delegating tasks and delivering projects ahead of deadlines through strong communication and teamwork. ● Mentorship: Trained and mentored over 20 new colleagues, providing them with essential knowledge of tools and technologies such as Python, SQL, Snowflake, AWS, and Big Data stack. Publications: Check out my blogs on Medium: > Python Series: "Just Python" > Data Migration: "Postgres to Snowflake — Migrate Real-time and Historical Data" > Salesforce to Snowflake: "Migrate Data from Salesforce to Snowflake" I am open to working as an independent contractor or in a full-time role, with flexibility across multiple time zones and a willingness to travel. This adaptability ensures that I can meet the diverse needs of global clients and projects. Feel free to contact me at - imharmanbhatia@gmail.com

Experience

7 yrs 8 mos
Total Experience
2 yrs 6 mos
Average Tenure
3 yrs 10 mos
Current Experience

Coursera

Senior Data Engineer

Jun 2022Present · 3 yrs 10 mos

Zs

Senior Data Engineer

Nov 2020Jun 2022 · 1 yr 7 mos · Pune, Maharashtra, India

  • ● Reduced the testing time by 70% per metric (handled 240+ metrics) by developing the testing automation
  • framework using web scraping and python.
  • ● Migrated 442 data files to AbbVie’s CDL in just 4 months covering detailed data analysis, automation of ingestion
  • scripts and data quality checks.
  • ● Designed and developed the Snowflake Data Warehouse framework for platform migration.
  • ● Implemented a pipeline to migrate 40+ TB data of 1000+ tables from Teradata to Snowflake using TPT, S3 and EMR.
  • ● Provided multiple Snowflake and Python training sessions to 300+ colleagues.
  • ● Led the team of 5 associates, delegated tasks and delivered the project days before deadline with the
  • effective communication and teamwork.
  • ● Develops and maintains effective relationships with others in order to encourage and support the team
  • Technology Stack includes -
  • > Snowflake
  • > Python
  • > SQL
  • > Hadoop
  • > Hive
  • > Spark
  • > AWS
PySparkLooker (Software)SnowflakePython (Programming Language)Amazon Web Services (AWS)DevOps+5

Infosys limited

2 roles

Big Data Engineer

Oct 2018Oct 2020 · 2 yrs

  • ● Implemented an ingestion module to ingest historical data (>10 GB) into HDFS from various heterogeneous sources.
  • ● Developed Pyspark code using Spark SQL for the transformation of data with 67 million records.
  • ● Debugged the code using unit testing, system integration testing, and performance benchmarking.
  • ● Created and managed documents relating to business rules, wireframes, and functional requirements.
  • ● Mentored and trained 20+ new colleagues with the required knowledge of tools and technologies such as python and big data.
  • ● Collaborate with various teams and management to gather business requirements and design data
  • pipelines as per client expectations.
PySparkSnowflakePython (Programming Language)Amazon Web Services (AWS)DevOpsScrum+4

System Engineer

Jul 2018Oct 2018 · 3 mos

  • Worked on Python, Data Structure and Algorithms.
SQLPython (Programming Language)ScrumMySQLAlgorithmsData Structures+2

The financial doctors

Research Intern

Jun 2018Jul 2018 · 1 mo · New Delhi Area, India

  • Developed Technical Strategies for stock market.
SQLMarket ResearchData ScienceMachine LearningAnalytical Skills

Gs auto international ltd

Industrial Trainee

Jan 2018May 2018 · 4 mos · Ludhiana Area, India

  • This was my first industrial exposure and got a chance to work on the development of the Electric Car Prototype (1:1).
  • ● Starting in January with just the design in the laptop to the electric car running on the road in May, the journey was priceless.
  • ● A team of 5 worked 50 hours a week to complete the prototype within the deadline.
  • ● Worked on the cutting of rods, welding, moulding, casting to automate all the internal features of the car using Python.

Punjab communications limited

Summer Intern

Jun 2016Jul 2016 · 1 mo · Chandigarh Area, India

  • Worked on Embedded Microcontroller Program.

Thinknext technologies pvt. ltd. - india

Industrial Training

Jun 2016Jul 2016 · 1 mo · Chandigarh Area, India

  • Completed the Live project in Raddison Hotel Management in Web Designing

Education

Lovely Professional University

Master of Science - MS — Information Technology

Aug 2021Feb 2023

Guru Nanak Dev Engineering College, Ludhiana

Bachelor of Technology — Electronics and Communications Engineering

Jan 2014Jan 2018

Stackforce found 100+ more professionals with Snowflake & Python

Explore similar profiles based on matching skills and experience