Suvam Mondal

Data Engineer

Bengaluru, Karnataka, India4 yrs 5 mos experience

Highly StableAI Enabled

Key Highlights

3.5+ years of expertise in Data Engineering.
Skilled in ETL development using PySpark and Informatica.
Passionate about transitioning to AI-ML Engineering.

Stackforce AI infers this person is a Data Engineer specializing in ETL processes within the Healthcare sector.

Contact

Skills

Core Skills

Data EngineeringEtl DevelopmentData IntegrationData IntegrityData Visualization

Other Skills

AWS RedshiftAmazon S3Amazon Web Services (AWS)Artificial Intelligence (AI)Azure DatabricksBootstrapC++Cascading Style Sheets (CSS)Communication SkillData Build Tool (DBT)Data WarehouseDataStageDatabasesDatabricksETL Tools

About

Hi, I'm Suvam, a dedicated Data Engineer in PwC with over 3.5 Yrs of Enriching Experience in Data Engineering Field. I Pursued my B.Tech from NIT Silchar and find Happiness on Solving Complex ELT / ETL Data Challenges and Aspiring & Passionate to be an AI-ML Engineer with a Strong Foundation in ML Algorithms. Throughout my Tenure, I Have 3.5+ Yrs of Expertise in crafting ELT Development in Data Build Tool (DBT) / Snowflake and ETL Development using PySpark on Databricks, enhanced with Python/ Pandas. Experienced in Informatica/ IICS to Design Data Integration Pipelines. Open to Collaborations, New Opportunities, and Discussions on All Things Data !!

Experience

4 yrs 5 mos

Total Experience

3 yrs 3 mos

Average Tenure

1 yr 1 mo

Current Experience

Pwc

Data Engineer

May 2025 – Present · 1 yr 1 mo · Bengaluru, Karnataka, India

Data Build Tool (DBT)SnowflakePySparkDatabricksPythonPandas+4

Cognizant

Data Engineer

Jan 2022 – May 2025 · 3 yrs 4 mos · Bengaluru, Karnataka, India · Remote

3+ Yrs of Expertise in crafting ETL Development using PySpark on Databricks, enhanced with Python/ Pandas. Experienced in Informatica/ IICS to Design Data Integration Pipelines.
## CLIENT - U.S BASED HEALTH CARE ##
|| PY-SPARK developer - DataBricks, PANDAS ||-------------------------------------------------------------------
Implemented SCD Type 2 pipelines, CDC processes using PySpark / Python, utilizing MD5 hash functions for Data Integration.
Developed transformations as Lookup, Join etc. in PySpark DataFrames, flowing data from S3 Bucketto RDBMS like RedShift/SnowFlake etc.
|| Informatica / IICS - ETL Developer ||-------------------------------------------------------------------
Designed and implemented end-to-end ETL Pipelines using IICS to Integrate Data from Multiple Sources (e.g., Flat Files from S3 Bucket) into Centralized Cloud Data WareHouse - RedShift.
Developed Reuseable Mappings, Mapplets, and Workflows to Streamline Data Integration Processes.
|| AWS REDSHIFT, S3 BUCKET, SNOWFLAKE ||-------------------------------------------------------------------
Snowflake, Redshift : Employed SQL queries to rectify mismatch and eliminate duplicate records across various layers (DBeaver) , ensuring data integrity and accuracy.
S3 Bucket: Leveraged S3 as a versatile cloud storage solution, retrieval of raw data files (.dat) with efficiency and reliability.
|| POWERBI AND MS SQL SERVER ||-------------------------------------------------------------------
Created and managed Dashboards for the Team, providing actionable insights. Developed and Altered SQL queries using SSMS for Database Management.
___________________________________________