Sushma Jadhav

Associate Consultant

Pune, Maharashtra, India6 yrs 4 mos experience

Most Likely To SwitchAI Enabled

Key Highlights

Led successful data-driven projects in fast-paced environments.
Enhanced data validation accuracy by 40% in analytics.
Reduced processing times by 60% through pipeline migration.

Stackforce AI infers this person is a Data Engineering expert with extensive experience in cloud-based data solutions.

Contact

Skills

Core Skills

Data EngineeringCloud ComputingEtl ProcessesData Quality

Other Skills

DatabricksPySparkSpark SQLData ModelingAWSData Lake/LakehouseGitLabETLData WarehouseAutomated Data CleansingAirflowPythonOracle DatabaseRecommendation EngineNatural Language Processing

About

Senior Data Engineer with 5+ years of experience in building and optimizing scalable data pipelines and architectures. Proficient in working with technologies such as SQL, Python, Spark, Databricks, and cloud platforms (AWS, Azure). Adept at transforming raw data into valuable insights by implementing efficient ETL processes, data warehousing, and automation solutions. Strong problem-solving skills with a focus on improving data quality, optimizing performance, and ensuring reliable data delivery. Experienced in leading teams, collaborating cross-functionally, and driving successful data-driven projects in fast-paced environments.

Experience

6 yrs 4 mos

Total Experience

3 yrs 2 mos

Average Tenure

4 yrs 5 mos

Current Experience

Zs

2 roles

Business Technology Solutions Associate Consultant

Promoted

Jun 2023 – Present · 3 yrs

Role: Senior Data Engineer
Tech Stack: Databricks, PySpark, Spark SQL, Data Modeling, AWS, Data Lake/Lakehouse,
GitLab
Led a team of 4 and handled cross-functional collaboration, ensuring seamless project
execution.
Designed and implemented a high-volume data pipeline to process structured and semi-
structured data and built a data lake house.
Managed 50+ deliverables monthly, ensuring 100% on-time delivery and high-quality
results.
Enhanced data validation accuracy by 40%, reducing defects in downstream analytics.
Utilized GitLab for version control and collaboration across teams, ensuring smooth
integration and deployment of code changes with minimal conflicts and delays.
Led the migration of data pipelines to Databricks and implemented Databricks SQL to
significantly reduce processing times by 60% and cut compute costs by 30%

DatabricksPySparkSpark SQLData ModelingAWSData Lake/Lakehouse+3

Senior Business Technology Solutions Associate

Jan 2022 – Jun 2023 · 1 yr 5 mos

Developed an ETL process using Spark SQL to clean and transform data from multiple sources and load it into a data warehouse.
Developed an automated data cleansing process to reduce manual errors and improve data quality.
Automated report generation process using pyspark.
Created end to end pipeline data load pipeline using Databricks and Airflow.
Modified native data to OMOP CDM for multiple claims and EHR datasets.
Spearheaded a team of 4 people.
Migrated OMOP conversion pipelines from SQL to Databricks (Delta implementation) which in turn reduced time by 40%.

Spark SQLETLData WarehouseAutomated Data CleansingDatabricksAirflow+2

Capgemini

3 roles

Associate Consultant

Oct 2021 – Dec 2021 · 2 mos

Cleaned, Standardized and Transformed raw data with custom-made ETL application to prepare unruly data using Python. Generated exceptions reports based on set criteria. These exceptions are provided to business users for further assessment.
Developed python scripts to extract, transform and load data in corresponding tables in oracle database. Understood entity relationship diagram and build relations for tables accordingly. Built a recommendation engine to suggest actions like RCA, turnaround time, action taken, etc. for respective exception. Developed a python script to update data in database for existing exceptions for feedback loop.

PythonETLOracle DatabaseRecommendation EngineData Engineering

Senior Analyst

Promoted

Oct 2020 – Sep 2021 · 11 mos

PythonNatural Language ProcessingAzure DatabricksData Engineering

Analyst

Sep 2019 – Sep 2020 · 1 yr

Skills -AWS Redshift, AWS CloudWatch
Worked on syncing data to different environments, blocking query, Unload data, monitor CloudWatch and setting alarms, giving access to users, backup table clean up.
Skills: Hadoop, HDFS, AWS S3, AWS EMR, Oozie
Orchestration of Hadoop jobs with help of Oozie.
Running queries using Apache Solr for data searching in Cassandra tables.
Scripts to copy data from AWS S3 to hdfs.
Running commands to trigger EMR in case of failure.
Run Hadoop commands as per need.
Querying tables on Redshift for analyzing data.
Skills: Data warehouse, Informatica, Tableau
Worked on mini projects during my training period on Etl technology.