S

Sushma Jadhav

Associate Consultant

Pune, Maharashtra, India6 yrs 4 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Led successful data-driven projects in fast-paced environments.
  • Enhanced data validation accuracy by 40% in analytics.
  • Reduced processing times by 60% through pipeline migration.
Stackforce AI infers this person is a Data Engineering expert with extensive experience in cloud-based data solutions.

Contact

Skills

Core Skills

Data EngineeringCloud ComputingEtl ProcessesData Quality

Other Skills

DatabricksPySparkSpark SQLData ModelingAWSData Lake/LakehouseGitLabETLData WarehouseAutomated Data CleansingAirflowPythonOracle DatabaseRecommendation EngineNatural Language Processing

About

Senior Data Engineer with 5+ years of experience in building and optimizing scalable data pipelines and architectures. Proficient in working with technologies such as SQL, Python, Spark, Databricks, and cloud platforms (AWS, Azure). Adept at transforming raw data into valuable insights by implementing efficient ETL processes, data warehousing, and automation solutions. Strong problem-solving skills with a focus on improving data quality, optimizing performance, and ensuring reliable data delivery. Experienced in leading teams, collaborating cross-functionally, and driving successful data-driven projects in fast-paced environments.

Experience

6 yrs 4 mos
Total Experience
3 yrs 2 mos
Average Tenure
4 yrs 5 mos
Current Experience

Zs

2 roles

Business Technology Solutions Associate Consultant

Promoted

Jun 2023Present · 3 yrs

  • Role: Senior Data Engineer
  • Tech Stack: Databricks, PySpark, Spark SQL, Data Modeling, AWS, Data Lake/Lakehouse,
  • GitLab
  • Led a team of 4 and handled cross-functional collaboration, ensuring seamless project
  • execution.
  • Designed and implemented a high-volume data pipeline to process structured and semi-
  • structured data and built a data lake house.
  • Managed 50+ deliverables monthly, ensuring 100% on-time delivery and high-quality
  • results.
  • Enhanced data validation accuracy by 40%, reducing defects in downstream analytics.
  • Utilized GitLab for version control and collaboration across teams, ensuring smooth
  • integration and deployment of code changes with minimal conflicts and delays.
  • Led the migration of data pipelines to Databricks and implemented Databricks SQL to
  • significantly reduce processing times by 60% and cut compute costs by 30%
DatabricksPySparkSpark SQLData ModelingAWSData Lake/Lakehouse+3

Senior Business Technology Solutions Associate

Jan 2022Jun 2023 · 1 yr 5 mos

  • Developed an ETL process using Spark SQL to clean and transform data from multiple sources and load it into a data warehouse.
  • Developed an automated data cleansing process to reduce manual errors and improve data quality.
  • Automated report generation process using pyspark.
  • Created end to end pipeline data load pipeline using Databricks and Airflow.
  • Modified native data to OMOP CDM for multiple claims and EHR datasets.
  • Spearheaded a team of 4 people.
  • Migrated OMOP conversion pipelines from SQL to Databricks (Delta implementation) which in turn reduced time by 40%.
Spark SQLETLData WarehouseAutomated Data CleansingDatabricksAirflow+2

Capgemini

3 roles

Associate Consultant

Oct 2021Dec 2021 · 2 mos

  • Cleaned, Standardized and Transformed raw data with custom-made ETL application to prepare unruly data using Python. Generated exceptions reports based on set criteria. These exceptions are provided to business users for further assessment.
  • Developed python scripts to extract, transform and load data in corresponding tables in oracle database. Understood entity relationship diagram and build relations for tables accordingly. Built a recommendation engine to suggest actions like RCA, turnaround time, action taken, etc. for respective exception. Developed a python script to update data in database for existing exceptions for feedback loop.
PythonETLOracle DatabaseRecommendation EngineData Engineering

Senior Analyst

Promoted

Oct 2020Sep 2021 · 11 mos

PythonNatural Language ProcessingAzure DatabricksData Engineering

Analyst

Sep 2019Sep 2020 · 1 yr

  • Skills -AWS Redshift, AWS CloudWatch
  • Worked on syncing data to different environments, blocking query, Unload data, monitor CloudWatch and setting alarms, giving access to users, backup table clean up.
  • Skills: Hadoop, HDFS, AWS S3, AWS EMR, Oozie
  • Orchestration of Hadoop jobs with help of Oozie.
  • Running queries using Apache Solr for data searching in Cassandra tables.
  • Scripts to copy data from AWS S3 to hdfs.
  • Running commands to trigger EMR in case of failure.
  • Run Hadoop commands as per need.
  • Querying tables on Redshift for analyzing data.
  • Skills: Data warehouse, Informatica, Tableau
  • Worked on mini projects during my training period on Etl technology.
AWS RedshiftAWS CloudWatchHadoopInformaticaTableauData Engineering

Education

Pune Institute of Computer Technology

Bachelor of Engineering — Electronics and Telecommunications

Jan 2016Jan 2019

K.T.E.S English Medium School

Jan 2013Present

Stackforce found 100+ more professionals with Data Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience