Yashraj Garud

Data Engineer

Pune, Maharashtra, India4 yrs 9 mos experience

Key Highlights

  • Expert in optimizing ETL workflows and data pipelines.
  • Proficient in Azure Data Factory and Spark technologies.
  • Led successful data migration projects in healthcare.
Stackforce AI infers this person is a Data Engineer specializing in SaaS and Healthcare data solutions.

Contact

Skills

Core Skills

Data EngineeringEtl ProcessesData VisualizationData MigrationData Processing

Other Skills

Apache SparkAzure Data FactoryAzure Data LakeAzure DatabricksAzure DevOpsCommunicationData IntegrityEngineeringEnglishMarketingMicrosoft AzureMicrosoft Power BIMySQLPower BIProject Management

About

Skilled in implementing efficient data ingestion processes, designing Azure Data Factory pipelines, and optimizing Spark applications using PySpark and SparkSQL for large-scale ETL transformations. Proficient in migrating legacy SSIS and SQL processes to scalable Azure solutions, leveraging Delta Lake and Databricks Delta Tables, and utilizing Azure DevOps for version control and deployment.

Experience

Atlas copco

Data Engineer

Feb 2024Present · 2 yrs 1 mo · Pune, Maharashtra, India · On-site

  • Designed and developed a Power BI dashboard for 100+ stakeholders across sourcing, logistics, and finance, streamlining data visualization and decision-making.
  • Built and monitored scalable data pipelines using Azure Data Factory (ADF), integrating data from SAP BW, SAP HANA, and Salesforce.
  • Extracted and transformed data in Azure Data Lake Storage using Spark (PySpark) and SQL, creating enriched datasets for enhanced analytics.
  • Optimized ETL workflows with caching, partitioning, and cluster optimization techniques, reducing pipeline runtimes by over 30%.
  • Migrated legacy QlikView reports to Power BI, leveraging Azure’s Medallion architecture to modernize data storage and reporting systems.
  • Managed version control and deployments with Azure DevOps, handling pull requests and migrating code across development, QA, and production environments.
Azure Data FactoryPower BISparkPySparkSQLAzure DevOps+2

Atos syntel

Data Engineer

Jul 2021Feb 2024 · 2 yrs 7 mos · Pune, Maharashtra, India

  • Worked on a migration project for a Healthcare client, ensuring seamless transition and
  • data integrity throughout the process.
  • 07/2021 – present
  • Pune, India
  • Developed a generic ingestion process for different client files, improving processing
  • time by 60% and achieving better results.
  • Gained hands-on experience working in a production environment, ensuring smooth
  • operations and data workflows.
  • Provided on-call support to both the client-side team and the onshore team, ensuring
  • effective communication and issue resolution.
  • Mentored new joiners, facilitating their onboarding and contributing to their
  • professional development.
  • Managed a team of 4 developers, coordinating their tasks, providing guidance, and
  • ensuring project success.
  • Developed Spark applications using SparkSQL in Databricks, extracting, transforming,
  • and aggregating data from multiple file formats to derive valuable business insights.
  • Demonstrated a strong understanding of Spark architecture, including Spark Core,
  • Spark SQL, Data Frames, Spark Streaming, Driver Node, Worker Node, Stages,
  • Executors, and Tasks.
  • Utilized Azure DevOps to create repositories and branches, enabling regular code pushes
  • and maintaining version control with the Databricks repository.
  • Conducted performance tuning of Spark jobs, optimizing code execution, and
  • implementing user-defined functions (UDFs) in PySpark to meet specific business
  • requirements.
  • Implemented robust logging mechanisms to capture and analyze logs, including error
  • logs, counts, user logs, and filenames. Worked on code performance improvements
  • based on the insights gained.
  • Utilized SSIS packages and SQL stored procedures in SQL Server for data processing and
  • integration tasks.
  • Collaborated on the creation of test case scenarios for the testing team, ensuring
  • comprehensive testing coverage and quality assurance.
SparkSQLAzure DevOpsSSISSQL ServerData ProcessingData Engineering+1

Education

University of Mumbai

Bachelor of Engineering - BE — Information Technology

Jan 2016Jan 2020

Jawahar Navodaya Vidyalaya - JNV

Aug 2009Aug 2016

Stackforce found 100+ more professionals with Data Engineering & Etl Processes

Explore similar profiles based on matching skills and experience