Shubham Sagar

Associate Consultant

Delhi, India5 yrs 6 mos experience
Highly StableAI ML Practitioner

Key Highlights

  • 5+ years of experience in data engineering.
  • Achieved 90th percentile in GATE examination.
  • Designed scalable data solutions at EY.
Stackforce AI infers this person is a Data Engineering and Automation specialist in the IT Services industry.

Contact

Skills

Core Skills

Data EngineeringData TransformationAutomationSoftware DevelopmentWeb Development

Other Skills

Microsoft FabricPySparkSQLDatabricksData CleaningPythonPowerShellGroovyREST APIsBashHTML5GitHub CopilotMicrosoft CopilotCopilotAzure Databricks

About

As a Computer Science graduate from Netaji Subhash Engineering College with a DGPA of 8.77, I developed a strong foundation in technology and data analysis, earning a 90th percentile score in the GATE examination. I am passionate about problem-solving, exploring data-driven solutions, and leveraging automation to drive operational efficiency. With over 5+ years of experience in data engineering, I specialize in building scalable and high-performance data pipelines, data cleaning, transformation, reconciliation, and data modeling. At Ernst & Young (EY), I focus on designing data solutions that enhance accuracy, consistency, and accessibility, using tools like Microsoft Fabric, PySpark, Databricks, and AWS. I have implemented Late Arriving Dimensions (LAD), Slowly Changing Dimensions (SCD), and automated data reconciliation strategies, optimization ensuring that datasets are clean and ready for business intelligence. Previously, as a Senior Python Developer at TATA Consultancy Services (TCS), I developed automation solutions to streamline data integration, cleaning, and transformation for mission-critical AIOps projects. I utilized Python, SQL, and PySpark to reduce manual efforts by up to 95%, greatly improving data processing efficiency and operational workflows. Key Skills and Technologies: • Data Engineering: ETL, Data Transformation, Cleansing, and Reconciliation • Data Modeling: SCD Type 1 & Type 2, LAD, Dimensional Modeling • Technologies & Tools: Microsoft Fabric, PySpark, Databricks, AWS (S3, Lambda, Glue, EC2), Delta Lake, SQL, PLSQL • Automation: Python, Pandas, NumPy, REST APIs, Cloud Data Pipelines • Big Data Processing: PySpark, Data Pipeline Optimization, Performance Tuning • Data Integration: API Integration, SQL Query Optimization, Data Wrangling • Tools: Databricks, PowerShell, ServiceNow, Tableau I am deeply committed to driving innovation at the intersection of AI, data engineering, and automation, creating impactful solutions that enhance business performance and enable actionable insights. Always eager to learn, I look forward to contributing to cutting-edge projects and expanding my skill set in data architecture and AI-driven solutions. Feel free to connect or reach out if you'd like to collaborate or exchange ideas on AI, data engineering, and automation! You can contact me at shubhamthrills@gmail.com.

Experience

Ey

Senior Consultant - Artificial Intelligence & Data Analytics

Feb 2025Present · 1 yr 1 mo · Delhi, India · On-site

  • At EY, I led the design and implementation for US based Insurance company of data engineering solutions, with a strong focus on data quality, transformation, and performance optimization. Leveraging technologies such as Microsoft Fabric, PySpark, SQL, and Databricks, I developed scalable data pipelines and architectures that supported complex business analytics and decision-making processes. Key responsibilities and achievements included:
  • o Data Cleaning & Transformation: Engineered ETL workflows using PySpark and Microsoft Fabric to clean, transform, and structure large datasets for downstream analytics and reporting, ensuring data quality and consistency.
  • o Data Validation: Implemented automation validation processes, ensuring accuracy and integrity of data across multiple systems by validating all the tables over various factor and exporting the consolidated report to the lakehouse
  • o Late Arriving Dimensions (LAD): Utilized multiple approaches to manage and process late-arriving data, ensuring that late records were accurately integrated into data pipelines without compromising data integrity.
  • o Slowly Changing Dimensions (SCD): Implemented both SCD Type 1 and Type 2 strategies for dimension tables, enabling efficient historical data tracking and supporting time-sensitive analytics.
  • o Implemented soft delete logic for fact tables during incremental loads by joining with delete and dimension tables to identify records to deactivate, and using the merge method to update corresponding fact table entries accordingly.
  • o Designed and implemented an optimization script leveraging techniques like V-Order, Z-Order, and Vacuum to enhance query performance and storage efficiency in large-scale data pipelines.
Microsoft FabricPySparkSQLDatabricksData CleaningData Transformation+1

Tata consultancy services

3 roles

I.T. Analyst

Promoted

Feb 2024Feb 2025 · 1 yr · On-site

  • Working as Senior Python Developer for Digitate, a product-based venture of Tata Consultancy Services under Digital Cadre
  • As a Senior Developer, I lead the development efforts for our in-house product, Digitate – Ignio. I focus on enhancing Ignio's
  • functionality by developing optimized scripts aimed at improving efficiency and adding generic features to the product.
  • During my tenure, I engaged with diverse stakeholders, understood industry challenges, and conducted detailed process analyses
  • to identify key pain points. I spearheaded the design and implementation of optimal solutions, overseeing the lifecycle from
  • planning to deployment. This enhanced capabilities and delivered practical solutions for clients. Below are a few instances:
  • 1. SAP Success Factor: Designed and developed a scalable and reusable script that integrates the SAP SuccessFactors portal, utilizing
  • Selenium scripts to retrieve service health data. This optimized solution yielded an impressive 96% reduction in operational efforts,
  • empowering real-time anomaly detection and substantially minimizing downtime.
  • 2. SharePoint Domain Whitelisting: Designed a scalable script that helps in seamlessly integrate with SharePoint portals using
  • PowerShell scripting, automating the domain whitelisting process in strict adherence to organizational protocols. This deployment
  • resulted in a remarkable 97% reduction in manual workload while efficiently managing multiple domains with precision.
  • 3. Patching: Developed an optimized & reusable script for patching multiple servers and databases concurrently for various
  • technologies over 7K devices using Python, Java, Groovy and PowerShell scripting. This optimized workflow resulted in 99%
  • reduction in human effort, errors, and manual time, significantly enhancing operational efficiency and overall system performance.
PythonSQLAutomationPowerShellGroovyData Engineering

Systems Engineer

Jul 2021Feb 2024 · 2 yrs 7 mos · On-site

  • As a Senior Developer, I lead the development efforts for our in-house product, Digitate – Ignio. I focus on enhancing Ignio's
  • functionality by developing optimized scripts aimed at improving efficiency and adding generic features to the product.
  • During my tenure, I spearheaded the design implementation of optimal solutions, overseeing the lifecycle from planning to deployment. This enhanced capabilities and delivered practical solutions for clients. Below are a few instances:
  • 1. SharePoint Domain Whitelisting: Designed a scalable script that helps in seamlessly integrate with SharePoint portals using
  • PowerShell scripting, automating the domain whitelisting process in strict adherence to organizational protocols. This deployment
  • resulted in a remarkable 97% reduction in manual workload while efficiently managing multiple domains with precision.
  • 2. Patching: Developed an optimized & reusable script for patching multiple servers and databases concurrently for various
  • technologies over 7K devices using Python, Java, Groovy and PowerShell scripting. This optimized workflow resulted in 99%
  • reduction in human effort, errors, and manual time, significantly enhancing operational efficiency and overall system performance.
PythonPowerShellREST APIsSoftware DevelopmentAutomation

Assistant System Engineer - Trainee

Sep 2020Jul 2021 · 10 mos · On-site

  • As a Developer, I lead the development efforts for our in-house product, Digitate – Ignio. I focus on enhancing Ignio's
  • functionality by developing optimized scripts aimed at improving efficiency and adding generic features to the product.
  • During my tenure, I spearheaded the design implementation of optimal solutions, overseeing the lifecycle from planning to deployment. This enhanced capabilities and delivered practical solutions for clients. Below are a few instances:
  • 1. Event Management and Health Check: Designed an optimized and reusable script to automatically log an incident in the client's
  • ITSM tool upon detecting sustained spikes in CPU, memory, disk usage, or any irregularities observed over various servers,
  • databases, storage, and network devices using Python, PowerShell, Bash, Groovy, and REST API programming. This deployment
  • streamlined operational efficiency by diminishing manual intervention and time consumption by an impressive 95%.
  • 2. User Account Management: Designed an optimized, scalable and reusable script consisting of password reset, user reactivation, and
  • user creation in the database using PL/SQL programming. This deployment led to a remarkable 98% reduction in manual workload,
  • accomplishing the entire process within a 5-minute timeframe.
PythonPowerShellBashREST APIsSoftware DevelopmentAutomation

Aj business group pvt. ltd.

2 roles

Front-End Web Developer

Dec 2018Jan 2019 · 1 mo · New Delhi Area, India

  • Worked as a Front End Web Developer on a project "AJ Print World" (www.ajprintworld.com) during 20th Dec 2018 to 20th Jan 2019.
  • Modified the complete website using WordPress within a week.
HTML5Web Development

Front-End Web Developer

Jun 2018Dec 2018 · 6 mos · New Delhi Area, India

  • Worked as a Front End Web Developer on a project "AJ News" (www.ajnewscast.com) during 1st June 2018 to 30th July 2018(Summer Internship).
  • Developed the complete Website using WordPress within a week.
HTML5Web Development

Education

Netaji Subhash Engineering College

Bachelor of Technology - B.Tech — Computer Science and Engineering

Jan 2016Jan 2020

Stackforce found 100+ more professionals with Data Engineering & Data Transformation

Explore similar profiles based on matching skills and experience