Shashwata Saha

Data Engineer

4 yrs 8 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in Data Engineering and ETL processes.
  • Proficient in building scalable data pipelines.
  • Strong background in frontend development with React.js.
Stackforce AI infers this person is a Data Engineering and Frontend Development specialist in the SaaS industry.

Contact

Skills

Core Skills

Data EngineeringSnowflakeReact.jsData ScienceFrontend DevelopmentSalesforce DevelopmentBackend Development

Other Skills

AWSAWS AthenaAWS GlueAgile MethodologyAirflowAmazon AthenaAmazon Web Services (AWS)ApacheApache AirflowApache SparkApexArtificial Intelligence (AI)Azure DatabricksBig DataCascading Style Sheets (CSS)

About

Practitioner of Clean Code, SOLID Principles, and Test Driven Development (TDD). Experienced in Data Engineering building end-to-end pipelines using Apache Spark, Pandas, Python, SQL, NoSQL, AWS services(S3, Glue, Athena, Lambda), and Frontend technologies React.js, Router, and Redux. I also worked on Java Backends and understanding CI/CD pipeline docker, K8s, Terraform, and Github Workflows. DISCLAIMER: WHATEVER I SAY OR POST IS SOLELY AND FULLY MY OWN PERSONAL OPINION. I DON'T REPRESENT ANY ORGANISATION OR BODY.

Experience

4 yrs 8 mos
Total Experience
1 yr 9 mos
Average Tenure
1 yr 2 mos
Current Experience

Microsoft

SDE 2 (Data Engineer)

Apr 2025Present · 1 yr 2 mos · Hyderabad, Telangana, India · Hybrid

  • Ingesting Knowledge Data into Copilot for Support portals

Thoughtworks

3 roles

Senior Consultant Developer

Aug 2024Apr 2025 · 8 mos

  • Ingested 120M+ rows from 75+ different distributed databases/node sources, to centralized snowflake DBs, per week.
  • Ideated and Implemented migration from legacy Autosys job orchestration to Airflow. Integrated legacy containers with
  • Airflow to streamline the flow of alerts and logs from the jobs to other consumer systems.
  • Spearheaded ServiceNow integration with data pipelines with OAuth2.0 to directly load to Snowflake, eliminating
  • redundant storage layers and reducing overall processing time by 30%.
  • Reduced snowflake query time by 20% by clustering keys and helping data to be pruned for often-used queries.
Data EngineeringSnowflakeAirflowOAuth2.0Data PipelinesQuery Optimization

Consultant Developer

Promoted

Aug 2023Aug 2024 · 1 yr

  • Data Product/Mesh Creator Pack
  • Worked on building an end-to-end ETL data pipeline for data products in a data mesh environment.
  • Pipeline with quality checks, transformers, and input-output ports for abstracting the file system complexity leveraging Pyspark (Spark), Python, and AWS.
  • Secret managers for AWS and alerting capabilities on quality check failure and Pipeline failure
  • Checkpointing for storing metadata about job runs (avoid processing of already processed data), and output data in AWS Glue tables(catalog).
  • Native query system using AWS Athena.
  • Moreover building loggers
  • Future Trends Prediction Model from Data Mesh
  • Led the Predictive Analytics POC for clients using pandas, and scikit-learn Python packages.
  • Derived useful trends from unorganized data while EDA, to filter out potential models.
  • Trained multiple multistage models and Predicted Future trends for the business.
  • Engineered future data when data wasn’t present.
  • Access Manager Frontend
  • Identified and Designed unit components to reuse to reduce code duplicity.
  • Designed State Modeling and Managed single state of truth which creates other derived states using side-effects
  • Build modular and scalable Error handlers ensuring Consistency around the app.
  • Build modular, scaleable, and Performant Request handlers that are independent of tech stacks, pre-load the data to negate rerender cycles, and Improve Performance.
  • Override and unit test MUI building block components for custom requirements.
  • Contributed to multiple product design, and scope creation activities with clients, and experience-designers to understand the requirement and evolve the product from a technical feasibility lens, to reduce the feedback loops and faster delivery.
  • Tech Used: React.js, React Router, Material UI, TypeScript, RTL, Jest
  • Leadership Activities
  • Anchored Community within organization and held multiple sessions across the community
  • Drove more Engagement by introducing new topics
ETLPysparkPythonAWSReact.jsTypeScript+1

Consultant Graduate Developer

Aug 2022Aug 2023 · 1 yr

Cognizant

Programmer Analyst Trainee

Sep 2021Jul 2022 · 10 mos · India

  • Development of CRM Components in Salesforce Health Cloud
  • Developed Components using built-in Admin tools, Flows, and Lightning Web Components.
  • Understood and Modified existing Apex code and Triggers for automation according to new requirements
  • Coordinated with Deployment teams to deploy changes to Pre-Production environments with Copado
  • Involved in new Production version releases.
  • Used Tools like Translation Workbench and Data Loader for version releases
  • Unit tested user stories and Documented Changes in Microsoft Office Tools
  • Followed the Agile Methodology in the entire process
SalesforceApexAgile MethodologySalesforce Development

Nichesoft inc

Intern

May 2021Sep 2021 · 4 mos · Bengaluru, Karnataka, India

  • Created API using Python Flask, Pymongo(MongoDB), Threads.
  • Grafana Plugin Development using React.js[JSX,TSX].
  • Designed UIs and Logos of Mobile apps using Figma.
PythonFlaskMongoDBReact.jsBackend Development

Zero dollar security

Graphics Designer

May 2020Aug 2020 · 3 mos · India

Education

RCC Institute of Information Technology

Bachelor of Engineering - BE

Jan 2017Jan 2021

Stackforce found 100+ more professionals with Data Engineering & Snowflake

Explore similar profiles based on matching skills and experience