Santanu Kumar Sahu

Associate Consultant

Pune, Maharashtra, India5 yrs 8 mos experience
Highly StableAI Enabled

Key Highlights

  • Proven expertise in cloud data migration and engineering.
  • Successfully led multiple high-priority data projects.
  • Snowflake Snow-Pro Core Certified professional.
Stackforce AI infers this person is a Data Engineering specialist with extensive experience in cloud migration and analytics across various industries.

Contact

Skills

Core Skills

Cloud MigrationData EngineeringData AnalyticsData MigrationCloud SolutionsPerformance Optimization

Other Skills

AWSAWS Boto3AWS DMSAWS GlueAWS LambdaAWS Step FunctionsAgile MethodologiesAmazon AthenaAmazon EC2Amazon KinesisAmazon RedshiftAmazon Simple Notification Service (SNS)Amazon Web Services (AWS)Apache AirflowApache Kafka

About

Versatile and solution-oriented Data & Gen AI Developer, Snowflake Snow-Pro Core Certified, Associate in General Insurance (AINS) 101 Certified, with more than 5.10+ years of experience in Analysis, Design, Data Modeling, Data Governance, ETL, Development, Implementation, Testing, Bigdata Analytics and maintenance of Data warehouse. Currently in Senior Data Engineer role to design & develop a solution for building Data lake/ Data Warehouse by using required tools & technologies in multiple use-cases within a project. Proven ability to manage multiple simultaneous high-priority tasks within tight deadlines while maintaining the highest quality. Technology Stack: Cloud Platforms: • Amazon Web Services (AWS) • Google Cloud Services (GCP) AWS Services: • S3 • Airflow • Lambda • Athena • IAM • EC2 • SES • SNS • OpenSearch Service • GraphQL API • Boto3 • Step Function • EMR GCP Services: • GCS • Cloud function • GCP VM • GCP DataProc • GCP Cloud Composer • GCP Data Fusion Data Warehouses: • Snowflake • AWS Redshift • GCP Bigquery ETL: • Informatica Cloud (IICS) • Databricks • AWS Glue • GCP Data Proc ML & AI Services : • Snowflake Cortex AI • GCP Vertex AI • AWS Sagemaker • AWS Tensorflow Data Modeling: • Erwin Data Modeler BI & Visualization: • PowerBI Languages and Skills: • SQL • NoSQL • Scala • Python • Pyspark • Pandas Bigdata Tools: •Apache Spark •Apache Spark Streaming •Map Reduce •Hadoop •Hbase •Sqoop •Hive •Cassandra Methodologies: • Agile

Experience

5 yrs 8 mos
Total Experience
3 yrs 6 mos
Average Tenure
2 yrs 1 mo
Current Experience

Genpact

Lead Consultant

May 2024Present · 2 yrs 1 mo · Bengaluru, Karnataka, India · Hybrid

Quantiphi

3 roles

Senior Data Engineer

Promoted

Jul 2022May 2024 · 1 yr 10 mos

  • Experienced Senior Data Engineer | Cloud & Data Migration Specialist
  • Pharmaceutical Industry Snowflake Migration:
  • Successfully orchestrated a seamless transition from on-premises systems to the Snowflake cloud platform.
  • Utilized Python and Pandas with AWS boto3 for data ingestion, ensuring robust error-handling and data quality.
  • Collaborated on Snowflake SQL & Python Stored Procedures, enabling efficient data transfers with historical and Change Data Capture (CDC) capabilities.
  • Developed Python scripts for Salesforce CRM Postgres & Source File based data validation.
  • Managed complex cloud data migration and data modeling projects within Snowflake's data warehousing system.
  • Fortune 100 Insurance Company Data Analytics (Western Europe):
  • Led a large-scale cloud migration initiative for a Fortune 100 Insurance Company.
  • Conducted technical analysis for legacy systems hosted across diverse platforms.
  • Implemented data models across critical domains like Policy, Quotes, Pricing, Finance, Billing, and Claims.
  • Spearheaded the development of ETL pipelines using Step function, Athena , Redshift ,Glue, IICS, Python, and SQL, ensuring a seamless data migration.
  • Managed key aspects of the project, including effort estimation, agile project management, and provided mentorship to team members.
  • Fortune 100 Retail Company Data Migration (USA):
  • I contributed the data migration from Legacy Source and Redshift to Snowflake. This involved converting Redshift stored procedures to Snowflake SQL and developing new stored procedures to implement business logic within the Snowflake environment.
  • I built CDC pipelines to ingest data from S3 Parquet files into Snowflake’s ODS layer using JavaScript-based stored procedures. Dimension and Fact procedures were restructured to align with Snowflake standards. I performed extensive unit and blank testing across development and upper environments, resolved bugs, and validated data accuracy between source and target systems.
PythonPandasAWSSnowflakeSQLETL+4

Framework Engineer (Cloud Data Engineer)

Sep 2020Jun 2022 · 1 yr 9 mos

  • Worked as a Framework Data Engineer in Western Europe region for a Fortune 100 Insuarance & Financial company for its Data Analytics project:
  • Worked in different use-cases of Pricing,Rating,Policy,Quotes,Billing & Claims domain
  • Contribute in developing a global data lake platform soultion on AWS cloud
  • Design ETL flow for the data being ingested in the data platform using Glue & Python .
  • Design Data Model and deploy on Redshift data warehousing system
  • Develop the ETL based on business logic as per use cases and populate the DWH
  • Involved in preparing unit test cases and execution.
  • Analyzing the day to day Datalake production defects/bugs.
  • Fix the problem and test the Solution with all the required functionality.
  • Preparing and reviewing the documents related to the project
  • Created design documents - Data Flow Diagram, Data Model creation and Data Mapping Sheet
  • Insurance Company Data Migaration (USA)
  • Client is a global leader in risk management solutions, aimed to optimize complex SQL stored procedures that previously took 5–10 hours to execute in Azure/SQL Server. The objective of this project was to re-engineer and enhance query performance within the Snowflake environment.
  • I led the optimization efforts by rewriting and tuning the SQL logic for Snowflake, significantly reducing execution time. To support secure data transfer, I worked extensively with AWS services including SFTP, S3, EC2, and IAM to move gzip files from the client’s location into our environment and then into Snowflake.
  • Additionally, I developed Python scripts using the Snowflake Python connector to perform various operations (SELECT, UPDATE, DELETE) within Snowflake. I also integrated R Studio with Snowflake via the R connector, enabling seamless data extraction and pushing results back into Snowflake for further analysis.
AWSGluePythonData ModelingETLData Engineering+1

Framework Engineer - Intern

Jan 2020Aug 2020 · 7 mos

  • Google Cloud EDW DWH Modernaization :-
  • This project was focused on migration of data from various sources like Redshift, Teradata, Oracle, SQL Server, Netteza and many others .
  • ● Worked on the Setting up the source from scratch and populating with data of various sizes .
  • ● Deploying scripts the automated migration of data to bigquery by the following approaches : GCP'S Cloud Data Fusion GCP'S DataFlow GCP'S DataProc

Stackforce found 100+ more professionals with Cloud Migration & Data Engineering

Explore similar profiles based on matching skills and experience