Suvin Shah — Data Engineer
Senior Data Engineer with 10 years building production-grade data platforms across financial services. I design, optimize, and scale the pipelines that turn raw data into governed, trusted assets from real-time streaming ingest to terabyte-scale batch ETL. What I build: ➔ Cloud-native ETL/ELT pipelines on AWS (Glue, Redshift, S3, EMR, Lambda, Kinesis) and Snowflake ➔ Real-time streaming architectures with Kafka, Spark Structured Streaming, and Delta Lake ➔ Orchestration frameworks using Airflow and dbt for reproducible, tested transformations ➔ Data warehouse and lakehouse designs serving analytics, ML, and regulatory reporting Impact at scale: ➔ Automated 1TB+ daily ETL pipelines—cut runtime 40%, shifted reporting from weekly to daily ➔ Unified customer datasets across 8 global business units—drove $3.5M retention revenue ➔ Built fraud detection model (XGBoost) achieving 99.9% accuracy on credit card transactions ➔ Reduced infrastructure costs through Spark tuning and cloud-native optimization Stack: Python · SQL · PySpark · Apache Spark · Kafka · Airflow · dbt · AWS (Glue, Redshift, S3, EMR, Lambda, Kinesis) · Snowflake · Databricks · GCP (BigQuery) · Terraform · Docker · Tableau · Power BI Credentials: M.S. Data Science, Pace University · AWS Cloud Practitioner · 10K+ LinkedIn followers Open to Senior Data Engineer, Staff Data Engineer, Analytics Engineer, and Data Analyst roles. Let’s talk: suvin.shah94@gmail.com
Stackforce AI infers this person is a Fintech Data Engineer specializing in scalable data pipelines and analytics platforms.
Location: San Jose, California, United States
Experience: 9 yrs 11 mos
Skills
- Data Engineering
- Etl Pipelines
- Data Visualization
- Data Quality Management
- Machine Learning
- Data Analysis
Career Highlights
- Automated 1TB+ daily ETL pipelines, cutting runtime by 40%.
- Unified customer datasets, driving $3.5M retention revenue.
- Built fraud detection model achieving 99.9% accuracy.
Work Experience
Citi
Senior Data Engineer (2 yrs 11 mos)
Data Engineer Intern (9 mos)
Radix Software Services Pvt Ltd
Data Engineer (2 yrs 8 mos)
SoftAge Information Technology Limited
Data Analyst (2 yrs 9 mos)
DataCrops Software Private Limited
Data Analyst (1 yr 7 mos)
Education
Masters at Pace University - Seidenberg School of Computer Science and Information Systems
BE - Bachelor of Engineering at SILVER OAK UNIVERSITY