Tanya Diwan — Data Engineer
I design and build cloud-native data pipelines and dimensional models that power retail analytics across 7 countries, working end-to-end from business requirement to deployed pipeline. At Capgemini, my core work spans: → ELT pipeline development on AWS (Glue, S3, Lambda) using PySpark and Python → Unified MDM solution consolidating 7 regional retail sources into one global data model → 50+ SQL-based data quality checks covering anomaly detection, business rule validation, and completeness testing → Fact and dimension table modelling using star schema in Snowflake → SQL KPI queries powering 30+ production dashboard pages used by global business teams → Automated report refresh via Snowflake stored procedures, improving data freshness Core stack: Python, PySpark , Advanced SQL, AWS (Glue, S3, Lambda), Snowflake, Spark SQL Currently learning: dbt, Apache Airflow, Databricks, Kafka
Stackforce AI infers this person is a Data Engineer specializing in Retail Analytics with expertise in cloud-native data solutions.
Location: Bengaluru, Karnataka, India
Experience: 2 yrs 5 mos
Skills
- Data Warehousing
- Python
- Aws
- Sql
- Snowflake
- Pyspark
- Apache Airflow
Career Highlights
- Expert in building cloud-native data pipelines.
- Proficient in AWS services and Snowflake.
- Strong background in data quality and analytics.
Work Experience
Deloitte
Data Engineer I (0 mo)
Capgemini
Senior Analyst (10 mos)
Analyst (1 yr 7 mos)
Education
Bachelor of Technology - BTech at Lakshmi Narain College of Technology, Kalchuri Nagar, Raisen Road, Post Klua, Bhopal-462021