Atul Matvar

Data Engineer

Delhi, Delhi, India9 yrs 6 mos experience
Highly Stable

Key Highlights

  • Improved query performance by 40% using Trino.
  • Reduced cluster overhead by 30% through optimized ETL execution.
  • Delivered trusted KPI datasets for business dashboards.
Stackforce AI infers this person is a Data Engineer specializing in Fintech data platforms and cloud architecture.

Contact

Skills

Core Skills

Data Platform EngineeringLakehouse ArchitectureCloud-native AnalyticsData EngineeringCloud ComputingAnalytics

Other Skills

AWS GlueAWS LambdaAmazon Web Services (AWS)Apache IcebergApache SparkBig DataBig Data AnalyticsBusiness AnalyticsBusiness Intelligence (BI)CommunicationCross-functional Team LeadershipData AnalysisData AnalyticsData ManagementData Quality

About

I’m a Data Engineer / Data Platform Engineer with 9+ years of experience building scalable Big Data and Cloud Data Platforms in FinTech and enterprise analytics domains. I specialize in designing modern Lakehouse architectures, building high-performance ETL/ELT pipelines, and delivering analytics-ready datasets that power business-critical dashboards, KPI reporting, and product decision-making. I have worked in high-scale environments at One97 Communications (Paytm) and Paytm Payments Bank, where I gained hands-on exposure to enterprise-grade data ecosystems, governance frameworks, and large-scale reporting platforms. Currently, I’m working at Algoworks Technology Pvt. Ltd., where I design and implement an AWS Lakehouse platform using Apache Iceberg on S3 following a Medallion Architecture (Bronze/Silver/Gold). I integrate complex data sources like OneStream (REST APIs), JDE, and TMS, and deliver curated Gold datasets for enterprise analytics and Power BI reporting. What I Do Best ✅ Data Platform Engineering Lakehouse Architecture ✅ Building scalable ingestion + transformation frameworks ✅ Batch + incremental pipeline design using Spark ✅ Data modeling, optimization & governance-ready datasets ✅ Cloud-native analytics on AWS (Glue, Athena, S3, Redshift Spectrum) ✅ Data Quality, reconciliation & monitoring frameworks Tech Stack Python | SQL | Apache Spark | PySpark | Spark SQL | AWS Glue | S3 | Athena | Redshift Spectrum | Iceberg | Kafka | Hadoop | Hive | Trino Key Highlights: Improved query performance by 40% by driving migration and optimization using Trino. Reduced cluster overhead by 30% through optimized ETL execution and tuning. Supported migration of workloads from on-prem to AWS, improving scalability and cost efficiency. Delivered trusted KPI datasets powering business dashboards through strong reconciliation & validation frameworks Career Goal I’m passionate about building scalable, reliable, and governed data platforms that enable organizations to unlock real-time insights and faster decision-making. I’m particularly interested in working with product-based, data-driven organizations where data is treated as a core product asset. Email: atulmatvar2@gmail.com Open to connecting with professionals in Data Engineering, Data Platforms, Cloud Architecture, and Analytics Engineering.

Experience

Algoworks

AWS Data Architect

Jan 2026Present · 2 mos · Noida, Uttar Pradesh, India · Remote

  • Architecting a scalable data ingestion framework for OneStream Data Cube (REST APIs) with schema validation, error handling, and standardized transformations.
  • Built and optimized AWS Glue + PySpark ETL pipelines supporting incremental processing and Iceberg-based Lakehouse storage.
  • Integrating multiple enterprise systems (JDE, TMS, OneStream) into a centralized AWS data platform, ensuring consistent modeling and governance readiness.
  • Designing curated Gold Layer datasets to support analytics consumption through Athena / Redshift Spectrum, enabling Power BI dashboards with improved reporting accuracy.
  • Implementing end-to-end data quality checks, reconciliation framework, and monitoring, ensuring trusted KPI reporting for business stakeholders.
Apache IcebergApache SparkData Platform EngineeringLakehouse Architecture

Paytm payments bank

3 roles

Technical Lead

Promoted

Apr 2023Jan 2026 · 2 yrs 9 mos · Noida, Uttar Pradesh, India

  • Built and maintained large-scale data pipelines supporting fintech reporting, analytics, and operational dashboards.
  • Worked on enterprise data platforms leveraging Spark, Hadoop, Hive, and SQL, enabling scalable batch processing and optimized query execution.
  • Supported cross-functional analytics requirements by delivering structured datasets for finance, compliance, and business reporting teams.
  • Performed query tuning, partitioning strategies, and performance optimization to improve reporting SLA adherence.
  • Contributed to platform modernization initiatives, including cloud adoption and distributed query performance improvements.
AWS GlueTechnical Project LeadershipData EngineeringCloud Computing

Senior Data Engineer

Promoted

Nov 2020Apr 2023 · 2 yrs 5 mos · Noida, Uttar Pradesh, India

Amazon Web Services (AWS)Docker

Data Engineer

Oct 2019Nov 2020 · 1 yr 1 mo · Noida, Uttar Pradesh, India

Data VisualizationNumPy

One97 communications limited

Data Engineer

Aug 2016Sep 2019 · 3 yrs 1 mo · Noida Area, India

Data WarehousingAnalytics

Education

Liverpool John Moores University

Master of Science - MS — Data Science

Sep 2022Feb 2025

International Institute of Information Technology Bangalore

Executive PG in Data Science — Deep Learning

Aug 2022Sep 2023

Karnataka State Open University, Mysore

Bachelor of Science (B.Sc.) — Information Technology

Jan 2011Jan 2015

Stackforce found 24 more professionals with Data Platform Engineering & Lakehouse Architecture

Explore similar profiles based on matching skills and experience