Yasar Arafath A — Data Engineer
Professional Summary: ​🚀 Scaling Data Excellence through Modern Architecture ​As a Principal Data Engineer with 11+ years of experience, I specialize in designing and deploying large-scale distributed data platforms that bridge the gap between raw data and high-impact AI/ML workloads. My expertise spans the entire lifecycle of the Modern Data Stack, from real-time ingestion to sophisticated lakehouse orchestration. ​I have a proven track record of navigating complex multi-cloud environments (AWS & GCP), ensuring that data infrastructure is not just functional, but optimized for performance, cost, and scalability. ​🛠Core Technical Expertise: ​Data Platforms: Snowflake, Databricks, BigQuery, Redshift ​Processing & Streaming: Apache Spark (EMR/Dataproc), Kafka, SQL Optimization ​Transformation & Ops: DBT, Airflow (Cloud Composer), Glue, GCS/S3 ​Next-Gen Tech: Vector Databases for AI-ready platforms & Large-scale Lakehouse design ​🏆 Key Career Milestones: ​Massive Scale: Architected distributed Spark pipelines on AWS EMR processing 2+ TB of daily data, directly enabling mission-critical ML workloads. ​Cloud Migration: Led a high-stakes migration from AWS/Snowflake to GCP, moving 1+ TB of analytical datasets with zero data loss and full reconciliation. ​Efficiency & Speed: Achieved a 30-40% reduction in ETL runtime by fine-tuning Spark workloads on Databricks and EMR. ​Modern ELT: Built a robust DBT architecture with 100+ models, drastically improving data lineage and development velocity. ​Real-Time Data: Designed Kafka-based event streaming to eliminate ingestion bottlenecks and reduce data latency for downstream consumers. ​Automation: Orchestrated 50+ production pipelines via Airflow with proactive monitoring and automated alerting. ​📫 Let’s Connect: I am passionate about building the next generation of data infrastructure. If you're looking to discuss Lakehouse architectures, Cloud Migrations, or AI-Ready Data Platforms, feel free to reach out!
Stackforce AI infers this person is a Data Engineering expert specializing in cloud migration and large-scale data platforms.
Location: Chennai, Tamil Nadu, India
Experience: 11 yrs 4 mos
Skills
- Data Engineering
- Cloud Migration
Career Highlights
- Architected distributed Spark pipelines processing 2+ TB daily.
- Led migration from AWS/Snowflake to GCP with zero data loss.
- Achieved 30-40% reduction in ETL runtime on Databricks.
Work Experience
LTIMindtree
Senior Data Specialist (9 mos)
Senior Specialist (9 mos)
Altimetrik
Technical Lead (9 mos)
Senior Software Engineer (4 yrs 4 mos)
UST
System Analyst (1 mo)
Cognizant
Associate (5 yrs 6 mos)
Education
Bachelor of Engineering at Chettinad College of Engineering & Technology