Anirban Goswami — CTO
Anirban is a Data Engineering Leader with 14+ years of hands-on experience building high-performance, large-scale data systems. His expertise spans the full depth of the Big Data stack, and he is known for solving complex engineering problems with precision, clarity, and architectural rigor.He specializes in designing distributed data systems optimized for throughput, latency, and cost efficiency—leveraging Spark (core/SQL/structured streaming), Scala, Python, Kafka, Hive, and advanced SQL. Anirban has architected and optimized data platforms across both on-prem clusters and cloud environments (AWS, Azure), with a strong focus on reliability, partitioning strategy, storage formats, and performance engineering.His technical footprint extends deeply into the Lakehouse ecosystem, especially Apache Iceberg. Anirban has built and optimized Iceberg-based data lakes, contributed advanced partitioning and metadata strategies, and engineered scalable ingestion and merge-path solutions tailored for large enterprise workloads.He has led the end-to-end engineering of production-grade pipelines using Databricks, AWS Glue, and Informatica PowerCenter—focusing on deterministic performance tuning, cluster resource optimization, metadata management, and resilient workflow orchestration. His approach blends system-level thinking with a strong command of internals, allowing him to identify bottlenecks and design solutions that scale. Anirban’s leadership style is technical-first: he mentors engineers through code reviews, architectural deep dives, performance-debugging sessions, and hands-on design collaboration. He has trained and guided 40+ data engineers, helping them develop strong fundamentals in Spark internals, distributed computing, storage formats, and data modeling.He is also an active technical writer on Medium, sharing deep-dive articles on Spark, Iceberg, data pipelines, and distributed system design—helping advance engineering practices in the broader community. Anirban brings a rare blend of engineering depth, architectural clarity, and hands-on problem-solving that makes him a trusted technical leader in modern data engineering.
Stackforce AI infers this person is a Data Engineering Leader with expertise in Big Data and Lakehouse architectures.
Location: Hyderabad, Telangana, India
Experience: 14 yrs 9 mos
Skills
- Big Data
- Generative Ai
Career Highlights
- 14+ years in data engineering leadership.
- Expert in architecting scalable data systems.
- Mentored over 40 data engineers in advanced technologies.
Work Experience
Apple
Engineering Leader (7 yrs 4 mos)
PwC India
Senior Data Engineer (1 yr 8 mos)
FICO
Data Engineer (10 mos)
American Express
Data Engineer (2 yrs)
Infosys
Lead Data Engineer (2 yrs 11 mos)
Education
Bachelor of Technology (B.Tech.) at Heritage Institute of Technology