Surbhi Gusain — Data Engineer
Data Engineer with expertise in designing and operating large-scale, distributed data pipelines. Skilled in Scala and Python, I build batch and workflow-driven pipelines using Apache Beam, Spark, Airflow, Dataflow, Dataproc, BigQuery, and GCP, with a focus on data correctness, performance optimization, and observability. I have experience with end-to-end pipeline ownership, schema design, transformation, deployment, and monitoring, as well as designing scalable authentication and authorization systems for web and mobile applications across global markets. Holding a B.Tech in Machine Learning and completing an ML internship, I continue to explore AI/ML to integrate intelligent solutions into robust, data-driven systems. Key Skills: Scala • Python • Apache Beam • Spark • Apache Airflow • GCP • BigQuery • Distributed Systems • Data Pipelines • Performance & Reliability • Machine Learning & AI
Stackforce AI infers this person is a Data Engineer with strong expertise in Machine Learning and Cloud Technologies.
Location: Pune, Maharashtra, India
Experience: 2 yrs 9 mos
Skills
- Apache Beam
- Google Cloud Platform (gcp)
- Java
- Systems Design
- Machine Learning
- Data Analysis
Career Highlights
- Expert in building scalable data pipelines.
- Strong foundation in machine learning and AI.
- Proficient in GCP and big data technologies.
Work Experience
HSBC
Senior Data Engineer (2 mos)
Data Engineer (11 mos)
Software Engineer (1 yr 4 mos)
Trainee Software Engineer (6 mos)
PhonePe
PhonePe Tech Scholar'22 (1 mo)
Oil and Natural Gas Corporation Ltd
Summer Intern (1 mo)
Education
Bachelor of Technology - BTech at Graphic Era Deemed to be University
Physics-Chemistry-Mathematics at The Army Public School