Varghese Roy

Data Engineer

Hyderabad, Telangana, India4 yrs 7 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Expert in building cloud-scale data platforms.
  • Proven track record in optimizing data pipelines.
  • Strong background in software engineering and data integration.
Stackforce AI infers this person is a Data Engineer with expertise in cloud computing and data integration in SaaS environments.

Contact

Skills

Core Skills

Data EngineeringCloud ComputingSoftware EngineeringFrontend Development

Other Skills

PySparkPython (Programming Language)ScalaApache SparkGitDockerKubernetes(AWS EKS)AWS EMRJenkinsAgile MethodologiesApache IcebergSQLReact.jsWeb DevelopmentData Structures

About

I am a software engineer with around 5 years of experience building cloud-scale data platforms and pipelines. I like working in fast-paced environments, solving challenging and complex problems. I am a continuous learner and fascinated about new breakthroughs in software engineering.

Experience

4 yrs 7 mos
Total Experience
2 yrs 6 mos
Average Tenure
3 yrs
Current Experience

Apple

Data Engineer (Vendor)

May 2023Present · 3 yrs · Hyderabad, Telangana, India · On-site

  • I currently work in the Maps project of Apple as a Data Engineer (onsite vendor). I have been working on creating scalable, and robust data pipelines to extract data for Data Scientists in order to train and deploy their ML models.
  • One project I worked on (called the HLA NAR TODO Filter) is expected to save at least $1,50,000 for Apple over the next two years.
  • Created a pipeline to download and process large number of images using Spark to predict whether a road is wide enough for vehicles to pass.
  • Experimented and setup the pyspark infrastructure for large scale image processing for my team.
  • Helped migrating pyspark jobs from Apple's internal clusters to AWS graviton clusters.
  • Experimented with Agentic AI frameworks, such as LangGraph for development of an Agentic AI application at Apple Maps.
  • Tech stack used: Scala, Python, Apache Spark, Pyspark, Git, Docker, Kubernetes(AWS EKS), AWS EMR, Jenkins
PySparkPython (Programming Language)ScalaApache SparkGitDocker+5

Techsophy

Software Engineer

May 2023Present · 3 yrs · Hyderabad, Telangana, India · On-site

Dremio

Software Engineer

Jul 2021Feb 2023 · 1 yr 7 mos · Hyderabad, Telangana, India · Hybrid

  • I worked in the Datalake team of Dremio. I worked mainly on the integration of Apache Iceberg features into Dremio's query engine. It involved enabling iceberg DML features for Dremio like "CTAS", "ALTER TABLE", Partition Evolution, etc. One important project I worked on was to create a new command called "COPY INTO", which enabled the direct copying of data from CSV and JSON files into an iceberg table. I have also worked on creating an automation tool that enables engineers to run basic SQL queries quickly on their code changes before uploading them.
Agile MethodologiesGitSoftware Engineering

Trell

Software Devloper

Jul 2020Dec 2020 · 5 mos · Bangalore Urban, Karnataka, India

  • I worked at Trell Experiences pvt. ltd, Bangalore for a period of five months from July 27 to December 21 where I worked in the Front-end web development team of Trell. I also worked in maintaining Trell's database.
React.jsWeb DevelopmentFrontend Development

Csir-cimfr, dhanbad

Research Intern

May 2019Jul 2019 · 2 mos · Dhanbad Area, India

  • I worked at CSIR-CIMFR Dhanbad for two months as an intern where I worked on the development of an IoT based prototype of a motor control system to be used in mines.

Education

Georgia Institute of Technology

Master of Science - MS — Computer Science

Jan 2023May 2026

Birla Institute of Technology and Science, Pilani - Goa Campus

Bachelor's degree — Electrical and Electronics Engineering

Jan 2017Jan 2021

Stackforce found 100+ more professionals with Data Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience