Aditya Shah

Senior Software Engineer

Sunnyvale, California, United States7 yrs 11 mos experience
Highly Stable

Key Highlights

  • Expert in Data Governance and Apache Spark.
  • Proven track record in developing Big Data applications.
  • Strong background in Hive and distributed systems.
Stackforce AI infers this person is a SaaS expert with a strong focus on Big Data and distributed systems.

Contact

Skills

Core Skills

Data GovernanceApache SparkHive

Other Skills

Apache HiveApache IcebergC++CSSDeep LearningDeltaDistributed File System (DFS)Distributed SystemsHTMLHadoopHudiJavaScriptLeadershipMachine LearningMicrosoft Office

Experience

Google

Senior Software Engineer

Sep 2025Present · 6 mos

Amazon web services (aws)

3 roles

Software Development Engineer II

Jul 2022Sep 2025 · 3 yrs 2 mos

  • Working in governance domain for Apache Spark and Key projects are as follows:
  • Developed, Launched and Maintained Fine Grain Access Control for Apache Spark with Support for All OTFs like Iceberg, Hudi and Delta.
  • Lead, Developed and Launched Multi Dialect Views with Data Access Control for Apache Spark and support for OTFs
Apache IcebergDeltaHudiDistributed SystemsData GovernanceHadoop+2

Software Development Engineer II

Promoted

Oct 2021Jul 2022 · 9 mos

  • Continued working on Elastic Map Reduce Team focusing on development of Big Data Analytics Applications. Key projects being:
  • Apache Hive currency, maintenance, optimization, security improvements and telemetry.
  • Apache Hive Metastore Service
  • Apache Hive Iceberg integration in EMR
  • Evaluation of new applications for EMR
Apache IcebergDistributed SystemsHadoopApache SparkHive

Software Development Engineer

Jan 2021Oct 2021 · 9 mos

  • Worked in Elastic Map Reduce Team focusing on development of Big Data Analytics Applications of Hive and Tez. Key projects being: Hive EMRRFS-S3 optimized Committer, Hive on EMR serverless, performance improvement optimisation for Hive.
Apache IcebergDistributed SystemsHadoopHive

Qubole

2 roles

Member Of Technical Staff, Hive Team

Jul 2019Dec 2020 · 1 yr 5 mos

  • Worked on transactional capabilities in Hive considering multi-engines like Spark and Presto support. This included designing and optimizing for cloud-native systems, developing in-house testing infrastructure, monitoring, alerting, and contributing back to the Open Source.
Distributed SystemsHadoopHive

Intern

Jul 2018Jun 2019 · 11 mos

  • Worked on Apache Hive, a data warehouse infrastructure tool to process structured data in Hadoop. Key assignments were, add eventual consistency check for object stores, improving the initial mapper estimation in Tez execution engine considering Auto-Scaling and Hive Version Upgrade.
Distributed SystemsHadoopHive

D. e. shaw india private limited

Summer Intern

May 2018Jul 2018 · 2 mos · Hyderabad Area, India

  • Evaluated some of the open-source libraries for mobile-specific UI components based on React and Integrated them in the DESCO JavaScript Stack. Setup two full-stack app for InOut (leave management portal) and DESFlow (Request Tracker) mobile revamped version show casing the new UI components.

Web intelligence and social computing lab

Research Assistant

Aug 2017Dec 2017 · 4 mos · PIlani

  • Worked under Dr. Yashvardhan Sharma on Event Extraction from code mixed data(Hindi, Malayalam, and Tamil) of social media text.

Ncflexe, iit kanpur

SURGE Research Intern

May 2017Jul 2017 · 2 mos · Kanpur Area, India

  • Worked under Dr. Deepak Gupta to develop a Cost-Effective Anti-counterfeiting technology. My project involved identifying 3D textures, authenticating images and generating signatures of images for indexing and searching. This was a part of the larger project to commercialize a patented lab implementation to tackle counterfeiting problem for any product that is being sold anywhere, with the plan to move it out of the IITK lab in the form of an independent startup entity.

Bhaskaracharya institute for space applications and geo-informatics

Summer Research Intern

May 2016Jul 2016 · 2 mos · Gujarat, India

  • Worked on Vehicle Detection to analyze traffic flow within the campus. Developed a system in which given a video, vehicles are detected in each frame using SVM and Haar Cascade Algorithm. The system ran on Distributed File System for large-scale data processing using Dumbo API.

Education

Birla Institute of Technology and Science, Pilani

Master of Science — Mathematics

Jan 2014Jan 2019

Birla Institute of Technology and Science, Pilani

Bachelor of Engineering - BE — Computer Science

Jan 2014Jan 2019

Symbiosis School Nashik

Jan 2008Jan 2014

Stackforce found 100+ more professionals with Data Governance & Apache Spark

Explore similar profiles based on matching skills and experience