Rajarshi Sarkar — Senior Software Engineer
Software Engineer with 9+ years of experience across Google, Amazon, and Walmart, specializing in Java, Big Data, Distributed Systems, and Microservices. Proven expertise in building scalable, high-performance, and secure distributed systems across domains like Big Data, Cloud Computing, E-commerce, and Retail. At Amazon Web Services, I was part of the EMR team powering petabyte-scale analytics using Apache Spark, Hive, and Trino. My work includes Hive partition pruning optimizations, integrating Iceberg into EMR with open-source contributions, making S3A the default file system, implementing Fine-Grained Access Control (FGAC), and optimizing EMR releases. Previously at Walmart, I designed and developed the cumulative data repository integrating real-time transactional data across the supply chain, enabling near real-time analytical insights. I also led the development of Data Lake products, including Data Pipeline, Data Quality, Metadata Manager, and Data Acceleration tools, with a strong focus on data integrity, governance, and end-to-end lineage. Open-source contributor to Apache Iceberg, Trino, and Gimel.
Stackforce AI infers this person is a Big Data and Cloud Computing expert with extensive experience in E-commerce and Retail.
Location: Bengaluru, Karnataka, India
Experience: 9 yrs 6 mos
Skills
- Big Data
- Amazon Web Services (aws)
Career Highlights
- 9+ years of experience in top tech companies.
- Expertise in Big Data and Distributed Systems.
- Proven track record in open-source contributions.
Work Experience
Sr. Software Engineer (8 mos)
Amazon
Software Development Engineer (4 yrs 4 mos)
Walmart
Sr. Software Engineer (1 yr 3 mos)
Software Engineer III (1 yr 11 mos)
Software Engineer II (1 yr 4 mos)
Indian Institute of Technology, Kharagpur
Intern (1 mo)
Indian Institute of Technology, Bombay
Intern (2 mos)
Education
Master of Computer Science at Arizona State University
Bachelor of Engineering at Birla Institute of Technology, Mesra