S

Shreya Dubey

Software Engineer

Redwood City, California, United States4 yrs 8 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Achieved a 50% reduction in response time for Apache Cassandra.
  • Developed advanced Machine Unlearning techniques with Microsoft.
  • Engineered a system to optimize tail latency in CockroachDB.
Stackforce AI infers this person is a Backend-focused Software Engineer with expertise in Cloud Computing and Database Management.

Contact

Skills

Core Skills

Machine LearningDistributed Sql DatabasesDistributed Nosql DatabasesSoftware Development

Other Skills

API DevelopmentAlgorithmsAmazon Web Services (AWS)Apache KafkaApache SparkArtificial Intelligence (AI)Back-End Web DevelopmentBenchmarkingBlockchainC (Programming Language)C++CMakeCascading Style Sheets (CSS)CockroachDBData Analysis

About

I am a dedicated Computer Science Master's student at the University of Massachusetts, Amherst (GPA: 3.93), on track to graduate in May 2024. I graduated with a B.Tech in Computer Science and Engineering from the Indian Institute of Technology Ropar(IIT Ropar), securing a respectable institute rank of 19 out of 120. As a former Software Engineer - 2 at Nutanix Technologies, I had the opportunity to lead initiatives aimed at improving Apache Cassandra's database. I engineered system throughput and scalability through Key Sharding and Key Versioning, leading to a 50% reduction in response time and enabling optimal Load Balancing. These work experiences have effectively honed my skills in Python, C++, Java, and various advanced technologies including Cassandra, Docker, AWS, and more. My project portfolio encompasses designing a user-interaction streaming data pipeline using Spark and Kafka and, developing a BERT-based Learning to Rank model for efficient document retrieval. My recent research at UMass Amherst focuses on optimizing tail latency during query execution for CockroachDB clusters hosted on the AWS cloud. Aiming for continual growth and innovation, I’m passionate about delving deeper into machine learning and distributed systems. My goal is to develop cutting-edge solutions that can handle the data demands of tomorrow while ensuring efficiency, reliability, and scalability.

Experience

Oracle

Member of Technical Staff

Jun 2024Present · 1 yr 9 mos · Redwood City, California, United States · On-site

Microsoft

Graduate Student Researcher

Feb 2024May 2024 · 3 mos · United States

  • Working alongside researchers from Microsoft within an Industry Mentorship Program, to develop advanced Machine Unlearning techniques. This ensures the efficient removal of specific user data from LLMs while maintaining strict compliance with privacy regulations.
Machine LearningNatural Language Processing (NLP)

University of massachusetts amherst

2 roles

Graduate Teaching Assistant

Sep 2023Dec 2023 · 3 mos · Amherst, Massachusetts, United States · On-site

  • Assisted Professor Hamed Zamani in grading assignments, exams, and coursework for the course Information Retrieval. Provided constructive student feedback, maintained grade records, and engaged in regular discussions with instructors to clarify evaluation criteria.

Graduate Student Researcher

Feb 2023Jan 2024 · 11 mos · Amherst, Massachusetts, United States · On-site

  • Assessed Query Data Access Causality in an AWS-hosted CockroachDB cluster, orchestrated with Docker Swarm.
  • Engineered a Distributed Database Management System that improved tail latency during query execution in a CockroachDB cluster; implemented gRPC for streamlined communication.
Python (Programming Language)Amazon Web Services (AWS)CockroachDBDocker SwarmBenchmarkingDistributed SQL Databases+1

Nutanix

Member of Technical Staff - 2

Jan 2019Jan 2021 · 2 yrs · Greater Bengaluru Area · On-site

  • Managed metadata using C++ within the Distributed NoSQL Database Management System, Apache Cassandra.
  • Leveraged my C++ programming skills to enhance database performance, including implementing key sharding, optimizing key management with key versioning, and introducing batch processing to improve write request efficiency.
  • Conducted performance benchmarking under real-time operational loads in Cassandra, determining ideal update batch sizes and further bolstering database efficiency.
  • These efforts resulted in increased system throughput, scalability, and reliability.
C++GitDistributed SystemsNoSQL - Apache CassandraMicroservicesBenchmarking+2

Amazon

Software Development Engineer Intern

May 2018Jul 2018 · 2 mos · Chennai, Tamil Nadu, India · On-site

  • Designed a JAVA-based Address Re-Resolution application that dynamically adapts to changes in reference address data or the Address Resolution Service logic, incorporating JUnit and Mockito for rigorous testing. This enhancement boosted delivery accuracy, enhancing customer satisfaction through precise and timely address updates.
GitObject-Oriented Programming (OOP)MockitoJavaJUnitSoftware Development

Education

University of Massachusetts Amherst

Master of Science - MS — Computer Science

Sep 2022May 2024

Indian Institute of Technology, Ropar

Bachelor of Technology - BTech — Computer Science

Jul 2015May 2019

Stackforce found 100+ more professionals with Machine Learning & Distributed Sql Databases

Explore similar profiles based on matching skills and experience