Ophelia S.

Data Engineer

4 yrs 1 mo experience

Key Highlights

Expert in building scalable data pipelines on AWS.
Proven track record of automating data workflows.
Strong mentor for early career tech talent.

Stackforce AI infers this person is a Data Engineer with expertise in SaaS and data infrastructure.

Contact

Skills

Core Skills

Data EngineeringAwsData VisualizationMachine LearningData Analysis

Other Skills

PySparkAmazon Web Services (AWS)SQLData ModelingBrazeAWS EMRTableauPythonOpenCVPILData WarehousingApache HadoopTrinoExtract, Transform, Load (ETL)CI/CD

About

I'm a Data Engineer at Grubhub with a master's degree in computer science, and here's what I've learned: most early-career folks don't know how to market themselves. You're doing personal projects, attending hackathons, crushing your co-ops and internships. But none of that translates into opportunities because you don't know how to tell your story. I was in the same spot until I learned to play the game - the numbers game on LinkedIn - not the 'cold apply to 1000 jobs' game. I landed three offers in 30 days, and now I help early tech talent do the same, translating what they've done into stories they can tell on LinkedIn and building a personal brand that leads to callbacks. Through posts and resources, I share what actually works for getting noticed in tech. Practical strategies from someone who navigated the same challenges and learned what works. Let's make you stand out on LinkedIn!

Experience

Grubhub

Data Engineer

Jun 2025 – Present · 9 mos

As a Data Engineer on the Growth CRM team, I partner across marketing ops, product, analytics, and merchant teams to build and scale the data infrastructure powering billions of daily customer communications. I design AWS-based pipelines, automate campaign workflows, and turn messy data into personalized, high-converting experiences.
Here are some of my accomplishments:
Built automated merchant partnership logic into CRM pipeline, unlocking self-serve campaign customization at scale (tens of millions of weekly diners) and eliminating engineering as a bottleneck for growth-focused sends
Engineered partition-based overwrite strategy for upstream Braze data, improving data availability by 70% and reducing pipeline runtime by 30%
Designed and maintained high-volume ETL pipeline for real-time event processing (100+ GB daily via PySpark on AWS EMR), transforming semi-structured JSON and managing data in S3 with Hive external metastore for CRM campaign performance tracking
Built automated Tableau dashboards that eliminated ~3 hours of manual reporting per week, enabling real-time campaign performance and self-service analytics for stakeholders

PySparkAmazon Web Services (AWS)SQLData ModelingBrazeData Engineering+1

Institute for advanced analytics

Graduate Teaching Assistant

Aug 2024 – Apr 2025 · 8 mos · Raleigh, North Carolina, United States

North Carolina State University’s Institute for Advanced Analytics houses the first M.S. in Analytics program in the U.S., known for a 93% job placement rate at graduation and an average base salary of $108K (Class of 2024, 3-year average).
As a Graduate Teaching Assistant, I mentored students in applied statistics and machine learning, supporting homework, projects, and remediation sessions.
I also collaborated with instructors to identify learning gaps and provided targeted support to strengthen students’ understanding of regression, classification, and model evaluation.
By the end of the course, students I mentored demonstrated improved performance on projects and assessments.
This experience strengthened my ability to explain complex quantitative concepts and foster student success.

Netflix

Data Engineering Intern

May 2024 – Aug 2024 · 3 mos · Los Angeles, California, United States · On-site

Netflix is the most subscribed to video streaming service worldwide. I worked on the Studio & Creative Production Data Engineering team, which is responsible for developing and maintaining data pipelines relevant to in-house productions. Netflix is scheduled to release 157 originals in 2024.
My main contribution was improving the efficiency of an internal tool’s data pipeline, reducing tech debt when incorporating this tool into the larger ecosystem of stable tools. This involves
deprecating R code and replacing it with PySpark code,
refactoring the existing data model to improve query performance, and
switching existing data sources to those of higher quality and maintainability.
While migrating source tables, I had to coordinate with a sister team to ensure that the required fields were available in the new sources and the data quality of the new source is better than the old one.
Through this experience, I learned the importance of assuring stakeholders that the change in data source will not degrade the data quality before making any significant code changes.
By the end of the project, I was able to
shorten the lines of code needed to execute the pipeline by 50%,
reduce the pipeline’s run time from 7 hours to 1 hour,
decrease disk space utilization by 90%, and
remove reliance on legacy views.

Data WarehousingData ModelingApache HadoopPySparkSQLTrino+3

North carolina state university

Machine Learning Researcher

Aug 2023 – Dec 2024 · 1 yr 4 mos · Raleigh, North Carolina, United States

During my first year as a CS student at North Carolina State University, I worked on an independent project under the supervision of Professor Ranga Raju Vatsavai.
The goal of the project was to determine if transfer learning can be applied to deforestation detection in satellite imagery.
The first part of the project involves collecting raw images and then applying normalization and image augmentation, using Python packages like OpenCV and PIL. 20+ carbon trading project sites across the globe were selected to ensure diverse representation of deforested regions in 6 countries.
The second part was training the model. I first used MobileNet-v2, a pre-trained CNN model without any modifications. Then I removed the top layers and retrained those layers, and compared the results between before and after retraining. To fine-tune the model, I tested over 50 parameter combinations.

World bank group

Data Analyst

Jun 2022 – May 2023 · 11 mos · Washington, District of Columbia, United States

The World Bank Group is an international organization that provides low interest loans to low and middle income countries to improve people’s quality of living.
I was the first data analyst hired in the IT Client Service Headquarters division, located in Washington D.C. The division’s main responsibility was to provide technical support to 5,000+ staff located in the US.
In my position, I leveraged datasets generated by different IT teams to develop data visualizations and reports used to inform managerial decisions.
For example, I found that 50%+ request tickets were associated with one configuration item, indicating overuse and lack of understanding of other configuration items. This led to a department-wide education campaign on proper ticket documentation.
Other projects include estimating ticket rerouting impact, assessing staff learning survey results, and developing reports on IT workload for regional managers.

Microsoft Power BIPythonSQLTableauData Analysis

Startup ucla entrepreneurship institute

Data Analytics Intern

Oct 2021 – Jun 2022 · 8 mos · Los Angeles, California, United States · On-site

Startup UCLA connects UCLA students and alumni and the Southern California startup community. It hosts programs like the Summer Accelerator, which provides mentorship and funding to early stage companies.
As a Data intern, I managed the website and provided analytics support to the Executive Director.
I was able to
redesign call-to-action buttons, resulting in 300% increase in booked consultation calls,
streamline data pipeline from various social media sources to the data sink, reducing overall time spent on data aggregation, and
analyze and report quarterly co-working space usage and social media engagement.

Ucla mobility lab

Research Assistant

Sep 2021 – Jun 2022 · 9 mos · Los Angeles, California, United States

The UCLA Mobility Lab focuses on developing cutting-edge transportation technologies for smart cities. Using AI, robotics, and machine learning, the lab conducts research in vehicle automation, urban mobility analytics, and smart transportation infrastructure.
As a Research Assistant, I:
used pandas and ArcPy to analyze broadband coverage for 2000+ transportation analysis zones in LA County,
helped design a survey to collect data on Los Angeles residents' telework choices for use in a multi-agent transportation simulation,
distributed a month-long survey to collect UCLA students' mode choice preferences with hypothetical bike share system implementation, and
developed discrete choice models to understand behavior and travel pattern changes before and after bike share implementation.

Statistical Data Analysis

Project on resources and governance (prg)

Research Assistant

Apr 2021 – Nov 2021 · 7 mos · Remote

The Project on Resources and Governance (PRG) is a non-profit research initiative that applies insights from the social and behavioral sciences to find effective solutions to governance problems in resource-rich countries
As a research assistant at PRG, I
identified artisanal and small-scale mining (ASM) sites in Central & West Africa with satellite imagery,
participated in developing an ASM identification protocol during weekly meetings with supervisor and three research assistants, and
webscraped Google Scholar data with Python and cleaned 3000 observations in Excel for literature review.

PythonR

Ucla sustainable la grand challenge

Research Scholar

Sep 2020 – Jun 2021 · 9 mos

The Sustainable LA Grand Challenge Undergraduate Research Scholars Program unites undergraduate students across all disciplines in their first research experiences in the field of urban sustainability.
As a research scholar, I participated in interdisciplinary group work and faculty-mentored independent research.
For independent research, I assisted with background research on the policy report 'Phasing Out Fossil Fuel Infrastructure in Los Angeles: Challenges for a Just Transition' by Andrea Furnaro and Dr. Kelly Kay from the Department of Geography. I also transcribed interviews and produced maps and charts.
For the interdisciplinary group work, I collaborated with a team of 5 to evaluate the feasibility of installing a geothermal heat pump on campus. I conducted literature review on geothermal installation case studies at other U.S. universities and presented our findings to the cohort, the Chief Sustainability Officer, and other stakeholders from campus energy production.