Valentina S.

AI Researcher

New York City, New York, United States1 yr 3 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in NLP and transformer-based modeling.
  • Developed scalable LLM-assisted data pipelines.
  • Achieved significant accuracy improvements in ML models.
Stackforce AI infers this person is a Machine Learning Engineer specializing in NLP and scalable AI solutions.

Contact

Skills

Core Skills

Large Language Models (llm)Data AnnotationNatural Language Processing (nlp)Machine LearningData AnalysisRobotic Process Automation (rpa)Data Science

Other Skills

AI AgentsGoogle Cloud Platform (GCP)Python (Programming Language)SQLModelingCritical ThinkingFeature EngineeringData WranglingStatistical ModelingData ManipulationRecommender SystemsPredictive ModelingData VisualizationExploratory Data AnalysisGit

About

💜Hi, I’m Valentina. I’m a Machine Learning Engineer specializing in NLP, LLM-assisted data curation, and transformer-based modeling. I build systems that extract structure from unstructured text — classification models, entity extractors, context-aware transformers, and large-scale semantic understanding pipelines. My work spans end-to-end ML development: dataset engineering, model design, fine-tuning, evaluation, and deployment using Python, PyTorch, Hugging Face, and GCP (Vertex AI, BigQuery, PySpark). I’m deeply interested in representation learning, embeddings, hierarchical modeling, and scalable ML systems that support real-world analytics, ranking, and personalization. I thrive in fast-moving environments where problems are ambiguous, datasets are messy, and the solution requires research-level thinking paired with practical engineering.

Experience

1 yr 3 mos
Total Experience
1 yr 3 mos
Average Tenure
--
Current Experience

Meta

Data Labeling Analyst IV

Jan 2026 – Present · 4 mos · New York City Metropolitan Area · Remote

  • Contract through Tundra Technical Solutions
AI AgentsLarge Language Models (LLM)Data Annotation

Julius

Machine Learning Engineer

Apr 2025 – Dec 2025 · 8 mos · New York City Metropolitan Area · Remote

  • Built transformer-based classifiers improving employer vs recruiter detection from 0.74 → 0.86 F1 across 90M+ job descriptions.
  • Designed hierarchical NLP architectures combining sentence, section, and document signals for fine-grained entity extraction.
  • Developed scalable LLM-assisted data pipelines, turning ~200 manual labels into thousands of high-quality annotations.
  • Processed and enriched 80M+ text records using BigQuery + PySpark to support BI and product analytics.
Large Language Models (LLM)Google Cloud Platform (GCP)Python (Programming Language)SQLNatural Language Processing (NLP)Modeling

Omdena

Junior Machine Learning Engineer

Jan 2024 – Jan 2025 · 1 yr · Remote

  • Built an LLM-powered triage model that cut manual review time by 40% for humanitarian mobility-aid requests.
  • Created LLM-driven labeling pipelines improving prioritization accuracy by 18% for partner NGOs.
  • Worked in distributed team settings with rapid iteration cycles under real-world deployment constraints.
Machine LearningData AnalysisData ScienceLarge Language Models (LLM)

Springboard

Data Scientist Fellow

Jan 2023 – Apr 2024 · 1 yr 3 mos · Remote

  • Developed XGBoost fraud detection models achieving 0.76 F1 via PCA grouping and adversarial validation.
  • Engineered 400+ feature representations across encoded transactional datasets.
  • Designed statistical A/B tests and uplift analysis for experimentation workflows.
Python (Programming Language)SQLCritical ThinkingMachine LearningData AnalysisFeature Engineering+10

Cognitus consulting

RPA Developer Intern

Jul 2021 – Aug 2021 · 1 mo · Miami, Florida, United States

  • Spearheaded the development of Gallop Intelligent Invoice Automation (GIIA), an AP invoice automation solution, for Cognitus Consulting's clientele.
  • This innovative solution facilitates the full automation of AP invoices into SAP S/4HANA, SAP Ariba, or any ERP system, achieving an impressive 98.5% accuracy rate.
  • Successfully deployed this solution to over 20 clients within a span of two years.
  • Implemented the solution using UiPath's Robotic Process Automation (RPA) capabilities.
  • Gained recognition by SAP, earning an endorsement as an application of distinction.
Programming LanguagesCritical ThinkingRobotic Process Automation (RPA)Feature EngineeringAttention to DetailUiPath+1

Deutsches forschungszentrum für künstliche intelligenz (dfki)

Research Intern in Underwater Robotics

Jun 2021 – Aug 2021 · 2 mos · Bremen, Bremen, Germany

  • Optimized a deep learning model for Autonomous Underwater Vehicles (AUV), achieving 98% accuracy in modeling kinematic relationships, which marked a significant advance in AUV control precision and navigational technology.
  • My role involved refining neural network architectures and enhancing performance through sophisticated data analysis and interpretation.
Data ScienceFine TuningPython (Programming Language)Critical ThinkingPattern RecognitionMachine Learning+12

Education

Jacobs University Bremen

Bachelor of Science | Minor in Industrial Engineering and Management — Robotics Technology/Technician

Springboard

Data Science Career Track | Machine Learning

Coral Reef Senior High School

High School/Secondary Diplomas and Certificates

Jan 2015 – Jan 2019

Miami Dade College

Stackforce found 100+ more professionals with Large Language Models (llm) & Data Annotation

Explore similar profiles based on matching skills and experience