Paul Iusztin

Co-Founder

Timişoara, Timiş, Romania7 yrs 1 mo experience
AI EnabledAI ML Practitioner

Key Highlights

  • Author of the bestseller LLM Engineer's Handbook.
  • Founder of Decoding AI, teaching AI Engineering.
  • Over 10 years of experience in AI and software development.
Stackforce AI infers this person is a highly skilled AI Engineer with extensive experience in Fintech and SaaS industries.

Contact

Skills

Core Skills

Artificial Intelligence (ai)MlopsMachine LearningDeep LearningData ScienceComputer VisionBusiness StrategySoftware DevelopmentProject Management

Other Skills

Large Language Models (LLM)PythonSQLInfrastructure as code (IaC)Amazon Web Services (AWS)AWSTerraformGenerative AITechnical WritingPyTorchFastAPIGraphQLGoogle Cloud Platform (GCP)Scikit-LearnDjango

About

I'm the author of the bestseller LLM Engineer's Handbook, lead instructor of the Agentic AI Engineering course, founding AI Engineer of a San Francisco start-up, and obsessed with making knowledge accessible through AI. With over 10 years of experience in AI and software and 20 apps shipped, I teach AI Engineering as I wanted to at the beginning of my career. End-to-end. From idea to production. From data collection to deploying, monitoring and evaluation. With a focus on AI principles, software patterns and infrastructure systems that will thrive in a future dominated by AI coding tools. My ultimate goal is to help other engineers escape PoC purgatory and 10x their AI Engineering skills. . If you want to become an AI Engineer pro, join Decoding AI for exclusive content on designing, building, and shipping AI software that works: 🗞️ https://www.decodingai.com See more: 🌐 Homepage: https://www.pauliusztin.ai 📚 LLM Engineer's Handbook: https://www.amazon.com/LLM-Engineers-Handbook-engineering-production/dp/1836200072/ 🤖 Agentic AI Engineering Course: https://academy.towardsai.net/courses/agent-engineering?ref=b3ab31 . ✉️ For collabs, contact me at: p.b.iusztin@gmail.com

Experience

7 yrs 1 mo
Total Experience
1 yr 2 mos
Average Tenure
1 yr 2 mos
Current Experience

Stealth ai startup

Founding Al Engineer

Mar 2025Present · 1 yr 2 mos · San Francisco, California, United States · Remote

  • 💼 Building end-to-end vertical AI agents for financial services with a deep focus on the AI, backend and infrastructure layers of the application.
  • → Leading the RAG ingestion, knowledge graph and AI evals components.
  • → Built the entire infrastructure, from shipping to AWS through Terraform, to implementing the CI/CD and monitoring components.
  • → Scaled the RAG ingestion pipeline from ingesting 1500 documents in one hour to 36000+
Artificial Intelligence (AI)Software DevelopmentMLOpsLarge Language Models (LLM)PythonSQL+2

Metaphysic.ai

2 roles

Senior Machine Learning Engineer

Jan 2024Sep 2024 · 8 mos · London Area, United Kingdom · Remote

  • 🟠 Computer vision & deep learning workflow engine: orchestrate & execute
  • The client, a leading GenAI platform, faced difficulties scaling and breaking down its complex ML jobs. Their manual processes and CLI-based tooling added significant friction in serving their clients. Thus, the client required a more intuitive and scalable tool that could be used across the company to train and serve its ML models.
  • → 𝗥𝗲𝗳𝗮𝗰𝘁𝗼𝗿𝗲𝗱 the ML worker architecture to follow DDD principles, making it more 𝗿𝗲𝗮𝗱𝗮𝗯𝗹𝗲, 𝗺𝗼𝗱𝘂𝗹𝗮𝗿, and 𝗲𝘅𝘁𝗲𝗻𝘀𝗶𝗯𝗹𝗲.
  • → 𝗗𝗲𝗰𝗿𝗲𝗮𝘀𝗲𝗱 the 𝗹𝗮𝘁𝗲𝗻𝗰𝘆 of the data curation DL models 𝗯𝘆 𝟴𝟮.𝟳%
  • → 𝗘𝘅𝘁𝗲𝗻𝗱𝗲𝗱 their clustering algorithm to accurately run on 𝗱𝗮𝘁𝗮𝘀𝗲𝘁𝘀 >𝟭𝟬𝗸 𝗶𝗺𝗮𝗴𝗲𝘀 and 𝗮𝘂𝘁𝗼𝗺𝗮𝘁𝗶𝗰𝗮𝗹𝗹𝘆 𝗱𝗲𝘁𝗲𝗰𝘁 the number of 𝗰𝗹𝘂𝘀𝘁𝗲𝗿𝘀 and 𝗼𝘂𝘁𝗹𝗶𝗲𝗿𝘀.
MLOpsMachine LearningDeep LearningGenerative AIComputer VisionSoftware Development+7

Senior Machine Learning Engineer

May 2023Dec 2023 · 7 mos · London Area, United Kingdom · Remote

  • 🔴 Scalable ML batch pipeline for continuous training & inference
  • The client, a leading GenAI platform, struggled with manual training and calling their ML models. As a tedious and unscalable process, they needed to automate their ML processes completely and integrate them with their current web platform.
  • → 𝗗𝗲𝘀𝗶𝗴𝗻𝗲𝗱 and 𝗶𝗺𝗽𝗹𝗲𝗺𝗲𝗻𝘁𝗲𝗱 a 𝗺𝗼𝗱𝘂𝗹𝗮𝗿 𝗣𝘆𝘁𝗵𝗼𝗻 𝗽𝗮𝗰𝗸𝗮𝗴𝗲 that orchestrates the ML steps into 3 fully automated batch pipelines: feature, training, and inference, 𝗲𝗹𝗶𝗺𝗶𝗻𝗮𝘁𝗶𝗻𝗴 𝗮 𝘁𝗲𝗱𝗶𝗼𝘂𝘀 𝟮-𝗵𝗼𝘂𝗿 𝗺𝗮𝗻𝘂𝗮𝗹 𝗽𝗿𝗼𝗰𝗲𝘀𝘀/𝘃𝗶𝗱𝗲𝗼.
  • → 𝗥𝗲𝗱𝘂𝗰𝗲𝗱 the 𝗽𝗿𝗼𝗰𝗲𝘀𝘀𝗶𝗻𝗴 𝘁𝗶𝗺𝗲 of the batch ML pipeline 𝗯𝘆 𝟲𝟮.𝟱% while preserving the same accuracy and quality.
  • → 𝗜𝗺𝗽𝗹𝗲𝗺𝗲𝗻𝘁𝗲𝗱 𝗠𝗟𝗢𝗽𝘀 𝗯𝗲𝘀𝘁 𝗽𝗿𝗮𝗰𝘁𝗶𝗰𝗲𝘀 such as IaC, CI/CD, monitoring, experiment tracking, and model registries, making the system reproducible, testable and trackable.
  • → 𝗗𝗲𝗽𝗹𝗼𝘆𝗲𝗱 a scalable & cost-effective 𝗮𝘀𝘆𝗻𝗰 𝗯𝗮𝘁𝗰𝗵 𝗮𝗿𝗰𝗵𝗶𝘁𝗲𝗰𝘁𝘂𝗿𝗲 on top of AWS ECS & SQS that scales in and out based on the number of jobs, 𝗿𝗲𝗱𝘂𝗰𝗶𝗻𝗴 their 𝗔𝗪𝗦 𝗰𝗼𝘀𝘁𝘀 𝗯𝘆 𝟱𝟮%.
  • 🔴 Big data ingestion & archiving pipeline
  • The client wanted to expand its platform features by securely ingesting, preprocessing, and storing TB of data into their platform.
  • → 𝐂𝐨𝐧𝐭𝐫𝐢𝐛𝐮𝐭𝐞𝐝 to the 𝐝𝐞𝐬𝐢𝐠𝐧 and 𝐢𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐚𝐭𝐢𝐨𝐧 of a 𝐝𝐚𝐭𝐚 𝐢𝐧𝐠𝐞𝐬𝐭𝐢𝐨𝐧 𝐩𝐢𝐩𝐞𝐥𝐢𝐧𝐞 that supports 𝐢𝐧𝐠𝐞𝐬𝐭𝐢𝐧𝐠 >𝟓 𝐓𝐁 of data in 𝐨𝐧𝐞 𝐠𝐨 in a reliable, fast, and secure way.
  • → 𝗜𝗺𝗽𝗹𝗲𝗺𝗲𝗻𝘁𝗲𝗱 a scalable 𝗱𝗮𝘁𝗮 𝗽𝗿𝗲𝗽𝗿𝗼𝗰𝗲𝘀𝘀𝗶𝗻𝗴 𝗽𝗶𝗽𝗲𝗹𝗶𝗻𝗲 that supports multiple media types (images, videos, audio, 3D objects), archiving the original files on an affordable storage solution, 𝗿𝗲𝗱𝘂𝗰𝗶𝗻𝗴 𝗰𝗼𝘀𝘁𝘀 𝗯𝘆 𝘂𝗽 𝘁𝗼 𝟴𝟭%.
MLOpsDeep LearningGenerative AIMachine LearningSoftware DevelopmentAmazon Web Services (AWS)+6

Decoding ai magazine

Founder

Jun 2023Present · 2 yrs 11 mos · Remote

  • 📰 Content on designing, building, and shipping AI software that works. Learn AI engineering, end-to-end, from idea to production. Every Tuesday.
Artificial Intelligence (AI)Generative AIMLOpsLarge Language Models (LLM)PythonDeep Learning+2

Coreai

3 roles

Machine Learning Tech Lead

Dec 2022Jun 2023 · 6 mos · Israel · Remote

  • 🟠 Real-time social media recommender system using machine learning & Kafka
  • The client, a social media start-up from Israel, needed a real-time dashboard and recommender system on top of their Kafka infrastructure using modern ML and MLOps solutions.
  • → 𝐋𝐞𝐝 the architecture system design of a modular real-time social media recommender system built on top of the client's Kafka nervous system that leverages streaming and ML technologies such as Kafka Streams, Kotlin, PyTorch, Python, and Docker.
  • → Quickly 𝐩𝐥𝐚𝐧𝐧𝐞𝐝 and 𝐝𝐞𝐯𝐞𝐥𝐨𝐩𝐞𝐝 a scalable recommender system based on heuristics, picked through A/B testing, using Kafka Streams and Kotlin. It served as a vital entry point and baseline for future algorithms.
  • → 𝐃𝐞𝐬𝐢𝐠𝐧𝐞𝐝 and 𝐢𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐞𝐝 a dashboard that computes KPIs in 𝐫𝐞𝐚𝐥-𝐭𝐢𝐦𝐞 𝐟𝐨𝐫 𝟏𝟎𝟎𝐤+ 𝐮𝐬𝐞𝐫𝐬 on the client's Kafka infrastructure using ksqlDB, SQL, Elasticsearch, and Apache Superset.
CI/CDDockerProject PlanningExtract, Transform, Load (ETL)Amazon Web Services (AWS)Python+7

Machine Learning Tech Lead

Apr 2022Dec 2022 · 8 mos · Israel · Remote

  • 🔴 Prototyping AI solutions for Demand Forecasting, Churn Prediction, and Customer Segmentation
  • The client, a machine learning start-up, wanted to expand their services portfolio by prototyping and gaining knowledge in various AI fields that leverage structured retail data.
  • → 𝐋𝐞𝐝 𝐚 𝐭𝐞𝐚𝐦 𝐨𝐟 𝟐 to develop an ML clustering solution that aggregates data from multiple sources and finds customer segments and insights for new potential clients.
  • → 𝐈𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐞𝐝 a prototype for churn and demand forecasting used as a proof of concept to close deals with various clients.
  • → 𝐒𝐚𝐯𝐞𝐝 𝟒𝟎% of 𝐞𝐧𝐠𝐢𝐧𝐞𝐞𝐫𝐢𝐧𝐠 𝐫𝐞𝐬𝐨𝐮𝐫𝐜𝐞𝐬 by building a reusable Python ML framework that can quickly be tailored to custom problems based on structured data.
CI/CDDockerExtract, Transform, Load (ETL)Amazon Web Services (AWS)PandasScikit-Learn+7

Machine Learning Tech Lead

Feb 2022May 2023 · 1 yr 3 mos · Israel · Remote

  • 🔴 coreControl - end-to-end MLOps infrastructure
  • The client, a machine learning service provider, required an end-to-end ML/MLOps infrastructure to scale their operations and save development resources across multiple AI projects.
  • → 𝐋𝐞𝐝 the 𝐝𝐞𝐬𝐢𝐠𝐧 and 𝐢𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐚𝐭𝐢𝐨𝐧 of coreControl by integrating the software with the client's products and collaborating with other ML engineers, developers, and graphic designers.
  • → 𝐑𝐞𝐝𝐮𝐜𝐞𝐝 the 𝐝𝐞𝐯𝐞𝐥𝐨𝐩𝐦𝐞𝐧𝐭 𝐢𝐦𝐞 across the client's machine learning projects 𝐛𝐲 𝟑𝟎% using coreControl.
  • → 𝐈𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐞𝐝 the data backbone of coreControl that syncs the data between multiple MLOps tools and exposes them to a UI through a RESTful API using Python, FastAPI, PostgreSQL, Docker, and AWS.
  • → 𝐃𝐞𝐯𝐞𝐥𝐨𝐩𝐞𝐝 a Python package that wraps various MLOps tools that standardizes the data scientists' work and centralizes all the ML artifacts to easily be analyzed in a single place using Python, DVC, and ClearML.
CI/CDDockerREST APIsAmazon Web Services (AWS)PythonMLOps+4

Politehnica university of timisoara

Teaching Assistant

Feb 2022Jun 2023 · 1 yr 4 mos · Timişoara, Timiş, Romania

  • 🟠 Teaching the foundations of 𝐚𝐫𝐭𝐢𝐟𝐢𝐜𝐢𝐚𝐥 𝐢𝐧𝐭𝐞𝐥𝐥𝐢𝐠𝐞𝐧𝐜𝐞 laboratory.
  • → Supervised and unsupervised learning
  • → Linear regression, SVMs, Decision trees
  • → Clustering, Dimensionality reduction
  • → Dense neural networks
  • → Convolutional neural networks
  • → Recurrent neural networks
  • → Transformers
Deep LearningMathematicsScikit-LearnPythonStatisticsMachine Learning+2

Everseen

Senior Machine Learning Engineer

Aug 2021Feb 2022 · 6 mos · Timişoara, Timiş, Romania

  • 🟠 Autonomous supply chain system using computer vision & deep learning
  • The client, a well-known player in the retail sector, needed to expand its deep learning and computer vision solutions by researching and building autonomous systems for its operations in the retail industry.
  • → 𝐓𝐫𝐚𝐢𝐧𝐞𝐝 a Mask-RCNN model for 2D object detection and segmentation on 100k proprietary retail images, with an 𝐦𝐀𝐏 𝐨𝐟 𝟖𝟓%.
  • → 𝐒𝐩𝐞𝐞𝐝 𝐮𝐩 the 𝐢𝐧𝐟𝐞𝐫𝐞𝐧𝐜𝐞 𝐭𝐢𝐦𝐞 of the Mask-RCNN model 𝐛𝐲 𝟓𝟎% by levering Docker, ONNX, and TensorRT to containerize and accelerate the models.
  • → 𝐓𝐫𝐚𝐢𝐧𝐞𝐝 a scalable classifier using an image retrieval model that can dynamically handle new classes without additional training.
  • → 𝐈𝐧𝐜𝐫𝐞𝐚𝐬𝐞𝐝 the 𝐫𝐞𝐩𝐫𝐨𝐝𝐮𝐜𝐢𝐛𝐢𝐥𝐢𝐭𝐲 of the 𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠 𝐩𝐫𝐨𝐜𝐞𝐬𝐬 by implementing MLOps solutions, such as DVC & MLFlow, that included data & model versioning, registries, and experiment tracking features.
Object DetectionOpenCVDockerDeep LearningResearchTensorRT+6

Continental

2 roles

Mid-Level Machine Learning Researcher

Promoted

Nov 2020Aug 2021 · 9 mos

  • 🟠 Autonomous driving system using computer vision & deep learning
  • The client, a well-known player in the automotive sector, needed to create an autonomous driving car prototype that could accurately detect traffic participants by leveraging DL and CV solutions.
  • → 𝐋𝐞𝐝 the 𝐝𝐞𝐯𝐞𝐥𝐨𝐩𝐦𝐞𝐧𝐭 of a prototype for real-time traffic participants detection by deploying a CenterNet model for 3D object detection on a test car using an Nvidia Drive AGX C++ platform.
  • → 𝐈𝐦𝐩𝐫𝐨𝐯𝐞𝐝 the reproducibility of the experimentation process by implementing MLOps solutions (DVC) that version unstructured big data.
  • → 𝐈𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐞𝐝 a replacement to the Hungarian algorithm that 𝐢𝐧𝐜𝐫𝐞𝐚𝐬𝐞𝐝 the 𝐬𝐩𝐞𝐞𝐝 by 𝟑𝟎%, making the system 𝐫𝐞𝐚𝐥-𝐭𝐢𝐦𝐞.
  • → 𝐒𝐚𝐯𝐞𝐝 𝟐 𝐝𝐚𝐲𝐬, in 𝐞𝐯𝐞𝐫𝐲 𝐦𝐨𝐧𝐭𝐡, of 𝐫𝐞𝐩𝐞𝐭𝐢𝐭𝐢𝐯𝐞 work by automating the steps for the creation of the 𝐦𝐨𝐧𝐭𝐡𝐥𝐲 demo.
Object DetectionOpenCVDeep LearningResearchTensorRTONNX+5

Junior Machine Learning Researcher

Nov 2019Oct 2020 · 11 mos

  • 🔴 Autonomous driving research using computer vision & deep learning
  • The client, a well-known player in the automotive sector, needed to expand its research into the autonomous driving field using novel deep learning and computer vision methods.
  • → 𝐑𝐞𝐬𝐞𝐚𝐫𝐜𝐡 in 3D object detection, depth estimation & completion, tracking, and sensor fusion algorithms using various sensors such as images, radar & LiDAR point clouds.
  • → 𝐑𝐞𝐝𝐮𝐜𝐞𝐝 𝐛𝐲 𝟒𝟎% the 𝐦𝐞𝐦𝐨𝐫𝐲 𝐮𝐬𝐚𝐠𝐞 of the 3D object detection code by introducing the Dynamic Voxelization algorithm into the preprocessing pipeline.
  • → 𝐓𝐫𝐚𝐢𝐧𝐞𝐝 state-of-the-art models to generate LiDAR-like dense point clouds from sparse Radar point clouds on various autonomous driving datasets, such as KITTI, nuScenes, DDAD, and A2D2.
OpenCVDeep LearningResearchMathematicsTensorFlowAmazon Web Services (AWS)+4

Elementum technologies

Co-Founder

Jun 2019Jun 2020 · 1 yr · Timişoara, Timiş, Romania

  • 🟠 Dorel - An Uber for handymen
  • 𝐃𝐨𝐫𝐞𝐥 is an app that connects handypersons with customers. It contains a review and portfolio system where any handyperson can build his profile. Based on that, a user can determine whether he fits the job. The app intends to bring, once again, trust in handypersons.
  • → 𝐏𝐢𝐭𝐜𝐡𝐞𝐝 the 𝐢𝐝𝐞𝐚 to various start-up incubators.
  • → 𝐌𝐚𝐧𝐚𝐠𝐞𝐝 the backend components of the software solution of Dorel using an Agile methodology.
  • → 𝐋𝐞𝐝 the 𝐝𝐞𝐬𝐢𝐠𝐧 and 𝐢𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐚𝐭𝐢𝐨𝐧 of the Django RESTful API server and the notifications microservice.
  • → 𝐃𝐞𝐬𝐢𝐠𝐧𝐞𝐝 and 𝐝𝐞𝐯𝐞𝐥𝐨𝐩𝐞𝐝 the Redux layer of the React Native mobile app.
Business StrategyDjangoDockerNode.jsAmazon Web Services (AWS)React.js+5

Safefleet telematics

2 roles

Mid-Level Software Developer

Promoted

Feb 2019Nov 2019 · 9 mos

  • 🟠 Smart parking management platform
  • The client, a local player in the urban parking payment systems, needed to scale its operations into the private sector by building a new parking platform.
  • → 𝐄𝐧𝐠𝐢𝐧𝐞𝐞𝐫𝐞𝐝 the Redux layer of the onboarding mechanism of the React Native mobile application.
  • → 𝐁𝐮𝐢𝐥𝐭 an autonomous system that could detect license numbers and automatically open the parking barrier for registered users using MQTT.
  • → 𝐃𝐞𝐯𝐞𝐥𝐨𝐩𝐞𝐝 the Python async notifications microservice for the Android & IOS mobile apps.
DjangoDockerNode.jsAmazon Web Services (AWS)React.jsMicroservices+5

Junior Backend Developer

Jul 2018Jan 2019 · 6 mos

  • 🔴 Smart parking management platform
  • The client, a local player in the urban parking payment systems, needed to scale its operations into the private sector by building a new parking platform.
  • → 𝐈𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐞𝐝 a suite of unit tests that 𝐝𝐞𝐜𝐫𝐞𝐚𝐬𝐞𝐝 the 𝐝𝐞𝐯𝐞𝐥𝐨𝐩𝐦𝐞𝐧𝐭 𝐭𝐢𝐦𝐞 by 𝟐𝟎%.
  • → 𝐃𝐞𝐯𝐞𝐥𝐨𝐩𝐞𝐝 various Django RESTful API server features, such as new endpoints, serializers, and CRUD operations.
  • → 𝐈𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐞𝐝 a real-time business metrics dashboard on the office TV using Grafana on a Raspberry Pi.
DjangoREST APIsAmazon Web Services (AWS)React.jsPythonLinux+3

Education

Politehnica University Timisoara

Master's degree — Machine Learning

Sep 2020Jun 2022

Politehnica University Timisoara

Bachelor's degree — Computer Software Engineering

Sep 2016Jun 2020

Stackforce found 100+ more professionals with Artificial Intelligence (ai) & Mlops

Explore similar profiles based on matching skills and experience