joel maria — Software Engineer
Senior Staff Engineer with 16+ years architecting high-availability, distributed systems operating at massive concurrency and enterprise scale (50M+ users). I specialize in designing AI-native production infrastructure by integrating Retrieval-Augmented Generation (RAG), embedding pipelines, vector search, and LLM orchestration into streaming-first, event-driven systems. My focus is not AI wrappers. I build cost-governed, fault-tolerant AI platforms that sustain real-world load. Throughout my career, I’ve operated at the intersection of: ● High-scale distributed systems ● Real-time streaming architectures ● Generative AI infrastructure ● FinTech-grade reliability and security At U.S. Bank, I co-defined the architecture for an AI-powered voice assistant, integrating secure mobile clients with ML inference services under strict banking constraints. At Upwork, I led architectural design for creator and gaming platforms serving 50M+ users, re-architecting data access layers for 200% performance gains and building streaming-first telemetry contracts enabling AI-ready workloads. Previously at Grax, I scaled a Kubernetes-native SaaS platform to 100M+ monthly API requests, halving end-to-end latency through migration to WebSocket streaming and event-driven communication. Core Architecture Domains AI-Native Infrastructure ● Retrieval-Augmented Generation (RAG) ● Embedding pipelines & sub-120ms vector similarity search ● LLM orchestration (guardrails, routing, deterministic fallbacks) ● Token governance & inference cost optimization (38% reduction) ● Production-grade AI observability & performance controls Distributed Systems at Scale ● Event-Driven Architecture (Kafka / Kafka Streams) ● High-concurrency microservices (Node.js / Python) ● GraphQL federation & data-layer optimization ● Sub-second telemetry & streaming pipelines ● Multi-tenant Kubernetes environments Cost & Performance Governance ● Reduced cloud operational spend by $67K/month via container rightsizing & caching ● Engineered burst-resilient systems under financial transaction workloads ● Designed scalable identity layers (OAuth2/JWT) across distributed domains
Stackforce AI infers this person is a SaaS and Fintech expert specializing in high-scale distributed systems and AI-native infrastructure.
Experience: 7 yrs 9 mos
Skills
- Distributed Systems
- Event-driven Architecture
- Ai-native Infrastructure
- Saas
- Microservices Architecture
- Mobile Development
- Enterprise Software Development
- Front-end Development
Career Highlights
- Architected AI-native platforms for 50M+ users.
- Achieved 200% performance gains in data access layers.
- Reduced operational costs by $67K/month through optimization.
Work Experience
Upwork
Senior Software Engineer, AI-Scale Platform Architecture (2 yrs 2 mos)
Logistic and Last-Mile AI Platform (2 yrs 4 mos)
U.S. Bank
Lead Engineer – AI Voice & Distributed Banking Systems (2 yrs 8 mos)
GRAX
Fullstack Engineer – High-Scale Distributed SaaS Platform (1 yr 2 mos)
CoStar
Senior Software Engineer – Distributed Identity & Microservices (2 yrs 9 mos)
Fidelity Investments
Fraud Detection Engineer – Streaming & Real-Time Systems (8 mos)
Verizon
Full-stack React.js Developer (1 yr 2 mos)
Bank of America
Senior Mobile Engineer (3 mos)
TD Ameritrade
CSS3 Lead Developer (1 yr 4 mos)
Elance
Freelance Back-End and Front-End Developer (3 yrs 8 mos)