Kollaikal Rupesh — Founder

I build real-time voice AI systems where model behavior, latency, and robustness matter as much as raw capability. As a Founding AI Engineer, I specialize in low-latency streaming inference architectures for conversational systems, optimizing throughput, analyzing latency–accuracy tradeoffs, and designing reliability primitives that prevent silent failure in production inference pipelines. My work spans: • Optimizing streaming speech pipelines (ASR → LLM → TTS) for sub-second response time under production-level load • Instrumenting fine-grained latency profiling (P50/P95) and diagnosing stage-level bottlenecks • Building evaluation frameworks measuring WER stability, partial transcript drift, and real-world inference behavior • Designing failover and liveness mechanisms that improve uptime consistency in distributed AI systems • Fixing silent degradations due to real-world constraints like sample-rate mismatches and back-end variability I’ve contributed to the open-source voice AI framework Pipecat, including: • Fixing Smart Turn v3 prediction failures at non-16 kHz sample rates via high-fidelity resampling (soxr VHQ), improving endpoint detection accuracy from 59% to 94% on telephony test sets • Correcting pipeline processor duplication logic to ensure frame-chain integrity • Designing heartbeat timeout detection and failover strategy abstractions to improve liveness and resilience • Reducing high-frequency audio logging overhead under streaming workloads • All changes merged into main with full test coverage I’m interested in speech modeling, inference robustness, and scalable real-time reasoning systems that work reliably in messy real-world environments.

Stackforce AI infers this person is a Backend-heavy AI Engineer specializing in real-time voice systems and scalable architectures.

Location: San Francisco, California, United States

Experience: 3 yrs 8 mos

Skills

Real-time Audio Processing
Backend Systems
Solution Architecture
Python
Llm Integration
Machine Learning
Data Pipelines
Data Management
Data Research
Data Analysis
Deep Learning

Career Highlights

Expert in real-time voice AI systems.
Proven track record in optimizing streaming inference architectures.
Significant contributions to open-source voice AI framework.

Work Experience

Wayline

Founding Engineer (4 mos)

smallest.ai

Forward Deployed Engineer (3 mos)

Spanda AI Inc.

AI Developer (4 mos)

SocioSquares

ML Engineer (Practicum) (8 mos)

University of California, Davis - Graduate School of Management

President, DSAC (10 mos)

ZoomInfo

Data Research & Enablement (9 mos)

Saveetha School of Engineering

Research Assistant (8 mos)

ORRC

Product & Data Analyst (1 yr)

Data & Impact Analyst (6 mos)

Education

Master's degree at University of California, Davis - Graduate School of Management

Bachelor's degree at Saveetha School of Engineering

Kollaikal Rupesh

Founder

San Francisco, California, United States3 yrs 8 mos experience

AI EnabledAI ML Practitioner

Key Highlights

Expert in real-time voice AI systems.
Proven track record in optimizing streaming inference architectures.
Significant contributions to open-source voice AI framework.

Stackforce AI infers this person is a Backend-heavy AI Engineer specializing in real-time voice systems and scalable architectures.

Contact

rupesh02k@gmail.com LinkedIn

Skills

Core Skills

Real-time Audio ProcessingBackend SystemsSolution ArchitecturePythonLlm IntegrationMachine LearningData PipelinesData ManagementData ResearchData AnalysisDeep Learning

Other Skills

Amazon Web Services (AWS)Distributed SystemsSystems DesignLLM EvaluationCI/CDML ObservabilityConversational AIAPI DesignForward Deployed EngineeringRapid PrototypingTechnical Discovery & Requirements GatheringCross-functional CollaborationsClient Demo DeliveryGitRetrieval-Augmented Generation (RAG)

About

Experience

3 yrs 8 mos

Total Experience

9 mos

Average Tenure

4 mos

Current Experience

Wayline

Founding Engineer

Jan 2026 – Present · 4 mos · San Francisco, California, United States · On-site

Amazon Web Services (AWS)Real-Time Audio ProcessingBackend SystemsDistributed SystemsSystems DesignLLM Evaluation+4

Smallest.ai

Forward Deployed Engineer

Oct 2025 – Jan 2026 · 3 mos · San Francisco, California, United States

Real-Time Audio ProcessingSolution ArchitectureForward Deployed EngineeringRapid PrototypingTechnical Discovery & Requirements GatheringCross-functional Collaborations+1

Spanda ai inc.

AI Developer

Jun 2025 – Oct 2025 · 4 mos · San Francisco, California, United States

Client Demo DeliveryPythonGitRetrieval-Augmented Generation (RAG)Solution EngineeringLLM Integration

Sociosquares

ML Engineer (Practicum)

Oct 2024 – Jun 2025 · 8 mos · San Francisco, California, United States

Prev. Data Engineer

Python (Programming Language)PythonMachine LearningData PipelinesSQLML Observability+2

University of california, davis - graduate school of management

President, DSAC

Sep 2024 – Jul 2025 · 10 mos · San Francisco, California, United States

Zoominfo

Data Research & Enablement

Oct 2023 – Jul 2024 · 9 mos · Chennai, Tamil Nadu, India

Data ResearchScalable Data PipelinesPythonData PipelinesData ManagementBusiness Strategy+3

Saveetha school of engineering

Research Assistant

Aug 2022 – Apr 2023 · 8 mos · On-site

Conducted research on enhancing the accuracy of text summarization, focusing on the comparison of LSTM, SVM, Logistic Regression, and K-Nearest Neighbors algorithms. This project, inspired by a significant shift from print to digital news consumption post-COVID, was conducted under the guidance of Dr. S. Christy Melwyn. The research aimed to address the challenges posed by the decline in newspaper readership and the rise of digital platforms, with LSTM demonstrating superior performance in accuracy.

Python (Programming Language)Machine LearningDeep LearningStatistical Data Analysis