K

Kollaikal Rupesh

Founder

San Francisco, California, United States3 yrs 6 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in real-time voice AI systems.
  • Proven track record in optimizing streaming inference architectures.
  • Significant contributions to open-source voice AI framework.
Stackforce AI infers this person is a Backend-heavy AI Engineer specializing in real-time voice systems and scalable architectures.

Contact

Skills

Core Skills

Real-time Audio ProcessingBackend SystemsSolution ArchitecturePythonLlm IntegrationMachine LearningData PipelinesData ManagementData ResearchData AnalysisDeep Learning

Other Skills

Amazon Web Services (AWS)Distributed SystemsSystems DesignLLM EvaluationCI/CDML ObservabilityConversational AIAPI DesignForward Deployed EngineeringRapid PrototypingTechnical Discovery & Requirements GatheringCross-functional CollaborationsClient Demo DeliveryGitRetrieval-Augmented Generation (RAG)

About

I build real-time voice AI systems where model behavior, latency, and robustness matter as much as raw capability. As a Founding AI Engineer, I specialize in low-latency streaming inference architectures for conversational systems, optimizing throughput, analyzing latency–accuracy tradeoffs, and designing reliability primitives that prevent silent failure in production inference pipelines. My work spans: • Optimizing streaming speech pipelines (ASR → LLM → TTS) for sub-second response time under production-level load • Instrumenting fine-grained latency profiling (P50/P95) and diagnosing stage-level bottlenecks • Building evaluation frameworks measuring WER stability, partial transcript drift, and real-world inference behavior • Designing failover and liveness mechanisms that improve uptime consistency in distributed AI systems • Fixing silent degradations due to real-world constraints like sample-rate mismatches and back-end variability I’ve contributed to the open-source voice AI framework Pipecat, including: • Fixing Smart Turn v3 prediction failures at non-16 kHz sample rates via high-fidelity resampling (soxr VHQ), improving endpoint detection accuracy from 59% to 94% on telephony test sets • Correcting pipeline processor duplication logic to ensure frame-chain integrity • Designing heartbeat timeout detection and failover strategy abstractions to improve liveness and resilience • Reducing high-frequency audio logging overhead under streaming workloads • All changes merged into main with full test coverage I’m interested in speech modeling, inference robustness, and scalable real-time reasoning systems that work reliably in messy real-world environments.

Experience

Wayline

Founding Engineer

Jan 2026Present · 2 mos · San Francisco, California, United States · On-site

Amazon Web Services (AWS)Real-Time Audio ProcessingBackend SystemsDistributed SystemsSystems DesignLLM Evaluation+4

Smallest.ai

Forward Deployed Engineer

Oct 2025Jan 2026 · 3 mos · San Francisco, California, United States

Real-Time Audio ProcessingSolution ArchitectureForward Deployed EngineeringRapid PrototypingTechnical Discovery & Requirements GatheringCross-functional Collaborations+1

Spanda ai inc.

AI Developer

Jun 2025Oct 2025 · 4 mos · San Francisco, California, United States

Client Demo DeliveryPythonGitRetrieval-Augmented Generation (RAG)Solution EngineeringLLM Integration

Sociosquares

ML Engineer (Practicum)

Oct 2024Jun 2025 · 8 mos · San Francisco, California, United States

  • Prev. Data Engineer
Python (Programming Language)PythonMachine LearningData PipelinesSQLML Observability+2

University of california, davis - graduate school of management

President, DSAC

Sep 2024Jul 2025 · 10 mos · San Francisco, California, United States

Zoominfo

Data Research & Enablement

Oct 2023Jul 2024 · 9 mos · Chennai, Tamil Nadu, India

Data ResearchScalable Data PipelinesPythonData PipelinesData ManagementBusiness Strategy+3

Saveetha school of engineering

Research Assistant

Aug 2022Apr 2023 · 8 mos · On-site

  • Conducted research on enhancing the accuracy of text summarization, focusing on the comparison of LSTM, SVM, Logistic Regression, and K-Nearest Neighbors algorithms. This project, inspired by a significant shift from print to digital news consumption post-COVID, was conducted under the guidance of Dr. S. Christy Melwyn. The research aimed to address the challenges posed by the decline in newspaper readership and the rise of digital platforms, with LSTM demonstrating superior performance in accuracy.
Python (Programming Language)Machine LearningDeep LearningStatistical Data Analysis

Orrc

2 roles

Product & Data Analyst

Promoted

Jun 2022Jun 2023 · 1 yr

Python

Data & Impact Analyst

Dec 2021Jun 2022 · 6 mos

Python (Programming Language)NumPyMachine LearningExtract, Transform, Load (ETL)SQLGo-to-Market Strategy+3

Education

University of California, Davis - Graduate School of Management

Master's degree — Business Analytics

Aug 2024Aug 2025

Saveetha School of Engineering

Bachelor's degree — Information Technology

Aug 2019Aug 2023

Stackforce found 100+ more professionals with Real-time Audio Processing & Backend Systems

Explore similar profiles based on matching skills and experience