About

About

Hi :wave:

I’m a software developer with 4+ years of experience building scalable systems across SaaS, real-time communications, and AI.

I joined FreJun in 2022 as a founding engineer and helped build and scale the platform from the ground up, growing it to process 500K+ calls daily with global reach and high availability. In 2025, I built and launched Teler, a voice infrastructure platform for AI Agents. Since November 2025, I’ve been working independently as a contractor, primarily building solutions in the Enterprise and Voice AI space.

Being an obsessive learner and problem solver, I’m always on the lookout for exciting problems to work on. Please feel free to get in touch with me at [email protected]

Projects

Enterprise RAG

A fully offline, advanced RAG system designed for secure, enterprise deployments.

Integrates hybrid retrieval combining BM25 ranking with Dense vector search, leverages high-throughput vLLM inference for scalable LLM and embedding workloads, and employs advanced chunking strategies to preserve semantic and structural fidelity. It further supports asynchronous, distributed file indexing with robust status tracking, and provides native ingestion for a broad spectrum of enterprise document formats including PDF, DOCX, PPTX, XLSX, CSV, and Markdown.

This project inspired my recent blog post Enterprise RAG: Production-grade RAG goes offline.

Teler

Voice Infrastructure for AI Agents.

Built on a distributed microservice architecture, Teler delivers advanced capabilities including SIP trunking, bidirectional media streaming, and real-time STT/TTS. Apart from building Teler’s core VoIP infrastructure, I also wrote the Python SDK, many reference implementations, and the Docusaurus-based developer documentation.

Kinto (Open-source)

A generic JSON document store with sharing and synchronisation capabilities. Used at Mozilla.

My contributions include implementing metrics for cache and failed authentications.

Conduit

Real-time messaging backend powering RhythmiqCX.

I’ve written more about Conduit in this blog post.

Local Voice

Locally running AI voice assistant for low-latency conversations, powered by OpenAI Whisper, Ollama, and Piper TTS.

Delivers < 1s round-trip latency on consumer hardware owing to a hybrid design combining multiprocessing and asyncio.

VoIP Dialer

A chrome extension which allows users to make/receive calls directly from the browser.

Currently serves 5000+ daily active users. Powered by WebRTC and SIP.js.

© Robin. Some rights reserved.

Using the Chirpy theme for Jekyll.