Rahul Soni — personal ledgerLedger · Selected work
Selected work · Shipped & in build

The proof — products owned end to end.

Enterprise work here is described by what it does. Clients and employer stay private; what I build independently, I'll name. Read it less as a portfolio than as one repeatable move: hand me a hard, expensive problem and it comes back live, accurate, and handed off clean.

A / Shipped for enterpriseIn production, in daily use

Systems that replace the old way — and pay for themselves.

The shape of the impact is consistent: thousands of human hours automated, six-figure ($) savings in the first months, and value that compounds into the millions as the system runs. These are the ones I can describe — there's more that isn't public yet.

01 Enterprise product · Shipped

An AI contract-intelligence platform

Extracts the clauses that matter, flags the risk, and runs the review work that used to be done by hand, across tens of thousands of documents.

Saving six figures ($) in costs from its first months, and still compounding.

Thousands
Of manual hours removed from the business. Analysts used to read every page.
02 Enterprise product · Flagship

An intelligent document-extraction platform

Turns the messiest real-world documents into clean, structured data. Multi-tenant, live, and accurate enough to trust without a human in the loop.

Months daysNew-client onboarding, through AI-assisted automated configuration.
MillionsOf documents the architecture is built to process.

Treated by leadership as an emerging revenue line.

97–98% accuracy
Up from the ~60% the legacy system ran at. Live today.
03 Enterprise product · Founder build

A multi-language media & social platform

Social-media management across platforms and languages, with a real media pipeline at its core: transcription, translation, and voice dubbing. My own venture, built end to end. Brief on purpose.

Built & run as founder
Multi-language
media
Cross-platform publishing on one side, a real dubbing pipeline on the other.
04 Enterprise · Shipped

A proposal-automation engine

Retrieval over a library of historical proposals plus semantic search, so responses to complex requests get drafted in context, grounded in prior work instead of a blank page.

05 Enterprise · Shipped

A high-throughput ingestion pipeline

A streaming, staged-concurrency redesign of a document-processing path: download, CPU work, and API calls pipelined instead of run in series. About 30–40× the throughput of what it replaced, on the same hardware.

I own the entire arc — problem, architecture, build, ship.Then I stay on as team lead and architect, taking each product into maintenance and scale. Every product on this page started as an empty repo and a hard question.
B / Built independentlyWhat I build when no one's asking

The instinct doesn't switch off when no one's paying.

Yantra: my own AI development environment, built from scratch to keep agents, projects, and context coherent across dozens of parallel workstreams.

A full, signed, notarised app: 600+ commits, 900+ tests, my daily driver for months. Not a plugin, not a fork. Other developers have asked for early access.

Stenograph: a private knowledge engine. Deep research grounded in your own context and projects, with an MCP layer that hands the answers straight to the agents I already run.

On track to process well over a million hours of media by late 2026. Research as callable infrastructure, not a portal you visit. Built solo.

03 Project · Real-time AI

A real-time voice-AI agent

A telephony agent that holds a live conversation: streaming speech-to-text into an LLM and back to voice, with voicemail detection and barge-in. It handles being interrupted, and it answers in under half a second.

<500ms
End-to-end. Fast enough to feel like a real call.
04Self-funded R&D · Compute at scale

A large-scale video-understanding pipeline

Self-funded research into understanding media at scale: ingesting tens of thousands of items, running vision-language models across my own GPU fleet, and benchmarking models head-to-head to find what actually works. The hardware involved is the kind most engineers only touch through an employer.

27K+Media items ingested · near-zero failures
9× H100GPU fleet, tuned for utilisation
250+ runsBenchmark experiments · 14 models
6–7×Latency cut by datacenter engineering

Want the detail behind any of these? Ask.

© 2026 Rahul Soni

This site is my own work, end to end — like everything it describes. · v1.3.0 · cf9ec25