Hi there 👋🏼,

I'm Anubhab

A tech enthusiast, product geek, and entrepreneur. Welcome to my blog, where I share my thoughts, experiences, and lessons learned from the trenches of building and scaling tech products. Expect honest insights, personal anecdotes, and practical wisdom on all things tech, product, and entrepreneurship.

The Problem That Keeps Finding Me

For fifteen-odd years, every few years, I find myself staring at the same problem wearing a different costume. Sometimes it appears as server costs that don’t quite square with usage. Sometimes as a few extra milliseconds in response time—the kind you’d shrug off if you weren’t paying close attention. Sometimes as features that, once they hit scale, start standing out as resource hogs: not broken, just hungry. “When you hear hoofbeats, think horses, not zebras....

GPU Poverty and the Escape to 'Framework-less'

Late last year I was building a multi-model agentic pipeline. Not a demo — something I wanted to actually run: audio in, Whisper for transcription, a small intent classifier, a RAG retrieval step, and finally Llama 3.1 8B for the response. Five models, one machine. The GPU I had was a single RTX 4090 with 24GB of VRAM. That should’ve been enough. Spoiler: the way existing inference stacks work, it wasn’t....

From 'Very Fast' to '~Fastest': Helping rust unleash compiler optimizations

diff-match-patch-rs A few years back, while building HandyTrain, we decided to build a collaborative content creation feature, among other things we needed a text-synchronization library - a WASM version for the client and a high-performance library for our Go and Rust services. With some research we landed on the fantastic diff match patch algorithms, the diff part is an implementation of this paper (often called the Myer’s Diff algorithm) by Eugene Myer’s....

Series

Desktop App for Document QA with RAG

A DIY style step-by-step guide to building your own cutting-edge GenAI-powered document QA desktop app with RAG.

WASM: The `What`, `When` and `How`

I’ve been using WebAssembly aka WASM for a while in production to do some incredible stuff in the browser, things that would be prohibitive in terms of performance otherwise. Here are some real use-cases I’ve used WASM for: Running statistical processing 10 million rows (roughly 50 columns in each row) of CSV data in browser. This feature required us to create a temporary playground of reports generated for our clients where they could run their own analysis without the need of permanent storage or costly servers....