/ 5 min read / project
Your 4K Context Window Might Only Be 300 Bengali Words
My Bengali tokenizer benchmark found that a 4K window can mean 2,786 words or 299 words depending on the tokenizer.
Systems Engineer
I build tools for search, audio, and infrastructure, then write about what I learn along the way. Mostly Rust, Go, and Python.
My Bengali tokenizer benchmark found that a 4K window can mean 2,786 words or 299 words depending on the tokenizer.
I built a security scanner that maps findings to the OWASP MCP Top 10. Here's what it found.
A single git push cross-posts to Dev.to and Hashnode, generates social content for X, LinkedIn, and Reddit, and tracks everything in a content ledger.
After years of building in silence, I'm finally writing about what I build and what I learn.