Multilingual Embeddings for RAG: Why the Wrong Model Broke Our Hinglish Search
TL;DR — We built a RAG system over 109 podcast episodes (1.6 million words, mostly Hinglish). The first embedding model we tried, a popular English-first one, returned nonsense for queries…
