Abstract: Machine translation, through software, automates the translation between languages and is integral for global communication. Challenging issues include language ambiguities, homographs, ...
HomoRich is the first large-scale, sentence-level Persian homograph dataset designed for grapheme-to-phoneme (G2P) conversion tasks. It addresses the scarcity of balanced, contextually annotated ...