![]() ![]() ![]() ![]() With the help of learnings from Makhzan, Zeerak is inching closer to a public beta of his Urdu keyboard. From autocorrect, to search, and to linguistic analysis, Makhzan will support a diverse set of use cases with a high-quality and free-to-use data source. A corpus of text is the fundamental building block used to train artificial intelligence upon which language processing capabilities are built. In late 2019, they released Makhzan, an Urdu text corpus. Growing on his master’s thesis work of building breakthrough Urdu keyboards for modern smartphones, Zeerak now runs a collaboration across continents and disciplines to build infrastructure for software developers across the world that want to support Urdu and other languages in the Arabic script. Zeerak Ahmed MDE ’18 runs Matnsaz, an initiative to better represent Urdu in technology. ![]()
0 Comments
Leave a Reply. |