Published on May 28, 2026
In Tajikistan, access to advanced technology has been limited, hindering the development of localized digital solutions. Traditional educational tools often fall short, leaving gaps in language and subject matter proficiency. The country has long sought a way to enhance learning and communication through technology.
Now, a significant shift is occurring with the launch of Soro, a family of Tajik-specialized conversational large language models. These models are trained exclusively on a vast 1.9-billion-token corpus of Tajik language materials, crafted to function effectively under the region’s constrained computing resources. This initiative marks a crucial step towards bridging the educational tech divide in Tajikistan.
Soro outperforms current models like Gemma 3 on newly established Tajik benchmarks focused on general knowledge and linguistic competence. Additionally, the project includes a newly developed suite of benchmarks to facilitate rigorous evaluations. These advancements not only boost the efficacy of educational resources but also ensure that Soro remains a competitive tool in broader applications, including English language tasks.
The ripple effects of Soro’s deployment are profound. Schools across Tajikistan are set to benefit from enhanced learning tools, fostering improved educational outcomes. Furthermore, its design enables low-memory deployment, making it a viable option for remote areas, there way for future expansions to meet nationwide educational needs.
Related News
- Google Unveils AI Pet Recognition, Sparking Privacy Concerns
- Data Centers Transform into AI Token Factories Amid Rising Demand
- Google Enhances Android with Caller ID Impersonation Alerts
- Claude-Share Launches: A New Era in Secure Code Sharing
- Adobe's $25 Billion Stock Buyback Amid AI-Fueled Market Concerns
- Goldman Sachs Predicts Surge of Mega IPOs by 2026