Published on May 28, 2026
In Tajikistan, access to advanced technology has been limited, hindering the development of localized digital solutions. Traditional educational tools often fall short, leaving gaps in language and subject matter proficiency. The country has long sought a way to enhance learning and communication through technology.
Now, a significant shift is occurring with the launch of Soro, a family of Tajik-specialized conversational large language models. These models are trained exclusively on a vast 1.9-billion-token corpus of Tajik language materials, crafted to function effectively under the region’s constrained computing resources. This initiative marks a crucial step towards bridging the educational tech divide in Tajikistan.
Soro outperforms current models like Gemma 3 on newly established Tajik benchmarks focused on general knowledge and linguistic competence. Additionally, the project includes a newly developed suite of benchmarks to facilitate rigorous evaluations. These advancements not only boost the efficacy of educational resources but also ensure that Soro remains a competitive tool in broader applications, including English language tasks.
The ripple effects of Soro’s deployment are profound. Schools across Tajikistan are set to benefit from enhanced learning tools, fostering improved educational outcomes. Furthermore, its design enables low-memory deployment, making it a viable option for remote areas, there way for future expansions to meet nationwide educational needs.
Related News
- Tech Update
- Uber Expands Its Influence with Delivery Hero Acquisition
- AI Revolution Demands Robust Data Infrastructure
- EU’s Ambitious AI Data Center Initiative Faces Major Setbacks
- Taiwan Investigates Alleged Smuggling of NVIDIA Chips to China
- Dive into the Shadows: Discovering Stephen King's Early Works