Our motto is our goal, "Every Word in Every Language". Completely fulfilling this mission is impossible, but it sets the target toward which we aim. We aspire to merge all that is known and all that is knowable about language, in one data system that is free to everyone.
More than 7000 different languages are spoken around the world. Some languages have hundreds of millions of speakers and many dialects. Hundreds of endangered languages, meanwhile, might disappear in a generation. Fully documenting every expression from every language variation would require thousands of researchers and decades of work.
At Kamusi, we concentrate on ways to collect as much data for as many languages as possible, arrange that data precisely, and put the information at the service of students, the public, and the technology we use to communicate. We have games and other tools for people worldwide to systematically share what they know about their language, and we have a global partner network for experts to contribute their knowledge. We also integrate existing datasets whenever possible, though most languages do not have any meaningful available digital resources.
The Kamusi architecture supports, in principle, a complete matrix of human expression across time and space. This is a mission to Mars, an ambition to be attempted by pulling together people and resources in a concerted international collaboration. It is a hard, hard road ahead. We seek every word in every language, but we cannot even really say what is a word or what is a language, much less how we can pay for it all. However, we have the core systems in place, or ready to release when we can afford to manage the data influx, and additional components that are fully specified and just need funds to implement.
Please join us in turning our Quixotic quest into a reality that creates and unites as much information as possible for the world's languages! We greatly appreciate your financial contributions, and eagerly anticipate your participation as we roll out our language games to reach for the stars, one word at a time, toward every word in every language.
These are the languages for which we have datasets that we are actively working toward putting online. Languages that are Active for you to search are marked with "A" in the list below.
•A = Active language, aligned and searchable
•c = Data 🔢 elicited through the Comparative African Word List
•d = Data from independent sources that Kamusi participants align playing 🐥📊 DUCKS
•e = Data from the 🎮 games you can play on 😂🌎🤖 EmojiWorldBot
•P = Pending language, data in queue for alignment
•w = Data from 🔠🕸 WordNet teams
We are actively creating new software for you to make use of and contribute to the 🎓 knowledge we are bringing together. Learn about software that is ready for you to download or in development, and the unique data systems we are putting in place for advanced language learning and technology:
Our biggest struggle is keeping Kamusi online and keeping it free. We cannot charge money for our services because that would block access to the very people we most want to benefit, the students and speakers of languages around the world that are almost always excluded from information technology. So, we ask, request, beseech, beg you, to please support our work by donating as generously as you can to help build and maintain this unique public resource.
Answers to general questions you might have about Kamusi services.
We are building this page around real questions from members of the Kamusi community. Send us a question that you think will help other visitors to the site, and frequently we will place the answer here.