What words belong in a dictionary? We want them all - every word, in every language. To reach anywhere close to that goal, we combine a number of strategies:
Aligned open 🔢 data. Terms for about 50 languages have been matched to the English version of 🔠🕸 WordNet. Each English 💭 idea has a number, and each language in WordNet has matched its terms to the same number. We've assembled all those words in Kamusi Here!, but that is just a starting point because (a) most languages don't cover the full 🔠🕸WordNet, (b) WordNet does not include nearly all concepts or terms in English, (c) every language has its own indigenous concepts that are not included in WordNet, and (d) 🔠🕸 WordNet only includes basic canonical forms, not the range of shapes that comprise a term. Additional data that, such as images, that has been elsewhere aligned to 🔠🕸 WordNet can also be automatically imported.
Non-aligned open 🔢 data. Many collections of words are 🆓 freely available, but are not already matched to enumerated concepts. If we have a 🔢 data point that something is equivalent to l-i-g-h-t, is that ⚖ (not heavy), 💡 (not dark), or 😆 (not serious)? We have designed 🐥📊 DUCKS for our visitors to match data they recognize to the data we already have, which both lines up equivalent concepts and reveals concepts we don't yet have, in English and indigenous to other languages.
Data 🔢 from 👪 people. Our list of concepts lets us know which 💭 ideas we do not have expressions for in any language. We display that missing data as gold boxes our visitors can fill in, and we can publish that information once it achieves consensus among a 👪🔊 speaker community. Over time, our crowd techniques are aimed to fill as many gaps as possible in collecting every word in every language.
These are the languages for which we have datasets that we are actively working toward putting online. Languages that are Active for you to search are marked with "A" in the list below.
•A = Active language, aligned and searchable
•c = Data 🔢 elicited through the Comparative African Word List
•d = Data from independent sources that Kamusi participants align playing 🐥📊 DUCKS
•e = Data from the 🎮 games you can play on 😂🌎🤖 EmojiWorldBot
•P = Pending language, data in queue for alignment
•w = Data from 🔠🕸 WordNet teams
We are actively creating new software for you to make use of and contribute to the 🎓 knowledge we are bringing together. Learn about software that is ready for you to download or in development, and the unique data systems we are putting in place for advanced language learning and technology:
Our biggest struggle is keeping Kamusi online and keeping it free. We cannot charge money for our services because that would block access to the very people we most want to benefit, the students and speakers of languages around the world that are almost always excluded from information technology. So, we ask, request, beseech, beg you, to please support our work by donating as generously as you can to help build and maintain this unique public resource.
Answers to general questions you might have about Kamusi services.
We are building this page around real questions from members of the Kamusi community. Send us a question that you think will help other visitors to the site, and frequently we will place the answer here.