Zemberek is an open source, platform independent, general purpose Natural Language Processing library and toolset designed for Turkic languages, especially Turkish.
Zemberek is officially used as spell checker in Open Office Turkish version and Turkish national Linux Distribution Pardus. Google Code will host Zemberek-2, Zemberek Corpus and Wordnet projects. These projects has Mozilla Public License.
by
otis
2009-07-24 09:41
turkish
·
language
·
analysis
·
search
·
tokenizer
·
stemming
·
NLP
·
library