links · people · groups · tags | My: links · tags · groups · watchlists · notes login · sign up now! | help · blog
Simpy simpy
 
Home / people / otis / groups / Lucene & Solr
1 - 10 of 55  next »  
 
by otis 2008-04-29 22:34 lucene · index · search · shard · grid · distributed search · hadoop · java · master · slave
http://katta.wiki.sourceforge.net/
by otis 2008-03-06 18:08 java · api · spelling
http://jaspell.sourceforge.net/
by benjamin 2008-02-25 14:09 scalability · scaling · lucene · search · hadoop · testimonial · solr · mapreduce · logging · logscan
http://www.highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-query-terabytes-data
geoLucene is an extension of Lucene that allows to effectively index and search documents that contain locational information (longitude/latitude). It uses R-tree as a spacial index.
by otis 2008-02-17 03:12 lucene · java · library · search · index · geolocation · geocode · query · spatial
https://sourceforge.net/projects/geolucene/
A lucene extension providing geographical based searching - boundary box and radius queries
by otis 2008-02-17 00:33 lucene · search · index · java · api · geocode · geolocation
http://www.nsshutdown.com/viewcvs/viewcvs.cgi/locallucene/
Aligned multilingual corpus JRC-ACQUIS . The dataset contains resources for the following languages: Bulgarian, Czech, Danish, German, Greek, English, Spanish, Estonian, Finnish, French, Hungarian, Italian, Lithuanian, Latvian, Maltese, Dutch, Polish, Portuguese, Romanian, Slovak, Slovene, Swedish.
by otis 2008-01-27 00:45 corpus · language · NLP · multilingual
http://wt.jrc.it/lt/Acquis/
Semantic Vector indexes, created by applying a Random Projection algorithm to term-document matrices created using Apache Lucene. The package creates a WordSpace model, of the kind developed by Stanford University's Infomap Project and other researchers during the 1990s and early 2000s. Such models are designed to represent words and documents in terms of underlying concepts, and as such can be used for many semantic (concept-aware) matching tasks such as automatic thesaurus generation, knowledge representation, and concept matching. The Semantic Vectors package uses a Random Projection algorithm, a form of automatic semantic analysis, similar to Latent Semantic Analysis (LSA) and its variants like Probabilistic Latent Semantic Analysis (PLSA).
by otis 2008-01-14 02:06 semantic · LSA · PLSA · NLP · information retrieval · java · api
http://code.google.com/p/semanticvectors/
Java-based framework designed to support the development of applications for unsupervised machine learning tasks, with a particular focus on their application to text data
by otis 2008-01-12 11:35 java · api · cluster · library · NLP · information retrieval
http://mlg.ucd.ie/content/view/18/
LETOR is a benchmark dataset for research on learning to rank, released by Microsoft Research Asia.
by otis 2008-01-09 01:56 information retrieval · evaluation · tool · rank
http://research.microsoft.com/users/LETOR/
Chinese Segmentation Bases on Apache Lucene Analyzer
by otis 2008-01-05 22:15 java · api · lucene · chinese · analysis · segment · index · search
http://code.google.com/p/hickwall-analyzer/
1 - 10 of 55  next »