links · people · groups · tags | My: links · tags · groups · watchlists · notes login · sign up now! | help · blog
Simpy simpy
 
Home / people / otis / groups / Lucene & Solr
1 - 10 of 57  next »  
 
LSql is a command-line tool written in Java that allows sql-like queries to run against a Lucene database. It can be run in interactive mode, or can automatically parse a list of commands from a file.
by otis 2009-04-15 16:00 lucene · sql · software · java
http://code.google.com/p/lucene-sql/
by otis 2009-02-20 00:05 geolocation · geocode · geography · search · lucene · solr · latitude · longitude
http://www.gissearch.com/
by otis 2008-04-29 22:34 lucene · index · search · shard · grid · distributed search · hadoop · java · master · slave
http://katta.wiki.sourceforge.net/
by otis 2008-03-06 18:08 java · api · spell
http://jaspell.sourceforge.net/
by benjamin 2008-02-25 14:09 scalability · scaling · lucene · search · hadoop · testimonial · solr · mapreduce · logging · logscan
http://www.highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-query-terabytes-data
geoLucene is an extension of Lucene that allows to effectively index and search documents that contain locational information (longitude/latitude). It uses R-tree as a spacial index.
by otis 2008-02-17 03:12 lucene · java · library · search · index · geolocation · geocode · query · spatial
https://sourceforge.net/projects/geolucene/
A lucene extension providing geographical based searching - boundary box and radius queries
by otis 2008-02-17 00:33 lucene · search · index · java · api · geocode · geolocation
http://www.nsshutdown.com/viewcvs/viewcvs.cgi/locallucene/
Aligned multilingual corpus JRC-ACQUIS . The dataset contains resources for the following languages: Bulgarian, Czech, Danish, German, Greek, English, Spanish, Estonian, Finnish, French, Hungarian, Italian, Lithuanian, Latvian, Maltese, Dutch, Polish, Portuguese, Romanian, Slovak, Slovene, Swedish.
by otis 2008-01-27 00:45 corpus · language · NLP · multilingual
http://wt.jrc.it/lt/Acquis/
Semantic Vector indexes, created by applying a Random Projection algorithm to term-document matrices created using Apache Lucene. The package creates a WordSpace model, of the kind developed by Stanford University's Infomap Project and other researchers during the 1990s and early 2000s. Such models are designed to represent words and documents in terms of underlying concepts, and as such can be used for many semantic (concept-aware) matching tasks such as automatic thesaurus generation, knowledge representation, and concept matching. The Semantic Vectors package uses a Random Projection algorithm, a form of automatic semantic analysis, similar to Latent Semantic Analysis (LSA) and its variants like Probabilistic Latent Semantic Analysis (PLSA).
by otis 2008-01-14 02:06 semantic · LSA · PLSA · NLP · information retrieval · java · api
http://code.google.com/p/semanticvectors/
Java-based framework designed to support the development of applications for unsupervised machine learning tasks, with a particular focus on their application to text data
by otis 2008-01-12 11:35 java · api · cluster · library · NLP · information retrieval
http://mlg.ucd.ie/content/view/18/
1 - 10 of 57  next »