links · people · groups · tags | My: links · tags · groups · watchlists · notes login · sign up now! | help · blog
Simpy simpy
 
Search Everyone: "search",

Top "search" experts: neardeath, beckymc21, macroron, barjacob, glebarr, tobassam,

Groups about "search": Searches, search engine, edinburgh_search, searchengines, Quality Search Engine Optimization, Google Search Clinic,

1 - 50 of 111 next »   Watch otis
 
Log aggregation, parsing, and indexing
by otis 2009-11-21 20:51 logging · log analysis · index · search · server · parse · software
http://code.google.com/p/logstash/ - cached - mail it - history
YouSeer is an open source search engine framework, which was built on top of other open source components. YouSeer utilizes Hereitrix as a crawler and solr as an indexing system. The framework provides software to ingest the documents harvested by Heritrix into solr. The ingesting software is very flexible and allows for user-specific data extraction implementations. Further, YouSeer provides a simple interface to query the index and another interface to retrieve cached versions of the documents.
by otis 2009-11-19 13:18 crawl · index · search · Heritrix · nutch · information retrieval
http://youseer.sourceforge.net/ - cached - mail it - history
by otis 2009-11-05 14:51 health care · medicine · search · visual · dictionary
http://www.curehunter.com/public/dictionary.do - cached - mail it - history
Common English misspellings from Wikipedia 4107 misspellings as of 2009-10-29
by otis 2009-10-29 12:20 wikipedia · spell · english · language · search · information retrieval · NLP
http://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines - cached - mail it - history
by otis 2009-10-29 12:06 ajax · solr · javascript · search · widget
http://evolvingweb.github.com/ajax-solr/ - cached - mail it - history
Wunder's progressive reranking explanation
by otis 2009-10-22 12:41 search · information retrieval · rank · score
http://wunderwood.org/most_casual_observer/2007/04/progressive_reranking.html - cached - mail it - history
Sen is the first opensource morphological analyzer written in pure Java.
by otis 2009-10-16 23:39 japanese · morphology · analysis · lucene · search · index · information retrieval · NLP · library
https://sen.dev.java.net/ - cached - mail it - history
OpenGrok is a fast and usable source code search and cross reference engine. It helps you search, cross-reference and navigate your source tree. It can understand various program file formats and version control histories like Mercurial, Git, SCCS, RCS, CVS, Subversion, Teamware, ClearCase, Perforce and Bazaar. In other words it lets you grok (profoundly understand) the open source, hence the name OpenGrok. It is written in Java.
by otis 2009-08-26 03:45 code · source code · search · browse · reference · java · subversion
http://www.opensolaris.org/os/project/opengrok/ - cached - mail it - history
by otis 2009-08-12 22:24 hbase · katta · solr · social media · shard · search · scalability
http://www.slideshare.net/lusciouspear/building-a-business-on-hadoop-hbase-and-open-source-distributed-computing?src=rel... - cached - mail it - history
Galago is a toolkit for experimenting with text search. It is based on small, pluggable components that are easy to replace and change, both during indexing and during retrieval. It includes TupleFlow, which is a distributed computation framework like MapReduce or Dryad. TupleFlow manages the difficult parts of processing text: serializing data, sorting it, and distributing processing. The IndexReader and IndexWriter classes manage storing key/value pairs like inverted lists. This makes it possible to make your own kinds of index structures without starting from scratch.
by otis 2009-08-12 16:01 java · software · search · library · information retrieval · distributed computing
http://www.galagosearch.org/ - cached - mail it - history
Ivory is a Hadoop toolkit for Web-scale information retrieval research that features a retrieval engine based on Markov Random Fields
by otis 2009-08-12 15:56 hadoop · MapReduce · information retrieval · search
http://www.umiacs.umd.edu/~jimmylin/ivory/docs/index.html - cached - mail it - history
by otis 2009-08-10 13:32 search · saas · lucene · AWS · ec2 · amazon
http://www.searchblox.com/searchbloxami.html - cached - mail it - history
Zemberek is an open source, platform independent, general purpose Natural Language Processing library and toolset designed for Turkic languages, especially Turkish. Zemberek is officially used as spell checker in Open Office Turkish version and Turkish national Linux Distribution Pardus. Google Code will host Zemberek-2, Zemberek Corpus and Wordnet projects. These projects has Mozilla Public License.
by otis 2009-07-24 09:41 turkish · language · analysis · search · tokenizer · stemming · NLP · library
http://code.google.com/p/zemberek/ - cached - mail it - history
by otis 2009-07-10 16:03 lucene · filesystem · index · search · desktop · desktop search
http://regain.sourceforge.net/ - cached - mail it - history
Default dictionary break iterator for Chinese, Japanese, Korean
by otis 2009-06-03 00:15 CJK · japan · chinese · korean · computational linguistics · NLP · information retrieval · search · analysis · word segmentation
http://bugs.icu-project.org/trac/ticket/2229 - cached - mail it - history
by otis 2009-05-28 23:40 chinese · dictionary · information retrieval · search
http://www.mdbg.net/chindict/chindict.php?page=cc-cedict - cached - mail it - history
by otis 2009-05-28 14:34 search · software · python · django · lucene · solr · information retrieval
http://haystacksearch.org/ - cached - mail it - history
by otis 2009-05-28 14:27 solr · ruby · ruby on rails · search · information retrieval
http://outoftime.github.com/sunspot/ - cached - mail it - history
by otis 2009-05-24 21:31 rdf · solr · software · java · search · information retrieval
http://fgiasson.com/blog/index.php/2009/04/29/rdf-aggregates-and-full-text-search-on-steroids-with-solr/ - cached - mail it - history
by otis 2009-05-14 17:13 django · solr · python · information retrieval · search
http://code.google.com/p/django-solr-search/ - cached - mail it - history
by otis 2009-05-14 15:07 taxonomy · ontology · facet · NLP · search
http://www.ideaeng.com/tabId/98/itemId/199/Whats-the-difference-between-Taxonomies-and-Ontol.aspx - cached - mail it - history
A WordPress plugin that interacts with an instance of the Solr search engine. This plugin allows you to index pages and posts, perform advanced queries and enable faceting on fields such as tags, categories, and author. Adds special template tags so you can create your own custom result pages to match your theme. Configuration options allow you to select pages to ignore, features to enable/disable, and what type of result information you want output.
by otis 2009-04-22 10:02 solr · wordpress · blog · search · plugin
https://launchpad.net/solr4wordpress - cached - mail it - history
REPLAY is an open source solution developed in java to manage the workflow of audiovisual lecture recordings from production in the classroom to distribution on various channels in an automated manner. In this, it also provides comprehensive functionalities for existing audiovisual archives, repositories or collections.
by otis 2009-03-24 12:57 audio · video · index · search · archive · free · software · java
http://www.replay.ethz.ch/ - cached - mail it - history
Sedna is a free native XML database which provides a full range of core database services - persistent storage, ACID transactions, security, indices, hot backup. Flexible XML processing facilities include W3C XQuery implementation, tight integration of XQuery with full-text search facilities and a node-level update language.
by otis 2009-03-14 16:46 xml · database · xquery · full-text · search
http://modis.ispras.ru/sedna/ - cached - mail it - history
WikiXMLDB provides a way of querying Wikipedia with XQuery.
by otis 2009-03-14 16:41 wikipedia · xml · xquery · search · knowledge · structure · NLP · data mining
http://wikixmldb.dyndns.org/ - cached - mail it - history
by otis 2009-03-08 00:36 lucene · search · query expansion · information retrieval
http://grasia.fdi.ucm.es/jose/query-expansion/ - cached - mail it - history
Lucas is a UIMA CAS consumer component which bridges the UIMA framework with the Lucene search engine library. Lucas maps CASes to lucene index documents according to a mapping file .
by otis 2009-02-27 12:57 java · UIMA · lucene · index · search · pipeline · software · information retrieval
https://www.coling.uni-jena.de/sites/lucas/index.html - cached - mail it - history
by otis 2009-02-20 00:05 geolocation · geocode · geography · search · lucene · solr · latitude · longitude
http://www.gissearch.com/ - cached - mail it - history
by otis 2009-02-17 02:55 .net · solr · client · software · search · library · information retrieval
http://code.google.com/p/solrnet/ - cached - mail it - history
Set Operation implementations for SortedIntegerSegments for inverted list caching in search engines. The implementations also include P4Delta compression algorithm based DocIdSet for iterating over DocIdSets in a compressed form.
by otis 2009-02-09 01:25 lucene · search · index · compress · information retrieval · set · java
http://code.google.com/p/lucene-ext/ - cached - mail it - history
by otis 2009-02-04 16:01 search · search results · ui · interface · design · usability
http://patterntap.com/tap/collection/search - cached - mail it - history
by otis 2009-01-08 17:46 Daniel Tunkelang · information retrieval · facet · navigate · results · search · endeca · set · presentation
http://yahoo.hosted.panopto.com/CourseCast/Viewer/Default.aspx?id=6d0a6847-be51-4d29-8c1c-f961274b5343 - cached - mail it - history
by otis 2008-12-23 14:10 collocations · term · summary · NLP · information retrieval · search · keywords · key phrases
http://www.extractor.com/ - cached - mail it - history
WebLA is a Java package for handling Web Graphs, implementing popular algorithms such as PageRank, HITS, CoCitation Similarity and SimRank. It is of particular interest for research in Information Retrieval, since it provides a set of APIs (Application Programming Interfaces) that allow one to easily experiment with such algorithms.
by otis 2008-12-21 01:54 information retrieval · search · algorithm · pagerank · graph · api · library · java
http://webla.sourceforge.net/ - cached - mail it - history
by otis 2008-12-19 12:20 search · search engine · log analysis · log · query
http://glinden.blogspot.com/2008/11/finding-task-boundaries-in-search-logs.html - cached - mail it - history
by otis 2008-12-05 00:46 pubsub · prospective search · paper · reference · research · publish · subscribe · query · search
http://www.seas.upenn.edu/~svilen/publications/subscribe.pdf - cached - mail it - history
Database->Lucene command-line indexing tool
by otis 2008-11-17 13:13 lucene · database · index · search · command line
http://lab.cisti-icist.nrc-cnrc.gc.ca/cistilabswiki/index.php/LuSql - cached - mail it - history
by otis 2008-10-09 13:14 maven · java · search · repository
http://mvnrepository.com/ - cached - mail it - history
Flax is a powerful enterprise search solution platform, open source licensed under the GPL.
by otis 2008-09-25 00:41 search · open source · software · library
http://www.flax.co.uk/index.shtml - cached - mail it - history
Presents files in a calendar view, supports search by name or content and filtering/narrowing by file type.
by otis 2008-09-25 00:37 filesystem · calendar · search · desktop
http://iola.dk/nemo/ - cached - mail it - history
Recoll is a personal full text search tool for Unix/Linux.
by otis 2008-09-25 00:29 desktop search · search · linux · xapian · software
http://www.lesbonscomptes.com/recoll/ - cached - mail it - history
by otis 2008-09-25 00:24 desktop search · search · linux · software
http://beagle-project.org/Main_Page - cached - mail it - history
Strigi is a daemon which uses a very fast and efficient crawler that can index data on your harddrive. Indexing operations are performed without hammering your system, this makes Strigi the fastest and smallest desktop searching program. Strigi can index different file formats, including the contents of the archive files.
by otis 2008-09-25 00:10 desktop search · search · linux · application
http://strigi.sourceforge.net/ - cached - mail it - history
by otis 2008-09-24 13:37 lucene · search · index · distributed search · facet · autocomplete · autosuggest · java · software · spell
http://www.statsbiblioteket.dk/summa/features-text-in-english - cached - mail it - history
by otis 2008-09-18 16:18 geo · search · lucene · solr
http://www.nsshutdown.com/projects/lucene/whitepaper/locallucene_v2.html - cached - mail it - history
Indexing package which makes it's best effort to abstract away which implementation of Indexer you are using by introducing the DocumentIndexer interface which don't use the propriatery lucene Document but instead uses java.util.Map.
by otis 2008-09-11 17:29 java · software · api · search · index · lucene · solr · cluster
http://dev.tailsweep.com/projects/haloe/ - cached - mail it - history
by otis 2008-08-18 13:30 search · search engine · information retrieval · vector space · linear algebra
http://mathdl.maa.org/mathDL/4/?pa=content&sa=viewDocument&nodeId=636&pf=1 - cached - mail it - history
Interviews with "Search Wizards" - people from the world of IR, NLP...
by otis 2008-06-11 12:06 search · people · interview · information retrieval · NLP
http://www.arnoldit.com/search-wizards-speak/ - cached - mail it - history
1 - 50 of 111 next »  
Related Tags
 
- exclude ~ optional + require
Add Dates