links · people · groups · tags | My: links · tags · groups · watchlists · notes login · sign up now! | help · blog
Simpy simpy
 
Search Everyone: "api",

Top "api" experts: toshiro, otis, jaydatema, robdefriesse, psychemedia, jagdot,

Groups about "api": Messaging, Algorithms, DevWeb ..., Web 2.0,

1 - 50 of 108 next »   Watch otis
 
A java library for removal of chrome / boilerplate from documents like web pages
by otis 2009-12-03 23:16 java · library · api · information extraction · software
http://code.google.com/p/boilerpipe/ - cached - mail it - history
by otis 2009-12-03 13:02 java · software · library · api · machine learning
http://java-ml.sourceforge.net/ - cached - mail it - history
Google's Java libraries
by otis 2009-11-11 20:56 java · library · api · collection · concurrent · google
http://code.google.com/p/guava-libraries/ - cached - mail it - history
C++ (but has Java API), GPL
by otis 2009-11-02 13:51 information retrieval · NLP · software · library · api
http://www.lsi.upc.edu/~nlp/freeling/ - cached - mail it - history
by otis 2009-11-02 13:33 NLP · information retrieval · computational linguistics · java · software · api · library
http://herd.ida.liu.se:8180/nlpfarm/ - cached - mail it - history
MaltParser is a system for data-driven dependency parsing, which can be used to induce a parsing model from treebank data and to parse new data using an induced model.
by otis 2009-10-29 16:20 machine learning · parse · computational linguistics · NLP · java · software · api · library
http://maltparser.org/ - cached - mail it - history
A collection of extremely large matrix decomposition algorithm implementations, in Java.
by otis 2009-09-11 16:20 matrix · library · api · java · software
http://code.google.com/p/decomposer/ - cached - mail it - history
Hazelcast is an open source clustering and highly scalable data distribution platform for Java.
by otis 2009-05-26 14:55 distributed filesystem · database · storage · java · api · library
http://www.hazelcast.com/ - cached - mail it - history
Distributed and persistent key-value database
by otis 2009-05-26 12:42 keys · values · database · hash table · server · api · distribute · storage
http://opensource.plurk.com/LightCloud/ - cached - mail it - history
Redis is a key-value database. It is similar to memcached but the dataset is not volatile, and values can be strings, exactly like in memcached, but also lists and sets with atomic operations to push/pop elements.
by otis 2009-05-26 12:40 keys · values · database · hash table · server · api
http://code.google.com/p/redis/ - cached - mail it - history
Tokyo Tyrant is a package of network interface to the DBM called Tokyo Cabinet. Though the DBM has high performance, you might bother in case that multiple processes share the same database, or remote processes access the database. Thus, Tokyo Tyrant is provided for concurrent and remote connections to Tokyo Cabinet. It is composed of the server process managing a database and its access library for client applications.
by otis 2009-05-26 12:37 keys · values · database · hash table · server · api
http://tokyocabinet.sourceforge.net/tyrantdoc/ - cached - mail it - history
WordnetAPI is a Java interface to the famous WordNet database of lexical relationships.
by otis 2009-05-15 10:07 wordnet · morphology · lexical · synonyms · NLP · information retrieval · library · api
http://code.google.com/p/wordnetapi/ - cached - mail it - history
AJAX Language API, you can translate and detect the language of blocks of text within a webpage using only Javascript. In addition, you can enable transliteration on any textfield or textarea in your web page.
by otis 2009-05-14 12:19 language · identification · detect · translation · ajax · api
http://code.google.com/apis/ajaxlanguage/ - cached - mail it - history
simple API to access Amazon's SQS, EC2, SimpleDB and DevPay LS web services. It uses the QUERY interfaces and patterns the methods after those available from Amazon in their SOAP client for these services.
by otis 2009-05-08 15:26 amazon · AWS · ec2 · api · software
http://code.google.com/p/typica/ - cached - mail it - history
by otis 2009-04-29 18:13 api · library · software · java · list
http://www.publicobject.com/glazedlists/ - cached - mail it - history
by otis 2009-03-20 22:23 wordnet · synonyms · api · library · java
http://lyle.smu.edu/~tspell/jaws/index.html - cached - mail it - history
Gson is a Java library that can be used to convert Java Objects into its JSON representation. It can also be used to convert a JSON string to an equivalent Java object. Gson can work with arbitrary Java objects including pre-existing objects that you do not have source-code of.
by otis 2009-03-09 15:27 json · java · library · api · convert
http://code.google.com/p/google-gson/ - cached - mail it - history
Word-aligned Compression library for java
by otis 2009-03-02 11:59 java · api · library · compress · information retrieval · encode
http://code.google.com/p/javaewah/ - cached - mail it - history
ClearTK is a toolkit for developing statistical natural language processing components in Java and is based on the Apache UIMA framework for text analysis.
by otis 2009-01-28 16:28 java · api · UIMA · NLP · statistics
http://code.google.com/p/cleartk/ - cached - mail it - history
Protocol Buffers are a way of encoding structured data in an efficient yet extensible format. Google uses Protocol Buffers for almost all of its internal RPC protocols and file formats.
by otis 2009-01-25 23:54 format · serialize · network · java · api · library
http://code.google.com/p/protobuf/ - cached - mail it - history
wsmo4j is an API and a reference implementation for building Semantic Web Services and Semantic Business Process applications
by otis 2009-01-22 15:23 semantic · ontology · rdf · java · api
http://wsmo4j.sourceforge.net/ - cached - mail it - history
JLangDetect is a pure Java implementation of a language detector. It provides a toolkit for training language recognition, and a simple implementation of a detector.
by otis 2009-01-17 00:41 language · java · api · NLP · identification
http://www.jroller.com/melix/entry/nlp_in_java_a_language - cached - mail it - history
This implements RLE compressed bitset for Java. The RLE compression scheme is Word-Aligned Hybrid compression scheme from LBNL. This package implements the Word-Aligned Hybrid (WAH) compression scheme from LBNL, which compresses the bitsets without sacrificing performance. The code is derived from the original authors' FastBit software and pseudo-codes in the paper.
by otis 2009-01-10 00:59 bitmap · index · api · java · compress
http://code.google.com/p/compressedbitset/ - cached - mail it - history
Java Simon is a simple monitoring API that allows you to follow and better understand your application. Monitors (familiarly called Simons) are placed directly into your code and you can choose whether you want to count something or measure time/duration. Access to Simons is provided via static SimonManager.
by otis 2008-12-26 12:10 java · api · monitor · application · performance · software · jmx
http://code.google.com/p/javasimon/ - cached - mail it - history
Evaluates arithmetic expressions represented as strings.
by otis 2008-12-21 01:56 java · api · library · math
http://code.google.com/p/arity/ - cached - mail it - history
WebLA is a Java package for handling Web Graphs, implementing popular algorithms such as PageRank, HITS, CoCitation Similarity and SimRank. It is of particular interest for research in Information Retrieval, since it provides a set of APIs (Application Programming Interfaces) that allow one to easily experiment with such algorithms.
by otis 2008-12-21 01:54 information retrieval · search · algorithm · pagerank · graph · api · library · java
http://webla.sourceforge.net/ - cached - mail it - history
by otis 2008-12-08 22:46 sentence detection · word segmentation · unicode · java · api · NLP · information retrieval · language
http://icu-project.org/userguide/boundaryAnalysis.html - cached - mail it - history
Anubis is a partition-aware tuple-space that can be used to share information reliably between hosts on a single site. It uses multicast IP and other techniques to share information across systems. Each host is notified of detected partition change events, as well as tuple insertion and removal.
by otis 2008-12-04 11:03 distributed computing · tuple space · java · api · communication · cluster · message
http://wiki.smartfrog.org/wiki/display/sf/Anubis - cached - mail it - history
by otis 2008-10-19 22:51 java · api · string · similar · metrics · computational linguistics · NLP · information retrieval · machine learning
http://www.dcs.shef.ac.uk/~sam/simmetrics.html - cached - mail it - history
Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
by otis 2008-10-17 13:41 machine learning · classification · java · api · software · information extraction · named entity extraction · cluster
http://mallet.cs.umass.edu/ - cached - mail it - history
Indexing package which makes it's best effort to abstract away which implementation of Indexer you are using by introducing the DocumentIndexer interface which don't use the propriatery lucene Document but instead uses java.util.Map.
by otis 2008-09-11 17:29 java · software · api · search · index · lucene · solr · cluster
http://dev.tailsweep.com/projects/haloe/ - cached - mail it - history
Parhely is an ORM system for HBase. It as well contains some utils such as AbstractShardedDao which can be used to shard MySQL db's for instance.
by otis 2008-09-11 17:28 java · software · api · storage · orm · hbase
http://dev.tailsweep.com/projects/parhely/ - cached - mail it - history
Mammatus's main focus is to deliver various cache and storage solutions which each addresses common challenges. All implementations follow the java.util.Map interface which makes it very easy to plugin.
by otis 2008-09-11 17:27 java · api · software · map · storage · cache
http://dev.tailsweep.com/projects/mammatus/ - cached - mail it - history
xSocket is an easy to use NIO-based library to build high performance, scalable network applications
by otis 2008-08-01 11:00 java · network · server · socket · api · NIO
http://xsocket.sourceforge.net/ - cached - mail it - history
OpenPipe/UIMA-like content processing pipeline software from Dieselpoint - open source
by otis 2008-06-19 18:23 pipe · pipeline · bus · dieselpoint · api · library · software · java
http://openpipeline.org/ - cached - mail it - history
by otis 2008-06-09 09:11 vietnam · word segmentation · language · NLP · java · api · library
http://jvnsegmenter.sourceforge.net/ - cached - mail it - history
by otis 2008-05-29 15:31 perl · module · library · api · NLP · information retrieval · ngram
http://ngram.sourceforge.net/ - cached - mail it - history
by otis 2008-05-20 11:35 histogram · library · java · api · software · chart
http://code.google.com/p/hist4j/ - cached - mail it - history
OAuth libraries
by otis 2008-03-14 17:23 oauth · api · software · library · client · sy
http://oauth.net/code/ - cached - mail it - history
by otis 2008-03-06 18:08 java · api · spell
http://jaspell.sourceforge.net/ - cached - mail it - history
Elefant (Efficient Learning, Large-scale Inference, and Optimization Toolkit) is an open source library for machine learning licensed under the Mozilla Public License (MPL). We aim at developing an open source machine learning platform which will become the platform of choice for prototyping and deploying machine learning algorithms.
by otis 2008-02-17 16:43 machine learning · api · library · software
http://elefant.developer.nicta.com.au/ - cached - mail it - history
A lucene extension providing geographical based searching - boundary box and radius queries
by otis 2008-02-17 00:33 lucene · search · index · java · api · geocode · geolocation
http://www.nsshutdown.com/viewcvs/viewcvs.cgi/locallucene/ - cached - mail it - history
open-source data mining solution
by otis 2008-02-01 01:26 data mining · java · api · library
http://rapid-i.com/content/blogcategory/10/69/lang,en/ - cached - mail it - history
A Clips-like Rule engine accessible from Java with the full power of Common Lisp
by otis 2008-01-21 12:23 java · api · rule engine · software
http://jlisa.sourceforge.net/ - cached - mail it - history
Semantic Vector indexes, created by applying a Random Projection algorithm to term-document matrices created using Apache Lucene. The package creates a WordSpace model, of the kind developed by Stanford University's Infomap Project and other researchers during the 1990s and early 2000s. Such models are designed to represent words and documents in terms of underlying concepts, and as such can be used for many semantic (concept-aware) matching tasks such as automatic thesaurus generation, knowledge representation, and concept matching. The Semantic Vectors package uses a Random Projection algorithm, a form of automatic semantic analysis, similar to Latent Semantic Analysis (LSA) and its variants like Probabilistic Latent Semantic Analysis (PLSA).
by otis 2008-01-14 02:06 semantic · LSA · PLSA · NLP · information retrieval · java · api
http://code.google.com/p/semanticvectors/ - cached - mail it - history
Java-based framework designed to support the development of applications for unsupervised machine learning tasks, with a particular focus on their application to text data
by otis 2008-01-12 11:35 java · api · cluster · library · NLP · information retrieval
http://mlg.ucd.ie/content/view/18/ - cached - mail it - history
Skynet is an open source Ruby implementation of Google’s Map/Reduce framework
by otis 2008-01-06 03:20 ruby · MapReduce · distributed computing · api · framework
http://skynet.rubyforge.org/ - cached - mail it - history
Chinese Segmentation Bases on Apache Lucene Analyzer
by otis 2008-01-05 22:15 java · api · lucene · chinese · analysis · segment · index · search
http://code.google.com/p/hickwall-analyzer/ - cached - mail it - history
OpenPipe is an open source scalable platform for manipulating a stream of documents. Pipelines are created from building bricks doing atomic operations on documents, like language detection, field manipulation, POS tagging, entity extraction or submitting the document to a search engine. OpenPipe was originally designed for indexing with the Solr full-text search engine, but could easily be adapted to any search engine or any other use case where documents need to be processed in various ways.
by otis 2007-12-26 18:09 java · library · api · solr · lucene · UIMA · document · process
http://openpipe.berlios.de/ - cached - mail it - history
by otis 2007-12-22 18:54 java · facebook · api · json
http://code.google.com/p/javabook/ - cached - mail it - history
1 - 50 of 108 next »  
Related Tags
 
- exclude ~ optional + require
Add Dates