links · people · groups · tags | My: links · tags · groups · watchlists · notes login · sign up now! | help · blog
Simpy simpy
 
Search Everyone: "java",

Top "java" experts: dapperdanman, grexe, cetoli, anjanb, sheepdog, geirarnesen,

Groups about "java": JAVA, java, Java, Java Developers, Web Design, programming,

1 - 50 of 249 next »   Watch otis
 
A java library for removal of chrome / boilerplate from documents like web pages
by otis 2009-12-03 23:16 java · library · api · information extraction · software
http://code.google.com/p/boilerpipe/ - cached - mail it - history
by otis 2009-12-03 13:02 java · software · library · api · machine learning
http://java-ml.sourceforge.net/ - cached - mail it - history
The purpose of this project is to develop a set of reusable Java components that implement functionality common to any web crawler. These components would benefit from collaboration among various existing web crawler projects, and reduce duplication of effort.
by otis 2009-12-01 22:33 crawl · robots.txt · fetch · java · software · information extraction
http://code.google.com/p/crawler-commons/ - cached - mail it - history
by otis 2009-11-27 23:05 performance · java · logging · statistics · chart · software · library
http://perf4j.codehaus.org/ - cached - mail it - history
List and Javadoc of all JE-BDB properties
by otis 2009-11-20 11:23 bdb · properties · configure · java · database · performance · tuning · software · javadoc
http://www.oracle.com/technology/documentation/berkeley-db/je/java/constant-values.html - cached - mail it - history
by otis 2009-11-18 12:26 hbase · architecture · java · hadoop
http://www.larsgeorge.com/2009/10/hbase-architecture-101-storage.html - cached - mail it - history
by otis 2009-11-11 21:27 open calais · screenshot · java · howto · tutorial · video
http://philippeadjiman.com/blog/2009/09/16/open-calais-from-java-with-eclipse-extract-entities-facts-and-events-in-4-min... - cached - mail it - history
by otis 2009-11-11 21:25 eclipse · java · ide · video · tips · shortcuts · howto · tutorial · screencast
http://philippeadjiman.com/blog/2009/10/11/5-video-tutorials-of-small-to-killer-eclipse-shortcuts/ - cached - mail it - history
Google's Java libraries
by otis 2009-11-11 20:56 java · library · api · collection · concurrent · google
http://code.google.com/p/guava-libraries/ - cached - mail it - history
by otis 2009-11-02 13:33 NLP · information retrieval · computational linguistics · java · software · api · library
http://herd.ida.liu.se:8180/nlpfarm/ - cached - mail it - history
log4j appender for Scribe
by otis 2009-10-31 01:31 scribe · logging · java · software
http://github.com/alexlod/scribe-log4j-appender - cached - mail it - history
MaltParser is a system for data-driven dependency parsing, which can be used to induce a parsing model from treebank data and to parse new data using an induced model.
by otis 2009-10-29 16:20 machine learning · parse · computational linguistics · NLP · java · software · api · library
http://maltparser.org/ - cached - mail it - history
by otis 2009-10-27 15:29 bdb · howto · tutorial · reference · database · java · library
http://www.oracle.com/technology/documentation/berkeley-db/je/GettingStartedGuide/backuprestore.html - cached - mail it - history
A collection of extremely large matrix decomposition algorithm implementations, in Java.
by otis 2009-09-11 16:20 matrix · library · api · java · software
http://code.google.com/p/decomposer/ - cached - mail it - history
OpenGrok is a fast and usable source code search and cross reference engine. It helps you search, cross-reference and navigate your source tree. It can understand various program file formats and version control histories like Mercurial, Git, SCCS, RCS, CVS, Subversion, Teamware, ClearCase, Perforce and Bazaar. In other words it lets you grok (profoundly understand) the open source, hence the name OpenGrok. It is written in Java.
by otis 2009-08-26 03:45 code · source code · search · browse · reference · java · subversion
http://www.opensolaris.org/os/project/opengrok/ - cached - mail it - history
A simple, asynchronous, single-threaded memcached client written in java.
by otis 2009-08-23 04:31 java · memcached · client
http://code.google.com/p/spymemcached/ - cached - mail it - history
by otis 2009-08-12 23:47 asf · apache · maven · repository · jar · java
https://repository.apache.org/index.html - cached - mail it - history
Galago is a toolkit for experimenting with text search. It is based on small, pluggable components that are easy to replace and change, both during indexing and during retrieval. It includes TupleFlow, which is a distributed computation framework like MapReduce or Dryad. TupleFlow manages the difficult parts of processing text: serializing data, sorting it, and distributing processing. The IndexReader and IndexWriter classes manage storing key/value pairs like inverted lists. This makes it possible to make your own kinds of index structures without starting from scratch.
by otis 2009-08-12 16:01 java · software · search · library · information retrieval · distributed computing
http://www.galagosearch.org/ - cached - mail it - history
by otis 2009-08-10 14:59 java · pattern · design · software · builder · object
http://forums.amd.com/devblog/blogpost.cfm?threadid=108340&catid=313 - cached - mail it - history
This article is focused on Distributed Cache implementations (i.e. Java-based implementations of in-memory Data-Structures – which are backed by a Data-Source such as a Relational Database)
by otis 2009-07-13 10:01 terracotta · cache · distribute · software · share · java
http://javamuse.blogspot.com/2009/06/so-you-want-terracotta-based.html - cached - mail it - history
by otis 2009-06-25 14:56 keys · hash table · client · java · library · software · database
http://code.google.com/p/tokyotyrant-java/ - cached - mail it - history
Hazelcast is an open source clustering and highly scalable data distribution platform for Java.
by otis 2009-05-26 14:55 distributed filesystem · database · storage · java · api · library
http://www.hazelcast.com/ - cached - mail it - history
by otis 2009-05-24 21:31 rdf · solr · software · java · search · information retrieval
http://fgiasson.com/blog/index.php/2009/04/29/rdf-aggregates-and-full-text-search-on-steroids-with-solr/ - cached - mail it - history
Bixo is an open source Java crawler that runs as a series of Cascading pipes. It is designed to be used as a tool for creating customized crawlers, thus each Cascading pipe implements a discrete operation. By building a customized Cascading pipe assembly, you can quickly create specialized crawlers that are optimized for a particular use case.
by otis 2009-05-24 00:38 crawl · fetch · spider · java · MapReduce · katta
http://bixo.101tec.com/ - cached - mail it - history
Hazelcast is a clustering and highly scalable data distribution platform for Java.
by otis 2009-05-05 00:18 java · software · distribute · discover · cluster · library
http://code.google.com/p/hazelcast/ - cached - mail it - history
The Eclipse Memory Analyzer is a fast and feature-rich Java heap analyzer that helps you find memory leaks and reduce memory consumption.
by otis 2009-05-04 15:03 heap · memory · jvm · analysis · eclipse · java
http://www.eclipse.org/mat/ - cached - mail it - history
Explanation of JVM parameters for parallel and concurrent garbage collectors, their goals and effects
by otis 2009-04-30 11:25 java · jvm · garbage collection
http://forums.amd.com/devblog/blogpost.cfm?threadid=108148&catid=313 - cached - mail it - history
Good summary of how to analyze and size young and tenured heap generations
by otis 2009-04-30 11:19 java · jvm · garbage collection · performance · memory · heap
http://forums.amd.com/devblog/blogpost.cfm?threadid=108618&catid=313 - cached - mail it - history
by otis 2009-04-29 18:13 api · library · software · java · list
http://www.publicobject.com/glazedlists/ - cached - mail it - history
by otis 2009-04-18 23:11 hadoop · database · java · software
http://www.cloudera.com/blog/2009/03/06/database-access-with-hadoop/ - cached - mail it - history
LSql is a command-line tool written in Java that allows sql-like queries to run against a Lucene database. It can be run in interactive mode, or can automatically parse a list of commands from a file.
by otis 2009-04-15 16:00 lucene · sql · software · java
http://code.google.com/p/lucene-sql/ - cached - mail it - history
by otis 2009-04-14 13:11 maven · grails · java · software · development
http://www.grails.org/Maven+Integration - cached - mail it - history
Maven Tools for Grails is a set of maven artifacts used to ease the development of Grails Application using maven 2.
by otis 2009-04-14 12:22 maven · grails · plugin · java · software · development
http://forge.octo.com/maven/sites/mtg/ - cached - mail it - history
Application Assembler Plugin is a Maven plugin for generating scripts for starting java applications. All dependencies and the artifact of the project itself are placed in a generated Maven repository in a defined assemble directory. All artifacts (dependencies + the artifact from the project) are added to the classpath in the generated bin scripts.
by otis 2009-03-27 16:22 java · application · startup · script · maven
http://mojo.codehaus.org/appassembler/appassembler-maven-plugin/ - cached - mail it - history
REPLAY is an open source solution developed in java to manage the workflow of audiovisual lecture recordings from production in the classroom to distribution on various channels in an automated manner. In this, it also provides comprehensive functionalities for existing audiovisual archives, repositories or collections.
by otis 2009-03-24 12:57 audio · video · index · search · archive · free · software · java
http://www.replay.ethz.ch/ - cached - mail it - history
by otis 2009-03-20 22:23 wordnet · synonyms · api · library · java
http://lyle.smu.edu/~tspell/jaws/index.html - cached - mail it - history
Davisor Offisor converts popular but hard-to-read non-XML document formats (Word, PowerPoint, Excel, and HTML) into XML and further
by otis 2009-03-20 10:41 java · library · parse · filter · convert · pdf · xml · xhtml · word
http://www.davisor.com/offisor/ - cached - mail it - history
wayback is an open source java implementation of the The Internet Archive Wayback Machine .
by otis 2009-03-10 19:27 java · archive · software
http://archive-access.sourceforge.net/projects/wayback/ - cached - mail it - history
Gson is a Java library that can be used to convert Java Objects into its JSON representation. It can also be used to convert a JSON string to an equivalent Java object. Gson can work with arbitrary Java objects including pre-existing objects that you do not have source-code of.
by otis 2009-03-09 15:27 json · java · library · api · convert
http://code.google.com/p/google-gson/ - cached - mail it - history
Word-aligned Compression library for java
by otis 2009-03-02 11:59 java · api · library · compress · information retrieval · encode
http://code.google.com/p/javaewah/ - cached - mail it - history
UIMA NLP Components
by otis 2009-02-27 12:59 java · UIMA · pipeline · NLP · information retrieval · software
http://www.julielab.de/Resources/Software/Tools.html - cached - mail it - history
Lucas is a UIMA CAS consumer component which bridges the UIMA framework with the Lucene search engine library. Lucas maps CASes to lucene index documents according to a mapping file .
by otis 2009-02-27 12:57 java · UIMA · lucene · index · search · pipeline · software · information retrieval
https://www.coling.uni-jena.de/sites/lucas/index.html - cached - mail it - history
Set Operation implementations for SortedIntegerSegments for inverted list caching in search engines. The implementations also include P4Delta compression algorithm based DocIdSet for iterating over DocIdSets in a compressed form.
by otis 2009-02-09 01:25 lucene · search · index · compress · information retrieval · set · java
http://code.google.com/p/lucene-ext/ - cached - mail it - history
ClearTK is a toolkit for developing statistical natural language processing components in Java and is based on the Apache UIMA framework for text analysis.
by otis 2009-01-28 16:28 java · api · UIMA · NLP · statistics
http://code.google.com/p/cleartk/ - cached - mail it - history
VisualVM is a visual tool integrating several commandline JDK tools and lightweight profiling capabilities. Designed for both production and development time use, it further enhances the capability of monitoring and performance analysis for the Java SE platform
by otis 2009-01-26 16:37 java · monitor · jvm · garbage collection · heap · profile · analysis
https://visualvm.dev.java.net/ - cached - mail it - history
Protocol Buffers are a way of encoding structured data in an efficient yet extensible format. Google uses Protocol Buffers for almost all of its internal RPC protocols and file formats.
by otis 2009-01-25 23:54 format · serialize · network · java · api · library
http://code.google.com/p/protobuf/ - cached - mail it - history
wsmo4j is an API and a reference implementation for building Semantic Web Services and Semantic Business Process applications
by otis 2009-01-22 15:23 semantic · ontology · rdf · java · api
http://wsmo4j.sourceforge.net/ - cached - mail it - history
JLangDetect is a pure Java implementation of a language detector. It provides a toolkit for training language recognition, and a simple implementation of a detector.
by otis 2009-01-17 00:41 language · java · api · NLP · identification
http://www.jroller.com/melix/entry/nlp_in_java_a_language - cached - mail it - history
This implements RLE compressed bitset for Java. The RLE compression scheme is Word-Aligned Hybrid compression scheme from LBNL. This package implements the Word-Aligned Hybrid (WAH) compression scheme from LBNL, which compresses the bitsets without sacrificing performance. The code is derived from the original authors' FastBit software and pseudo-codes in the paper.
by otis 2009-01-10 00:59 bitmap · index · api · java · compress
http://code.google.com/p/compressedbitset/ - cached - mail it - history
Java Simon is a simple monitoring API that allows you to follow and better understand your application. Monitors (familiarly called Simons) are placed directly into your code and you can choose whether you want to count something or measure time/duration. Access to Simons is provided via static SimonManager.
by otis 2008-12-26 12:10 java · api · monitor · application · performance · software · jmx
http://code.google.com/p/javasimon/ - cached - mail it - history
1 - 50 of 249 next »  
Related Tags
 
- exclude ~ optional + require
Add Dates