JScrape is a simple yet powerful java api for scraping (aka screen scraping) data from a web page using XQuery. This API makes it simple to pull data from other sources and maintain them in a simple way
Java framework for extracting and querying full-text content and metadata from various information systems (e.g. file systems, web sites, mail boxes) and the file formats (e.g. documents, images) occurring in these systems.