links
·
people
·
groups
·
tags
| My:
links
·
tags
·
groups
·
watchlists
·
notes
login
·
sign up now!
|
help
·
blog
simpy
Search Everyone:
("metaindex") "datasets"
Sort by:
relevance
· freshness ·
popularity
Search:
multi
· tag ·
site
·
extension
1 - 20 of 116
next »
Public web crawler projects « Elastic Web Mining | Bixolabs
by
2 people
:
tsk
,
otis
... last on
2009-12-05
...
tags:
crawl
·
Ken
·
Krugler
·
blog article
·
dataset
·
web
http://bixolabs.com/2009/12/02/public-web-crawler-projects/
-
cached
-
mail it
-
history
» popurls.com - popular urls where everyone's flying to
by
277 people
:
seeker83
,
undergroundhiphop
,
groupwiseinc
... last on
2009-12-04
...
tags:
news
·
web2.0
·
social
·
links
·
aggregator
·
rss
·
web
·
bookmarks
·
imported
·
tools
http://popurls.com/
-
cached
-
mail it
-
history
reddit: what's new online
by
84 people
:
undergroundhiphop
,
ericl
,
jensilverman
... last on
2009-11-30
...
tags:
social
·
Bookmarks
·
news
·
links
·
web2.0
·
bookmarks
·
aggregator
·
blog
·
online
·
shortcut:zephyr
http://www.reddit.com/
-
cached
-
mail it
-
history
Drew Curtis' FARK.com
by
280 people
:
berokas
,
charme
,
jjsacramento
... last on
2009-11-29
...
tags:
news
·
fark
·
humor
·
bookmarks
·
daily
·
curtis
·
drew
·
funny
·
blog
·
fun
http://www.fark.com/
-
cached
-
mail it
-
history
digg
by
427 people
:
charme
,
junander
,
marco
... last on
2009-11-20
...
tags:
news
·
technology
·
digg
·
bookmarks
·
blog
·
tech
·
toolbar
·
daily
·
folder
·
social
http://www.digg.com/
-
cached
-
mail it
-
history
Web as Corpus ToolKit - Home Page
by
otis
... last on
2009-11-02
...
tags:
NLP
·
corpus
·
dataset
·
information retrieval
·
perl
·
software
·
text mining
http://www.drni.de/wac-tk/
-
cached
-
mail it
-
history
Infochimps.org: Free Redistributable Rich Data Sets
by
4 people
:
ognjen
,
otis
,
avatar
... last on
2009-10-29
...
tags:
data
·
data mining
·
dataset
·
datasets
·
repository
·
42
·
free
·
freeCulture
·
machine learning
·
reference
http://infochimps.org/
-
cached
-
mail it
-
history
Rpmfind mirror
by
13 people
:
fivezoom
,
anthony68
,
tuxsoul
... last on
2009-10-27
...
tags:
linux
·
net
·
rpmfind
·
redhat
·
search
·
LinuxDownload
·
rpm
·
software
·
speakeasy
·
Bookmarks
http://www.rpmfind.net/
-
cached
-
mail it
-
history
freshmeat.net: Welcome to freshmeat.net
by
8 people
:
aeryn
,
believer773
,
hbo
... last on
2009-10-24
...
tags:
linux
·
software
·
freshmeat
·
net
·
bookmarks
·
computers
·
fm
·
imported
·
news
·
toolbar
http://www.freshmeat.net/
-
cached
-
mail it
-
history
(theinfo)
by
ippisl
... last on
2009-10-03
...
tags:
analysis
·
crawlers
·
data
·
dataset
·
datasets
·
tools
http://theinfo.org
-
cached
-
mail it
-
history
Untitled
by
174 people
:
ericl
,
ognjen
,
egonerwin
... last on
2009-09-24
...
tags:
technology
·
culture
·
hin
·
kuro
·
news
·
org
·
trenches
·
blog
·
tech
·
bookmarks
http://www.kuro5hin.org/
-
cached
-
mail it
-
history
Ars Technica - The PC Enthusiast's Resource
by
114 people
:
ericl
,
onionknight
,
ppmartin
... last on
2009-08-31
...
tags:
news
·
technology
·
tech
·
hardware
·
computers
·
bookmarks
·
computer
·
daily
·
pc
·
toolbar
http://arstechnica.com/index.ars
-
cached
-
mail it
-
history
http://www.fdsapi.com
by
sven
... last on
2009-08-24
...
tags:
api
·
dataset
·
formatted
·
java
·
jsp
·
programming
http://www.fdsapi.com
-
cached
-
mail it
-
history
wiki.dbpedia.org : About
by
7 people
:
dranorter
,
praguebob
,
waster
... last on
2009-06-22
...
tags:
semantic
·
data
·
wikipedia
·
database
·
datamining
·
dataset
·
semanticweb
·
analysis
·
api
·
conversion
http://dbpedia.org/About
-
cached
-
mail it
-
history
System One - Wikipedia3
by
6 people
:
dranorter
,
mshook
,
toxi
... last on
2009-06-22
...
tags:
rdf
·
wikipedia
·
database
·
dataset
·
conversion
·
english
·
fromdelicious
·
one
·
opendata
·
readme
http://labs.systemone.at/wikipedia3
-
cached
-
mail it
-
history
Some Datasets Available on the Web � Data Wrangling Blog
by
2 people
:
denilw
,
jeekepaule
... last on
2009-06-02
...
tags:
Available
·
Blog
·
Data
·
Datasets
·
Some
·
Web
·
Wrangling
·
data
·
database
·
datasets
http://www.datawrangling.com/some-datasets-available-on-the-web
-
cached
-
mail it
-
history
Main Page - OVISWiki
by
2 people
:
otis
,
voipsipguru
... last on
2009-06-01
...
tags:
cluster
·
dataset
·
distributed computing
·
monitor
·
performance
·
statistics
·
status
·
system:unfiled
https://ovis.ca.sandia.gov/mediawiki/index.php/Main_Page
-
cached
-
mail it
-
history
Untitled
by
2 people
:
kjgillis
,
dapperdanman
... last on
2009-05-28
...
tags:
Data
·
Data Extraction Tool
·
Data Gadget
·
Data Sets
·
Data Tool
·
Data Widget
·
Datasets
·
Government 2.0
·
Government Data Sets
·
Government Datasets
http://www.data.gov/
-
cached
-
mail it
-
history
Web as Corpus
by
otis
... last on
2009-05-17
...
tags:
NLP
·
computational linguistics
·
corpus
·
crawl
·
dataset
·
index
·
information retrieval
·
linguistics
·
web
http://webascorpus.org/
-
cached
-
mail it
-
history
Cutting Edge: DataSets vs. Collections -- MSDN Magazine, August 2005
by
3 people
:
d4ljoyn
,
ikhwanhayat
,
pfeilbr
... last on
2009-03-30
...
tags:
architecture
·
.net
·
@dev
·
database
·
dataset
·
dotnet
·
engineering
·
object
·
orm
·
programming
http://msdn.microsoft.com/msdnmag/issues/05/08/CuttingEdge/default.aspx
-
cached
-
mail it
-
history
1 - 20 of 116
next »
Related Users
16
avatar
13
ecajun
12
rkendle
7
malzoid
7
otis
7
phile
7
reubgr
6
egonerwin
5
jdrsantos
5
johnfromberkeley
Related Tags
668
news
290
bookmarks
234
technology
169
daily
164
blog
164
social
160
links
157
toolbar
148
tech
145
web2.0
140
folder
130
imported
116
web
109
blogs
106
digg
102
fark
97
culture
79
aggregator
74
rss
71
humor