links · people · groups · tags | My: links · tags · groups · watchlists · notes login · sign up now! | help · blog
Simpy simpy
 
era, member since Jun 19, 2006
.
Search Everyone: "data",

Top "data" experts: socrtwo, dataentryoutsorcing, outsourcingdata, bobodod, shane_john, jackiege,

Groups about "data": data on the net, Data recovery, Data Recovery Software, Linux Data Recovery RDS, Data Recovery Software, Novell Data Recovery Software,

1 - 9 of 9   Watch era
 
Human-readable data serialization language for Perl, Python, Ruby, Java, etc.
by era 2006-06-19 01:25 03a · algorithm · data · format · module · perl · programming · rubric · 20060619-0123
http://www.yaml.org/ - cached - mail it - history
Ogg Vorbis listening samples side by side with WMA, MP3, RA, et al.
by era 2006-06-19 01:25 audio · data · download · opensource · testing · 20060619-0123
http://www.xiph.org/ogg/vorbis/listen.html - cached - mail it - history
Compound Text is a format for multiple character set data, such as multi-lingual text Images missing, get the PDF from http://www.x.org/X11R6.8.1/docs/CTEXT/ctext.pdf or the sources from http://cvs.freedesktop.org/xorg/xc/doc/specs/CTEXT/ctext.tbl.ms?rev=HEAD&view=markup
by era 2006-06-19 01:25 03a · character · data · reference · standard · unicode · x11 · 20060619-0123
http://www.xfree86.org/current/ctext.html - cached - mail it - history
Just for a quick review, here is the dump I got by scraping the site the other night (and/or day) This is temporary, and will disappear. Soon. It's just for you to comment on before I do the next round (tentatively, I'd do this weekly). Please review the details of the YAML formatting in particular, if you think you know how it should look. *** This is a 770,167-byte file on a slow server [now up to 1,3 Mb]. Don't click if you're impatient. I deleted entry #247 altogether. Other than that, the data is as I got it off the server. I obtained the data by going through the /entry/nnn for each nnn up to number 2519 [now 4259], which for some reason I believe was the highest entry number at the time. Then I extracted the actual data out of the web page for each entry, and converted it to (some sort of) YAML. The /entry/ page doesn't show the modification time, so I don't have that. I have reserved a field for it in the format nevertheless. I also have this in something resembling XML, of course without any formal DTD or anything. Let me know if you think getting that would be more useful (instead, or as well). For the context impaired, the "and/or day" refers to time zone confusion.
by era 2006-06-19 01:24 blog · data · delirioussiteblog · delirioussitedump · erablog · 20060619-0123
http://www.iki.fi/era/tmp/entries.yml - cached - mail it - history
This (Windows!) program managed to salvage almost all of my trashed ext3 filesystem
by era 2006-06-19 01:24 03a · data · linux · recovery · 20060619-0123
http://www.data-recovery-software.net/Linux_Recovery_Download.shtml - cached - mail it - history
by era 2006-06-19 01:23 03a · data · download · module · python · statistics · tool · visualization · 20060619-0123
http://matplotlib.sourceforge.net/screenshots.html - cached - mail it - history
The PU1 PU2 PU3 PUA and older Ling-Spam email corpora
by era 2006-06-19 01:23 03a · benchmark · collection · corpus · data · download · mail · reference · spam · 20060619-0123
http://iit.demokritos.gr/skel/i-config/downloads/ - cached - mail it - history
This Monday's dump is now available, at the same location as last week The old data from last week is in entries-2005-04-07.yml. The file is now 1,343,896 bytes, and the server is still slow. I've scraped up through entry #4259, for a total of 4049 undeleted entries. (Hmm, I guess I'm falling behind pretty badly already.) And the location is still very temporary. Still, here's the URL again: http://www.iki.fi/era/tmp/entries.yml The scrape took several hours, and involves a separate TCP connection for each page fetched. I guess I ought to make that slightly less resource-intensive, but I'm not sure how exactly to do it. The current system was made that way because I wanted to be able to specify an arbitrary pause between fetches (currently 4 seconds). Will I be tying up resources if the fetch keeps the connection open but idle for several seconds between each fetch?
by era 2006-06-19 01:23 blog · data · delirioussiteblog · delirioussitedump · erablog · 20060619-0123
http://de.lirio.us/rubric/entry/3479 - cached - mail it - history
Troff source for ctext spec at x.org (via ViewCVS at Freedesktop.org)
by era 2006-06-19 01:23 character · cvs · data · linux · reference · standard · unicode · unix · x11 · 20060619-0123
http://cvs.freedesktop.org/xorg/xc/doc/specs/CTEXT/ctext.tbl.ms?rev=HEAD&view=markup - cached - mail it - history
1 - 9 of 9  
Related Tags
 
- exclude ~ optional + require
Add Dates