Just for a quick review, here is the dump I got by scraping the site the other night (and/or day)
This is temporary, and will disappear. Soon. It's just for you to comment on before I do the next round (tentatively, I'd do this weekly). Please review the details of the YAML formatting in particular, if you think you know how it should look.
*** This is a 770,167-byte file on a slow server [now up to 1,3 Mb]. Don't click if you're impatient.
I deleted entry #247 altogether. Other than that, the data is as I got it off the server.
I obtained the data by going through the /entry/nnn for each nnn up to number 2519 [now 4259], which for some reason I believe was the highest entry number at the time. Then I extracted the actual data out of the web page for each entry, and converted it to (some sort of) YAML.
The /entry/ page doesn't show the modification time, so I don't have that. I have reserved a field for it in the format nevertheless.
I also have this in something resembling XML, of course without any formal DTD or anything. Let me know if you think getting that would be more useful (instead, or as well).
For the context impaired, the "and/or day" refers to time zone confusion.
by
era
2006-06-19 01:24
blog
·
data
·
delirioussiteblog
·
delirioussitedump
·
erablog
·
20060619-0123