[sf-lug] Google Reader data saving w/ ArchiveTeam; users, programmers and tech company employees needed; Deadline July 1st

Will guacamolepandemonium at gmail.com
Tue Jun 25 11:55:45 PDT 2013


By the way, the resulting data will be available at the Internet Archive (
https://archive.org ) (in big archive files called 'warc' files here:
http://archive.org/details/archiveteam_greader) and the operation is part
of the ArchiveTeam which is a volunteer org that quickly downloads sites
that are going down soon.

On Tue, Jun 25, 2013 at 11:32 AM, Will <guacamolepandemonium at gmail.com>wrote:

> ArchiveTeam needs YOUR HELP in saving Google Reader's cached data[0].
> Right now we mainly need ***URLS to feeds (RSS, Atom, etc)***, which come
> from crawling sites[1] to get usernames lists and topic lists (newline
> separated), and from anyone with access to crawled information (also anyone
> at Google or other companies/organizations that would have web crawled
> data).
>
> Lists of urls can be submitted[2] and also OPML files[2], and lists of
> keywords that are important to people (please put "querylist" in the titles
> of those files)[2].
>
> Sites looking to replace Google Reader should have an interest in the
> cached data since they can then offer that to their users as a feature.
>
> Please tell anyone you think would be interested in the ArchiveTeam Google
> Reader effort!!
>
> More information in #donereading on EFnet[3]
>
>
> [0] http://archiveteam.org/index.php?title=Google_Reader
>
> [1]
> http://archiveteam.org/index.php?title=Google_Reader#Crawl_websites_to_discover_blogs_and_usernames
>
> [2] http://allyourfeed.ludios.org:8080/
>
> [3] http://chat.efnet.org:9090/?nick=&channels=%23donereading&Login=Login
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://linuxmafia.com/pipermail/sf-lug/attachments/20130625/bc82859d/attachment.html>


More information about the sf-lug mailing list