Sitescooper Latest Changes

These are the latest changes to sitescooper and its site files.Note that you can download each file independently from here .
Who / When What
akkana site_samples/science/new_scientist_news.site
2005-11-10 Use RSS instead ofhtml feed, because of stories that weirdly don't show up in Plucker
   
akkana site_samples/
2005-11-10 opinion/pulpit.site, opinion/slate.site,palmsized/the_register_rss.site, tech/slashdot_top.site: Updatesfor sites that have changed
   
akkana site_samples/
2005-11-10 business/lazarus_at_large.site,opinion/alanmiller.site, tech/paulgraham.site: New sites from me
   
akkana site_samples/tech/pcmag_firstlooks.site
2005-11-10 New site from Goh BoonNam
   
akkana lib/Sitescooper/URLProcessor.pm
2005-11-10 Add application/*xml to allowedtypes, for newer RSS sites
   
barrygonzaga site_samples/bsd/openbsd_journal.site
2005-08-08 update url, removepostprocess magic, update email
   
barrygonzaga site_samples/regional_philippines/ctc-movies-metro.site
2005-08-08 Addclickthecity.com Metro Manila Movie Guide; note: huge site
   
barrygonzaga site_samples/palmsized/inq7-mobile.site
2005-08-08 3 level inq7.net site
   
barrygonzaga site_samples/regional_philippines/
2005-08-08 inq7.site, pdi.site: replacepdi.site with inq7.site
   
barrygonzaga site_samples/linux/gwn.site
2005-08-08 add logo imageurl; update authoremail
   
barrygonzaga site_samples/business/businessweek.site
2005-08-08 Reflect web site title,update author email
   
barrygonzaga site_samples/palmsized/
2005-08-08 ny_times.site, salon.site: removenonworking site
   
akkana lib/Sitescooper/Main.pm
2005-07-06 Add < < ^^ > > links at end of story aswell as beginning
   
akkana site_samples/
2005-07-06 lib/layouts.site, humor/jon_carroll.site,news/wired_news/wired_news_politics.site, opinion/salon.site,science/new_scientist_news.site, tech/newsforge.site: Some updatesfor sites that have changed.
   
akkana site_samples/regional_boston/bostonglobe.site
2005-07-06 New site: BostonGlobe City & Region sections. From Bruce Zohn
   
akkana site_samples/
2005-07-06 news/USNews.site, news/newsweek_intl.site,tech/pcmag_images.site: Updates from BoonNam Goh
   
akkana site_samples/science/new_scientist_news.site
2005-01-26 Changes to trackthe recent site changes
   
akkana site_samples/regional_israel/
2005-01-17 haaretz.site, jpost-columns.site,jpost-international.site, jpost-israel.site, jpost-me.site,jpost-opinion.site: David Resnick : JerusalemPost and Haaretz site files
   
akkana site_samples/regional_uk/bbc_news_sci_tech.site
2005-01-17 Add ContentsDiff
   
akkana site_samples/sport/GSR/
2005-01-17 GSR_Appearance_Mods.site, GSR_Bike.site,GSR_General_Disc.site, GSR_Owners.site, GSR_Performance_Mods.site,GSR_Stories.site, GSR_Technical.site, GSR_Tips-n-Tricks.site:Delmer Wells : GSR motorcycle information sites
   
akkana site_samples/opinion/slate.site
2005-01-05 Anthony Foglia : New site, Slate
   
akkana site_samples/linux/slashdot.site
2005-01-05 B. M. Sleight : minor changes to pick up ask.slashdot.orgit.slashdot.org
   
akkana site_samples/weblog/kevin_sites.site
2005-01-05 New site from Delmer Wells : Kevin's War Blog
   
akkana site_samples/tech/pcmag_images.site
2005-01-05 Goh Boon Nam: Update totrack site changes and grab images better
   
akkana site_samples/business/the_economist.site
2005-01-05 Goh Boon Nam: RemoveSubscription-only pages which cause problem to Plucker
   
akkana site_samples/
2005-01-05 humor/dave_barry.site,linux/debian_weekly_news.site,news/wired_news/wired_news_tech.site, tech/newsforge.site,tech/the_register.site, weblog/riverbend.site: Updates to trackchanges in the web sites
   
akkana site_samples/weblog/riverbend.site
2004-06-22 Fixed StoryStart
   
akkana site_samples/linux/
2004-06-03 kc_debian_hurd.site, kc_gimp.site: Remove nolonger extant debian, hurd and gimp kernel cousins
   
akkana site_samples/regional_australia/yourmovies_canberra.site
2004-05-20 YourMovies, Canberra: from Ken Russell
   
akkana site_samples/news/USNews.site
2004-05-14 Update from Goh Boon Nam
   
akkana site_samples/
2004-05-14 science/archaeology_org.site,science/grahamhancock.site, tech/slyck.site: New sites from KenRussell
   
akkana site_samples/palmsized/the_register_rss.site
2004-05-14 New palmsizedregister from Ken Russell
   
akkana site_samples/palmsized/
2004-05-14 the_register.site,the_register_rss.site: Rename palmsized The Register to TheRegister RSS, so as not to conflict with the non-palmsized Register
   
akkana site_samples/
2004-05-14 news/atlantic.site, tech/slashdot_top.site: Newsites
   
akkana site_samples/opinion/salon.site
2004-05-14 Comment out StoryToPrintableSub-- it was causing errors
   
akkana site_samples/
2004-04-27 linux/desktoplinux.site, science/smithsonian.site,tech/joelonsoftware.site, tech/newsforge.site,weblog/riverbend.site, weblog/where_is_raed.site: New sites, fromme
   
akkana site_samples/lib/layouts.site
2004-04-27 Fix BBC news information
   
akkana site_samples/
2004-04-27 linux/kernel_traffic.site,opinion/i_cringely.site, tech/the_register.site: Update URL,content start, and other minor fixes
   
akkana site_samples/news/yahoo/
2004-04-26 yahoo_business.site,yahoo_entertainment.site, yahoo_politics.site, yahoo_tech.site,yahoo_top_stories.site: Re-adding yahoo sites, fixed thanks toJonathan Becker
   
akkana site_samples/comics/
2004-04-26 boondocks.site, doonesbury.site,tedrall.site: New comics from Ignatz Sol
   
akkana site_samples/
2004-04-25 news/newsweek_intl.site, tech/pcmag_images.site:Updates from Goh Boon Nam
   
akkana site_samples/humor/dave_barry.site
2004-04-25 Update from Alan Hoyle : fix story start, end, headline
   
cwerner site_samples/opinion/pulpit.site
2004-04-23 New site for Bob Cringely'sweekly column: The Pulpit. This is the same site scooped byi_cringely.site, except that he old i_cringely site did a 2 levelscoop that attempted to get a set of columns, whereas the new onegets a single column and only on Fridays. The old one can probablybe removed, but I didnt want to mess with it in case someone isrelying on it.
   
cwerner default_isilox.ixl, sitescooper.cf, doc/site_params.html,lib/Sitescooper/Main.pm, lib/Sitescooper/SCF.pm
2004-03-22 Improved supportfor isiloXC:1. Added a new param to sitescooper.cf "ISiloDefaultIxlFile" thatpoints to an .ixl file in the file system. This means that userscan change the iSiloX options by using the iSiloX GUI tool tocreate a new document, change all the options, then save as a .ixlfile. The and tags of the document arestripped and replaced by sitescooper but the rest is used forgenerating the isilox pdb.More details are given in the comments in sitescooper.cf.The most common likely use for this is to allow the users of-isilox to specify global settings for things like image depth,color, inclusion, dithering etc, and perhaps for category too.2. Added a new site param called "ExtraISiloIxlTags", to allow ixlsettings specific to a site. Updated doc/site_params.html, so seethis for more details.This is a little different in that the user has to specify a set oftop-level tags for the .ixl file. These get appended to thegenerated file thus overriding the defaults (or overriding theglobal options if the new config param is used). This takesadvantage of the fact that isilox tolerates the tags appearing morethan once by simply taking the last tag and ignoring earlier copies(or at least its xml parser does).So you can set general options in your .ixl file and overridespecific options in the .site files. The fact that you have tooverride the whole tag such as means that you can'toverride, say bitdepth separately from dithering, but its stillpretty powerful. And simpler and more durable (ie resitant tochanges in isilox) than adding a bunch of new site params.: Modified Files: : sitescooper/sitescooper.cfsitescooper/doc/site_params.html : sitescooper/lib/Sitescooper/Main.pm : sitescooper/lib/Sitescooper/SCF.pm : Added Files: : sitescooper/default_isilox.ixl
   
jmason lib/Sitescooper/
2004-02-19 Robot.pm, StoryURLProcessor.pm: some glitchesin RSS output fixed; now does not search for sub-stories afterhtml_to_text conversion
   
jmason site_samples/science/new_scientist_news.site
2004-02-18 New Scientist Newssite updated
   
akkana site_samples/
2004-02-16 cinema/ebert_1min.site, cinema/roger_ebert.site,humor/dave_barry.site: Contributions from Alan Hoyle, alanh atemail.unc.edu
   
jmason lib/Sitescooper/
2004-02-13 Main.pm, SCF.pm: added patch from Robert Fuhge,robert.fuhge.at.epost.de, assign categories to Plucker documentsusing the Category: line in the site file
   
jmason site_samples/tech/risks.site
2004-02-13 updated risks.site to use new'mobile device' rendering
   
akkana site_samples/business/the_economist.site
2004-02-11 The Economist, fromBoonNam Goh
   
akkana site_samples/news/
2004-02-11 newsweek.site, newsweek_intl.site: Newsweekupdates from BoonNam Goh
   
jmason site_samples/security/
2004-02-07 crypto_gram.site, crypto_gram.site:cryptogram site fixed
   
jmason lib/Sitescooper/Robot.pm
2004-01-31 handle undef headlines
   
jmason lib/Sitescooper/Robot.pm
2004-01-31 oops; RSS output headline was not beingHTML-encoded correctly
   
akkana site_samples/
2003-11-15 tech/computer_world.site, news/newsweek_intl.site:Contributions from BoonNam Goh
   
barrygonzaga site_samples/linux/gwn.site
2003-11-04 add Gentoo Weekly News
   
akkana site_samples/
2003-10-31 news/Newsweek.site, news/NewsweekIntl.site,regional_israel/jpost.site: Remove inconsistently named files
   
akkana site_samples/news/
2003-10-31 newsweek.site, newsweek_intl.site: Newsweek,from Goh Boon Nam
   
akkana site_samples/regional_israel/jerusalem_post.site
2003-10-31 Jerusalem Post,from David Resnick
   
akkana site_samples/tech/wiredmag.site
2003-10-31 Previous commit only got onespecific date. So I've substituted my own Wired site file, whichdoesn't get entire stories yet, but it does get Wired every day.
   
akkana site_samples/tech/wiredmag.site
2003-10-31 One issue of Wired Magazine,from richard_html2pdb at yahoo dot com
   
akkana site_samples/tech/pcmag_images.site
2003-10-31 Update from Goh Boon Nam:Get full-sized images
   
akkana site_samples/news/
2003-10-31 Newsweek.site, NewsweekIntl.site: Newsweekupdates (US and Intl) from BoonNam Goh
   
akkana site_samples/regional_israel/jpost.site
2003-10-31 Jerusalem Post, fromDavid Resnick
   
akkana site_samples/news/
2003-10-29 Newsweek.site, USNews.site: New sitescontributed by BoonNam Goh
   
hubidubi site_samples/regional_hungary/linuxonline_hu.site
2003-09-17 new site file for linuxonline.hu
   
jmason lib/Sitescooper/StoryURLProcessor.pm
2003-07-08 fixed some wierdness inerror messages
   
jmason site_samples/science/new_scientist.site
2003-06-25 fixed NS site
   
jmason lib/Sitescooper/Main.pm
2003-06-25 bug fixed
   
jmason sitescooper.pl, lib/Sitescooper/Main.pm,site_samples/science/new_scientist.site
2003-06-25 bug on win32, noted byRobert P. Nix
   
jmason site_samples/culture/world_new_york.site
2003-06-11 fixed and re-addedWorld New York site
   
jmason site_samples/science/new_scientist.site
2003-06-11 added headline supportfor New Scientist
   
jmason site_samples/
2003-06-11 business/economist.site, business/stocksmart.site,business/wsj.site, cinema/coaxialnews.site, cinema/coolnews.site,cinema/forcenet.site, comics/girls_and_sports.site,comics/horrorscope.site, comics/i_need_help.site,comics/new_breed.site, comics/pops_place.site,comics/wildwood.site, culture/plastic.site, games/bluesnews.site,humor/alexei_sayle.site, humor/dave_barry.site,humor/ditherati.site, linux/linuxtoday.site, linux/linuxworld.site,linux/mandrakeforum.site, linux/mysql_newsletter.site,linux/weekly_news.site, news/gallup_poll.site,news/world_new_york.site,news/wired_news/wired_news_top_stories.site,news/yahoo/yahoo_business.site,news/yahoo/yahoo_entertainment.site, news/yahoo/yahoo_health.site,news/yahoo/yahoo_oddly_enough.site, news/yahoo/yahoo_politics.site,news/yahoo/yahoo_public_opinion.site,news/yahoo/yahoo_science.site, news/yahoo/yahoo_sports.site,news/yahoo/yahoo_technology.site,news/yahoo/yahoo_top_stories.site, news/yahoo/yahoo_world.site,odd/morbid_fact_du_jour.site, odd/snopes.site,opinion/salon_archives.site, opinion/tbtf.site,opinion/tbtf_log.site, opinion/unblinking.site, palm/memoware.site,palm/palmguru.site, palm/palminfocenter.site, palm/pdalive.site,palm/pencomputing.site, palmsized/beyond2000-pda.site,regional_australia/abc_news_online.site,regional_australia/fairfax_it.site,regional_california/mercury_center.site,regional_california/la_times/latimes_local.site,regional_california/la_times/latimes_nat.site,regional_california/la_times/latimes_oc.site,regional_california/la_times/latimes_science.site,regional_california/la_times/latimes_tech.site,regional_california/la_times/latimes_world.site,regional_croatia/KSET_monthly.site,regional_francais/libe_portrait_du_jour.site,regional_francais/libe_rebonds.site, regional_francais/sia_fr.site,regional_germany/bundesregierung.site,regional_germany/de_excite.site, regional_germany/de_heute.site,regional_germany/de_zdnet.site,regional_germany/de_zeit/de_zeit_media.site,regional_hungary/hirek.site, regional_israel/jerusalem_post.site,regional_north_carolina/cats_cradle.site,regional_north_carolina/charlotte_observer.site,regional_north_carolina/news_observer.site,regional_philadelphia/phillynews.site,regional_seattle/seattletimes.site, regional_spain/es_zdnet.site,regional_spain/marca_soccer.site, regional_spain/marca_sports.site,regional_uk/digiguide_tv_listings.site,regional_uk/times_britain.site, regional_uk/times_world.site,science/cosmiverse.site, science/nasa2go.site, science/sciam.site,security/securityportal.site, sport/fox_sports.site,tech/beyond2000.site, tech/mit_tech_review.site,weblog/joel_on_software.site, weblog/tsluts.site: removed all sitesthat now give HTTP errors when used
   
jmason sitescooper.pl, lib/Sitescooper/LWPHTTPClient.pm,lib/Sitescooper/Main.pm, site_samples/web/alistapart.site,site_samples/web/asktog.site, site_samples/web/webmonkey.site
2003-06-11 added -timeout parameter
   
jmason rss-to-site.pl
2003-06-11 another patch from Adrian Colley
   
jmason site_samples/
2003-06-11 cinema/ebert_answer_man.site,cinema/ebert_features.site, cinema/ebert_great_movies.site,cinema/roger_ebert.site, opinion/nro.site: updated sites from JohnStraw
   
jmason site_samples/regional_germany/
2003-06-10 de_cert.site, de_cyberkino.site,de_gazette.site, de_heise_mobil.site, de_heise_tp.site,de_heute.site, de_pdassi_news.site, de_pdassi_software.site,de_spiegel.site, de_stern.site, de_tagesschau.site,de_teltarif.site, de_tvspielfilm.site, mobile2day.site,palmfaq_de.site, pda_debitel_net.site, windows2000faq.site,zdnet_news.site, bundesregierung.site: a whole lot of newregional_germany sites from Stefan Schwingeler
   
jmason lib/Sitescooper/Main.pm,site_samples/comics/thismodernworld.site,site_samples/security/crypto_gram.site
2003-06-10 patch for Plucker; now ableto handle big images. also added thismodernworld site. patch fromAdrian Colley
   
jmason lib/Sitescooper/
2003-06-09 Main.pm, StoryURLProcessor.pm: removenon-required hashing
   
jmason sitescooper.pl, lib/Sitescooper/Main.pm,lib/Sitescooper/Robot.pm
2003-06-09 description now encoded; RSS 1.0 thedefault
   
jmason lib/Sitescooper/StoryURLProcessor.pm
2003-06-06 added SubStoryPermalinkconf setting so that permalinks are picked up
   
jmason lib/Sitescooper/
2003-06-06 Robot.pm, SCF.pm: added SubStoryId conf settingso that permalinks are picked up
   
jmason lib/Sitescooper/URLProcessor.pm
2003-06-05 relative links became relativeto ; fixed
   
jmason site_samples/tech/pcmag_images.site
2003-06-05 updated PC Magazine sitefrom Goh Boon Nam
   
jmason lib/Sitescooper/Robot.pm
2003-06-05 oops, forgot escaping in descriptiontags
   
jmason lib/Sitescooper/Main.pm
2003-06-04 guid fix; use the real URL as much asposs
   
jmason lib/Sitescooper/LinksURLProcessor.pm
2003-06-04 remove HTML comments beforelooking for links
   
jmason lib/Sitescooper/StoryURLProcessor.pm
2003-06-04 added -maxstories supportfor substory mode
   
jmason lib/Sitescooper/StoryURLProcessor.pm
2003-06-04 lib/Sitescooper/
   
jmason lib/Sitescooper/Main.pm
2003-06-03 fixed invalid RSS
   
jmason lib/Sitescooper/Robot.pm
2003-06-03 rss with -dump works
   
jmason lib/Sitescooper/
2003-06-03 Main.pm, Robot.pm, SCF.pm,StoryURLProcessor.pm, URLProcessor.pm: can now extract'sub-stories' from within a story page
   
jmason lib/Sitescooper/
2003-05-31 Main.pm, Robot.pm: added -rss switch for RSSoutput
   
hubidubi site_samples/regional_hungary/hirek.site
2003-05-28 Site URL update
   
jmason lib/Sitescooper/Main.pm
2003-04-29 fix for plucker from Rik Wehbring
   
barrygonzaga site_samples/regional_philippines/pdi.site
2003-03-31 update and clean
   
jmason site_samples/regional_australia/abc_news_online.site
2003-03-03 added ABCNews Online site from Wayne Osborn
   
hubidubi site_samples/linux/mysql_newsletter.site
2003-02-26 Site file for MySQL monthly newsletter
   
hubidubi site_samples/
2003-02-06 regional_hungary/freebsd_hu.site,regional_hungary/hup_hu.site, regional_hungary/linuxforum_hu.site,linux/footnotes.site: some site logo improvements
   
jmason site_samples/tech/pcmag_images.site
2003-01-22 added pcmag_images.site fromGoh Boon Nam
   
jmason site_samples/weblog/eckes.site
2003-01-15 added eckes.site
   
jmason lib/Sitescooper/CacheSingleton.pm,lib/Sitescooper/DirCacheFactory.pm,lib/Sitescooper/PerSiteDirCache.pm,site_samples/regional_hungary/freebsd_hu.site
2003-01-15 freebsd_hu.site fromHubidubi
   
jmason lib/Sitescooper/Main.pm, lib/Sitescooper/SCF.pm,site_samples/languages/php_net.site,site_samples/linux/debian_weekly_news.site,site_samples/linux/footnotes.site,site_samples/regional_hungary/hirek.site,site_samples/regional_hungary/hup_hu.site,site_samples/regional_hungary/linux_hu.site,site_samples/regional_hungary/linuxforum_hu.site,site_samples/regional_hungary/metro_hu.site,site_samples/regional_hungary/pdamania_hu.site
2002-11-15 many site updatesfrom Hubidubi
   
barrygonzaga site_samples/regional_philippines/pdi.site
2002-11-03 -fix "letters" storyurl -fix "business" story url -fix "business" stories
   
barrygonzaga site_samples/humor/
2002-10-29 bofh-2k+1.site, bofh-2k.site: adddescription, clean up bad bold/italic markups, replaced
with

..

   
barrygonzaga site_samples/humor/
2002-10-28 bofh-2k+1.site, bofh-2k.site: add bofh 2kand 2k+1
   
jmason site_samples/sport/cnn_sports.site
2002-09-03 added cnn_sports site
   
jmason site_samples/linux/weekly_news.site
2002-09-03 updated weekly_news.site
   
jmason lib/Sitescooper/
2002-07-15 Main.pm, URLProcessor.pm: applied bugfix fromBernd Rellermeyer
   
barrygonzaga site_samples/sport/mobilebikes.site
2002-05-06 cycling newsletter
   
barrygonzaga site_samples/palmsized/the_register.site
2002-05-06 cleanup
   
barrygonzaga site_samples/
2002-05-06 bsd/openbsd_journal.site, news/gallup_poll.site,palm/palminfocenter.site, palm/pdalive.site, palmsized/salon.site,business/businessweek.site: obscured email address
   
barrygonzaga site_samples/palmsized/ny_times_handheld.site
2002-05-06 site restricted
   
barrygonzaga site_samples/regional_philippines/pdi.site
2002-05-06 - obscured emailaddress - cleanups
   
barrygonzaga site_samples/palmsized/ny_times_handheld.site
2002-05-06 obscured emailaddress
   
jmason site_samples/lib/layouts.site
2002-01-25 updated BBC layout from Akkana'ssite
   
jmason site_samples/regional_uk/digiguide_tv_listings.site
2002-01-22 Digiguidesite re-submitted from Andy Carlson
   
jmason site_samples/linux/linuxtoday.site
2002-01-22 updated linuxtoday
   
jmason site_samples/
2002-01-22 science/new_scientist_news.site,security/hacker_news_network.site: hackernews gone
   
jmason site_samples/comics/
2002-01-21 better_half.site, between_friends.site,crock.site, curtis.site, dinette_set.site, edge_city.site,girls_and_sports.site, grin_and_bear_it.site, horrorscope.site,i_need_help.site, katzenjammer_kids.site, lockhorns.site,mallard_fillmore.site, moose_and_molly.site, new_breed.site,piranha_club.site, pops_place.site, redeye.site,rhymes_with_orange.site, safe_havens.site, sam_and_silo.site,six_chix.site, theyll_do_it_every_time.site, trudy.site,tumbleweeds.site, zippy_the_pinhead.site: re-added fixed comicsfrom Yoon Fui Thean
   
jmason site_samples/
2002-01-19 admin/sitescooper_archive.site,bsd/oreillynet_bsd.site, business/cnn_financial.site,business/cnnfn.site, cinema/filmink-online.site,palmsized/cnn.site, regional_seattle/seattle_p_i.site,weblog/tim_oreilly.site: fixed some redirected links; removingduplicate CNN sites
   
jmason site_samples/
2002-01-19 business/hottips.site, linux/linuxplaza.site,opinion/feed.site, regional_germany/de_spiegel.site,regional_north_carolina/weather24_raleigh.site: more dead sitespruned
   
jmason site_samples/
2002-01-18 languages/aspwire.site,languages/news_perl_org.site, languages/perlmonth.site,languages/sqlwire.site, languages/vbwire.site,opinion/simson_garfinkel.site, tech/sendmail_net.site: removed lotsof dead sites
   
jmason site_samples/
2002-01-18 business/financial_times.site,business/fox_market_wire.site, business/the_standard.site,business/the_street.site, cinema/cinescape.site,comics/better_half.site, comics/between_friends.site,comics/crock.site, comics/curtis.site, comics/dinette_set.site,comics/girls_and_sports.site, comics/grin_and_bear_it.site,comics/horrorscope.site, comics/i_need_help.site,comics/katzenjammer_kids.site, comics/lockhorns.site,comics/mallard_fillmore.site, comics/moose_and_molly.site,comics/new_breed.site, comics/piranha_club.site,comics/pops_place.site, comics/redeye.site,comics/rhymes_with_orange.site, comics/safe_havens.site,comics/sam_and_silo.site, comics/six_chix.site,comics/theyll_do_it_every_time.site, comics/trudy.site,comics/tumbleweeds.site, comics/zippy_the_pinhead.site,games/oswalds_6th_floor.site, humor/modern_humorist.site,languages/perlnews.site, linux/mandrake_pda.site,news/csmonitor.site, news/my_excite.site, palm/palmgear.site,palmsized/mercury_center_mobile.site, palmsized/the_standard.site,regional_chicago/chicago_tribune_arts_and_entertainment.site,regional_chicago/chicago_tribune_books.site,regional_chicago/chicago_tribune_cars.site,regional_chicago/chicago_tribune_commentary.site,regional_chicago/chicago_tribune_editorials.site,regional_chicago/chicago_tribune_friday.site,regional_chicago/chicago_tribune_good_eating.site,regional_chicago/chicago_tribune_health_and_family.site,regional_chicago/chicago_tribune_home_and_garden.site,regional_chicago/chicago_tribune_jobs.site,regional_chicago/chicago_tribune_kidnews.site,regional_chicago/chicago_tribune_magazine.site,regional_chicago/chicago_tribune_metro_chicago.site,regional_chicago/chicago_tribune_metro_dupage.site,regional_chicago/chicago_tribune_metro_lake.site,regional_chicago/chicago_tribune_metro_mchenry.site,regional_chicago/chicago_tribune_metro_northwest.site,regional_chicago/chicago_tribune_metro_southwest.site,regional_chicago/chicago_tribune_new_homes.site,regional_chicago/chicago_tribune_real_estate.site,regional_chicago/chicago_tribune_tempo.site,regional_chicago/chicago_tribune_transportation.site,regional_chicago/chicago_tribune_travel.site,regional_chicago/chicago_tribune_tv_week.site,regional_chicago/chicago_tribune_woman_news.site,regional_chicago/chicago_tribune_your_money.site,regional_chicago/chicago_tribune_your_place.site,regional_croatia/DHMZ_Hrvatska_danas.site,regional_croatia/DHMZ_Hrvatska_sutra.site,regional_croatia/DHMZ_Jadran.site,regional_croatia/DHMZ_Zagreb_danas.site,regional_croatia/DHMZ_Zagreb_sutra.site,regional_denmark/politiken.site,regional_denmark/valutakurser.site,regional_francais/01_informatique.site, regional_francais/afp.site,regional_francais/cinenouba.site, regional_germany/de_br_news.site,regional_germany/de_dwelle.site,regional_germany/de_kalenderblatt.site,regional_ireland/irish_times.site,regional_north_carolina/wral-tv.site,regional_philippines/manila_bulletin.site,regional_spain/telebasket_nba_spanish.site,regional_spain/telebasket_spain.site,regional_toronto/globe_and_mail_business.site,regional_uk/digiguide_tv_listings.site,security/securityfocus.site, sport/EurosportTV.site,sport/cnn_sports.site, sport/telebasket_nba.site,sport/thatsracin.site, sport/uk_sports_com.site, tech/cnet.site,tech/geeknews.site, tech/techweb.site: removed sites which now giveHTTP 404s
   
jmason site_samples/
2002-01-18 regional_california/ocregister.site,science/hotair_features.site, sport/sportingnews.site: moved brokensites to 'broken' dir
   
jmason site_samples/
2002-01-18 linux/kde-dev-news.site,linux/rhad_rumor_mill.site, news/gallup_poll.site,opinion/idler.site, opinion/jaundiced_eye.site,opinion/slate_todays_papers.site,regional_denmark/sslug-nyheder.site, regional_francais/libe_q.site,science/spaceref.site, tech/rcfoc.site,web/webreference_experts.site: thoroughly outdated dead sitesremoved
   
jmason site_samples/bsd/daemonnews.site
2002-01-18 removed broken site
   
jmason site_samples/humor/jon_carroll.site
2002-01-18 added jon_carroll.site fromJan Lund Thomsen
   
jmason site_samples/regional_germany/de_tecchannel.site
2002-01-14 addedde_tecchannel.site from Michael Schubart
   
jmason site_samples/
2002-01-14 cinema/imdb_studio_briefing.site,weblog/jason_pettus.site: added sites from Jan Lund Thomsen
   
jmason site_samples/regional_denmark/geekculture.site
2002-01-07 added sites fromJan Lund Thomsen
   
jmason lib/Sitescooper/Main.pm
2002-01-07 committed patch from Akkana to silencePlucker warning
   
jmason site_samples/regional_japan/jp_daily_yomiuri_english.site
2002-01-04 addedjp_daily_yomiuri_english.site from Michael Schubart
   
jmason site_samples/regional_japan/jp_japan_times/
2002-01-02 jp_japan_times_business.site, jp_japan_times_news.site: addedjp_japan_times sites from Michael Schubart
   
jmason lib/Sitescooper/Main.pm
2001-12-30 added fix for iSilo on win2k
   
jmason site_samples/comics/calvin_and_hobbes.site,t/html/newstories/index.html, t/html/newstories/1/page1_1.html,t/html/newstories/2/page2.html
2001-12-16 updated calvin and hobbes site fromGary Paulson
   
jmason site_samples/regional_denmark/politiken_daily_summary.site,t/html/scdiff2.html
2001-12-04 added politiken_daily_summary.site from JanLund Thomsen
   
jmason site_samples/humor/alexei_sayle.site, t/html/scdiff2.html
2001-12-04 addedAlexei Sayle site from Jan Lund Thomsen
   
jmason lib/Sitescooper/CacheSingleton.pm,lib/Sitescooper/PerSiteDirCache.pm,t/html/http_redirect/front/currentdate/index.html,t/html/newstories/index.html, t/html/newstories/1/page1_1.html,t/html/newstories/2/page2.html
2001-12-04 backed out prev change; alreadyfixed in CVS
   
jmason lib/PDA/PilotInstall.pm, lib/Sitescooper/CacheSingleton.pm,t/html/http_redirect/front/currentdate/index.html
2001-12-04 added fixes forproblems reported by Andy Carlson
   
alastair site_samples/regional_australia/fairfax_it.site
2001-12-03 Fixed to workwith the latest Fairfax site changes.
   
alastair site_samples/tech/zzz.site
2001-12-03 Updated site to include ContentsDiff(d'oh!)
   
jmason t/html/newstoriesdiff/index.html
2001-12-02 ..
   
jmason sitescooper.pl, lib/Sitescooper/Main.pm,t/html/newstoriesdiff/index.html
2001-12-02 added Torsten Uhlmann's isilo-Xsupport patch
   
jmason lib/Sitescooper/Robot.pm,site_samples/regional_denmark/politiken.site
2001-11-25 updated politiken,from Claus Hindsgaul
   
jmason site_samples/linux/kc_kde.site
2001-11-12 updated kc_kde from TorstenUhlmann
   
jmason site_samples/comics/family_circus.site
2001-10-31 family_circus.site fromThean Yoon Fui
   
barrygonzaga site_samples/palm/palminfocenter.site
2001-10-26 removed advertisement fromcontents fixed (P|p)olls.asp link in contents
   
jmason default_templates.html, lib/Sitescooper/Main.pm,lib/Sitescooper/PerSiteDirCache.pm,lib/Sitescooper/URLProcessor.pm,site_samples/palmsized/the_guardian_palmsized.site
2001-10-06 fixed bug using-fromcache with shared cache
   
jmason site_samples/
2001-10-02 regional_uk/the_guardian.site,science/new_scientist.site: a few site updates
   
jmason site_samples/science/new_scientist.site
2001-10-02 updated newscientistsite
   
jmason sitescooper.cf, lib/Sitescooper/DirCacheFactory.pm,lib/Sitescooper/Main.pm
2001-10-02 added __OUTPUTFORMAT__ support
   
jmason site_samples/science/sciam.site
2001-10-02 updated sciam site to honorcaching
   
jmason lib/PDA/PilotInstall.pm
2001-09-27 fixed PDA::PilotInstall to work withlater palm desktops and activeperls
   
jmason lib/PDA/PilotInstall.pm
2001-09-25 fixes from Tim Steele
   
barrygonzaga site_samples/palmsized/the_register.site
2001-09-21 used rss file,palm-friendly site is/was not updated regularly.
   
barrygonzaga site_samples/palmsized/beyond2000-pda.site
2001-09-21 used by2k's palmedition.
   
barrygonzaga site_samples/regional_philippines/pdi.site
2001-09-20 fixed sites erroneouslinks
   
barrygonzaga site_samples/palmsized/cnn.site
2001-09-19 added contents logo, removedduplicate
's on stories.
   
barrygonzaga site_samples/regional_philippines/
2001-09-19 manila_bulletin.site,pdi.site: renamed/moved category, regional_philippines *not*regional_phillipines.
   
jmason site_samples/
2001-09-17 bsd/openbsd_journal.site,palm/palminfocenter.site, palmsized/cnn.site,palmsized/ny_times_handheld.site, palmsized/the_register.site: sitefiles from Barry Dexter A. Gonzaga
   
jmason site_samples/palmsized/the_guardian_palmsized.site
2001-09-14 Guardian siteupdated by Stewart C. Russell (stewart /at/ ref.collins.co.uk)
   
jmason site_samples/business/businessweek.site
2001-09-06 oops, forgot busweek
   
jmason site_samples/
2001-09-05 palm/pdalive.site, palmsized/ny_times.site,palmsized/salon.site, news/gallup_poll.site,palm/palminfocenter.site: added sites from Barry Dexter A. Gonzaga
   
jmason site_samples/regional_denmark/politiken.site
2001-08-27 added Politikensite from Claus Hindsgaul
   
jmason lib/Sitescooper/UserAgent.pm
2001-08-20 fixed http auth support
   
jmason site_samples/regional_toronto/
2001-08-18 globe_and_mail_columnists.site,globe_and_mail_national.site, globe_and_mail_thearts.site,globe_and_mail_toronto.site: globe+mail sites updated by MichaelGraham (magog@the-wire.com)
   
jmason site_samples/regional_california/
2001-08-17 la_times.site,latimes_nat.site, latimes_oc.site,la_times/la_times_frontpage.site, la_times/latimes_local.site,la_times/latimes_nat.site, la_times/latimes_oc.site,la_times/latimes_science.site, la_times/latimes_tech.site,la_times/latimes_world.site: added new LA Times sites from MarkBeckman (mbeckman at jps.net), and reorged them into a directory
   
jmason site_samples/comics/
2001-08-16 flash_gordon.site, prince_valiant.site:Yoon Fui Thean: comics update
   
jmason site_samples/
2001-06-28 business/cnn_financial.site, news/cnn_mobile.site,science/sciam.site, sport/cnn_sports.site: added SciAm site fromMarko, and some CNN sites from David's PODS system translated byMarko
   
jmason lib/Sitescooper/Main.pm
2001-06-28 added support for escaped-hashes in sitefiles from Jeff Hecker
   
jmason site_samples/opinion/unblinking.site
2001-06-21 fixed typo
   
jmason TODO
2001-06-20 added Manila Bulletin site from Eric Pareja
   
jmason sitescooper.cf, doc/running.html
2001-06-19 fixed doco a little
   
jmason lib/Sitescooper/LWPHTTPClient.pm, lib/Sitescooper/Main.pm,lib/Sitescooper/SCF.pm, lib/Sitescooper/URLProcessor.pm,lib/Sitescooper/Util.pm, site_samples/tech/firstmonday.site
2001-06-16 addedFirst Monday site, and worked around webserver bug
   
jmason Makefile
2001-06-11 fixed MANDIR in sitescooper make install
   
jmason site_samples/tech/the_register.site
2001-06-08 added sites
   
jmason site_samples/regional_germany/
2001-06-08 de_heise.site,de_sueddeutsche.site, de_sz/de_sz.site, de_sz/de_sz_drei.site,de_sz/de_sz_politik.site, de_sz/de_sz_sport.site,de_sz/de_sz_wissen.site, de_zeit/de_zeit.site,de_zeit/de_zeit_alternate.site, de_zeit/de_zeit_kultur.site,de_zeit/de_zeit_leben.site, de_zeit/de_zeit_media.site,de_zeit/de_zeit_politik.site, de_zeit/de_zeit_reisen.site,de_zeit/de_zeit_wirtschaft.site, de_zeit/de_zeit_wissen.site: newde_sz, de_zeit and de_heise sites from Peter Marschall
   
jmason site_samples/regional_germany/de_sz/
2001-06-08 de_sz.site,de_sz.site-halbwegs-ok, de_sz_bay.site, de_sz_bayern.site,de_sz_berlin.site, de_sz_beruf.site, de_sz_drei.site,de_sz_feuill.site, de_sz_feuilleton.site, de_sz_hochschule.site,de_sz_immobilien.site, de_sz_kultur.site, de_sz_literatur.site,de_sz_medien.site, de_sz_meinung.site, de_sz_muenchen.site,de_sz_nche.site, de_sz_pano.site, de_sz_panorama.site,de_sz_politik.site, de_sz_reise.site, de_sz_sonder.site,de_sz_sonderbeilage.site, de_sz_sport.site, de_sz_streifl.site,de_sz_streiflicht.site, de_sz_verkehr.site, de_sz_verm.site,de_sz_vier.site, de_sz_wirt.site, de_sz_wirtschaft.site,de_sz_wissen.site, de_sz_wochenende.site: new de_sz and de_zeitsites from Peter Marschall
   
jmason sitescooper.pl, lib/Sitescooper/Main.pm,lib/Sitescooper/PerSiteDirCache.pm,site_samples/languages/use_perl.site
2001-06-05 added mod to not copy up.cvsignore
   

(Scooped by /> sitescooper . /devel/index.html> Go back to the sitescooper page)