An Introduction to Compassionate Screen Scraping

08/10/2008 python(48)screen-scraping(3)

Page Summary

One of the most common quickie projects on the web is to screenscrape a website and play around with its data. These projects are a lot of fun, and can allow for inventive mashups, but often the screepscraping scripts cause unnecessary load on the site's servers due to inconsiderate technique. This is an introduction to the art of compassionate screenscraping.

Page Statistics

An Introduction to Compassionate Screen Scraping has received 43933 pageviews (an average of 25 views per day since publication).

Pageviews for Recent Days

Show daily pageviews for trailing window.

DateViews
05/24/201323
05/23/201326
05/22/201316
05/21/201325
05/20/201326
05/19/201334
05/18/201325
05/17/201321
05/16/201319
05/15/201321
05/14/201318
05/13/201332
05/12/201320
05/11/201330
05/10/201318
05/09/201321
05/08/201335
05/07/201333
05/06/201321
05/05/201337
05/04/201327
05/03/201329
05/02/201328
05/01/201338
04/30/201331
04/29/201324
04/28/201320
04/27/201327
04/26/201319
04/25/201339
04/24/201323

Page Referrers

Top referrers for this page. Show up to 50 referrers with at least 10 pageviews.

ReferViews
direct15272
imported from google analytics10839
news.ycombinator.com4772
www.google.com3982
www.poynter.org1779
www.stumbleupon.com1360
lethain.com1203
www.reddit.com1073
paradox1x.org1032
stackoverflow.com354
www.quora.com298
dev.lethain.com262
www.delicious.com95
www.mozenda.com92
webscraping.com85
news.ycombinator.org85
sitescraper.net75
schoolofdata.org67
www.codeproject.com53
hackerne.ws41
twitter.com36
searchyc.com32
www.instapaper.com31
mail.python.org31
wiki.greasespot.net30
realtime.michaelhart.me30
hckrnews.com25
www.hnsearch.com22
hackurls.com20
www.netvibes.com18
www.jimmyr.com18

Will Larson

Your delightful host.
Email: lethain[at]gmail
Develop at SocialCode.
Used to Digg, and Y!.

 

All Rights Reserved, Will Larson 2007 - 2013.