You are writing a comment about An Introduction to Compassionate Screen Scraping, here is a quick summary:
One of the most common quickie projects on the web is to screenscrape a website and play around with its data. These projects are a lot of fun, and can allow for inventive mashups, but often the screepscraping scripts cause unnecessary load on the site's servers due to inconsiderate technique. This is an introduction to the art of compassionate screenscraping.
You are responding to this comment written by Andreas Krohn on August 11th 2008, 05:57.
Great post as always. Why are you using Python to do webscraping instead of openkapow.com, dappit.com or some tool like that? Writing screenscraping code is all good if it is valid HTML and no serious JavaScript, but otherwise it is quickly getting very very complex.
Please be aware that comment forms go stale after one hour.
Comments may make use of LifeFlow MarkDown. Raw html will be escaped.
Quick Introduction to LifeFlow MarkDown Syntax
A highlighted code block:
Other common languages work as well: scheme, python, java, html, etc.
Other markdown syntax: