Pages tagged with Similar to Storing Bounded Timeboxes in Redis

An Introduction to Compassionate Screen Scraping

One of the most common quickie projects on the web is to screenscrape a website and play around with its data. These projects are a lot of fun, and can allow for inventive mashups, but often the screepscraping scripts cause unnecessary load on the site's servers due to inconsiderate technique. This is an introduction to the art of compassionate screenscraping.

Deploying Django with Fabric

A look at how to manage deployment complexity with Django using Fabric. Something of a continuation on the post from yesterday.

Python Datastructures Backed by Redis


Part of my day's experiment was to play with implementing Python datastructures which are implemented ontop of Redis. Here we take a look at dictionaries and lists, but it should be straightforward to extend this idea to sets as well.

An Epic Introduction to PyObjC and Cocoa

PyObjC is one of the most helpful projects I have ever used, but a number of individuals have been having trouble getting started with PyObjC on Leopard because the documentation is in a bit of a disarray. In particular, there didn't seem to be a comprehensive tutorial that could introduce a newcomer to all the important aspects of PyobjC, and that was completely up to date. Here is my attempt to fill that void. With a vengeance.

Notes on Redis Memory Usage


Yesterday I spent some time running memory usage tests on Redis. Specifically I was interested in storing roughly comparable data in the different data structures and comparing costs. This article contains the numbers and notes from that process.

Huge CSV and XML Files in Python


Quick walkthrough of my code for converting a very large CSV file into a very large XML file using the Python standard libraries. Despite a few issues along the way, was a very pleasant experience.

Using Optional Parameters in Django Urls


A simple but helpful trick for using optional parameters in Django views to allow one view to serve multiple urls with varying parameters.

Using Threadpools in Python


This article takes a look at creating a threadpool in Python. Specifically it takes a stab at iteratively processing CSV and XML files and farming out the parsed data for processing by a threadpool. The Python logging, csv and ElementTree modules make cameo appearances.

Tailing in Python


A quick and pointless look at implementing tail in Python. Something of a koan.

Using PyFacebook without the Facebook middleware

I've been working on a Facebook application with a couple of friends recently. We decided to use PyFacebook library, but there was a brief period of intense confusion on my part about how to use the PyFacebook library without the included middleware. I worked through it, though, and this article has some advice on how you can do the same.

All Rights Reserved, Will Larson 2007 - 2014.