April 2009
11 posts
The ElementTree iterparse Function →
You want to parse a huge (>15Gb) xml file. You want to use xml.etree.cElementTree.iterparse
oddCMS - Index →
Looks ideal for displaying static pages
MWDumper : converting Wikipedia xml files to sql →
Wikipedia:Database download →
2 tags
Setting up cron jobs in Django
In my Django project, I needed to crawl a page every hour for data and import it to my database. The best/cleanest way I found to do this is using the django-command-extensions extension. This is a great extension that includes great job management, and many more features.
Install django-command-extensions found here.
Read...
conText: a Wikipedia-powered content tagger →
Collaborative collective classificiation - BBC... →
The Wire Bible →
Pop-Up: Yelle →
je veux
json-template - Reference for the language →