April 2009
11 posts
The ElementTree iterparse Function →
You want to parse a huge (>15Gb) xml file. You want to use xml.etree.cElementTree.iterparse
Apr 30th
oddCMS - Index →
Looks ideal for displaying static pages
Apr 29th
MWDumper : converting Wikipedia xml files to sql →
Apr 25th
Wikipedia:Database download  →
Apr 25th
2 tags
Setting up cron jobs in Django
In my Django project, I needed to crawl a page every hour for data and import it to my database. The best/cleanest way I found to do this is using the django-command-extensions extension. This is a great extension that includes great job management, and many more features. Install django-command-extensions found here. Read...
Apr 24th
conText: a Wikipedia-powered content tagger →
Apr 21st
Collaborative collective classificiation - BBC... →
Apr 21st
The Wire Bible →
Apr 17th
Apr 9th
Pop-Up: Yelle →
je veux
Apr 3rd
json-template - Reference for the language →
Apr 2nd