« More Blog Crawl | Drexler Keynote » |
Inspired by the web services madness at BlogShares and Technorati, I've whipped up a quick XML-RPC interface to our own NITLE crawl database. You can get the language, authoring tool, and number of incoming and outgoing blog links for any blog URL we have listed (110K blogs and daily growin'). The micro-documentation is available right here, or you can grab the source code for a tiny Perl demo client. No keys or other such nonsense - just don't melt our server.
For the curious, I've also slapped together a description of our methodology, such as it is, explaining how we run the crawl. It's something that I hope to flesh out as time goes by - let me know what I left out, or send in suggestions for a better way to do it.
I think we're going to call this project the 'NITLE Blog Census', by the way. That makes it sound nice and official.
« More Blog Crawl | Drexler Keynote » |
brevity is for the weak
Greatest Hits
The Alameda-Weehawken Burrito TunnelThe story of America's most awesome infrastructure project.
Argentina on Two Steaks A Day
Eating the happiest cows in the world
Scott and Scurvy
Why did 19th century explorers forget the simple cure for scurvy?
No Evidence of Disease
A cancer story with an unfortunate complication.
Controlled Tango Into Terrain
Trying to learn how to dance in Argentina
Dabblers and Blowhards
Calling out Paul Graham for a silly essay about painting
Attacked By Thugs
Warsaw police hijinks
Dating Without Kundera
Practical alternatives to the Slavic Dave Matthews
A Rocket To Nowhere
A Space Shuttle rant
Best Practices For Time Travelers
The story of John Titor, visitor from the future
100 Years Of Turbulence
The Wright Brothers and the harmful effects of patent law
Every Damn Thing
Your Host
Maciej Cegłowski
maciej @ ceglowski.com
Please ask permission before reprinting full-text posts or I will crush you.