scrap pad

feed parsing issues

I've been using MagpieRSS to aggregate some feeds (new design/content on circuitous.org). It's straightforward, has caching built in, and handles RSS 1.0, 2.0, and Atom feeds. But isn't UTF8-compatible. Somewhere after fetching the feed, in the process of morphing it into an object, characters get lost.

The source RSS is already in UTF8, Snoopy (the HTTP client included with MagpieRSS) picks it up as UTF8, and MagpieRSS already has an extremely painful looking UTF8 fix... what gives?

A possible solution is to use the "Universal Feed Parser," which would involve writing some Python. Or, I could dig around in MagpieRSS itself.

Archives

who I am