Parsing RSS 1.0 with XML::LibXML
I needed an RSS parser that could handle multiple dc:subject fields for a project. And I’m starting to think about what the next generation of XML::RSS should look like. (the current thinking is to break it into separate parser and write elements, and hopefully delegate all the hard work to great modules which have developed since XML::RSS was originally written. Also I’m not convinced that parsing the RSS to a data structure vs. an object is still a good design decision. Hopefully discussions of this will happen on Perl RSS Workers list.) I started to hack out something quick using XML::LibXML. A couple of hours of work turned out something that was described as “kind of like XML::Atom, but, um, with bugs.” So I spent the flight Seattle -> Oakland copying XML::Atom::Feed, and I’ve got the first hints of a RSS 1.0 parser based on libxml2. I don’t like the way I’m handling namespaces, it was just quick and easy, but writing this has given me a few ideas. I’ll write about them soon. (though maybe not until the flight back to Seattle next weekend)
In the meantime, RSS::Feed 0.001a.