telophase: (Cannot brain I have the dumb)
telophase ([personal profile] telophase) wrote2008-09-24 10:37 am
Entry tags:

aaaarrgghhh!

Still working on that RSS parser. My current problem is although the feed contains HTML tags like links and code for italics, the parser I've got only barfs up the text, minus formatting and links. And I can't find anything online talking about this, or how to combat it. It ought to be possible to do it! Aaaarggh!

(I think it's the
strDesc = objItem.selectSingleNode("description").Text
line that's stripping the formatting tags, but so far I've failed to find anything online about that. It doesn't help that I'm currently sleep-deprived.)

[identity profile] forvrin.livejournal.com 2008-09-24 05:15 pm (UTC)(link)
What's it written in?

[identity profile] telophase.livejournal.com 2008-09-24 05:19 pm (UTC)(link)
VBscript.

I've managed to run across a reference to CDATA, which appears in the source code of the feed produces by Wordpress, which tells me that it means it's character data only and doesn't parse the tags. After lunch, that's going to be my next angle of attack on this problem - to research CDATA and figure out what-all's up with it and see if I can change it.

[identity profile] forvrin.livejournal.com 2008-09-24 05:26 pm (UTC)(link)
CDATA would have to do with the XML file, not so much with the stuff in the VBscript object.

are you reading RSS and trying to parse it, or produce RSS feeds?


[identity profile] telophase.livejournal.com 2008-09-24 06:23 pm (UTC)(link)
Reading and trying to parse it. This has the RSS and parser code in it (http://www.magatsu.net/download/files.zip), so I don't need to attempt to explain what it's doing. :)