
perlkid
stranger
Oct 25, 2000, 12:15 AM
Post #1 of 2
(262 views)
|
|
Xml Help, Kanji .. Anyone ... ?
|
Can't Post
|
|
There is a file I'm trying to parse, It's news headlines n articles. Foreach news article the xml looks like this <BLOCKQUOTE><font size="1" face="Arial,Helvetica,sans serif">code:</font><HR> - <article id="_11403615"> <url>http://c.moreover.com/click/here.pl?x11403613</url> <headline_text>Multiprocessing Modules</headline_text> <source>CNET</source> <media_type>text</media_type> <cluster>Web developer news</cluster> <tagline /> <document_url>http://home.cnet.com/webbuilding/0-3880-6-7253.html?tag=st.bl.3880.also.3880-6-7253</document_url> <harvest_time>Oct 24 2000 9:23PM</harvest_time> <access_registration /> <access_status /> </article> </pre><HR></BLOCKQUOTE> I'm trying to print all of the news headlines. I'm having no success at all. I don't have xml::parser either. Can anyone help me with this? I've tried all kinds of regex but they just don't treat the xml like text. Thanks A Lot, perlkid
|