Possible bug in HTML::Parser
- From: "Mark" <nospam@xxxxxxxxxxxxxxxx>
- Date: Tue, 15 Nov 2005 17:05:19 -0800
Hello.
I am using the HTML::Parser module to parse a list of bookmarks
exported from the Firefox browser. Firefox exports bookmarks to an
HTML file containing nested definition lists.
I have discovered that when the parser encounters a bookmark
whose name ends in a closing parenthesis, the closing parenthesis
is stripped. (Bookmark names are coded as definition terms, using
the <dt> tag.)
A sample of the code being parsed looks like this:
<DT><A HREF="http://www.google.com" ADD_DATE="1101144594"
ID="rdf:#$.GjDP">Google (search engine)</A>
The decoded text passed to the handler by HTML::Parser
would be "Google (search engine".
Any ideas whether this is a bug in HTML::Parser, or should I
take another look at my code?
Thanks
-Mark
.
- Follow-Ups:
- SOLVED: Possible bug in HTML::Parser
- From: Mark
- Re: Possible bug in HTML::Parser
- From: Mark
- Re: Possible bug in HTML::Parser
- From: Bart Lateur
- SOLVED: Possible bug in HTML::Parser
- Prev by Date: Re: XML::Parser examples for the novice
- Next by Date: BioPerl
- Previous by thread: Perl Related Question
- Next by thread: Re: Possible bug in HTML::Parser
- Index(es):
Relevant Pages
|
|