Re: Possible bug in HTML::Parser



"Bart Lateur" <bart.lateur@xxxxxxxxxx> wrote:
>> Mark wrote:
>>
>> <DT><A HREF="http://www.google.com"; ADD_DATE="1101144594"
>> ID="rdf:#$.GjDP">Google (search engine)</A>
>>
>> The decoded text passed to the handler by HTML::Parser
>> would be "Google (search engine".
>
> I've tried it with HTML::TokeParser::Simple, which is built on top of
> HTML::Parser, and it comes out well:
>

Ok, I've replicated your example using HTML::TokeParser::Simple.
But I would sure hate to scrap the hours I just spent learning
HTML::Parser, and re-write with TokeParser. After all, TokeParser
was supposedly written to save people from having to learn
HTML::Parser!

Can anyone here identify the problem with HTML::Parser, or
perhaps my (mis)use of this module? If TokeParser is based on
HTML::Parser, then it seems odd that it does not encounter
the same problem (unless it works around it somehow.)

Thanks
-Mark




.