You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ReadNumericEntityNode function handles the readings different way than ReadNamedEntityNode. Numeric entity is read as TEXT_NODE and named entities are read as ENTITY_REFERENCE_NODE, also different events are triggered which causes HTMLParser to handle them in separate ways, which may cause problems when parsing HTML. I.e. /</; and /&/#60/; are handled in separate ways.
You guys know if this is intended functionality or not? Does HTML parsing spec state that these has to be parsed on different ways or something?
I can also provide PR for fixing this if needed.
The text was updated successfully, but these errors were encountered:
Hello.
I am not the original author.
One of the original authors: @smsisko, but as far as I understand, he does not develop the library on GitHub (only in sourceforge, but last version is very old).
Hi, I haven't used that library in a long time. Back then I made a couple of changes with the original author for a problem a ran into, but that was about it. There wasn't a lot of activiy, so I was added as a maintainer. Nowadays I don't often have an occasion to use Delphi (or Free Pascal), and I really haven't kept up to date with the HTML standard.
Sorry I can't be more helpful.
HTMLp/HtmlReader.pas
Line 497 in 4883f99
ReadNumericEntityNode function handles the readings different way than ReadNamedEntityNode. Numeric entity is read as TEXT_NODE and named entities are read as ENTITY_REFERENCE_NODE, also different events are triggered which causes HTMLParser to handle them in separate ways, which may cause problems when parsing HTML. I.e. /</; and /&/#60/; are handled in separate ways.
You guys know if this is intended functionality or not? Does HTML parsing spec state that these has to be parsed on different ways or something?
I can also provide PR for fixing this if needed.
The text was updated successfully, but these errors were encountered: