HTMLReader handles numeric and named entity references different ways

https://github.com/ange007/HTMLp/blob/4883f9902d88aee29570867530d804649ee15c79/HtmlReader.pas#L497

ReadNumericEntityNode function handles the readings different way than ReadNamedEntityNode. Numeric entity is read as TEXT_NODE and named entities are read as ENTITY_REFERENCE_NODE, also different events are triggered which causes HTMLParser to handle them in separate ways, which may cause problems when parsing HTML. I.e. /&lt/; and /&/#60/; are handled in separate ways.

You guys know if this is intended functionality or not? Does HTML parsing spec state that these has to be parsed on different ways or something?

I can also provide PR for fixing this if needed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HTMLReader handles numeric and named entity references different ways #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

HTMLReader handles numeric and named entity references different ways #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions