You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The RSS feed is not properly handling special characters in its content.
For example, when parsing information from this URL:
Example: Supercoppa e non solo, nel super gennaio di Conceiçao c'è di tutto. Le tre vie per ripartire
Expected: Supercoppa e non solo, nel super gennaio di Conceição c'è di tutto. Le tre vie per ripartire
Observed: Special characters remain as escaped entities (&#x...;).
I investigated the issue and found that the RSS feed from the above URL seems to omit wrapping the <description> field in <![CDATA[...]]>. As a result, the parser used by Glance may be too reliant on the RSS feed’s format, not handling the missing CDATA properly.
Steps to Reproduce
Fetch the RSS feed from https://www.gazzetta.it/dynamic-feed/rss/section/Calcio.xml.
Parse the feed and examine the <description> fields for escaped special characters.
Special characters like ç and é remain in their escaped form (&#x...;) instead of being decoded into readable text.
Expected Behavior
Special characters should be handled properly, with escaped entities correctly decoded into readable text.
Possible Cause
The feed is missing CDATA blocks around the <description> fields, which are needed to encapsulate unencoded text.
The parser might not handle this deviation, assuming the feed strictly adheres to standard formatting.
The text was updated successfully, but these errors were encountered:
marb08
changed the title
Improper Special Character Escaping in RSS Feed
Improper special character escaping in RSS Feed
Jan 3, 2025
Description
The RSS feed is not properly handling special characters in its content.
For example, when parsing information from this URL:
Supercoppa e non solo, nel super gennaio di Conceiçao c'è di tutto. Le tre vie per ripartire
Supercoppa e non solo, nel super gennaio di Conceição c'è di tutto. Le tre vie per ripartire
&#x...;
).I investigated the issue and found that the RSS feed from the above URL seems to omit wrapping the
<description>
field in<![CDATA[...]]>
. As a result, the parser used by Glance may be too reliant on the RSS feed’s format, not handling the missingCDATA
properly.Steps to Reproduce
https://www.gazzetta.it/dynamic-feed/rss/section/Calcio.xml
.<description>
fields for escaped special characters.ç
andé
remain in their escaped form (&#x...;
) instead of being decoded into readable text.Expected Behavior
Possible Cause
CDATA
blocks around the<description>
fields, which are needed to encapsulate unencoded text.The text was updated successfully, but these errors were encountered: