I see that some rss
on xml
have strange strings.
For example, ...
is …
.
What is this? And why? It开发者_C百科s about CDATA
?
On my web application (C#), reading rss, I usually do myString.Replace("…", "...")
: not sure if it's the best strategy
These are numeric character references.
A numeric character reference (NCR) is a common markup construct used in SGML and other SGML-related markup languages such as HTML and XML. It consists of a short sequence of characters that, in turn, represent a single character from the Universal Character Set (UCS) of Unicode.
In this case, the numeric character reference for the ellipsis character - …
.
From this reference page it's horizontal ellipsis.
精彩评论