None-english characters incorrect with the RSS module?
Posted: Sun Dec 17, 2006 7:50 pm
Hi, I've trying to use the RSS module to display my blog (Blogger ) on my site, and I've found that the Japanese/Chinese characters show up as "?" after the xml is parsed. I looked at the code and the source xml, both seemed to use UTF-8 encoding. Is there any known problem with the RSS module?
Btw I'm using PHP5, so the xml parser *should* be no problem right?
For example, try to get the Engadget Japanese RSS:
http://japanese.engadget.com/rss.xml
Or the Chinese RSS:
http://chinese.engadget.com/rss.xml
Everything should come out as question marks.
*edit*
Ok, I discovered that the descriptions on both of these links are fine. But the title still comes out as question marks, weird.
*edit again*
Seems that the engadget.com feeds are not the best example. Their descriptions encodes the utf-8 characters as special HTML chars. The problem arises when the characters are hardcoded text.
This feed would illustrate the problem:
http://blog.cnblog.org/atom.xml
I just realized that many of you probably don't have East Asian text support installed ( in XP ), you will need to install them to see the characters properly.
Btw I'm using PHP5, so the xml parser *should* be no problem right?
For example, try to get the Engadget Japanese RSS:
http://japanese.engadget.com/rss.xml
Or the Chinese RSS:
http://chinese.engadget.com/rss.xml
Everything should come out as question marks.
*edit*
Ok, I discovered that the descriptions on both of these links are fine. But the title still comes out as question marks, weird.
*edit again*
Seems that the engadget.com feeds are not the best example. Their descriptions encodes the utf-8 characters as special HTML chars. The problem arises when the characters are hardcoded text.
This feed would illustrate the problem:
http://blog.cnblog.org/atom.xml
I just realized that many of you probably don't have East Asian text support installed ( in XP ), you will need to install them to see the characters properly.