None-english characters incorrect with the RSS module?

For questions and problems with the CMS core. This board is NOT for any 3rd party modules, addons, PHP scripts or anything NOT distributed with the CMS made simple package itself.
Locked
sinkpoint

None-english characters incorrect with the RSS module?

Post by sinkpoint »

Hi, I've trying to use the RSS module to display my blog (Blogger ) on my site, and I've found that the Japanese/Chinese characters show up as "?" after the xml is parsed. I looked at the code and the source xml, both seemed to use UTF-8 encoding. Is there any known problem with the RSS module?

Btw I'm using PHP5, so the xml parser *should* be no problem right?

For example, try to get the Engadget Japanese RSS:
http://japanese.engadget.com/rss.xml

Or the Chinese RSS:
http://chinese.engadget.com/rss.xml

Everything should come out as question marks.

*edit*

Ok, I discovered that the descriptions on both of these links are fine. But the title still comes out as question marks, weird.

*edit again*

Seems that the engadget.com feeds are not the best example. Their descriptions encodes the utf-8 characters as special HTML chars. The problem arises when the characters are hardcoded text.

This feed would illustrate the problem:
http://blog.cnblog.org/atom.xml

I just realized that many of you probably don't have East Asian text support installed ( in XP ), you will need to install them to see the characters properly.
Last edited by sinkpoint on Sun Dec 17, 2006 8:17 pm, edited 1 time in total.
sinkpoint

Re: None-english characters incorrect with the RSS module?

Post by sinkpoint »

Solved!

Ok, I've discovered the problem.

In file: modules/RSS/rss_fetch.inc line# 358

Change "MAGPIE_OUTPUT_ENCODING" from "ISO-8859-1" to "UTF-8" solves the problem.
Locked

Return to “CMSMS Core”