Page 1 of 1

None-english characters incorrect with the RSS module?

Posted: Sun Dec 17, 2006 7:50 pm
by sinkpoint
Hi, I've trying to use the RSS module to display my blog (Blogger ) on my site, and I've found that the Japanese/Chinese characters show up as "?" after the xml is parsed. I looked at the code and the source xml, both seemed to use UTF-8 encoding. Is there any known problem with the RSS module?

Btw I'm using PHP5, so the xml parser *should* be no problem right?

For example, try to get the Engadget Japanese RSS:
http://japanese.engadget.com/rss.xml

Or the Chinese RSS:
http://chinese.engadget.com/rss.xml

Everything should come out as question marks.

*edit*

Ok, I discovered that the descriptions on both of these links are fine. But the title still comes out as question marks, weird.

*edit again*

Seems that the engadget.com feeds are not the best example. Their descriptions encodes the utf-8 characters as special HTML chars. The problem arises when the characters are hardcoded text.

This feed would illustrate the problem:
http://blog.cnblog.org/atom.xml

I just realized that many of you probably don't have East Asian text support installed ( in XP ), you will need to install them to see the characters properly.

Re: None-english characters incorrect with the RSS module?

Posted: Sun Dec 17, 2006 8:43 pm
by sinkpoint
Solved!

Ok, I've discovered the problem.

In file: modules/RSS/rss_fetch.inc line# 358

Change "MAGPIE_OUTPUT_ENCODING" from "ISO-8859-1" to "UTF-8" solves the problem.