CMSMS 1.02 search module and special characters

For questions and problems with the CMS core. This board is NOT for any 3rd party modules, addons, PHP scripts or anything NOT distributed with the CMS made simple package itself.
Post Reply
kode_fi
Forum Members
Forum Members
Posts: 21
Joined: Mon Oct 17, 2005 7:13 am

CMSMS 1.02 search module and special characters

Post by kode_fi »

Made fresh 1.02 installation on Centos 4.4 using php 4.3.9 Apache 2.0.52 mysql 4.1.20.

Cmsms and mysql database uses UTF-8 encoding.
There was a bug on 1.01 (http://dev.cmsmadesimple.org/tracker/in ... 6&atid=101) that prevented search module to search and index umlauts properly. That's corrected now on 1.02.

On CMSMS 1.02, if Title or Menu Text contains nordic characters (öäå) they get stored to database without umlaut encoding and search module doesn't find these strings.
If Content contains nordic special characters without umlaut encoding, search module doesn't find these words either.

Picture is content_props table content field. If content contains special characters with and without umlauts, only umlaut encoded strings are found

Image

Searching word mäki1 finds string but mäki2 finds nothing.

Indexing seems to work OK at both cases

Image

Using only umlaut encoded characters on content is fine for me, but search module being unable to search Menu Texts or Titles containing special characters is big problem.

Something I should try??

Teemu
Piratos

Re: CMSMS 1.02 search module and special characters

Post by Piratos »

Same here

german üöäß in content ok but not in title search never found something.
Ted
Power Poster
Power Poster
Posts: 3329
Joined: Fri Jun 11, 2004 6:58 pm
Location: Fairless Hills, Pa USA

Re: CMSMS 1.02 search module and special characters

Post by Ted »

That's just bizarre...  Ok, I'll look into it.
ganeshXL

Re: CMSMS 1.02 search module and special characters

Post by ganeshXL »

Yes, there's a serious bug with Umlauts.

No matter if I use UTF-8 for everything (DB, HTML-templates, CMSMC-settings) or ISO-8859-1, the TITLES and MENU TEXTS are always broken. I have to *manually* re-write those in phpMyAdmin. After I do that, the titles/menu-texts get displayed correctly on the published (public) site, BUT it's then broken in the admin-section.

I've tried various hacks*, but couldn't find a real workaround.

* e.g. entering ö (ö) etc. in the title-fields didn't work.

As long as you have only english content, that's probaly never a problem, but as soon as you use it for french or german etc., it becomes a bit of a hassle... I don't really want to tell my client to alter everything separately in phpMyAdmin... imagine the chaos he could break loose there :|

btw, I'm using 1.0.4 "Lanai". This happens on both a *nix server + local dev machine on Win2k.
Last edited by ganeshXL on Sat Mar 03, 2007 3:33 am, edited 1 time in total.
Post Reply

Return to “CMSMS Core”