Page 1 of 1

Google indexing news module

Posted: Tue Aug 05, 2008 7:14 pm
by CWebguy
I'm getting a second index for each page in Google for the news module.  Any way to prevent this (like a robot.txt or something)?  Thanks.

Note:  URL looks like this .php?mact=News,cntnt01,detail,0&cntnt01articleid=1&cntnt01origid=61&cntnt01returnid=61

Thanks.

Re: Google indexing news module

Posted: Wed Aug 06, 2008 6:20 pm
by CWebguy
Well, robots won't let me take out a string parameter (only folders).  Any ideas?

I guess I'll just have to disable it or try to hack it somehow.

Thanks!
CWebguy

Re: Google indexing news module

Posted: Wed Aug 06, 2008 7:56 pm
by Dr.CSS
Try turning on pretty URLs and hierarchy in config, use mod_rewrite to take out the index.php? then you may be able to tell robot not to follow the news URL...

Re: Google indexing news module

Posted: Wed Aug 06, 2008 8:24 pm
by viebig
This is an awful problem of the cmsms modules in general.

They can generate different urls for the same page. In this case the return ID is the problem.

This is horrible for SEO and acessibility.

Imagine the scenario, this simple scenario thta by the way is the one of the cmsms standard install: A template that show the news module foreach page.

Every page will have a different URL, because the returnid will be different.

I think we shoud really patch a solution in the next release of news modules. This problem in totally controversial on CMSMS principples on SEO and Acessibility.

Re: Google indexing news module

Posted: Wed Aug 06, 2008 9:20 pm
by CWebguy
I agree with viebig, a fix would be nice :)

Mark, I have tried turning on pretty urls (internal config) with no success.  I don't feel like messing the apache mod_rewrite right now (just cause I have so much else to do! haha :) so for the time being I've just disabled it but I will look into the mod_rewrite in the future and take that into consideration!  (Glad I was able to point this problem out :)

Thanks!
Cwebguy