Page 1 of 2

Indexation on Google (for Azurael)

Posted: 06 Oct 2004 10:17 pm
by Duvel78
Hey,

You was wondering why the indexation on Google so good was for the Volvo300mania forum, well you've got mail!

The keywords are: no ID sessions and url rewritting for the static urls :)

Posted: 07 Oct 2004 04:40 pm
by Azurael
Thanks very much!

I already did the no IDs for anonymous sessions mod, but I really want to do the static page mod :)

Posted: 07 Oct 2004 04:46 pm
by Duvel78
You're welcome!

Something really exciting is...

http://www.google.com/search?hl=en&ie=U ... 0mania.com

It shows how many pages are indexed by Google for Volvo 300 mania: 2520 at this moment... :D

Posted: 07 Oct 2004 11:57 pm
by Azurael

Posted: 07 Oct 2004 11:58 pm
by Duvel78
sm54 5 results! sm4 sm40 sm56

Posted: 08 Oct 2004 09:47 pm
by Duvel78
Oh no..... There's a problem... The indexation is... bad!! Cause when we look to the results, a lot of pages are only known by Google but not indexed, which is a very big difference....................... (results with an URL and not a title)

That's a NEW problem caused by the url rewritting, the rules on Google become really severe, it's considered as a duplicate content in this case :cry: I've to change that! sm68

Posted: 09 Oct 2004 01:08 pm
by Azurael
Well, there are certain entries in robots.txt that can reduce that issue (remember reading about it in one of the howtos) - I have a major problem - I have to get my host to install mod_rewrite :D That could take them years...

Posted: 09 Oct 2004 01:57 pm
by Duvel78
Azurael wrote:Well, there are certain entries in robots.txt that can reduce that issue (remember reading about it in one of the howtos)
I know that but there was wrong entries in that file on the french Phpbb community, so I forfgot that file but now it seems to be a bad idea. There're big debates about that problem of indexing on Google since months, nobody knows really why it happens...
Azurael wrote: - I have a major problem - I have to get my host to install mod_rewrite :D That could take them years...
The mod isn't installed? Strange! Is it a free host then?

Posted: 09 Oct 2004 02:41 pm
by Azurael
Nope, just a useless one :D

They only have installed what comes with ensim :(

Posted: 09 Oct 2004 09:34 pm
by Duvel78
I've analysed the problem, it's much more complicated as I thought...

Pages with and without url rewritting are affected. No problem with the website but the forum is very bad indexed by Google. (-75%!!!!!) There're a lot of complains about that problem (sandbox effect?) on the PhpBB community but no real solutions! I don't know what to do cause the concept of 'duplicate content' can be very large! (same signatures, avatars and so on...)

Posted: 12 Oct 2004 04:37 am
by Azurael
I think that I'm doing something wrong, in hindsight - I just looked at phpInfo, and sure enough:

Loaded Modules mod_fastcgi, mod_jk, httpd_defines, httpdmon, mod_perl, mod_php4, mod_frontpage, mod_ssl, mod_setenvif, mod_so, mod_usertrack, mod_headers, mod_expires, mod_digest, mod_auth_db, mod_auth_anon, mod_auth, mod_access, mod_rewrite, mod_alias, mod_proxy, mod_userdir, mod_actions, mod_imap, mod_asis, mod_cgi, mod_dir, mod_autoindex, mod_include, mod_info, mod_status, mod_negotiation, mod_mime, mod_log_referer, mod_log_agent, mod_log_config, mod_env, http_core

We have rewrite! And just about every other Apache mod written :D I think it might be a permissions issue, so I'm going to check that everything is correctly chmodded.

I guess at the end of the day, the easiest way to get in Google's 'good books' is to leave the forum at it's defaults and archive it for indexing instead. That way you can strip signatures, etc out and it's much 'cleaner' - I've noticed lots of other sites do it. I think that doing it constantly would constitute a lot of server load though.

Posted: 12 Oct 2004 05:31 am
by Azurael
Done it! :)

The problems was that follow symlinks wasn't turned on ;)

Posted: 12 Oct 2004 07:32 pm
by Duvel78
Azurael wrote:
I guess at the end of the day, the easiest way to get in Google's 'good books' is to leave the forum at it's defaults and archive it for indexing instead. That way you can strip signatures, etc out and it's much 'cleaner' - I've noticed lots of other sites do it.
I should remove the signatures? That's maybe bad for the "duplicate content" problem... I think I'll stop the url rewriting and put the latest version (2.0.10) without changes.

Posted: 12 Oct 2004 08:09 pm
by Azurael
2.0.10 seems mostly to have fixes for php5 in, so I don't see the benefit doing most of the upgrade... My forum is half 2.0.8 half 2.0.10... Plus all the other stuff that makes it do what I want to ;)

URL rewriting is really the only solution to indexing unless you want to frequently archive the whole forum. The mod I used only rewrites the URLs for viewtopic and viewforum... Which gets round the whole multiple content problem for the most part.

Posted: 14 Oct 2004 10:06 am
by Chris_C
Duvel, as you seem knowledgable about the phpBB forums, I wonder if you could shed some light for me. I'm trying to attach a php header to a phpBB forum, using a simple "include" call. I have included this call in the template, the page_header.php and the index.php!!!!! All result in errors of one form or another. How did you get the header on this forum?