| Author |
myphpnuke not search engine friendly |
|
trini0
MPN Developer Joined: 18-Jul-2001 Posts: 5096
From: FL USA
|
Posted: 2002-05-13 13:36
I found this bit of info at zend.com at
http://www.zend.com/tips/tips.php?id=98&single=1
After reading about MikeB's comment on portability, Ill be hesitant to jump into that.
When I have some time Ill run a couple tests on IIS and Apache and see what the outcome is.
The bottom line is that we have to stay as portable as possible... ----------------- trini0... mPN2 Test Bed
|
|
Newt-Dog
MPN Developer Joined: 07-May-2001 Posts: 266
From: So. California, USA
|
Posted: 2002-05-13 13:52
There was a thread in MPN that was discussing Robot Text, and Gareth gave a pretty good "disallow" list for the robots to read. I changed my Robots Text accordingly and now I have several spyders crawling my site daily. One I checked on had did a pretty good job of cataloging my "Quotes" section of my site, but nothing else.
Here is the Robot Text thread.
Maybe Eman could take a look at the disallow list and see if it could be modified to allow more spyder access without letting sensitive info out.
Newt-dog ----------------- Hey! visit PhantomCow.com sometime and Chat!
|
|
Emancipator
Not too shy to talk Joined: 08-May-2002 Posts: 21
|
Posted: 2002-05-13 13:53
I like all the feedback, no better way to learn then read what everyone is saying.
Mr Browning:
What do you suggest I do to make my urls more search engine friendly. and your thoughts about it being php info, yes that is what I was referring to. There has to be a way to do this.
Trini0
I agree portability is definately an issue. We dont want to turn this project into a complicated nightmare. however if you cant use the site, ie, if the spiders just ignore your content the site is useless from my standpoint. I am sure that with all the smart PHP people here we can figure out a doable way of making this work.
The link to the article I read was
http://www.zend.com/zend/spotlight/searchengine.php
Look forward to even more feedback
Eman
|
|
Emancipator
Not too shy to talk Joined: 08-May-2002 Posts: 21
|
Posted: 2002-05-13 13:58
Quote:
|
Here is the Robot Text thread.
Maybe Eman could take a look at the disallow list and see if it could be modified to allow more spyder access without letting sensitive info out.
|
|
I am certainly more then willing to do my part to help the community. Spider scripts are great, however a robot.txt file can only do so much. If the php is not allowing proper access, it still does nothing. But since I am still new to PHP I will learn as we go along.
I will look at this robot.txt file and see what I think of it, and possibly post some feedback. I sure am pleased to see so many people interacting with my topic. Great to have feedback, especially since I feel this is a big issue that will improve the value of myphpnuke even more.
Eman
|
|
trini0
MPN Developer Joined: 18-Jul-2001 Posts: 5096
From: FL USA
|
Posted: 2002-05-13 13:59
Look at the user comments below that article. It confirms what MikeB was talking about. So I may not venture into this because of the lack of portability. ----------------- trini0... mPN2 Test Bed
|
|
trini0
MPN Developer Joined: 18-Jul-2001 Posts: 5096
From: FL USA
|
Posted: 2002-05-13 14:01
Here is another thought.
In mPN2, I started to modify backend.php to do more than syndicate news articles. Maybe, someone can write an xml parser that would display the links to articles etc, and have the spider crawl over that to get to the actual articles....
Just a thought... ----------------- trini0... mPN2 Test Bed
|
|
Emancipator
Not too shy to talk Joined: 08-May-2002 Posts: 21
|
Posted: 2002-05-13 14:16
Trini0
I will have to read the article you are reading.. have not looked at that one. But then again not being a PHP wiz not sure if it will mean much to me.
As far as the XML goes, that may be a good idea. I wish I could help you on this one, but alas I am only a search engine/graphic design lad
Eman
|
|
Kiril
MPN Developer Joined: 10-Nov-2001 Posts: 4181
|
Posted: 2002-05-13 20:19
How about looking at this from a different angle.
All spders/bots read meta data with no problems. How about creating a daily meta file of pages you want indexed by search engines using php. In essence, each page you want indexed by the search engine would get it's own meta data for the spider/bot created directly from the database.
This way the data would still be dynamic, and all of the spiders could read it. This along with the appropriate allow/disallow list, I think it would work nicely.
|
|
Emancipator
Not too shy to talk Joined: 08-May-2002 Posts: 21
|
Posted: 2002-05-14 06:14
Kiril.
Thanks for your notes. I am sure many people will not agree with me, however I tend to believe I know what im doing since my results are always very high. Meta content is not as important as it used to be for your engine results. The most important aspect is content that is properly formatted to meet the requirements of the major engines.
To be honest the only engine's I really worry about are Google and Yahoo which use the same backbone, but different algorithms. They put very little emphasis on meta content because of abuse in the past. Content with them is the key item that gets a good ranking.
I think that Kiril definately has a great idea, but I am just not sure whether this would be enough to help rankings. So far it seems the only solution is doing what Newt Dog suggested and making HTML newsletters which link to other content pages so the engines can spider it.
The only thing we need to do is to use a different get parameter other them "?" or a similar operator since it is not a proper url referrer and search engines ignore it.
Thanks everyone for the great feedback, This is why I use myphpnuke... hope we can continue to discuss this and find a solution.
Eman
|
|
trini0
MPN Developer Joined: 18-Jul-2001 Posts: 5096
From: FL USA
|
Posted: 2002-05-14 06:21
We do not control the ? get parameter...
The script gods in the sky do... ----------------- trini0... mPN2 Test Bed
|