PageRank Distribution and Navigation Systems

Ok, this post is aimed at SEOs rather than a general audience, and is something I’ve been mulling over for quite some time without getting round to posting about it. I got the nudge to finally do so from a post on Search Engine Land – PageRank Sculpting Leaves NoFollowed Tags Behind .

In that article the author essentially discusses PageRank Sculpting, and I should immediately say that in an article-length piece as opposed to something more thesis-length there is a limit to how much can be discussed, so we both have to keep things simpler than we might prefer. We also have to disregard for the purposes of discussion that since you can’t measure real PageRank (as opposed to the hopelessly inaccurate Toolbar PageRank) it’s extremely difficult to know whether your efforts at sculpting have worked.

That said, as I mentioned in my comments to that article there are some fundamental assumptions and questions inherent in trying to decide on a real-world strategy based on this type of sculpting by navigation.

All links are equal but some are more equal than others

The first one is that all links are created equal, but are they? There has been much written about the idea that editorial links, that is links surrounded by body text, are more important than navigational links in the menu system. We don’t know how this works of course – it could for instance be part of the relevancy aspect of the algoritmh that boosts the body links or it could be part of a PageRank algorithm that reduces the value of menu links.

Shooting yourself in the foot(er)

Similarly with footers and other items that can be thought of as boilerplate content. There has been some suggestion that such boilerplate content gets “ignored” for content purposes. Does that mean the links also get ignored or downgraded? And again is that to do with context or PageRank value?

Navigating  – by the seat of your pants?

Then there is the thorny question of sitemaps. Not in this case the Google/XML kind (they are thorny as well but for entirely different reasons which I’ll leave for another post) but the old-fashioned HTML sitemap. Many SEOs (and many amateurs) trot out the mantra all too easily of “you must have a sitemap”, “it helps the spiders find your pages”. But what are the implications of a sitemap for how the PageRank flows in your site? This has always seemed to me to be a crucial consideration which has usually been ignored.

I well recall spending ages playing about with and discussing the results of the tool at
http://www.webworkshop.net/pagerank_calculator.php?pgs=
when it first appeared back around 2002. Again it’s a very theoretical tool and we don’t know how much the PageRank implementation has changed since then so I’m wary of reading too much into the results these days, but a bit of experimentation with it suggests some pretty major changes in PageRank distribution by using different methods of navigation.

Some people may suggest that a smaller site should use a “get anywhere from anywhere” type of navigation and there is often much benefit to the users in that approach. Some may suggest a more siloed method where different aspect of the site are largely kept separate from each other. Testing the effects of these different scenarios using this tool suggest very different PageRank distribution – sometimes concentrating most of it on the home page and sometimes distributing it almost evenly throughout the site. I’ll leave you to play around with it and see these effects for yourselves, but it may explain some of the odd effects with certain sites where the home page seems to be the one that ranks for everything and the deeper pages don’t seem to get the rankings their content deserves.

My point here is that if you simulate a site and observe the PageRank distribution, and then add a simulated sitemap, even if it’s only linked to from the homepage, then the distribution changes a great deal. Now whether this is matched in a real world situation is another matter, but it bears some thought before blindly following the advice to add a sitemap.

And of course if there is a negative bias against footer links and your sitemap is linked from there… Complicated isn’t it?

Targeting effectively

And it doesn’t stop there. Generally it’s the higher level pages that target the more generic terms and thus need more PageRank to succeed in a competitive market. But generic rankings often don’t convert well, so does that suggest that we should be distributing PageRank more evenly in order to get more of it to the lower level pages where the better-converting specific terms are being targeted by product pages that will actually produce profits?

Good job we don’t have to include clients with their own agendas into this mix or it would get really complicated… Oh! We do.

Now I don’t have the answers to these questions, and maybe no-one outside Google does, but I think they are important questions to consider and discuss – which is the point of this post. Have you had experience of trying to sculpt PageRank and had unusual ranking results. Have you any pages that have loads of good content but seem dead in ranking and traffic terms despite seemingly normal navigation to them? Have you any ideas about the questions I’ve posed here. Maybe we can all gain some insights into a very complex area, but at least I hope I’ve encouraged you to think about it.

Google site command oddities

For some time I’ve been noticing irregularities in the results of the information that Google provides through the various commands and through the Webmaster Tools facility. Of course we all know the link command has been hopelessly broken for many years and frankly I don’t know why it’s still there, but the site command was always fairly reliable until a few months ago. The other fact about it was that it didn’t matter which version of Google you used, which seemed to confirm what we were told about the same information being stored internationally and simply filtered at run-time.

Like many other people I often use the shortcuts available in excellent tools such as the SearchStatus extension for Firefox in order to get indexing and link data from the “big 3″ search engines but do spot checks manually to make sure there are no anomalies. Today, when checking results for a client who sells children’s bedroom furniture and accessories I noticed a large difference in the results of a site command in Google.com (which is what SearchStatus points to) and the same query run manually in Google.co.uk

I wonder what that tells us about how Google stores information and returns it for different queries? Various possibilities come to mind – one that would be nice is if the .com results are the pages in the main index and the others are in the supplementary index, but this clearly needs investigation and thought.

Whatever the reasons behind this discrepancy I’ll certainly be keeping a close eye on it with all my sites.

Who’s feeding the spammers with information?

Spam. We all get it, but for a long time I’ve been nursing a wee suspicion that it’s a bit too well targeted. Over the years I’ve seen a number of odd “coincidences” – spam from people with names very similar to friends of mine, subject lines extremely similar to various subjects I have interests in that are very unlikely to have been selected at random.

But in the last few days I’ve noticed something else and recalled a couple of other earlier similar spams. A couple of days ago I sent a package via an international carrier and within a day I received a message purporting to be from that company trying to tell me that delivery had failed but as usual containing a disguised file that will undoubtedly have been a virus of some sort. I remembered that once before I had sent a another package via another carrier and had later received messages disguised as being from them. Yesterday I paid my phone and broadband bill and today I received a spoof message apparently from the company with a link to a fake login to fix the “failed payment”. Fortunately I’m both experienced and wary, but it would have been so very simple for someone to have clicked on something very similar to a message they were expecting.

Now I don’t believe much in coincidence, and this is all getting to be far too much not to have some sort of causative basis behind it. I increasingly suspect that somehow the spammers are being fed with or are intercepting, more detailed information on all our communications than should be possible. And that, if true is a very worrying development.I may have to dig out my old copy of PGP, but sadly not that many people use it.

Anyone else seeing this sort of pattern in their spam?  Come and join me on the grassy knoll, conspiracy theories rule ;-)

Be careful out there, and don’t forget your tinfoil hats.

Constant change needs constant rethinks

The last few months have been a whirl of business development, networking, new clients and juggling time constraints – not to mention trying to keep up with Google’s ever-changing activities.  So maybe it shouldn’t come as a surprise to read my own main website and find loads of things that I need to rewrite and re-evaluate.

In some ways it’s the old “cobbler’s children” situation; always working on other people’s sites leaves little time or perspective for your own. In other ways it’s the nature of the SEO business – I often say that if you go on holiday for a week then you have to spend 2 weeks catching up after you get back. And there’s also a bit of change now that I’m fully independent rather than making decisions with others. But the longer I spend in this business the more often I realise that we’re constantly learning and adapting to changing market conditions and that ideas and tactics that were gospel 6 months ago are now totally different. You have to step back and remember that increasingly often, lest you find yourself saying something you no longer believe.

So I’ll be doing a fair bit of thinking about how I describe my approach to web optimisation and marketing and much changing of the site will ensue to reflect my current strategies.

There’ll be lots of rethinking being done at the moment following the major changes Google has been rolling out recently – particularly the new personalised search change which could potentially revolutionise the search market. I’ll be writing about that and the effects it could have in the very near future, but for now it’s off to do some strategic thinking, editing and site analysis.

More Google Geolocation oddities

Sometimes Google really puzzles me. I just did a search from the UK, using Google.co.uk, selecting UK-only results, for “chatsworth bed”. (I was checking to see how Chatsworth bed was doing against the competition.) The Local results that Google inserted into the serps after the third natural result were:

Local business results for Bed near Chatsworth, CA, USA

Say Huh??? Why would they possibly think that would be relevant to me when I asked for UK-only results?

I was using Firefox. Strangely when I repeated the search using Safari there were no local results at all. Very odd.

Google through the looking glass

My last post discussed changes in the search landscape but it seems Google is busy changing both the business landscape and the real landscape.

A couple of weeks ago we had SEORoundtable reporting on them producing some very strange results in their search engine – searches in Google for “Google Ireland” were producing Google New Zealand as the number one result. Other people soon chimed in with similar results in South Africa, and South America. All good fun and most of the examples I saw then have been sorted out by no-doubt embarrased Google engineers.

This morning I was reading a somewhat mind-boggling business analysis by Bill Gurley of their maps business and the effects it’s having on GPS companies in the US. If Bill’s right then the effects on mobile phone companies as well as GPS ones could be considerable. The sort of “Less Than Free” business model he describes, if expanded to netbooks and other places their operating system and maps technology could have application, could put them in a position to topple Microsoft from their software domination and put them in an impregnable position to serve adverts to just about everywhere.

But what are we to make of the third item which was highlighted by SearchEngineLand today? It appears that Google Maps has made up a fictitious town in England called Argleton - amusingly described by nearby resident Roy Bailey. Is this a “trap location”, a careless mistake by their mapping partners, or a wicked English attempt to usurp the position of Brigadoon as the world’s favourite disappearing village? ;-)

Still feeling confident about following those GPS directions? Think I’ll stick to good old Ordnance Survey!