Your Weekly Step Forth into the World of Search Engines

» Visit the StepForth News Home Page

StepForth Search Engine Placement and OptimizationSEO News From StepForth Search Engine Placement Inc.
Wednesday, April 6th 2005

Dear valued subscribers,

Welcome to StepForth's weekly SEO update.

» If you wish more information then please view our news section.
» View StepForth's latest search engine optimization and search engine placement services
» Images not loading? This could be a result of your Outlook settings. View the online version.
» StepForth now contributes articles to both Search Engine Guide and WebProNews
» Do you want to hear about the news as it comes? The SEO Blog is our daily events post.
» Do you want to get the other side of the story? Get news direct from the search engines.

Highlights of the Week: Google United - Google Patent Examined

Google UnitedThoughts on Google's patent...
"Information retrieval based on historical data"

Google's newest patent application is lengthy. It is interesting in some places and enigmatic in others. Less colourful than most end user license agreements, the patent covers an enormous range of ranking analysis techniques Google wants to ensure are kept under their control. Some of the ideas and concepts covered in the document are almost certainly worked into the current algorithm running Google. Some are being worked in as this article is being written. Some may never see the blue-light of electrons but are pretty good ideas so it might have been considered wise to patent them. Google's not saying which is which. While not exactly War and Peace, it's a pretty complex document that gives readers a glimpse inside the minds of Google engineers. What it doesn't give is a 100% clear overview of how Google operates now and how the various ideas covered in the patent application will be integrated into Google's algorithms. One interesting section seems to confirm what SEOs have been saying for almost a year, Google does have a "sandbox" where it stores new links or sites for about a month before evaluation.

Google is in the midst of sweeping changes to the way it operates as a search engine. As a matter of fact, it isn't really a search engine in the fine sense of the word anymore. It isn't really a portal either. It is more of an institution, the ultimate private-public partnership. Calling itself a media-company, Google is now a multi-faceted information and multi-media delivery system that is accessed primarily through its well-known interface found at www.google.com .

Google is known for its from-the-hip style of innovation. While the face is familiar, the brains behind it are growing and changing rapidly. Four major factors (technology, revenue, user demand and competition) influence and drive these changes. Where Microsoft dithers and .dll's over its software for years before introduction, Google encourages its staff to spend up to 20% of their time tripping their way up the stairs of invention. Sometimes they produce ideas that didn't work out as they expected, as was the case with Orkut, and sometimes they produce spectacular results as with Google News. The sum total of what works and what doesn't work has served to inform Google what its users want in a search engine. After all, where the users go, the advertising dollars must follow. Such is the way of the Internet.

In its recent SEC filing, the first it has produced since going public in August 2004, Google said it was going to spend a lot of money to continue outpacing its rivals. This year they figure they will spend about $500 million to develop or enhance newer technologies. In 2004 and 2003, Google spent $319 million and $177 million respectively. The increase in innovation-spending corresponds with a doubling of Google's staff headcount which has jumped from 1628 employees in 2003 to 3021 by the end of 2004.

Over the past five years Google has produced a number of features that have proven popular enough to be included among its public-search offerings. On their front page, these features include Image Search, Google Groups, Google News, Froogle, Google Local, and Google Desktop. There are dozens of other features which can be accessed by clicking on the "more" button near the upper right of the screen. We believe that Google is working to tie all these features together to present its users with search options that are, for want of a better phrase, more relevant than those offered by its competitors. As the Internet and technologies available for users advances, different types of files become searchable and therefore relevant to users. Take Google Video as an example. Now Google (and some of its competitors) can find and read text from closed captioning scripts. As well quotes from recent episodes of virtually any TV show are searchable and can be served back to users along side the clip where the quote originated. Now, imagine a merging of video, textual, graphical and audio files in organic search results. This is, in our opinion, the true intent of the ideas contained in the patent document.

The patent document relates primarily to sorting and cataloging organic search results. As we know them today, organic search results at Google are influenced by a number of factors, many of which involve an evaluation of incoming links. Google needs to ensure its users and advertisers that it is capable of taking action against the darker facets of the search engine optimization sector. Recent stories in the mainstream press have left many with the impression that dark-art SEO and link-spamming is the surest way to get top placements. Google engineers take pride in their work and the popularity of their organic search results is the bedrock on which their profitable business models are built. They can't afford to allow link-spam and deceptive SEO techniques to dominate their organic listings, especially as these listings are about to address and catalog a much more robust and complicated Internet.

Over the past ten months, SEOs have complained and questioned the phenomena known as the Google Sandbox. The sandbox theory explains the time-lag between link-acquisition for a site and link-recognition and reward by Google. A few key sections of the patent document fill in the blanks for SEOs on what Google is examining when a finely crafted link-building campaign falls into the sandbox. The biggest influencer is links and Google is finding new and improved ways to evaluate them.

Google's core algorithm is based on measuring links coming into a page. Because of this, link-building is part of any good search engine optimization campaign. In the span of a month, incoming links to one or more pages of a website might jump by hundreds or thousands. Some of those links might be useful in Google's eyes and some might be useless. The question is, how does it sort which is which?

Google collects a lot of data when it examines a page and the links directed on to or off of that page. When Google mentions they are using "historic data" to determine the value of links directed to your page, they are referring to a number of factors. It knows how long the page has been online, or at least when it first became aware of said page. It also knows how long pages linked to have been online. It knows how often links get clicked and also knows which computer, (and in many cases, exactly who) is clicking the link and where that clicking is coming from.

For an example, check out the following sections from the patent document:

  1. A method for scoring a document, comprising: identifying a document; obtaining one or more types of history data associated with the document; and generating a score for the document based on the one or more types of history data.
  2. The method of claim 1, wherein the one or more types of history data includes information relating to an inception date; and wherein the generating a score includes: determining an inception date corresponding to the document, and scoring the document based, at least in part, on the inception date corresponding to the document.
  3. The method of claim 2, wherein the document includes a plurality of documents; and wherein the scoring the document includes: determining an age of each of the documents based on the inception dates corresponding to the documents, determining an average age of the documents based on the ages of the documents, and scoring the documents based, at least in part, on a difference between the ages of the documents and the average age.
  4. The method of claim 2, wherein the generating a score for the document includes scoring the document based, at least in part, on an elapsed time measured from the inception date corresponding to the document.

By the time a reader gets to item 63, the document has covered dozens of page, site, link and URL related factors that may or may not be included in the current working algorithm.

Here is a quick breakdown of "history factors" we think are relevant to Google's algorithm today. Please note, each item might refer to a specific page and at the same time, also refer to all other pages associated with it.

  • How long a domain or URL has has been registered.
  • Has ownership of a domain changed after previous registrations expired?
  • Has the physical location of the registrant changed?
  • How lengthy is the URL itself? Was it registered to game the index?
  • How many pages are included in the website? (A one document or page website is not considered a highly relevant source of information.)
  • Freshness and age of document.
  • Use of anchor text (both on site and in links directed to site).
  • "Trust Factors" regarding sites or pages outbound links refer to, and inbound links are found on.
  • The "discovery date" of a particular link and the history of changes involving that link.
  • Rate of growth for new links. A sudden burst of growth likely indicates some form of link-spam.
  • Variations in anchor text used to phrase links directed to a page being evaluated. If the same anchor text is used in every inbound link, are they phrased that way for branding purposes or spamming purposes?
  • Number of searches for keyword phrase associated with the anchor text used in links.
  • Number of times Google users click on Google results by entering keyword phrases used in anchor text of incoming links. Does the page being evaluated receive visitors for that keyword phrase on Google's search engine?
  • How do users actually behave while on the page, site or document being evaluated?

Jim HedgerThere is a lot more to find in this document. Thus far, the more we explain, the more questions we have. One thing we are very sure about, the intent of the ideas covered in the patents extends beyond the search tool we know now. We expect to publish a white paper on our analysis of the patent and its implications early next week. Until then, we advise our clients to stay the course. We have long preached a very conservative approach to Google based on relevant link building (which can be slow going but very effective), highly stratified content that is relevant only to the topic addressed by the site, and clear paths based on multiple keyword phrases for spiders to follow.

by Jim Hedger, News Editor
Important ©Copyright Note: readers are welcome to republish the content from StepForth Weekly newsletters
but we do require credit in the format that follows: "Article by <author>, StepForth Search Engine Placement Inc."
Major Player Updates: PPC Lawsuit Initiated & Google Satellite Maps

Pay Per Click Class Action Lawsuit Initiated

It was only a matter of time before someone took the problems associated with click-fraud to court. In February, a group of advertisers quietly filed a lawsuit against Google, Yahoo, Time Warner (AOL), Ask Jeeves, Disney, Lycos, LookSmart and FindWhat.

Led by Texarkana Arkansas Retailer Lane's Gifts and Collectibles LLC, the plaintiffs contend that the search engines knowingly charged for fraudulent clicks, at an average rate of $0.50 per click. The group hopes to have their suit certified as a class-action lawsuit which would allow other advertisers to join.

Under a pay per click advertising program, advertisers generally pay a search engine a small fee each time a user clicks on their ad. Those willing to pay more than their competitors receive the highest placements. One of the main factors driving the growth of paid advertising is contextual distribution programs in which a search engine allows private webmasters to display topically relevant ads on their web pages with revenues from click-throughs divided equally between the search engine and the webmaster. Google's famous AdSense program is the most prominent of such distribution models. Being able to make some money every time a site visitor clicks on a paid-link on your site is so tempting to some unscrupulous webmasters that they hire legions of ad-clickers or develop software to simulate human link-clicking activity.

Google, Yahoo and other search firms offering paid advertising are working to develop tools to detect and deter click fraud. They are also opening up by allowing developers access to Application Programming Interfaces which provide a work-lab for the creation of tools to better track a paid-advertising campaign. Nevertheless, click fraud represents a growing fear in the minds of advertisers, one that might be quantified if the suit produces hard numbers.


Google MapsGoogle Satellite Maps - Looking Through Keyholes in the Sky

Google has incorporated satellite images into Google Maps . The new feature offers the ultimate in bird's eye views for searchers using Google Maps by replacing the standard 2-dimentional maps with images taken by satellite. On the upper right hand side of the Google Maps page is a small text-link that toggles between standard map and satellite image. The satellite images are currently only available for North American addresses but will be introduced for other regions as the year progresses.

Google uses its Map tool in Local search results, helping users plot the fastest route from point A to points B, C, D and E, a feature that translates well from flat-map to satellite map. To get a better sense of the full functionality of Google Maps, request a route from Seattle WA to Miami Beach FL. Note how every time you need to alter course or change highways, Google places a "pin" in the map. Now that Keyhole satellite images are incorporated in their map feature, an image of the intersection or highway interchange is displayed.

Google's newest (and neatest) toy is the stuff of spy novels. Speculation on the introduction of an orbital mind control laser feature or an obstacle removing photon ray assistant is brewing however Google spokespersons are not commenting about (or even giggling about) these rumours.

by Jim Hedger, News Editor
Work With StepForth
Get StepForth Working For You
Resell SEO Services Give Your Clients The Search Engine Placements They Need.
Take the StepForth Review Find out how search engine friendly your website is today... (free!)
StepForth Client Spotlight: Island Quest Realty Ltd

Island Quest Realty Ltd - Salt Spring IslandSalt Spring Island Real Estate
Island Quest Realty Ltd., are experienced to help you make lifestyle choices. Donna Regen, Kerry Chalmers or Kelly Regen will guide you through the complexities of buying a rural property in a professional but caring manner. They offer more than 30 years of award winning local real estate experience. They have extensive legal and business backgrounds, strong negotiating skills, and remain proactive in staying up to date with the ongoing changes in our real estate industry.

The Net Reality: Yahoo Gets Aggressively Retentive around CEO

yahoo!Yahoo has taken steps to ensure its CEO Terry Semel sticks around for a few more years. Semel was viewed by many as a likely successor to outgoing Disney CEO Michael Eisner.

To make sure Semel remained interested in Yahoo, where his salary was pegged at a lowly $600,000 per year, Yahoo’s board gave Semel the option of purchasing 2 million shares of company stocks with an accelerated vestment schedule that would allow Semel to sell about ½ of the shares at the end of 2005 if he reaches specific performance criteria. He also received 250,000 restricted shares which will be available for sale in three years time. Last year, Semel exercised $230 million worth of options and continues to hold almost $325 million in shares.

"Mr. Semel's unique skills, experience spanning the Internet and media industries, and repeated past success make him an attractive candidate to competing organizations…", Yahoo reported in a proxy statement filed at the SEC on Monday. For options worth $690 million at today’s share prices, Yahoo and its repeated success must seem pretty unique to Semel.

by Jim Hedger, News Editor


Visit the SEO BLOG Regularly for Daily SEO Tips & Updates
SEO Blog - SEO Tips

If you have any questions please do not hesitate to call the StepForth staff:
Toll-Free: 1-877-385-5526 | Local: 385-1190
http://www.stepforth.com


To unsubscribe from this weekly newsletter simply reply to news@stepforth.com and include "unsubscribe" as the subject