Over the last two and a half years I’ve become convinced that Google performs rich categorization of sites and queries. The signs are plain as day and the impact is substantial. Categorization allows Google to apply algorithmic changes by category as well as deliver more relevant and diverse search results.
Site Categorization
In late 2008 I wrote about how taxonomic search could be used to create diversity in search engine results. Later, In May 2009, Google launched search options that included the ability to filter your results by Forums and Reviews.
Rich Snippets stole the show during this launch but it should have been crystal clear that Google was able to distinguish site or page type. How else would it deliver results in these filters?
Google made this even more transparent when they tested the ability to see more or fewer shopping sites. Obviously Google was able to tell (with relative accuracy) which sites were eCommerce enabled.
You may also come across search results for Q&A sites or forums that include a pseudo-rich snippet which includes the number of posts and authors.
This is not structured markup. This isn’t RDFa or microdata. Instead, Google has identified patterns in these sites, extracting the relevant information and creating their own rich snippet. More recently, Google demonstrated that it could determine if a page contained search results.
These are interesting applications but it’s what is at the heart of it all that is really remarkable – the ability to algorithmically identify and categorize a page or site.
Query Categorization
If it were a snake it would have bit you already. Of course Google categorizes queries! You see this when local results are triggered, or when the Onebox is presented. Enter a query that is product based and the Shopping Onebox is likely to appear. Enter a query that is health related and the Health Onebox may appear.
It’s not a matter of if, but how deeply Google categorizes queries.
Matching Queries to Categories
Matching queries to categories can help deliver the right information to users. You can already see this happening as Google changes the left hand search options based on the query.
You won’t get a Recipes option if you search for ‘baked new jersey’.
You won’t get a Books option if you search for ‘crying of lot 23’.
These examples could be delivered by simply accessing relevant databases of recipes or books, but I’m guessing the relation is far more dynamic.
Matching Documents to Queries
The question then becomes how Google matches web pages (aka documents) to queries. What documents (or results) does Google return based on the query categorization?
A recent patent dissected by Bill Slawski shows how Google might think about matching documents, queries, keywords and categories. It indicates that what Google thinks your site is about could influence how and when it is returned for certain queries.
If you were to take all of that information that Google provides for your site, and try to guess at a category or categories that Google might assign for your site, could you?
That is an interesting question.
Google Ad Planner
One of the unconventional places I’ve looked at recently to give me a sense of what Google might think a site is about is Google Ad Planner.
The content categories are usually a mixture of accurate and bizarre, particularly around geography. Now, I’m not saying that the content categories here influence Google’s categorization. But I have found it illuminating to look at Ad Planner results when comparing competitors.
Beyond the content categories are affinity scores for sites also visited and audience interests. The latter is another interesting data point when thinking about how Google might categorize sites.
The good news is Google has a compelling reason for this data to be accurate (it’s attached to advertising) and the data seems to be updated frequently. As an example, the Ad Planner thumbnail for this blog shows my most recent blog post.
Why Google Ad Planner thinks I’m in Canada is a bit of a mystery. I did do a blog posts about Hockey Memories and The Flyers 1987 Stanley Cup Playoffs, but I’m not in Canada (as lovely a country as it may be.)
After verifying site ownership you can change your categories, which reveals a fairly robust taxonomy. Is this the same taxonomy Google uses in their algorithm? Probably not. But it might help inform or update the one used for search. So, I’ve gone ahead and changed my categories and description.
It’s in my can’t-hurt-and-might-help category.
TL;DR
Google continues to work on matching categories to queries and queries to web pages to improve search quality. Understanding how your site might be perceived by Google is an important new step in search engine optimization.
The Next Post: Optimize The Google +1 Button
The Previous Post: Doppelgangers
1 trackbacks/pingbacks
Comments About What Does Google Think Your Site Is About?
// 6 comments so far.
Holly // August 16th 2011
Great post – and Pynchon reference!
AJ Kohn // August 24th 2011
Thank you Holly! It’s always nice to have my literary references acknowledged.
Heather Wood // October 10th 2011
Interesting Article for sure! I tried to input my site into google ad planner so i could see and edit my category and I get the following…
Ad Planner could not locate sufficient data to display for domain: ravenousravendesign.com
I also tried a few other client sites only getting the same result. WHAT GIVES?
I’v yet to be able to use ad planner and would really like to :p
AJ Kohn // October 14th 2011
Heather,
My guess is that the sites in question don’t receive enough traffic. One of my other sites is in Ad Planner but has absolutely no category data. Now, if the sites your trying have substantial traffic then that’s another story entirely.
Richard Robbins // April 12th 2012
Do you know of any tool available to Webmasters get some idea of how Google has categorized a website? I am trying to make a recommendation for a client of mine regarding what content to add to a site to refresh it. It would help if I knew for sure in which category(ies) Google has placed the website. I’ve checked Google trends, Webmaster Tools, and other places, but I can’t find anything that gives a clear answer.
A couple things mentioned in this article lead me to believe that there has to be some way to get this information:
“the ability to algorithmically identify and categorize a page or site”
“If you were to take all of that information that Google provides for your site, and
try to guess at a category or categories that Google might assign for your site,
could you?”
Even if there were a list available of the kinds of categories Google uses, that would be helpful.
Thanks,
Richard
Richard Robbins // April 12th 2012
Sorry about my last question. I apparently skimmed over the Ad Planner section. I just logged in to Ad Planner, and I got all the information I needed from Google about my client’s website. I highly recommend using the tool. Understanding how to get your website categorized correctly (or at least how your website is categorized) is a great foundation for wording your page titles and doing other optimization efforts. It’s a way to play to your website’s strengths.
Sorry, comments for this entry are closed at this time.
You can follow any responses to this entry via its RSS comments feed.