Google's next search engine: What's the difference?

By Scott M. Fulton, III | Published August 11, 2009, 1:47 PM

Yesterday, without much explanation or instructions, Google opened the floodgates on what it's describing as the next generation of its search engine, most likely to test its efficiency and performance using real-world traffic. Testers are being invited to sample the new engine that Google is calling "Caffeine," although perhaps intentionally, it isn't yet explaining just what the differences are.

In Betanews' initial tests Tuesday morning comparing Caffeine to Google's current stable release, we noticed that for nearly every simple and complex search query we tried, the top three non-paid search results were always the same. But the order of results starting as high as #4, sometimes #6, changed. Usually Caffeine retrieved the same pages as the stable version, but shuffled them in a different order.

For instance, with a query that used to stump search engines that couldn't make sense of special punctuation, "virtual function" C++ C#, we would expect to find entries that compared the use of virtual functions in two classes of programming: traditional C++ and Microsoft's C#. For this query, the first four results retrieved were the same for Caffeine as for the stable version (which I'll call "Stable" for short). But Caffeine swapped the order of entries #5 and #6: an independent article by Jordan Leverington from devarticles.com, and an MSDN article by Microsoft support engineer Rakki Muthukumar, respectively. Caffeine placed the independent article higher.

Caffeine also included a separate grouping of entries taken from Google Groups (posts on Usenet forums), which Stable omitted. Caffeine then rated the text of a discussion forum on the subject much higher -- #7 rather than #10.

Unless you believe in conspiracy theories (and I tend not to), the logic of this organizational shuffling isn't yet self-evident. So for our next trial, we tried an intentionally vague query related to something that's been in the news lately: folks who show up at political town hall meetings and try to out-shout the speaker. Without any punctuation and without much specificity, our test query is shouting town hall.

For both engines, the retrieved Google News entries at the top were identical -- evidently Caffeine's new algorithms do not extend to Google News (or to any other Google department). And again, the top three entries retrieved by Caffeine and Stable were identical, with a Fox News story showing up as #1, and a CBS News story as #3.

The #4 items were different, although they came from the same source: the political blog TalkingPointsMemo.com. Both were about the strange trend of congressmen being shouted down by onlookers at public events. But surprisingly, of the two stories the engines pulled up, it was Caffeine that pulled up the older story (August 3 versus August 5 from Stable); and it was the Stable version that included an expansion box enabling more results from the same blog.

While the remainder of Caffeine's Page 1 entries included a YouTube video that didn't appear on Stable's Page 1, Stable pulled up this story from a St. Louis Fox affiliate of a shouted-down town hall meeting from last week, conducted by Rep. Russ Carnahan (D - Mo.) headlined "Carnahan Town Hall Turns Into Political Shouting Match" -- certainly fitting the criteria -- rating it #8; while Caffeine rated the same story #32.

This one is something of a puzzle, because all three of our search query words appear squarely in the story's headline (although "shouting" was not in the URL). And since it fit the subject matter, Caffeine should have had good reason to rate it high. In an effort to discover why, we mixed the order of the terms to town hall shouting, so that Google's interpreter would pair the first two terms rather than the second and third. As expected, we received different results. The second time around, with Caffeine, a different TalkingPointsMemo.com story (but still August 3) appeared as #4, and the one that had been #4 just a few minutes ago had been bumped down to #7. The order of stories appearing from #5 on had changed. Meanwhile, the same reorganized query in the Stable version bumped the Carnahan story down one spot to #9, whereas Caffeine bumped it down to #39.

In other words, our promoting the pairing of "town hall" in the query demoted a story that should ring alarm bells for that very context, more so for the test version of the search engine than for the stable version.

Next, the tie-breaker: How both engines handle a misspelled query…

1 | 2 | Next Page →

Comments

View comments by with a score of at least

Hate to say it but..... bit of a boring story here?

Score: 2

|

It seems to drag out a point... well... pointlessly.

I don't really care that knitting weekly has slipped two places in the rankings with the new search engine. The important stuff is still at the top. We get that after one example search. You can state that more were tried and the same thing happened, then we don't have to read the same thing twice or more times.

Score: -1

|

If anyone can beat Google at searching is Google itself. but the lack of differences tells me two things: a search engine can't be better than Google's. it reached its limits. or the creativity at Google is at its peak.
another way to see it is that the search engine is Google's masterpiece. they wouldn't dare to make a drastic change to something that had lead the company to the top.

Score: 2

|

Latest Firefox 3.6 beta fixes 133 bugs, promises faster page load times

A once-sluggish beta testing process has kicked into overdrive, with astonishing success at finding serious bugs. Will Mozilla be able to fix all the others in time?

Apple invokes DMCA, claims Psystar is 'trafficking in circumvention devices'

In trying to close the book on possibly the last attempt at a Mac clone, Apple cites from its own landmark case...but may actually be misinterpreting it.

The fallacy of Facebook privacy

Carmi Levy | Wide Angle Zoom: If an insurance company learns something interesting about its client through the Internet, is that snooping?

Microsoft 'worked with Apple' for Silverlight on iPhone, says Goldfarb

By not making such a big deal out of trying to stream video to the iPhone, Microsoft got a big deal out of it, revealed the Silverlight product manager.

Confirmed: Office 2010 to ship in June

Two weeks after Microsoft had been expected to draw a clearer roadmap for its principal applications suite, it's finally ready to commit to the end of H1.

New EU antitrust commissioner will oversee Microsoft, Oracle+Sun, Intel issues

As one of Europe's most prominent politicians shifts positions in January, her replacement remains a question mark over technology's biggest issues.

Without its own 'iTablet' yet, is Apple missing the boat?

Steve Jobs is on record as dissing "single-purpose" devices like e-readers. But given their recent popularity, was that a mistake?

Not-so-mobile battery life: Time to force the issue

Carmi Levy | Wide Angle Zoom: If power efficiency is important when you buy a car or even a motorcycle, why shouldn't it matter for a smartphone?

Clicker.com cuts through the Web video chaos

In a world where homemade video and Hollywood movies travel the same pipeline, it's good to have a real search engine to cut through the clutter.

Microsoft's Ray Ozzie: 'Nobody's going to be 100% open'

The mobile apps ecosystems of the world may converge over time, led by apps being ported over across platforms, according to the Chief Software Architect.

A case study in improving software: What Office 2010 can learn from Notion 3

A music composition product gambles with a complete overhaul, in an effort to make headway against two well-known competitors in a tough market.