«  Marketwatch: "5 reasons Cuil Won't Kill Google" Main What is more popular, John McCain or a rash?  »


I spent most of yesterday playing around with Google's newest competitor, cuil.jpg.

I did a series of searches on Cuil and compared them with Google. The results, when I could get past the server crashes cause by the rush of others trying it out on its debut day, were very different. Some were better for me on Cuil. Others were better for me on Google. But the experiences (and results) were so different that it's clear to me that the very familiarity and comfort with the Google experience has a major effect on users -- even me. This is beyond "network effect."

I have to admit that I am thrilled that Cuil has come along now. Its design and features help me explain much of what I have been writing about here and in the book. I hope it survives and establishes some market share, if for no other reason to demonstrate that there is more than one way to search the messy collection of documents that we call the Web.

Cuil does three things differently from Google:

1) Cuil offers a very different interface and layout of search results. Instead of the top-to-bottom ranking of results that often bear no relation to each other, Cuil offers two or three columns of entries that fit much better on the horizontal screens on which we all read. There is no clear hierarchy on each page of results -- no numbers or anything. Cuil excerpts seemingly random chunks of text from each result page. The snippets play a bigger part in the user experience than they do in Google. Most significantly, Cuil places a box of related subject tabs on the right side of the page. This can guide users to common idea areas or specific items they might have failed to recall when doing the initial search. One of my biggest problems with Google Web Search is that it presents the illusion of quality via rankings. That linearity undermines the important yet frustrating aspects of research: you don't always know what you are looking for or know what is best for you.

2) Cuil bases its searches on semantic relations and patterns, not user history or incoming links. This is a major break from the "tyranny of the majority" that is the central operational feature of PageRank. Certainly, Google has been working on folding in semantic measures of quality and relevance into PageRank for some time. But the folks at Cuil seem to believe in its potential as a replacement for the PageRank principle. This is refreshing. It means searches produce very different results on both engines. For instance, here is a common Google search for "Siva.":

SivaGooglesearch.jpg


Note the prominence of yours truly. This is one of the absurd things about Google and PageRank. There are almost a billion Hindus in the world. Don't you think that a search for the name of a Hindu god should be dominated by references to that God? Sure, I am one important Hindu! Don't get me wrong. But I am not as important as any god, Hindu or not. Oh, searches for "Thor" on both services yield links to motocross and Marvel Comics, as there are very few Thor worshipers left in the world. It's troubling that the once-relevant band Smashing Pumpkins and I get high billing in this Google search. That's because Google's PageRank privileges the cyberactive and motivated. So groups that tend to produce links -- like Smashing Pumpkins fans and Sivacracy friends, tend to have their results show up high, regardless of relative relevance in the real world.

Here is the same search on Cuil:


SivaCuilSearch.jpg

As you can see, this is a somewhat better (i.e. relevant for most of the world) search. It has no references to me or my blogs. But instead of the chief Smashing Pumpkins fan site (called Siva), it has a guitar chord and tab site about the song "Siva." However, Cuil fails to offer Wikipedia sites as high hits in many searches, including this one. This is a shame, as Wikipedia is a very helpful place to start reading about almost any subject (although it also has a cyberbias). Google, for more than a year, has placed Wikipedia pages very high in most search results. That's a very good thing. Overall, comparative searches are sometimes better on Cuil, sometimes better on Google, but very different almost always.

3) As a result of the linguistic turn, Cuil has no need to keep a dossier on us to improve search. This is one of Cuil's major selling points. And it certainly should make Cuil the search engine of choice for holligans, misfits, pottymouths, and n'erdowells of all stripes. But seriously, Cuil might be able to demonstrate that quality search need not depend on massive data collection and exploitation. If it fails, however, Google's triumph will once again generate calls for stronger global privacy regulation. So watch this contest closely.

These three factors raise all sorts of interesting questions.

• How does Cuil "learn?" One of the few benefits of massive personal data collection is that Google can train its searches over time to conform to our clicking choices (even if they are initially limited by PageRank in the first place).

• How will Cuil demonstrate its alleged superiority in search? This will be tough, as there is no clear or universal standard of quality.

• How can Cuil deal with other languages and scripts?

• How can Cuil fold in images, video, maps, etc. to their search results?

• Why would an advertiser go with Cuil over Google if it does not offer targeted ads based on massive data collection and mining?

• Will Google do as it always does -- welcome competition and swallow the best of its innovations?

• How can I raise my Cuil rank so I can remain "Internet famous" on every major search engine?

arrow

Comments (5)

Thanks for this review. I was wondering what you would have to say about Cuil. I did some playing with it as well, plus I read a bunch of reviews and their comment streams. Your post, I think, was more informative and incisive than anything else I looked at yesterday. So, thanks :^).

Jardinero1 on July 30, 2008 12:20 PM:

One thing I don't like about Google Search is its placement of Wikipedia at the top all the time. Yes, Wikipedia is a great place to start, but it's not very efficacious to tell searchers that fact anymore. Everyone, even my seventy something parents, check Wikipedia. Isn't it stating the obvious to place Wikipedia at the top of the rankings? Cuil's omission of Wikipedia is a welcome improvement in my opinion.

By the way, you stated a concern about "the tyranny of the majority". Well said, does this mean you are a "republican", little "r"? At least intellectually?

I did my homework too but although I hate monopolies I have to admit I'll stay with google.

I cuiled and googled the word "España"...cuil sent me to totally irrelevant links such as Hyundai in Spain...google showed me a map of Spain, pictures of Spain, the official touristic web-site from Spain, etc...

I don't care about the ranks or the graphic designs I care the information I am looking for should be relevant...so...Google it it...

Cuil does a counterproductively lousy job at searching for me. As someone who has been writing and speaking on tech-related topics for over a decade, it is easy to use Google to find out who I am, where I live, and what I've been up to for the last decade.

In stark contrast, of the 11 hits (why not 12?) on the first page of CUIL results, only 1 is actually useful. 6 contain the same tired and obsolete text for a series of articles I wrote in 97-8 for Java Developer's Journal. The others are all lame mashups, combining various quotes. It isn't until you get to about the 20th hit that you can find something from my current employer or some of my photo galleries.

http://www.cuil.com/search?q=%22Jay+Heiser%22

Google is exponentially more useful, returning mostly legitimate and useful hits, and not only weeding out all the dupes, but correctly identifying what I believe to be the canonical version of the series of 10-year old articles.

Granted, if I hadn't Cuiled myself, I wouldn't have known that the company which owns the JDJ IP apparently has a huge number of virtual domains with shared content and is encouraging search worms to arrive at the mistaken impression that they are pasting multiple copies of this aging content on dozens of sites. Why would the readers of Yacht Owners mag want to read a decade-old article about Java security?

henry delion on August 7, 2008 5:34 PM:

it's quite amazing that Cuil seems to not take Wikipedia seriously, considering they blatantly copy its category structure (!)

http://en.wikipedia.org/wiki/Wikipedia_talk:GFDL_Compliance#Cuil

Post a comment

We had to crank up the spam filter so it may take a little while to appear. Thanks.

(If you haven't left a comment here before, you may need to be approved by the site owner before your comment will appear. Until then, it won't appear on the entry. Thanks for waiting.)

A book in progress by

Siva Vaidhyanathan

Siva Vaidhyanathan

This blog, the result of a collaboration between myself and the Institute for the Future of the Book, is dedicated to exploring the process of writing a critical interpretation of the actions and intentions behind the cultural behemoth that is Google, Inc. The book will answer three key questions: What does the world look like through the lens of Google?; How is Google's ubiquity affecting the production and dissemination of knowledge?; and how has the corporation altered the rules and practices that govern other companies, institutions, and states? [more]

» Send links, questions and ideas:
siva [at] googlizationofeverything [dot] com

» To reach me for a press query, please write to SIVAMEDIA ut POBOX dut COM

» To reach me for a speaking invitation, please write to SIVASPEAK ut POBOX dut COM

» Visit my main blog: SIVACRACY.NET

» More about me

Topics

Like the Mind of God (57 posts)

All the World's Information (75 posts)

What If Big Ads Don't Work (20 posts)

Don't Be Evil (16 posts)

Is Google a Library? (84 posts)

Challenging Big Media (46 posts)

The Dossier (49 posts)

Global Google (26 posts)

Google Earth (6 posts)

A Public Utility? (37 posts)

About this Book (28 posts)

RSS Feed icon  RSS Feed


Powered by Movable Type 3.35