[Search-l] Interesting Article
Gérard Dupont
ger.dupont at gmail.com
Tue Jul 29 07:12:24 UTC 2008
I don't see the point "relevance ... based on the contents, not its
popularity"... That's what most of the system do. The most basic search
engine make a keyword index which is based on content. The point in page
rank is that it is much more wide because it needs an analysis of links
between lot of pages and maintain the graph of links.
Better the big index claimed or the "relevance based on content", I prefer
to talk about some minor feature (much less interesting for marketing of
course). The new result layout which looks like a journal is appealing to
me. At least that is a change from the classic ranked list. It contains
extended snippet (or it looks like that the snippet are bigger... whatever)
and they always include a small picture. IMHO that's relevant but do anyone
know where are those pictures from (that not thumbnail of the result neither
a picture from the result...). The category tool is also something good (but
not new) which comes from enterprise search tools where the category of
documents is much more important. They also have an interactive query
suggestion tool which is quite relevant for a one day old engine.
Classically, query suggestion is based on past queries... Either they
already have existing data (or the one day old data) either they use some
other approach.
Finally, I did not try the engine enough (neither made any benchmark) to
claim that is a good or a bad engine. But do anyone made such study ?
gdupont
2008/7/29 Aerik Sylvan <aerik at thesylvans.com>
> This was an interesting piece I saw (from the Mercury News):
>
> *The latest search engine* with ambitions to distinguish itself from
> Google and win a bit of mindshare is called Cuil <http://www.cuil.com/>(pronounced "cool," and BTW, can any of you marketing folks explain the
> wisdom of choosing a company name that needs a pronouncer?). Cuil, started
> by some ex-Googlers, emerged from stealth mode this morning, and at the
> moment, its only ambition is to survive a first day in which it was
> overwhelmed by both traffic and lukewarm<http://www.readwriteweb.com/archives/cuil_good_but_not_good_enough.php>-to-bad
> reviews.
>
> Among Cuil's selling points <http://www.cuil.com/info/news_press/> are an
> index of 120 billion Web pages -- three times the size of Google's, it
> claims -- and relevance that it says is based on the contents of a page, not
> its popularity.
>
>
> The size of the index is interesting, as well as "relevance ... based on
> the contents, not its popularity" ... wow, what a concept. Pagerank was
> brilliant and all, but (my soapbox again) I do not always want the most
> popular (entrenched) results.
>
>
> Best,
>
> Aerik
>
>
> On Sun, Jul 27, 2008 at 8:56 AM, Dennis Kubes <kubes at apache.org> wrote:
>
>> I found it interesting in that we recently passed 1 billion urls that we
>> know about.
>>
>> Dennis
>>
>> Jimmy Wales wrote:
>> > Dennis Kubes wrote:
>> >> http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html
>> >
>> > Weird.
>> >
>> > 1 trillion unique web pages. I am skeptical. That's 166 pages per
>> > person on earth. Or, if we assume there are 1 billion people online,
>> > that's 1,000 pages for every person online. I don't know about you, but
>> > I haven't written 1,000 web pages yet.
>> >
>> > If they are data-driven pages, that's interesting and all, but
>> > "counting" pages from a data-driven site is a bit silly. Even the blog
>> > post acknowledges this, by talking about how a calendar site has,
>> > theoretically, an infinite number of pages.
>> >
>> > --Jimbo
>> >
>> > _______________________________________________
>> > Wikia Search mailing list
>> > http://re.search.wikia.com/
>> > Change options or unsubscribe:
>> http://lists.wikia.com/mailman/options/search-l
>> _______________________________________________
>> Wikia Search mailing list
>> http://re.search.wikia.com/
>> Change options or unsubscribe:
>> http://lists.wikia.com/mailman/options/search-l
>>
>
>
>
> --
> http://www.wikidweb.com - the Wiki Directory of the Web
> http://tagthis.info - Hosted Tagging for your website!
>
> _______________________________________________
> Wikia Search mailing list
> http://re.search.wikia.com/
> Change options or unsubscribe:
> http://lists.wikia.com/mailman/options/search-l
>
--
Gérard Dupont
Information Processing Competence Center (IPCC) - EADS DS
http://weblab-project.org
Perception & Machine Learning team - LITIS Laboratory
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikia.com/pipermail/search-l/attachments/20080729/068e8238/attachment.html
More information about the Search-l
mailing list