[Search-l] Related concepts, function words, content words
Jimmy Wales
jwales at wikia.com
Thu Aug 7 18:13:22 UTC 2008
That's totally fascinating.
This is one reason I am such a skeptic for the world seeing any major
improvements in machine-generated search results anytime soon, whether
using semantic technology or whatever. Frankly, machines are still
pretty stupid, and stupid even in cases where there is pretty obviously
a HUGE amount of money to be made from having the machines "understand"
things just a little bit.
In your example case, pretty clearly what was needed was ads for books,
videos, or seminars about coaching - inspiring team members to show up
on time, etc. A human knows that immediately.
--Jimbo
Linas Vepstas wrote:
> I just noticed something curious about google's "related topics" function.
>
> I'd been reading gmail using the web browser, and there's always a list
> of ads that seem to be keyed off of keywords in the email. Today, none
> of the ads were keyed off of keywords ... instead, they were keyed off of
> broad sentiment.
>
> The actual email was from my rowing coach, bitching about how people
> failed to show up for practice, and how that makes everyone late on the
> water and changes the planned workout, etc. The ads were all about
> employee-employer relations -- how to fire employees, how to file
> workplace greivances, negotiating with unions, etc. Now, nowhere
> in the email did it use the words "employee", "union", "grievance",
> "fire", "discharge" -- but somehow google perceived the overall negative
> tone, and that it had to do with personal relationships. Its mistake was
> to assume its job-related. And yet -- none of the ads were for marriage
> counseling or spousal abuse -- so it could tell that this was a more formal
> setting -- it did not mistake it for a lover showing up late for a romantic
> dinner (no "buy her flowers" ads), or missing out on camping with your
> buddies (no "how to make friends" ads).
>
> Particularly of note is that it missed the obvious sports nature of the email:
> content words like "rowing" "water", "workout", "practice" and "boat" were in
> the email, and should have given a strong positive .. and yet these were
> overlooked, in favour of the much more vague non-content, functional
> phrases like "failing to show up".
>
> --linas
> _______________________________________________
> Wikia Search mailing list
> http://re.search.wikia.com/
> Change options or unsubscribe: http://lists.wikia.com/mailman/options/search-l
>
>
More information about the Search-l
mailing list