[Search-l] Parsed text samples

Linas Vepstas linasvepstas at gmail.com
Thu Jul 17 22:06:38 UTC 2008


2008/7/17 Jeremie Miller <jeremie at jabber.org>:
> It's definitely sexy hot, thanks Linas (and RelEx folks)!
>
> I'm looking forward to deploying this both in our map-reduce cluster on a
> large set of the top pages in the index (and posting the resulting data of
> course), as well as figuring out how we could better integrate this with
> Grub as a platform, I think the promise here to have rich tagged content is
> very very exciting :)

Well, someone still has to do the actual work of wiring all
of this stuff up. The example I gave is a tip-of-the-iceberg,
just the simplest example, an example where a query can
be performed at high speed, using more-or-less the existing
query mechanisms.  There are certainly much fancier things
one can do (and these are the things I'm working on, but
for general use, not just search.)

So we still need someone to stand up and say "gee,
I understood that last bit, I'll hook the data in, and see
how it works" -- measure and experiment, performance
tune (I know of some weak spots), and if it all works, try
out some of the fancier ideas.  I can brainstorm and
guide and provide advice but at least right now,
I can't do the actual search-engine part of the work.
I am very much interested in the results, because at
least a part of what I need to accomplish requires a
fast search-like ability to find related concepts,
and this is the first step on that road.

--linas

(Linas wonders what Rich Jones plans to do for the
rest of gsoc...)



More information about the Search-l mailing list