[Search-l] Search Team Update: July 8, 2008
Dan Lewis
dan at wikia-inc.com
Tue Jul 8 20:40:59 UTC 2008
Here's what the Search team did last week:
Nutch:
1) Finished LinkRank algorithm
2) Finished LinkLoop idetifier tool
3) Finished LinkDumper tool
4) Finished NodeDumper tool
5) Finished NodeReader tool
6) Finished WebGraph Tool
7) Started working on 302 redirect errors in results
This is pretty exciting in and of itself. It means that Nutch, and
therefore Wikia Search, now has a stable link analysis algorithm in place
that handles reciprocal links, link loops, most link farms and tight knit
communities. The algorithm, therefore, will be able to consider not only
the content of a page, but also the links incoming, to determine the
relevancy and strength of each page on a keyword-by-keyword basis.
The link analysis suite is not yet deployed -- give it a few days.
We'll have more on the link analysis tool later this week.
Operations:
1) Put some time in on more Grub backend tools
2) More work on re-indexing and scoring updates -- this is for the link
analysis tools
3) Worked with the ISC to try and resolve some networking and storage issues
Search Tools:
1) Added support for multiple keywords in a single KT call.
2) Metadata is not returned with all KT calls.
3) Started addition of url table and sort / fetch KT changes by url.
4) Worked on Firefox Toolbar
5) Began work on Advanced URL Add Functionality
6) Worked on spelling suggestions tool
Community:
1) Launched Wikia Search blog
2) Began work on People You May Know tool
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikia.com/pipermail/search-l/attachments/20080708/0ff209ea/attachment.html
More information about the Search-l
mailing list