From kubes at apache.org Tue Apr 1 03:08:34 2008 From: kubes at apache.org (Dennis Kubes) Date: Mon, 31 Mar 2008 22:08:34 -0500 Subject: [Search-l] Sorry to do this but its coming, yes a rant :-( In-Reply-To: References: <47F038AE.2000503@apache.org> Message-ID: <47F1A732.705@apache.org> Mark (Markie) wrote: > hmmm sent one reply to this, but forgot to comment :-p > > so, comments below > > mark > > On Mon, Mar 31, 2008 at 2:04 AM, Dennis Kubes > wrote: > > Hi Markie, > > First let me say that if anything has been missed, or promised and then > not delivered, it was not intentional. > > > okay so maybe there not intentional, but any chance of them being > sorted? :-p Absolutely. :) > > > Second, I would agree with you > that while we have been working to make changes to improve the accuracy > of the search results, we have not been doing a very good job of keeping > the community informed about those or other changes and that is > something we need to work on. > > For my part I will attempt to communicate more of what we are working on > in terms of the search engine internals, starting now. > > > excellent :-D many thanks > > > Probably the > biggest improvement we have seen in terms of relevancy is changing how > inbound link text is index. > > Inbound link text the text of anchors pointing to a page. We currently > index that text along with a given page. So for example if page x links > to page y and the anchor text reads "hotels" that text will get put into > the index under page y. The problem we were having was we would index > the first N number of links pointing to a page without regard for what > were the best links. That provided for some weird results when we > launched, for instance google.com would come up > in a search for dallas > hotels because it had one inbound link that said "dallas" and another > that said "hotels". To fix this we started looking and inbound links > according to the score of their parent (pointing from) page. The idea > behind this was that higher scoring pages would have better outbound > links. In our current index we first determine what the *best* links > are by their parent pages score and then index the first N best links. > And what we have seen as a result is a big increase in the relevancy of > the search results. > > > excellent, as this has been one of our major problems, so im glad to > hear that work is being done to sort the problem Funny thing is we didn't know the extent to which this change would help. Once we deployed it we saw a noticeable improvement. One of the problems I see is being able to determine if a change actually improves search results (besides just looking at the results). Any ideas on how to determine this? > > > > Here is a list of the things I see that could help improve search > relevancy going forward: > > - Being able to score elements of web pages. For example determine if a > piece of text is a h1, h2, div, etc. Currently our web pages parsers > don't support that. > > > are these codes available anywhere in wikia's svn? Many of the improvements are being made to Nutch directly. Any changes to Nutch are submitted back to the Apache JIRA and if the Nutch community oks them to the Apache SVN repository. A lot of the discussion for Nutch internals would happen on the Nutch user and dev mailing lists. Dennis > > > > > - Better integration of the star system into the rankings and better > ability for the community to tag pages as spam. This is part of the KT > stuff Jer has been working on. > > > :-D > > > > > - Overall improvement in the search algorithm. Currently the algorithm > is based on nutch's OPIC implementation. Long story short this > algorithm is unstable after a few iterations because web page score keep > increasing exponentially. This is more of a Nutch problem and has > already been discussed on the Nutch lists but essentially we need a new > process for scoring and probably a new algorithm that is more > pagerank-like and has some type of convergence. > > There are other items as well but I think these things would help show a > dramatic improvement in search quality. > > Last let me say that anybody should feel free to email me at any time. > If something isn't being done fast enough or something seems to be > getting left out. Give me a nudge. :) > > > /me adds you to contacts :-p > > > > Dennis > > > Mark (Markie) wrote: > > re sending in case it was missed, from 4/5 days ago, maybe the people > > copied in (wikia staff/founders) would be willing to give a small > amount > > of time to reply?!? > > > > mark > > > > ---------- Forwarded message ---------- > > From: *Mark (Markie)* > > >> > > Date: Wed, Mar 26, 2008 at 11:13 PM > > Subject: Sorry to do this but its coming, yes a rant :-( > > To: Mailing list for Search Wikia > > >>, Search > Wiki > > >>, > Jimmy Wales > > >>, jer > > > >>, > dennis at igfoo.com > > > > > > > Right, im afraid the time has come once again where i have been > > wondering to my self again, and i feel that things need to be > said, so > > here they are. > > > > *Whats happening with the project. AFAIK overall (and i know > somethings > > have happened) but *very* little seems to have happened since the > > launch. Now i know that things are probably happening with the team, > > but any chance of actually telling the users about this, cos its not > > looking good from here atm. > > > > Ive copied in the so called pillars of search > > > > 1. *Transparency* - riiiiiiight :-( > > 2. *Community* - hmmm contribute to stale projects? > > 3. *Quality* - well.... > > 4. *Privacy * - > hmm yes > > that seems to have been done to an extent ( by the > community mind) > > > > > > Ive been on the project since dec 2006, and so have been waiting > along > > time for this to happen, so its not purely a case of i want > everything > > to happen NOW, i just want it to look like SOMETHING will happen > SOON. > > > > *This brings me onto the next topic of where is the project going??? > > There has been practically no progress, and frankly i cant see much > > being done from my point. The launch has happened, many people were > > interested, contributed but have now left, because NOTHING has > happened. > > so overall the net gain of launching the project?? bad press and > a few > > (relative to the web) minis. > > > > *Many things have been promised by various people, which havent > > happened. Most specifically this has come from a certain member of > > staff, one specifically, that has said that they will do many things, > > but even the most basic of tasks seem to have not happened. so > > Broken/missed promises. Well iirc (name here) said he would make sure > > that the about pages etc were created, hmm... > > (http://alpha.search.wikia.com/about.html in case you forgot > where those > > were). This is a wikia project, any chance of getting ANY > > involvement/input/co-ordination from the team who, ultimately, > want us > > to make them more successfull and a profit (if were being frank). > > > > Now i know i havent been that active recently on the wiki, but i have > > been reading the mailing lists and talking in irc, but the main > reason > > for me not being active on the wiki, is mainly the fact that i > just dont > > have the motivation to do anything because of the above. Frankly atm > > its a stale project, but hopefully this rant (which i hate doing) > will > > mean that the project will hopefully become better. > > > > If i have offended anyone above then i am sorry, but i feel that > certain > > things need to be said right now, in order to make the project > better, > > which is my aim. > > > > Many thanks and look forward to the responses to this, especially > from > > wikia staff > > > > Regards > > > > mark > > > > (user:Markie) > > > > > > > ------------------------------------------------------------------------ > > > > _______________________________________________ > > Wikia Search mailing list > > http://alpha.search.wikia.com/ > > Change options or unsubscribe: > http://lists.wikia.com/mailman/options/search-l > _______________________________________________ > Wikia Search mailing list > http://alpha.search.wikia.com/ > Change options or unsubscribe: > http://lists.wikia.com/mailman/options/search-l > > From aerik at thesylvans.com Tue Apr 1 03:48:23 2008 From: aerik at thesylvans.com (Aerik Sylvan) Date: Mon, 31 Mar 2008 20:48:23 -0700 Subject: [Search-l] Sorry to do this but its coming, yes a rant :-( In-Reply-To: <47F1A732.705@apache.org> References: <47F038AE.2000503@apache.org> <47F1A732.705@apache.org> Message-ID: <355a36af0803312048y553f2f14i499320b124ce73ae@mail.gmail.com> On Mon, Mar 31, 2008 at 8:08 PM, Dennis Kubes wrote: > . One of the > problems I see is being able to determine if a change actually improves > search results (besides just looking at the results). Any ideas on how > to determine this? > At the risk of being dense/overly stubborn: direct feedback coupled with indirect feedback? The direct feedback would be a score given by willing participants. Indirect feedback is clickthroughs, stuff like that. I really think the area where wikia search has a chance to shine is in how we engage participants in the process. I'm *sure* google and the other big guys collect indirect feedback. They've got an army of scientists. We probably won't beat them at that game, but we shouldn't ignore indirect feedback and pagerank style algorithms either.. BUT, where we can be diiferent - maybe even better - is in engaging the direct involvement of the community. I don't mean picking the entrenched results for a given search, as Mahalo is doing (BORING! I *still* say it's "just" a spin on the dmoz approach - not that it's bad, it just isn't new or impressive any more) but instead engage the community to continually provide feedback on results and keep the results *fresh*. Incorporate ideas like Alexas movers and shakers. I have always like how netfilx asks you how you liked a movie, after you've returned it. Maybe we can think of better methods. The trick is going to be volume. If you have 10 million searches, and 1 percent of the searches have active feedback, that's 100,000 data points. Given an assumed very wide spread (that 100,000 points will have a very small head and a very long tail), it is hard to draw statistically significant conclusions. So, you need either a) more searches or b)a higher percentage of feedback. Not sure how to tackle that, but if any project ever was going to do it, it's this one. Best Regards, Aerik -- http://www.wikidweb.com - the Wiki Directory of the Web http://tagthis.info - Hosted Tagging for your website! -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.wikia.com/pipermail/search-l/attachments/20080331/bf632754/attachment-0001.html From corinne.keller at unisg.ch Tue Apr 1 08:17:37 2008 From: corinne.keller at unisg.ch (Corinne Keller) Date: Tue, 1 Apr 2008 10:17:37 +0200 Subject: [Search-l] Corinne Keller ist ausser Haus. Message-ID: Ich werde ab 28.03.2008 nicht im B?ro sein. Ich kehre zur?ck am 07.04.2008. Ich werde Ihre Nachricht gerne nach meiner R?ckkehr beantworten. F?r dringende Fragen zu EDIS wenden Sie sich bitte an Frau Manuela Wagner (manue-la.wagner at unisg.ch). Freundliche Gr?sse, Corinne Keller ---------- I will answer to your e-mail as soon as I return. For urgent questions concerning EDIS, please contact Ms Manuela Wagner (manuela.wagner at unisg.ch) Kind regards, Corinne Keller -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.wikia.com/pipermail/search-l/attachments/20080401/7c49f9e5/attachment.html From marcnaweb at gmail.com Wed Apr 2 19:28:49 2008 From: marcnaweb at gmail.com (Marc .) Date: Wed, 2 Apr 2008 16:28:49 -0300 Subject: [Search-l] Humanitarian Search - business model Message-ID: <5f2640d0804021228x17a954e7udfc3871a8b8e3cfb@mail.gmail.com> Hello all, Reading this french article I realized that a "humanitarian search business model" for Wikia can fit perfectly with our concept and can strongly help to find resources for the search engine development. The idea is simple: when clicking on an ad during the search, a part --or full-- revenue is given to NGOs. The NGOs is chosen the users. Here is the article links and some site that use the idea (both French, sorry): http://www.lemonde.fr/web/chat/0,46-0 2-3234,55-1027667 51-893669,0.html www.doona.fr www.veosearch.com Please, let me know what you think about it. BR Marc Rosenfeld From jeremie at jabber.org Thu Apr 3 06:05:54 2008 From: jeremie at jabber.org (jer) Date: Thu, 3 Apr 2008 01:05:54 -0500 Subject: [Search-l] for the curious Message-ID: <80E9B95F-EB63-437C-8CFB-3C18DA282BAB@jabber.org> If you'd like to "watch" the iterative progress on a new version of the search results page, you can play with it live here: http://re.search.wikia.com/BEWARE/ (as the name implies, it's only barely working and quite a bit is missing/buggy) And also use subversion to fetch the newest copies and check the changelog for notes on the progress via: svn co http://svn.swlabs.org/re.search (the result html/js/css is in the "cool" subdirectory) There's a lot of features stubbed out and it will likely be evolving hourly and daily for the next few weeks as some dust settles. I'll be sure to send out another notice here once it starts to stabilize a little as well. Any development related discussions please direct to the search- ui at wikia.com mailing list. Jer From marcnaweb at gmail.com Sat Apr 5 03:41:20 2008 From: marcnaweb at gmail.com (Marc .) Date: Sat, 5 Apr 2008 00:41:20 -0300 Subject: [Search-l] Fwd: Humanitarian Search - business model In-Reply-To: <5f2640d0804021228x17a954e7udfc3871a8b8e3cfb@mail.gmail.com> References: <5f2640d0804021228x17a954e7udfc3871a8b8e3cfb@mail.gmail.com> Message-ID: <5f2640d0804042041r646a6b25obd13c8b9a530597b@mail.gmail.com> Hello all, Reading this french article I realized that a "humanitarian search business model" for Wikia can fit perfectly with our concept and can strongly help to find resources for the search engine development. The idea is simple: when clicking on an ad during the search, a part --or full-- revenue is given to NGOs. The NGOs is chosen the users. Here is the article links and some site that use the idea (both French, sorry): http://www.lemonde.fr/web/chat/0,46-0 2-3234,55-1027667 51-893669,0.html www.doona.fr www.veosearch.com Please, let me know what you think about it. BR Marc Rosenfeld From chrisdesouza at yahoo.com Sat Apr 5 06:18:51 2008 From: chrisdesouza at yahoo.com (Chris Desouza) Date: Fri, 4 Apr 2008 23:18:51 -0700 (PDT) Subject: [Search-l] Fwd: Humanitarian Search - business model In-Reply-To: <5f2640d0804042041r646a6b25obd13c8b9a530597b@mail.gmail.com> Message-ID: <122363.53170.qm@web54108.mail.re2.yahoo.com> If good need be done, do it quietly. If ad clicks will be shared with or donated to NGO's, then do it quietly. There is no purpose in elevating charitable discourse in public to compromise with user experience. Too many advertisers are paying through their nose for undeserving clicks and all major search engines are complicit in this. Fraud, copyright infringement is no longer the domain of folks like me and you. The foundation of many a success on the web is built upon stolen goods. If I am paying for my ads, I'd care less which charity benefits, when my ad dollars are being held ransom to causes and fancy unrelated to my enterprise and more so, when my ad dollars are used as bait to extract income for altruistic purposes. A better way to actualize charitable goals is to ask folks to buy something from advertisers. For far too long, advertisers are sitting ducks in the Internet pond. I am not against charity. But, just not at the expense of advertisers. Not when they have no say in this. If you would rather not agree, try opening an Adwords account for your campaign. You will know! Chris "Marc ." wrote: Hello all, Reading this french article I realized that a "humanitarian search business model" for Wikia can fit perfectly with our concept and can strongly help to find resources for the search engine development. The idea is simple: when clicking on an ad during the search, a part --or full-- revenue is given to NGOs. The NGOs is chosen the users. Here is the article links and some site that use the idea (both French, sorry): http://www.lemonde.fr/web/chat/0,46-0 at 2-3234,55-1027667 at 51-893669,0.html www.doona.fr www.veosearch.com Please, let me know what you think about it. BR Marc Rosenfeld _______________________________________________ Wikia Search mailing list http://alpha.search.wikia.com/ Change options or unsubscribe: http://lists.wikia.com/mailman/options/search-l --------------------------------- You rock. That's why Blockbuster's offering you one month of Blockbuster Total Access, No Cost. -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.wikia.com/pipermail/search-l/attachments/20080404/44028ae8/attachment.html From jeremie at jabber.org Fri Apr 11 04:38:39 2008 From: jeremie at jabber.org (jer) Date: Thu, 10 Apr 2008 23:38:39 -0500 Subject: [Search-l] thought this might be interesting for the group Message-ID: <0672E568-A472-44A3-8AEF-2F46D6D0E5FE@jabber.org> http://www.eurekalert.org/pub_releases/2008-04/ps-rcw041008.php Researchers classify Web searches Although millions of people use Web search engines, researchers show that ? by using relatively simple methods ? most queries submitted can be classified into one of three categories. Jim Jansen, assistant professor in Penn State's College of Information Sciences and Technology, worked with IST undergraduate Danielle Booth and Amanda Spink, Queensland University of Technology, to find that Web search engine users are doing primarily informational, navigational or transactional searching. Informational searching involves looking for a specific fact or topic, navigational searching seeks to locate a specific Web site and transactional searching looks for information related to buying a particular product or service. The research was the first published work of its kind done using actual searching data, with the aim of real-time classification. Researchers analyzed more than 1.5 million queries from hundreds of thousands of search engines users. Findings showed that about 80 percent of queries are informational and about 10 percent each are for navigational and transactional purposes. Jansen and his colleagues arrived at those results by selecting random samples of records and analyzing query length, the order of the query in the session and the search results. These fields helped the team develop an algorithm that classified the searches with a 74-percent accuracy rate. "Other results have classified comparatively much smaller sets of queries, usually manually," Jansen said. "This research aimed to classify queries automatically. "Our findings have broad implications for search engines and e- commerce if they can classify the user intent of queries in real time. This is why we wanted a computational undemanding algorithm," Jansen continued. "It proves the 80/20 rule that 80 percent of the cases can be achieved with these clear-cut methods." The paper "Determining the informational, navigational and transactional intent of Web queries" will appear in the May 2008 issue of Information Processing & Management. The article is currently available online. The Penn State researcher said he plans to continue this research using a more complex algorithm that will hopefully yield a 90-percent accuracy rate using similar searching criteria. From jeremie at jabber.org Sat Apr 12 01:24:00 2008 From: jeremie at jabber.org (jer) Date: Fri, 11 Apr 2008 20:24:00 -0500 Subject: [Search-l] index update and request for seeds Message-ID: <78CC33A6-FB79-40BC-9804-31E9AC70E3DA@jabber.org> The index was refreshed again this week and rolled out yesterday (thanks Seth and Dennis!), it was a crawl of the top ~25M pages based on a source of over 800M URLs. Result quality has some minor improvements, but more significantly this update enabled language detection (see http://wiki.apache.org/nutch/ LanguageIdentifierPlugin). The results page doesn't take advantage of this for any automatic handling yet, but you can add a lang:?? to any search as a filter. For the fun of it, I did a search of all the ISO two-letter language codes and got the total results for each, it's attached below. Seeds: the index is built and served with Nutch and we're preparing a compressed copy to put up for download via bittorrent, it should be about 290GB. If anyone has that much space and a decent connection and would be willing to help seed this just email me directly and I'll get you going with a copy. Also, besides lang:, here's a list of the other filters this index supports: url: (in the url) anchor: (in the incoming anchor text) content: (specifically just in the content) title: (title of the page) host: (just in the hostname) Jer AA 7 AB 46 AF 626 AM 21 AR 15265 AS 3 AY 0 AZ 212 BA 31 BE 283 BG 4868 BH 0 BI 0 BN 50 BO 4 BR 223 CA 14187 CO 7 CS 74257 CY 311 DA 149332 DE 1439097 DZ 1 EL 13071 EN 9602332 EO 1640 ES 611212 ET 1112 EU 917 FA 4239 FI 54938 FJ 0 FO 46 FR 791189 FY 9 GA 97 GD 111 GL 1053 GN 5 GU 35 HA 2 HE 10609 HI 896 HR 3591 HU 43842 HY 96 IA 13 IE 14 IK 1 ID 326 IS 45142 IT 595428 IU 0 JA 265533 JV 1 KA 43 KK 7 KL 1 KM 8 KN 27 KO 3070 KS 0 KU 58 KY 1 LA 184 LN 0 LO 6 LT 4633 LV 1519 MG 6 MI 234 MK 151 ML 10 MN 15 MO 1 MR 9 MS 72 MT 12 MY 8 NA 11 NE 116 NL 390734 NO 131595 OC 25 OM 4 OR 0 PA 12 PL 351849 PS 4 PT 180741 QU 1 RM 14 RN 0 RO 11080 RU 302048 RW 0 SA 63 SD 0 SG 9 SH 36 SI 153 SK 8127 SL 1486 SM 2 SN 0 SO 12 SQ 133 SR 1190 SS 1 ST 3 SU 0 SV 102920 SW 48 TA 91 TE 12 TG 1 TH 10203 TI 1 TK 3 TL 22 TN 0 TO 3 TR 32021 TS 1 TT 13 TW 1664 UG 0 UK 2394 UR 74 UZ 4 VI 341 VO 0 WO 0 XH 0 YI 7 YO 0 ZA 17 ZH 36847 ZU 37 From jeremie at jabber.org Mon Apr 14 15:21:53 2008 From: jeremie at jabber.org (jer) Date: Mon, 14 Apr 2008 10:21:53 -0500 Subject: [Search-l] making progress on the new results Message-ID: <82363877-FE6C-419A-A407-6073E9CC7063@jabber.org> It's still very experimental and nowhere near any kind of finished/ polished state, but I'd like to encourage anyone interested in seeing where this is heading to check out our progress. Right now the results page allows anyone to add, delete, and edit any result, and it saves and loads with those changes for everyone. There's also a "spotlight" feature where any single result item can be given special attention. The star ratings work, and there's a super rudimentary global changes page. Finally, there's also a very simple "related searches" feature where anyone can add other searches related to the current one (and it saves them for everyone). It's only been tested in Safari 3 and Firefox 3 so far, there's also a few little (and very experimental) surprises that may or may not work :) http://re.search.wikia.com/BEWARE/ You can also follow the progress daily via subversion at http://svn.swlabs.org/kt/cool and there will be a trac set up to help track bugs/etc as soon as it starts to settle down. Enjoy! Jer From aerik at thesylvans.com Mon Apr 14 16:31:05 2008 From: aerik at thesylvans.com (Aerik Sylvan) Date: Mon, 14 Apr 2008 09:31:05 -0700 Subject: [Search-l] making progress on the new results In-Reply-To: <82363877-FE6C-419A-A407-6073E9CC7063@jabber.org> References: <82363877-FE6C-419A-A407-6073E9CC7063@jabber.org> Message-ID: <355a36af0804140931j6dab889frb647fa4cc60f7e7e@mail.gmail.com> On Mon, Apr 14, 2008 at 8:21 AM, jer wrote: > It's still very experimental and nowhere near any kind of finished/ > polished state, but I'd like to encourage anyone interested in seeing > where this is heading to check out our progress. > > Right now the results page allows anyone to add, delete, and edit any > result, and it saves and loads with those changes for everyone. > There's also a "spotlight" feature where any single result item can be > given special attention. The star ratings work, and there's a super > rudimentary global changes page. Finally, there's also a very simple > "related searches" feature where anyone can add other searches related > to the current one (and it saves them for everyone). > Very cool, Jer. It's really starting to look like something, and the (not quite finished) features are interesting and different from other stuff out there. You can see the vision taking shape. Cool :-) Aerik -- http://www.wikidweb.com - the Wiki Directory of the Web http://tagthis.info - Hosted Tagging for your website! -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.wikia.com/pipermail/search-l/attachments/20080414/0966b040/attachment.html From natanael.l at gmail.com Tue Apr 15 06:39:58 2008 From: natanael.l at gmail.com (Natanael) Date: Tue, 15 Apr 2008 08:39:58 +0200 Subject: [Search-l] making progress on the new results In-Reply-To: <355a36af0804140931j6dab889frb647fa4cc60f7e7e@mail.gmail.com> References: <82363877-FE6C-419A-A407-6073E9CC7063@jabber.org> <355a36af0804140931j6dab889frb647fa4cc60f7e7e@mail.gmail.com> Message-ID: I really like this! This is what I wanted to see when testing this search engine! On 4/14/08, Aerik Sylvan wrote: > > > > On Mon, Apr 14, 2008 at 8:21 AM, jer wrote: > > > It's still very experimental and nowhere near any kind of finished/ > > polished state, but I'd like to encourage anyone interested in seeing > > where this is heading to check out our progress. > > > > Right now the results page allows anyone to add, delete, and edit any > > result, and it saves and loads with those changes for everyone. > > There's also a "spotlight" feature where any single result item can be > > given special attention. The star ratings work, and there's a super > > rudimentary global changes page. Finally, there's also a very simple > > "related searches" feature where anyone can add other searches related > > to the current one (and it saves them for everyone). > > > > Very cool, Jer. It's really starting to look like something, and the (not > quite finished) features are interesting and different from other stuff out > there. You can see the vision taking shape. > > Cool :-) > > Aerik > > > > -- > http://www.wikidweb.com - the Wiki Directory of the Web > http://tagthis.info - Hosted Tagging for your website! > _______________________________________________ > Wikia Search mailing list > http://alpha.search.wikia.com/ > Change options or unsubscribe: > http://lists.wikia.com/mailman/options/search-l > -- If everybody are thinking alike, then somebody aren't thinking || Stupidity is a renewable resource -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.wikia.com/pipermail/search-l/attachments/20080415/cb4529c0/attachment.html From jeremie at jabber.org Thu Apr 17 18:20:42 2008 From: jeremie at jabber.org (jer) Date: Thu, 17 Apr 2008 13:20:42 -0500 Subject: [Search-l] article: "20 (Rare) Questions for Google Search Guru Udi Manber" Message-ID: <508F5168-AAF9-4942-9620-63EE4ED27E71@jabber.org> I found it to be a great summary of the current state of Google's attitude towards search, figured everyone on this list would probably be interested as well :) http://www.popularmechanics.com/blogs/technology_news/4259137.html?nav=RSS2 Jer From jason at calacanis.com Thu Apr 17 18:22:19 2008 From: jason at calacanis.com (=?utf-8?B?SmFzb24gTWNDYWJlIENhbGFjYW5pcw==?=) Date: Thu, 17 Apr 2008 18:22:19 +0000 Subject: [Search-l] article: "20 (Rare) Questions for Google Search Guru UdiManber" In-Reply-To: <508F5168-AAF9-4942-9620-63EE4ED27E71@jabber.org> References: <508F5168-AAF9-4942-9620-63EE4ED27E71@jabber.org> Message-ID: <110944579-1208456565-cardhu_decombobulator_blackberry.rim.net-899527592-@bxe008.bisx.prod.on.blackberry> Good read Udi is smart --------------- Jason at Calacanis.com | Mobile: 310-456-4900 http://www.calacanis.com | http://www.mahalo.com Executive Assistant: admin at calacanis.com -----Original Message----- From: jer Date: Thu, 17 Apr 2008 13:20:42 To:Mailing list for Search Wikia Subject: [Search-l] article: "20 (Rare) Questions for Google Search Guru Udi Manber" I found it to be a great summary of the current state of Google's attitude towards search, figured everyone on this list would probably be interested as well :) http://www.popularmechanics.com/blogs/technology_news/4259137.html?nav=RSS2 Jer _______________________________________________ Wikia Search mailing list http://alpha.search.wikia.com/ Change options or unsubscribe: http://lists.wikia.com/mailman/options/search-l From thindil2 at gmail.com Sat Apr 19 16:34:03 2008 From: thindil2 at gmail.com (Bartek Jasicki) Date: Sat, 19 Apr 2008 18:34:03 +0200 Subject: [Search-l] Informations about work progress Message-ID: <20080419183403.18baea44@vallheru.vallheru> Hi If someone is interested in work progress about Wikia Search - I set few thigs, which can help trace newest changes in project Subversion repositories. 1. On IRC channel #searchwikia is bot CIA-# which send informations about newest changes in repositories. 2. http://people.swlabs.org/~bartek/watcher - simple web RSS aggregator - all informations in one place Few informations: All informations are provided via http://cia.vc site. For now may be a little delay between changes and information about (around 15 minutes, but sometimes it can be larger). For now are tracked: - GrubNG - http://cia.vc/stats/project/grubng - SearchWikia Search Interface - http://cia.vc/stats/project/wikiasearch-searchui - Nutch - http://cia.vc/stats/project/nutch - HBase - http://cia.vc/stats/project/hbase If someone want add any other project to tracked list, simply send me info which and I add ;) Bartek From thomasasta at gmx.net Sun Apr 20 11:50:53 2008 From: thomasasta at gmx.net (thomasasta at gmx.net) Date: Sun, 20 Apr 2008 13:50:53 +0200 Subject: [Search-l] BiblioteQ - Library Search & Indexing Management Tool for Windows released Message-ID: <20080420115053.97080@gmx.net> BiblioteQ - Library Search & Indexing Management Tool for Windows released http://biblioteq.sourceforge.net/ The first windows version of the Library database tool BiblioteQ has been released, please evaluate and test it with your local library databases... Thanks -- Psst! Geheimtipp: Online Games kostenlos spielen bei den GMX Free Games! http://games.entertainment.gmx.net/de/entertainment/games/free From newsmarkie at googlemail.com Mon Apr 21 16:50:34 2008 From: newsmarkie at googlemail.com (Mark (Markie)) Date: Mon, 21 Apr 2008 17:50:34 +0100 Subject: [Search-l] Google image labeler Message-ID: hmm interesting google are roping people in to label images with tags, to "help improve the quality of Google's image search results" more at http://images.google.com/imagelabeler/ regards mark -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.wikia.com/pipermail/search-l/attachments/20080421/665867a9/attachment.html From borboleta at gmail.com Mon Apr 21 17:30:00 2008 From: borboleta at gmail.com (Bani) Date: Mon, 21 Apr 2008 14:30:00 -0300 Subject: [Search-l] Google image labeler In-Reply-To: References: Message-ID: <35b94d690804211030w72ad6ebct6b26c23e9978fd54@mail.gmail.com> Apparently they pay attention to what the people who go present lectures there say: http://video.google.com/videoplay?docid=-8246463980976635143&hl=en Bani On Mon, Apr 21, 2008 at 1:50 PM, Mark (Markie) wrote: > hmm interesting > > google are roping people in to label images with tags, to "help improve the > quality of Google's image search results" > > more at http://images.google.com/imagelabeler/ > > regards > > mark > > _______________________________________________ > Wikia Search mailing list > http://alpha.search.wikia.com/ > Change options or unsubscribe: > http://lists.wikia.com/mailman/options/search-l >