[Search-l] wrong questions

Jeremie Miller jeremie at jabber.org
Mon May 12 06:33:53 UTC 2008


Excellent meaty thread Paul, cool :)

As asked I won't touch on the meta-topics, but on the tech side I at  
least want to draw an analogy: I'm seeing this process of building an  
open search platform evolve as more of a frankenstein, combining  
protocols, code, piles of data, lots of experiments, and gobs of  
zelektrisity!  I also think we're a ways away yet from seeing the  
vision really start to come together, I didn't expect this to be easy,  
*g*.

/me goes to tend to his pile of half working code and formats...

Jer

On May 11, 2008, at 5:53 PM, Paul Vixie wrote:

> various folks here, as well as sethf in his blogs, have asked plenty  
> of hard
> questions about wikia.com's search project -- who owns it, is it  
> truly open,
> what is wikia's agenda, and so on.  these are the wrong questions,  
> since they
> focus on the company who sponsors the work, and not on the work  
> itself.  i've
> been trying to think of the right questions.  here's an  
> approximation of what
> critics of this project ought to be investigating.
>
> 1. why is ISC's the only backend?  jer's vision is backend  
> syndication, so,
> if his XML schema is stable and if there's at least one f/l/oss  
> implementation
> of crawling and of indexing, then, why aren't there more crawlers  
> and more
> indexers, conforming to jer's XML, possibly flooding data between  
> each other
> and possibly dividing up the workload so that all crawlers don't  
> have to
> crawl all sites?  ISC ought to have peers, and we ought to be able  
> to have
> gentlemen's agreements, like, "we'll do [a-l].com, you do [m- 
> z].com", etc.
>
> 2. why is Wikia's the only frontend?  again referring to the  
> syndication
> model, and knowing that there are other "social search engines",  
> when will
> we see someone other than wikia use ISC's backend, or any other  
> backend whose
> data can be reached using jer's XML?
>
> 3. who is driving the syndication model?  it's clear that ISC knows  
> how to
> provide network and power, and that jer knows how to design the  
> system and
> build various parts of it, but who is the champion for jer's vision  
> -- who
> will drive us to better answers for #1 and #2 above?  who ought to  
> be in here
> answering critics and beating the drum, which is a distraction to  
> jer (and
> candidly he's too busy to do this part well unless he drops other  
> stuff
> that's already late)?  remembering that jimbo keeps this issue alive  
> in the
> press, the overall project still lacks a day to day "programme  
> manager".
>
> 4. what else is jer working on?  has wikia dedicated him to this  
> project or
> does he also handle day to day fire fighting on wikia's existing  
> services to
> justify his paycheck?  and while we're on that topic, what other  
> personnel
> has wikia dedicated to this -- how seriously are they really taking  
> it, in
> terms of cash on the barrel head?
>
> 5. who else is working on this, outside of wikia?  what outside  
> volunteers
> or wikia competitor's employees have commit access to the source  
> pool for
> the crawler, or indexer, or front end, or have root access to the  
> donated
> back end machines hosted by ISC?  if the answer is nobody, then is  
> that due
> to lack of outreach (see #3 above) or is it wikia's preference that  
> outsiders
> contribute content rather than code and sysops?  (is that written  
> anywhere?)
>
> 6. where are the mini-articles stored?  if outside volunteers are  
> mostly
> contributing data, is that data stored on wikia's front end?  if so,  
> what are
> the redistribution terms -- would wikia flood this data to competing  
> front
> end operators, and accept incoming floods of similar data from  
> competitors?
> or, is this the "secret sauce", there's no way to get access to  
> contributed
> data of this kind except one article at a time, inside wikia's  
> advertising
> system?
>
> 7. given that the idea of "taking on google" is silly, given their  
> size and
> focus and ambition and brand strength and so on, and that what we can
> actually hope to achieve with this project is to change the game and  
> make
> search part of the internet infrastructure, where are the white  
> papers,
> journal articles, and outreach glossies explaining what the new  
> world of
> internet search could look like, and what effect this change will  
> have on
> google, microsoft, yahoo, and the current market hierarchy, and the  
> rest of
> the "social search" scene?
>
> 8. has anybody reached out to yahoo and microsoft to see if they'd  
> like to
> join this effort or at least sponsor it, since as #2 and #3 in  
> internet
> search today, they're the ones with the most to gain if we change  
> the game.
> and if nobody's doing this now, and i did it, what would wikia say  
> about
> sharing the sponsorship burden with other players, perhaps larger  
> players?
>
> this list of questions isn't meant to be exhaustive.  but as in my own
> controversial efforts over the years, i find the quality of  
> criticism here
> somewhat low.  forget about jimbo and his tv news girlfriend and the  
> instant
> messenger chat logs.  forget about wikia's corporate interests, or  
> whether
> wikipedia was a once in a lifetime event, or whether wikipedia's  
> admins are
> running amok, or what the wikimedia foundation board is up to.   
> there are
> plenty of excellent questions about the wikia-sponsored search  
> project whose
> backend is hosted at ISC, which are not salacious or even delicate.   
> the
> above list is meant to show the kind of questions i mean.
>
> for the record, i'd like it if jer did not jump in and answer any of  
> these,
> because he's got more important stuff to do, and because i'd like to  
> see
> wikia provide another face, another name, another voice, to this  
> forum.
>
> also for the record, ISC's hosting of this project has been a cash  
> neutral
> event for us, which is important since we don't have cash for this  
> kind of
> thing.  the 15-ton air handler wikia bought feeds a room that has  
> other
> projects in it too, and our network is a fixed cost, and wikia has  
> agreed
> to pay for the power we use for search, and the servers were all  
> donated,
> and that donation was targetted for this project, and we got a lot  
> more
> servers than we needed, and we've been passing the excess along to  
> other
> f/l/oss and internet security projects.  so no matter whether this  
> project
> changes the world, ISC is already winning.
> -- 
> Paul Vixie
> _______________________________________________
> Wikia Search mailing list
> http://alpha.search.wikia.com/
> Change options or unsubscribe: http://lists.wikia.com/mailman/options/search-l
>




More information about the Search-l mailing list