[Grub-dev] Taking on Grub Java

Bartek Jasicki thindil2 at gmail.com
Thu Jan 8 09:41:10 UTC 2009


On 2009-01-07, at. 12:59:55
"Yousef Ourabi" <yourabi at zero-analog.com> wrote:

> Feel free to take the ball on run with it.
> 
> I'm still waiting for the server and crawl data to be open-sourced
> since that was originally the supposed goal of this project.
> 
> I've been waiting for a year and still waiting.
> 
> -Yousef
> 

Little offtopic:

Current Grub servers are nothing more than few Perl scripts and Apache
server. Probably all are available at SVN repository for some time:

http://svn.swlabs.org/grubng/trunk/perl/robo.pl - checking robots.txt
files

http://svn.swlabs.org/grubng/trunk/perl/workunit.pl - generating new
workunits

http://svn.swlabs.org/grubng/trunk/perl/dispatch.cgi - dispatch server

http://svn.swlabs.org/grubng/trunk/perl/webdb.pm and 
http://svn.swlabs.org/grubng/trunk/perl/put.cgi - upload server.

AFAIK there is missing only one part of system - scripts which upload
crawled data to database. SOON (TM, copyrighted, patented ;) ) upload
server been replaced by standalone server written in C# (which is
still little not tested and under heavy development):

http://svn.swlabs.org/grubng/trunk/csharp/grub-upload/

About crawled data:

Recently crawled .arc files available at

http://soap.grub.org/arcs/

Part of database are available at:

http://search.isc.org/download/

But if you interested in database and index IMO you should check
Internet Open Index project (there go all data from Grub):

http://search.isc.org/

Bartek

-- 
Grub Next Generation: http://grub.org
Mailing List: grub-dev at wikia.com
IRC: #wikia-search at irc.freenode.net
Jabber: thindil at jabberpl.org


More information about the Grub-dev mailing list