[Grub-dev] Taking on Grub Java
Bartek Jasicki
thindil2 at gmail.com
Thu Jan 8 09:41:10 UTC 2009
On 2009-01-07, at. 12:59:55
"Yousef Ourabi" <yourabi at zero-analog.com> wrote:
> Feel free to take the ball on run with it.
>
> I'm still waiting for the server and crawl data to be open-sourced
> since that was originally the supposed goal of this project.
>
> I've been waiting for a year and still waiting.
>
> -Yousef
>
Little offtopic:
Current Grub servers are nothing more than few Perl scripts and Apache
server. Probably all are available at SVN repository for some time:
http://svn.swlabs.org/grubng/trunk/perl/robo.pl - checking robots.txt
files
http://svn.swlabs.org/grubng/trunk/perl/workunit.pl - generating new
workunits
http://svn.swlabs.org/grubng/trunk/perl/dispatch.cgi - dispatch server
http://svn.swlabs.org/grubng/trunk/perl/webdb.pm and
http://svn.swlabs.org/grubng/trunk/perl/put.cgi - upload server.
AFAIK there is missing only one part of system - scripts which upload
crawled data to database. SOON (TM, copyrighted, patented ;) ) upload
server been replaced by standalone server written in C# (which is
still little not tested and under heavy development):
http://svn.swlabs.org/grubng/trunk/csharp/grub-upload/
About crawled data:
Recently crawled .arc files available at
http://soap.grub.org/arcs/
Part of database are available at:
http://search.isc.org/download/
But if you interested in database and index IMO you should check
Internet Open Index project (there go all data from Grub):
http://search.isc.org/
Bartek
--
Grub Next Generation: http://grub.org
Mailing List: grub-dev at wikia.com
IRC: #wikia-search at irc.freenode.net
Jabber: thindil at jabberpl.org
More information about the Grub-dev
mailing list