[Grub-dev] Wikis url extraction

Yousef Ourabi yourabi at zero-analog.com
Wed Jan 16 00:20:05 UTC 2008


I don't understand what you are asking.

You used grubng and got some wikipedia URLs?

Wouldn't this be already rendered HTML so the wiki syntax is irrelevant.

Sorry I just don't understand?

thanks.
Yousef

On 1/15/08, Balinny <balinny at gmail.com> wrote:
>
> From some urls i got, you seem to have been extracting URLs from wiki
> sources.
> Please note that any trailing ] should be removed and you should skip
> any urls
> containing braces {{ won't be valid and thus doesn't need to be crawled
> (but the
> urls generated via that template do, so the best way is using
> externallinks table).
>
>
> BTW: Which page parser is used?
>
> _______________________________________________
> Grub-dev mailing list
> Grub-dev at wikia.com
> http://lists.wikia.com/mailman/listinfo/grub-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikia.com/pipermail/grub-dev/attachments/20080115/dfd39b0f/attachment.html 


More information about the Grub-dev mailing list