[Grub-dev] Wikis url extraction
Yousef Ourabi
yourabi at zero-analog.com
Wed Jan 16 00:20:05 UTC 2008
I don't understand what you are asking.
You used grubng and got some wikipedia URLs?
Wouldn't this be already rendered HTML so the wiki syntax is irrelevant.
Sorry I just don't understand?
thanks.
Yousef
On 1/15/08, Balinny <balinny at gmail.com> wrote:
>
> From some urls i got, you seem to have been extracting URLs from wiki
> sources.
> Please note that any trailing ] should be removed and you should skip
> any urls
> containing braces {{ won't be valid and thus doesn't need to be crawled
> (but the
> urls generated via that template do, so the best way is using
> externallinks table).
>
>
> BTW: Which page parser is used?
>
> _______________________________________________
> Grub-dev mailing list
> Grub-dev at wikia.com
> http://lists.wikia.com/mailman/listinfo/grub-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikia.com/pipermail/grub-dev/attachments/20080115/dfd39b0f/attachment.html
More information about the Grub-dev
mailing list