[Search-l] Parsed text samples

Linas Vepstas linasvepstas at gmail.com
Thu Jul 17 01:06:16 UTC 2008


Hi,

A minor announcement: I've uploaded some pre-parsed
files onto

http://relex.swlabs.org/~linas/data/ and in particular, to
http://relex.swlabs.org/~linas/data/gutenberg/
and
http://relex.swlabs.org/~linas/data/voa/

Look for the *.xml.gz files.

These files contain text that has been parsed by the link-grammar
english-language parser, and marked up with dependency relations
relex. The file format is described at

http://opencog.org/wiki/RelEx_compact_output

(and Relex itself at http://opencog.org/wiki/RelEx)

These files are quite large and verbose.

You may ask yourself "what the heck is this stuff for?" -- and that's
a good question.  Start up a new email thread, and I'll happily
brainstorm some of the practical and not-so-practical applications.

--linas



More information about the Search-l mailing list