[Search-l] Parsed text samples
Linas Vepstas
linasvepstas at gmail.com
Thu Jul 17 01:06:16 UTC 2008
Hi,
A minor announcement: I've uploaded some pre-parsed
files onto
http://relex.swlabs.org/~linas/data/ and in particular, to
http://relex.swlabs.org/~linas/data/gutenberg/
and
http://relex.swlabs.org/~linas/data/voa/
Look for the *.xml.gz files.
These files contain text that has been parsed by the link-grammar
english-language parser, and marked up with dependency relations
relex. The file format is described at
http://opencog.org/wiki/RelEx_compact_output
(and Relex itself at http://opencog.org/wiki/RelEx)
These files are quite large and verbose.
You may ask yourself "what the heck is this stuff for?" -- and that's
a good question. Start up a new email thread, and I'll happily
brainstorm some of the practical and not-so-practical applications.
--linas
More information about the Search-l
mailing list