Corpus
LibreOffice and TEI Stylesheets for file conversion
10/17/12 23:47 Filed in: Corpus Linguistics
If you want to batch convert a lot of files to some more accessible format (for example ODT or DOCX to HTML or TEI XML), you can use first of all LibreOffice.
Here is a brief introduction how to batch convert files to some LibreOffice output format or TEI XML.
Read More...
Here is a brief introduction how to batch convert files to some LibreOffice output format or TEI XML.
Read More...
The LINGUIST List corpus
04/03/12 06:06 Filed in: Corpus Linguistics
The LINGUIST List corpora can be found here:
http://ltl.emich.edu/llc/
You can find in there the LINGUIST List mailings converted to TEI P5 XML. The linguistically annotated version will be available in an extended interface.
See the previous blog for instructions on how to use Philologic…
Read More...
http://ltl.emich.edu/llc/
You can find in there the LINGUIST List mailings converted to TEI P5 XML. The linguistically annotated version will be available in an extended interface.
See the previous blog for instructions on how to use Philologic…
Read More...
Working with the Philologic interface on the LTL corpora
03/26/12 21:21 Filed in: Corpus Linguistics
Here is a brief first introduction to the Philologic interface for the LTL corpora and the LINGUIST List corpus;
Read More...
Read More...
The LTL corpus
03/08/12 12:39 Filed in: Corpus Linguistics
The first version of the small LTL corpus with a couple of million tokens is online. It contains TEI P5 XML encoded books from the public domain. See here…
Read More...
Read More...
Using Antconc: Notes 1
02/02/12 20:44 Filed in: Info