WSU talk: info on corpora and tech that will be discussed

I’ll give a talk on corpora and relevant technologies at Wayne State University in Detroit on the 19th of October at 11 AM. Here are some links, papers and slides that might be interesting for colleagues and students to follow and post process:

Read More...

Text analyzed and parsed to TEI XML wrapper

I set up a simple testing page for a wrapper of raw text to TEI XML. It uses in this version just the Stanford CoreNLP tools to tokenize, recognize sentences, part of speech annotate and lemmatize the input. Just paste a paragraph of text in there. In the next version this will be expanded with NLP tools for a couple of more languages, as well as other analysis components and tools for English.

Read More...

Stanford-CoreNLP corenlp.sh script on Mac OS X Lion

To make the Stanford CoreNLP tools work on your Mac OS X 10.7.x (Lion) distribution with the included bash script do this...

Read More...