Python 3 for Linguists at the LSA Summer Institute 2013 Course Material

The course material for the LSA Summer Institute 2013 course Python 3 for Linguists will be made available at:

Python for Linguists Wiki (LTL, EMU)
Python 3 for Linguists (Dropbox)

There is a (currently not so full) Github repository Py3L with the (future) source code.

We are using the Komodo Edit 8.x (the free editor) and Python 3.3 in the course. We will be able to help you installing the necessary software components.
Read More...

Moving projects and code to GitHub

I am moving code and project folders to GitHub. I don’t know, whether this is a good idea, it just turns out to be easier to use… Happy

This port includes the SNLTK code, all kinds of Python 3 projects, Java code, some of the C(++) code for FSTs and some NLP tasks, corpus and TEI XML utils. Some of that I limited to pull-only and push-access exclusively for collaborators. If you were involved in some of that, let me know, send me your GitHub-ID is and I can add you to the collaborators group of the particular repos.

In particular, my course material will be migrated to GitHub completely. For example, the course material for the LSA Summer Institute course in summer 2013 will be placed there:
Python 3 for Linguists

Read More...

Tokenization, frequency profiles and N-gram models in Python 3

This is a brief description about how to use the Python 3 scripts to generate N-gram models for word tokens and characters from text. I expect you to have a Python 3 interpreter installed on your system.

Read More...