Scripts

A repository of NLP-related scripts that may be of interest to fellow researchers and developers.

The code is provided without warranty, meaning that we cannot guarantee that they will function properly and that we cannot be held responsible if something goes wrong. We might not always be able to provide support, but in most cases the author(s) of the script will be happy to hear your comments, feedback or questions.

Python tutorials Source code repository

Script Version Language Short description Download
art.py 3.0.2 python A script to test the significance of recall, precision and f-score differences between 2 machine learning systems with approximate randomization testing.
confusionmatrix.py 2.2.0 python Obtain evaluation statistics from TiMBL, Maxent and SVMLight files
rtest.py 1.1.1 python This script can be used to test the significance of a Pearson correlation coefficient and to test the difference between two correlation coefficients.
template 1.0 python A template showing how scripts in the repository are documented.
textgain.py 1.0 python Script to compute information gain (IG) and gain ratio (GR) for words in documents that belong to different classes.