NLI corpora (Stehwien & Pado 2015)
- Type
-
Corpus
- Author
-
Sebastian Padó, Sabrina Stehwien
- Description
-
This page contains the data for the paper "Generalization in Native Language Identification -- Learners versus Scientists" (Stehwien & Pado CLiC 2015).
- Downloads
-
- nli-corpus_documentation.txt README
- ACL-NLI.tgz ACL NLI corpus as used (preprocessed files, 3 MB)
- acl_ids.txt ACL NLI corpus as used (document IDs)
- icle_ids.txt ICLE corpus as used (document IDs only, for copyright reasons)
- lang8_ids.txt Lang8 corpus as used (document IDs only, for copyright reasons)
- Lang8-Scripts.tgz Lang8 scraper to recreate the Lang8 corpus
![This image shows Sebastian Padó](https://www.ims.uni-stuttgart.de/images/team/pado-sebastian-2014.jpg?__scale=w:150,h:150,cx:0,cy:41,cw:150,ch:150)
Sebastian Padó
Prof. Dr.Chair of Theoretical Computational Linguistics, Managing Director of the IMS