Linguistics tools




This page lists a variety of tools I have found to be useful and interesting in my own research and teaching.



Tools for text analysis

Name of tool Developer Availability
Concordancer for Windows 3.0 (aka WConcord) Linglit, TU Darmstadt free
Wordsmith Tools Mike Scott commercial
IMS Corpus Workbench IMS Stuttgart free
Simple Concordance Program Alan Reed free
TextSTAT - Simple Text Analysis Tool Matthias Hüning, FU Berlin free
AntConc Laurence Anthony, Waseda University, Japan free

Tools for linguistic annotation

Name of tool Developer Availability
Trigrams'n'Tags (T'n'T) Thorsten Brants non-free
Stanford Part-of-Speech-Tagger Stanford NLP Group free
Decision Tree Tagger IMS Stuttgart free
Brill tagger Eric Brill free
Stanford Parser Stanford NLP Group free
Shalmaneser: A Shallow Semantic Parser Erk, Pado, Uni Saarbrücken free
A General Tool for Anaphora Resolution - GuiTAR (v1.1) Mijail Alexandrov Kabadjov free
EXMARaLDA (Extensible Markup Language for Discourse Annotation) SFB 538 Mehrsprachigkeit, Universität Hamburg
MMAX2 EML (European Medialab), Heidelberg Open Source project
Systemic Coder Mick O'Donnell, Wagsoft.com free
RST Tool Mick O'Donnell, Wagsoft.com free
UAM Corpus Tool Mick O'Donnell, Wagsoft.com free
Dexter Coder Gregory Garretson free

Linguistic fonts

Name of tool Developer Availability
The IPA-SAM phonetic fonts Department of Phonetics and Linguistics, University College London TrueType fonts (not Unicode) suitable for Windows and MacOS that include all current IPA symbols
Doulos SIL Font (Unicode) SIL (Summer Institute of Linguistics) Open Font License (OFL)
Charis SIL Font (Unicode) SIL (Summer Institute of Linguistics) Open Font License (OFL)

Linguistic data

Name of tool Developer Availability
WordNet - a lexical database for the English language Cognitive Science Lab, Princeton free
FrameNet ICSI, Berkely, Cal.
title
. .

Linguistics Toolkits and Development Frameworks

Name of tool Developer Availability
Natural Language ToolKit (NLTK) Python-based Natural Language Processing Toolkit open source
Stanford NLP Tools Stanford NLP Group free
GATE, A General Architecture for Text Engineering ... the Eclipse of Natural Language Engineering open source
Protégé a free, open source ontology editor and knowledge-base framework open source

Corpus Statistics

Name of tool Developer Availability
The R Project for Statistical Computing well, the title is slightly suggestive ... open source
. .

Other useful stuff for linguists

Name of tool Developer Availability
UltraEdit Versatile editor with syntax highlighting commercial, but worth it
Notepad++ Versatile editor with syntax highlighting free
Emacs Eight megabytes and constantly swapping ;-) GPL
XMLSpy XML editor commercial
Treebeard An Open Source XSLT IDE open source
Phase 5 Great html editor free
Eclipse Software devlopment framework open source
Perl Dynamic programming language originally developed by Larry Wall open source
Python Dynamic, OO programming language open source
selfhtml HTML, XML, etc. tutorial and reference