Linguistics Computing Resources on the Internet
A topically organized list of resources on the Internet that pertain to linguistics computing.
General Information on Computing and Linguistics
- Using Computers in Linguistics: A Practical Guide, ed. by John M. Lawler and Helen Aristar Dry
- Notes for the computer assisted language worker, by Nick Thieberger
- Linguistic Data Consortium
- Linguistic Annotation describes tools and formats for creating and managing linguistic annotations.
- Linguistic Exploration describes resources for language documentation and linguistic exploration.
Commercial Research and Development Sites
- LinguistX Platform is a fast, comprehensive suite of multilingual text services.
- Lingsoft, linguistic software company in Helsinki, Finland
- Conexor's analysers for English, taggers and parsers
- NLP at Microsoft Research
Software Archives
- AI Repository at CMU
- IMMD8 Computational Linguistics Software Archive
- Natural Language Software Registry
- Software via LINGUIST
- UMichigan Linguistics Archive
- Language Software
Helpdesk FAQ
Note: The Helpdesk has been suspended but the FAQ still has useful advice - Linguistics directory at GARBO (Finland)
Software Tools
Fonts and Multilingual Resources
- Computers and Writing Systems
- SIL Fonts, Font Utilities, and Keyboard Utilities
- Charis A serif, proportionally-spaced font optimized for readability in long printed documents
- Doulos SIL Extended Latin, Cyrillic, and International Phonetic Alphabet (IPA) Unicode font for Windows and Macintosh
- Gentium Extended Latin, Greek, and IPA Unicode font for Windows and Macintosh
- Legacy (non-Unicode) IPA fonts
- SIL Encore IPA fonts for Windows and Macintosh
- IPAPhon for Windows and Macintosh
- PalPhon IPA font based on Palatino for Macintosh [Download 451K, 18-Feb-92]
- TechPhonetic IPA font based on GoudyOldStyle for Macintosh [Download 305K, 10-May-91]
- Links to IPA fonts via the IPA
- Fonts in CyberSpace, SIL's guide to finding fonts on the Internet
- Linguist's Software, fonts for the whole world
- Arboreal, a font for creating syntactic trees, Cascadilla Press. Arboreal is available for Macs and Windows
- LogicTimes, Times with Greek letters and logic symbols (Mac)
- MtScript, the Multext multilingual text editor (Sun Sparc)
- Circle Noetic Services, linguistic products (hyphenation, spell checker, word lists, etc.)
Data Management
- askSam, freeform information manager for MS-DOS and Windows.
- HyperCard for Macintosh
- Shoebox (Windows, Macintosh)
- Toolbox Unicode compatible upgrade to Shoebox (Windows)
Speech Analysis and Phonetics
- Annotation Graph Toolkit, a suite of software components for building tools for annotating linguistic signals, time-series data which documents any kind of linguistic behavior (e.g., audio, video). Intended audience: developers
- Praat: doing phonetics by computer
- Sable standard for speech synthesis markup
- Signalyze signal processing software for Macintosh
- SpeechLab facilities via the Max Planck Institute for Psycholinguistics
Phonology and Morphology
- Alvey Natural Language Tools
- Computational Morphology and Phonology (SIL)
- Phono version 4.1, a Windows software tool for developing and testing models of regular historical sound change
- Special Interest Group on Computational Phonology (SIGPHON)
- SIL software:
Grammar, Syntax, Semantics
- Alvey Natural Language Tools
- Apple Pie Parser, a bottom-up probabilistic chart parser
- Ergo Linguistic Technologies, English parsing software
- Gramglos, a glossary of generative grammar (MS-DOS)
Download file: gramglos.zip [129K, 11-Sep-94] - Grammar and Trees, Macintosh HyperCard stack to draw trees of phrase
structure grammars
Download file: grammarandtrees1.0.hqx [132K, 17-Jan-94] - Grammar
Laboratories for the Macintosh
Download files:- CG Laboratory [475K, 24-Mar-95]
- PATR Laboratory [488K, 24-Mar-95]
- PSG Laboratory [469K, 17-Dec-94]
- DCG Laboratory [493K, 24-Mar-95]
- Link Grammar
- PC-PATR a unification-based syntactic parser for MS-DOS, Window, Macintosh, and UNIX
- System to write and test Unification-based Grammar
Download file: gfulab15.zip [131K, 16-Nov-93]
Lexical Tools
See also Lexical Resources on the Internet.
- ARIES Natural Language Tools, a lexical platform for the Spanish language
- Good Language Software
- Lexical FreeNet - finite relation expression network
- WordNet, an online lexical reference system
Text Analysis and Corpus Linguistics
See also Text Resources on the Internet.
- Concordance programs
- CONC, a concordance generator for Macintosh
- Concorder, a concordance package for the Macintosh
- Corpus Wizard, concordancer for Windows
- FreeText concordance program for Macintosh
Download file: free-text-103.hqx, [235K] - Oxford Concordance Program (OCP) and Micro-OCP
- MicroConcord for multilingual corpora (MS-DOS)
- MultiConcord: the Lingua Multilingual Parallel Concordancer for Windows
- WordCruncher information and WordCruncher QuickGuide
- Corpus Linguistics at University of Birmingham
- Emdros, a text database engine for analyzed or annotated text
- Hyperbase (in French)
- IT, Interlinear Text Processor for MS-DOS and Macintosh
- Multext, Multilingual Text Tools and Corpora
- Paai's text utilities for UNIX
- TACT, Text Analysis Computing Tools--a text-analysis and retrieval system for MS-DOS. See also TACT Overview.
- Text Analysis Tools and Techniques via Oxford's CTI Centre for Textual Studies
- TEXTPACK: a system for computer-aided content analysis
- Texts and Tools at LETRS
- WLIST, word list generator with statistics (MS-DOS)
Download file: wlist11.zip [28K, 30-Oct-93]
Also available here: wlist11.zip [29K] - Word Count, Freq, Syllables, Readability
Download file: wc14.zip [12K, 17-Jan-94] - Xerox POS tagger
Translation
- AltaVista Translation Service using Systran Translation Software
- Ergane, multilingual translation dictionary for Windows
- Natural Language Translation Sepcialist Group, British Computer Society
- Translation Experts Ltd., interactive translation software
Historical and Comparative Linguistics, Dialectology
- Algorithm for identifying related words across languages
Download file: cognate.zip [18K, 9-Dec-91] - PHONO, tool for developing and testing models of regular sound change (MS-DOS)
Download file: phono.zip [110K, 7-May-96] - Software tools for lexicostatistics
Download file: glotto02.zip [169K, 25-May-94] - Wordsurv, analyzes and compares dialect word lists (Windows, Palm)
- A World of Words, Macintosh HyperCard stacks about history of Indo-European
Download file: aworldofwords.cpt.hqx [628K, 19-Jan-94]
Text Processing Languages and Utilities
- AWK, a pattern-scanning text processing language
- BiTrans for MS-DOS will translate a text written in any transliteration
system (e.g. Chinese written in pinyin) into almost any other system
(e.g. the Giles-Wade system), provided that it is given the necessary
translation rules.
Download file: bitrans2.zip [70K, 24-Oct-95] - Emacs, programmable text editor
- Icon, a high-level, general-purpose programming language with a large repertoire of features for processing data structures and character strings.
- SNOBOL4 and SPITBOL Information
- SPITBOL, for non-numeric computing
