Adelheid Tagger-Lemmatizer |
|
Enriching Data |
part of speech tagging, lemmatisation, tokenisation |
web application |
Linguistics |
general linguistics, Syntax, historical linguistics |
Language independent |
text/plain, text/xml |
released |
CLARIN-NL |
MPI for Psycholinguistics |
Frog |
v0.15 |
Enriching Data |
dependency parsing, shallow parsing, lemmatisation, morphological analysis, named entity recognition, part of speech tagging, sentence splitting, tokenisation |
web application |
Linguistics |
general linguistics, Syntax |
Dutch |
application/pdf, application/msword, text/folia+xml, text/plain |
published |
CLARIN-NL, CLARIAH-CORE |
none yet |
Nederlab |
2 |
Browsing and Searching, Data analysis |
coreference resolution, corpus searching, corpus processing, corpus workbench, lemmatisation, part of speech tagging, dependency parsing, tokenisation |
web application |
Linguistics, History, Cultural Sciences |
historical linguistics |
Dutch |
|
published |
CLARIN-NL, CLARIAH-CORE |
Meertens/HuC |
OpenConvert |
|
Enriching Data |
corpus processing, format conversion, text conversion, tokenisation, part of speech tagging |
local desktop |
Linguistics, Religion Studies, Communication and Media Studies, Cultural Sciences, History, Literary Studies, Philosophy, Political Studies |
|
Language independent |
text/plain, application/msword, text/html, text/xml, application/epub+zip, application/zip |
released |
CLARIN-NL, CLARIAH-CORE |
Dutch Language Institute |
TTNWW |
|
Enriching Data |
grammatical relation assignment, coreference resolution, corpus processing, dependency parsing, lemmatisation, multiword unit identification, named entity recognition, orthographic normalisation, part of speech tagging, semantic role labeling, chunking, parsing, speech recognition, speech transcription, tokenisation, up/down sampling |
web application |
Linguistics, Communication and Media Studies, History, Oral History |
discourse analysis, Orthography, Semantics, Syntax |
Dutch |
text/plain, audio/wav |
withdrawn |
CLARIN-NL |
Meertens/HuC |
Alpino (CLST web service and application) |
unknown |
Enriching Data |
parsing, dependency parsing, lemmatisation, morphological analysis, named entity recognition, part of speech tagging, sentence splitting, tokenisation |
web application |
Linguistics |
general linguistics, Syntax |
Dutch |
text/plain |
published |
CLARIAH-CORE |
none yet |
Alpino |
unknown |
Enriching Data |
parsing, dependency parsing, lemmatisation, morphological analysis, named entity recognition, part of speech tagging, sentence splitting, tokenisation |
web application |
Linguistics |
general linguistics, Syntax |
Dutch |
text/plain |
published |
CLARIAH-CORE |
none yet |
PICCL |
v0.6.4 |
Enriching Data |
optical character recognition, orthographic normalisation, sentence splitting, tokenisation, dependency parsing, shallow parsing, lemmatisation, morphological analysis, named entity recognition, part of speech tagging |
local desktop |
Linguistics, Philosophy, Literary Studies, Religion Studies, History |
general linguistics, Orthography, Morphology, Syntax |
Dutch, Swedish, Russian, Spanish, Portuguese, English, German, French, Italian, Finnish, Modern Greek, Classical Greek, Icelandic, German (Fraktur), Latin, Romanian |
application/pdf, image/tiff, text/plain, text/folia+xml, image/vnd.djvu |
published |
CLARIN-NL, CLARIAH-CORE |
none yet |
Ucto Engine |
v0.13 |
Enriching Data |
sentence splitting, tokenisation |
local desktop |
Linguistics |
general linguistics, Syntax |
Language independent |
application/pdf, application/msword, text/folia+xml, text/plain |
published |
CLARIN-NL, CLARIAH-CORE |
none yet |
Ucto |
v0.13 |
Enriching Data |
sentence splitting, tokenisation |
local desktop |
Linguistics |
general linguistics, Syntax |
Dutch, Swedish, Russian, Spanish, Portuguese, English, German, French, Italian |
application/pdf, application/msword, text/folia+xml, text/plain |
published |
CLARIN-NL, CLARIAH-CORE |
none yet |