Adelheid Tagger-Lemmatizer |
|
Enriching Data |
part of speech tagging, lemmatisation, tokenisation |
web application |
Linguistics |
general linguistics, Syntax, historical linguistics |
Language independent |
text/plain, text/xml |
released |
CLARIN-NL |
MPI for Psycholinguistics |
Frog |
v0.15 |
Enriching Data |
dependency parsing, shallow parsing, lemmatisation, morphological analysis, named entity recognition, part of speech tagging, sentence splitting, tokenisation |
web application |
Linguistics |
general linguistics, Syntax |
Dutch |
application/pdf, application/msword, text/folia+xml, text/plain |
published |
CLARIN-NL, CLARIAH-CORE |
none yet |
OpenConvert |
|
Enriching Data |
corpus processing, format conversion, text conversion, tokenisation, part of speech tagging |
local desktop |
Linguistics, Religion Studies, Communication and Media Studies, Cultural Sciences, History, Literary Studies, Philosophy, Political Studies |
|
Language independent |
text/plain, application/msword, text/html, text/xml, application/epub+zip, application/zip |
released |
CLARIN-NL, CLARIAH-CORE |
Dutch Language Institute |
TTNWW |
|
Enriching Data |
grammatical relation assignment, coreference resolution, corpus processing, dependency parsing, lemmatisation, multiword unit identification, named entity recognition, orthographic normalisation, part of speech tagging, semantic role labeling, chunking, parsing, speech recognition, speech transcription, tokenisation, up/down sampling |
web application |
Linguistics, Communication and Media Studies, History, Oral History |
discourse analysis, Orthography, Semantics, Syntax |
Dutch |
text/plain, audio/wav |
withdrawn |
CLARIN-NL |
Meertens/HuC |
PICCL |
v0.6.4 |
Enriching Data |
optical character recognition, orthographic normalisation, sentence splitting, tokenisation, dependency parsing, shallow parsing, lemmatisation, morphological analysis, named entity recognition, part of speech tagging |
local desktop |
Linguistics, Philosophy, Literary Studies, Religion Studies, History |
general linguistics, Orthography, Morphology, Syntax |
Dutch, Swedish, Russian, Spanish, Portuguese, English, German, French, Italian, Finnish, Modern Greek, Classical Greek, Icelandic, German (Fraktur), Latin, Romanian |
application/pdf, image/tiff, text/plain, text/folia+xml, image/vnd.djvu |
published |
CLARIN-NL, CLARIAH-CORE |
none yet |
Ucto Engine |
v0.13 |
Enriching Data |
sentence splitting, tokenisation |
local desktop |
Linguistics |
general linguistics, Syntax |
Language independent |
application/pdf, application/msword, text/folia+xml, text/plain |
published |
CLARIN-NL, CLARIAH-CORE |
none yet |
Ucto |
v0.13 |
Enriching Data |
sentence splitting, tokenisation |
local desktop |
Linguistics |
general linguistics, Syntax |
Dutch, Swedish, Russian, Spanish, Portuguese, English, German, French, Italian |
application/pdf, application/msword, text/folia+xml, text/plain |
published |
CLARIN-NL, CLARIAH-CORE |
none yet |