Cornetto: Combinatorial and Relational Network as Toolkit for Dutch Language Technology
SummaryThe data used in the Cornetto lexical resource (92K lemma's and almost 120K word meanings) are not available anymore. The best alternative data set is the Open Dutch WordNet. This data set consists of 116.992 synsets, form which 95.356 from WordNet 3.0.
Background
Open Source Dutch WordNet is a Dutch lexical semantic database. It was created by removing the proprietary content from Cornetto (http://dev.clarin.nl/node/1944) , and by using open source resources to replace this proprietary content. Open Source Dutch WordNet contains 116,992 synsets, of which 95,356 originate from WordNet 3.0 and 21,636 synsets are new synsets. The number of English synsets without Dutch synonyms is 60,743, which means that 34,613 WordNet 3.0 synsets have been filled with at least one Dutch synonym.
This project has been co-funded by the Nederlandse Taalunie (http://taalunie.org/).The Nederlandse TaalUnie and the Free University of Amsterdam share the ownership of Open Source Dutch WordNet.
- Project leader: Prof. dr. Piek Vossen (VU University Amsterdam)
- CLARIN center: Institute for Dutch Lexicology
- Help contact :
- Web-sites: http://wordpress.let.vupr.nl/odwn/
- User scenario's (use-cases): http://wordpress.let.vupr.nl/odwn/demo/
- Manual:
- Tool/Service link: http://wordpress.let.vupr.nl/odwn/
- Data link (VLO): http://catalog.clarin.eu/vlo/record?fq=collection:TST-Centrale&docId=htt...
- Data link: (Open Dutch Wordnet) http://wordpress.let.vupr.nl/odwn/data/
- Publications:
- Vossen, P., I. Maks, R. Seegers and H. van der Vliet (2008). Integrating Lexical Units, Synsets, and Ontology in the Cornetto Database. In Proceedings of LREC-2008, Marrakech, Morocco.
- Vossen, P., I. maks, R. Segers, H. van der Vliet, M.F. Moens, K. Hofmann, E. Tjong Kim Sang, M. de Rijke (2013), Corntto: a lexical semantic database for Dutch, Chapter in: P. Spyns & J. Odijk (eds): Essential Speech and Language Technology for Dutch, Results by the STEVIN-programme, Publ. Springer series Theory and Applications of Natural Language Processing, ISBN 978-3-642-30909-0.