Project Title and Abstract
![]() |
Project Title
Combinatorial and Relational Network as Toolkit for Dutch Language Technology. Acronym
Cornetto |
Abstract
Cornetto will build a lexical semantic database for Dutch, covering 40K entries, including the most generic and central part of the language. The database will go beyond the structure and content of Wordnet and FrameNet. It will contain both vertical and horizontal semantic relations and combinatorial lexical constraints such as multiword expressions, idioms and collocations on the one hand, and lexical functions and frames on the other. The concepts will be aligned with the English Wordnet so that ontologies and domain labels can be imported. The semantic layer will be validated with a formal ontology, to make it usable in Semantic Web environments.
In addition, Cornetto will develop a toolkit for the acquisition of new concepts and relations and the tuning and extraction of a domain specific sub-lexicon from a compiled corpus. Such a sub-lexicon will be extracted for the domain of financial law. The lexical database will be evaluated by integration in IR and QA applications and the sub-lexicon will be evaluated by a user-group of language technology companies. The Cornetto goals fit the resources priority for Electronic lexicons and the research priority for Semantic analysis. In the area of applications it is related to:
- Monolingual and multilingual Information extraction
- Semantic web
- Dialogue and QA solutions
- Automatic summarization and text generation applications
- Machine translation
- Educational systems
Project Proposal
News
Workshop slides available online
Thank you all for attending the Cornetto workshop and making it a success! All workshop presentations can now be downloaded from our Project Meetings page.
Cornetto Demo
Try the Cornetto client yourself and explore the content of the Cornetto database with our online Demo! [Read more]
Sponsor
Nederlandse Taalunie
Last update: 22 October, 2008, p.vossen(at)let.vu.nl

