Discovering hidden collocations in a bilingual Spanish–English dictionary
-
1
Universidade da Coruña
info
- Iztok Kosem (ed. lit.)
- Miloš Jakubíček (ed. lit.)
- Jelena Kallas (ed. lit.)
- Simon Krek (ed. lit.)
Publisher: Lexical Computing ; Trojina, Institute for Applied Slovene Studies
ISBN: 978-961-93594-3-3
Year of publication: 2015
Pages: 170-185
Congress: eLEX : Electronic lexicography in the 21st century (4. 2015. Herstmonceux)
Type: Conference paper
Abstract
This paper addresses the problem of how to exploit the collocational information included in an online Spanish–English dictionary. Even though collocations are not identified as such in this dictionary, abundant collocational information is used as a means of distinguishing senses. Given that this information is structured in XML markup, the conversion into a bilingual collocation database seems viable in order to obtain the germ of a first Spanish–English collocation dictionary. The concept of collocation used here comes from the Explanatory and Combinatorial Lexicology (Mel’čuk, 2012). In this framework, collocations are understood as recurrent phrases composed of two lexical units, one of which, the base, is selected according to its meaning, while the selection of the other, the collocate, is determined by the base. The methodology I propose consists of reorganizing the links between words in such a way that the bilingual collocational correspondence is included in the entry for the base. The lexical tool obtained as a result of this reorganization could be exploited for different applications in natural language processing, ranging from machine translation to computer assisted language learning systems.