Multilingual semantic resources and parallel corpora in the biomedical domain: The CLEF-ER challenge

Rebholz-Schuhmann, Dietrich; Clematide, Simon; Rinaldi, Fabio; Kafkas, Senay; Van Mulligen, Erik M.; Bui, Chinh; Hellrich, Johannes; Lewin, Ian; Milward, David; Poprat, Michael; Jimeno-Yepes, Antonio; Hahn, Udo; Kors, Jan

D. Rebholz-Schuhmann (Dietrich), S. Clematide (Simon), F. Rinaldi (Fabio), S. Kafkas (Senay), E.M. Van Mulligen (Erik M.), C. Bui (Chinh), J. Hellrich (Johannes), I. Lewin (Ian), D. Milward (David), M. Poprat (Michael), et al. J.A. Kors (Jan)

2013

Multilingual semantic resources and parallel corpora in the biomedical domain: The CLEF-ER challenge

Presented at the 2013 Cross Language Evaluation Forum Conference, CLEF 2013 (September 2013), Valencia

Multilingual terminological resources can be drawn from parallel corpora in the languages of interest, possibly exploiting machine translation solutions for term identification. This main objective of the CLEF-ER challenge involves parallel corpora in English and other languages. The challenge organisers have gathered and normalized documents from the biomedical domain: titles from scientific articles, drug labels from the European Medicines Agency, and patent texts from the European Patent Office. The parallel units have been identified, marked-up and formatted for future use. The three different corpora show comparable sizes. In preparation of the CLEF-ER challenge, the documents have been annotated with terminologies in English and non-English languages (de, fr, es, and nl) and the pre-existing terminological resource has been optimized for the entity recognition task in CLEF-ER. Finally a silver standard corpus for entity annotations and their identifiers has been produced on the English documents for the evaluation of challenge contributions.

Additional Metadata
Persistent URL	hdl.handle.net/1765/90842
Conference	2013 Cross Language Evaluation Forum Conference, CLEF 2013
Organisation	Department of Medical Informatics
Citation APA APA Style APA-ALL Style AAA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Rebholz-Schuhmann, D., Clematide, S., Rinaldi, F., Kafkas, S., Van Mulligen, E. M., Bui, C., Hellrich, J., Lewin, I., Milward, D., Poprat, M., Jimeno-Yepes, A., Hahn, U.& Kors, J. (2013, January). Multilingual semantic resources and parallel corpora in the biomedical domain: The CLEF-ER challenge. 2013 Cross Language Evaluation Forum Conference, CLEF 2013, Valencia, September 2013.http://hdl.handle.net/1765/90842

Multilingual semantic resources and parallel corpora in the biomedical domain: The CLEF-ER challenge

Publication

Publication

About

Multilingual semantic resources and parallel corpora in the biomedical domain: The CLEF-ER challenge

Publication

Publication

Workflow

Workflow

Add Content