Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
KCL-Health-NLP@CLEF eHealth 2018 Task 1: ICD-10 coding of French and Italian death certificates with character-level convolutional neural networks
Show others and affiliations
2018 (English)In: CEUR Workshop Proceedings, CEUR-WS , 2018, Vol. 2125Conference paper, Published paper (Refereed)
Abstract [en]

In this paper we describe the participation of the KCL-Health-NLP team in the CLEF eHealth 2018 lab, specifically Task 1: Multilingual Information Extraction-ICD10 coding. The task involves the automatic coding of causes of death in death certificates in French, Italian and Hungarian according to the ICD-10 taxonomy. Choosing to work on the two Romance languages, we treated the task as a sequence-to-sequence prediction problem. Our system has an encoder-decoder architecture, with convolutional neural networks based on character em-beddings as encoders and recurrent neural network decoders. Our hypothesis was that a character-level representation would allow our model to generalise across two genealogically related languages. Results obtained by pre-training our Italian model on the French data set confirmed this intuition. We also explored the impact of character-level features extracted from dictionary-matched ICD codes. We obtained F-measures of 0.72/0.64 and 0.78 on the French aligned/raw and Italian raw internal test data, respectively. On the blind test set released by the task organisers, our top results were 0.65/0.52 and 0.69 F-measure, respectively.

Place, publisher, year, edition, pages
CEUR-WS , 2018. Vol. 2125
Series
CEUR Workshop Proceedings, ISSN 1613-0073
Keywords [en]
Convolutional neural networks, Encoder-decoder architecture, Recurrent neural networks
National Category
Other Computer and Information Science
Identifiers
URN: urn:nbn:se:kth:diva-234052Scopus ID: 2-s2.0-85051060301OAI: oai:DiVA.org:kth-234052DiVA, id: diva2:1245933
Conference
19th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2018, Avignon, France, 10 September 2018 through 14 September 2018
Note

QC 20180906

Available from: 2018-09-06 Created: 2018-09-06 Last updated: 2019-12-05Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Authority records BETA

Velupillai, Sumithra

Search in DiVA

By author/editor
Velupillai, Sumithra
By organisation
Theoretical Computer Science, TCS
Other Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 139 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf