Open this publication in new window or tab >>2021 (English)In: Proceedings of the Fifth Workshop on Universal Dependencies (UDW, SyntaxFest 2021), Association for Computational Linguistics (ACL) , 2021, p. 96-108Conference paper, Published paper (Refereed)
Abstract [en]
Many downstream applications are using dependency trees, and are thus relying on dependencyparsers producing correct, or at least consistent, output. However, dependency parsers are trainedusing machine learning, and are therefore susceptible to unwanted inconsistencies due to biasesin the training data. This paper explores the effects of such biases in four languages – English,Swedish, Russian, and Ukrainian – though an experiment where we study the effect of replacingnumerals in sentences. We show that such seemingly insignificant changes in the input can causelarge differences in the output, and suggest that data augmentation can remedy the problems.
Place, publisher, year, edition, pages
Association for Computational Linguistics (ACL), 2021
National Category
Natural Language Processing
Research subject
Computer Science
Identifiers
urn:nbn:se:kth:diva-326888 (URN)2-s2.0-85138675937 (Scopus ID)
Conference
UDW 2021 - 5th Workshop on Universal Dependencies, Proceedings - To be held as part of SyntaxFest 2021, Sofia, 21-25 March 2021
Funder
Vinnova, 2019-02997
Note
Part of proceedings ISBN 978-195591717-9
QC 20230515
2023-05-152023-05-152025-02-07Bibliographically approved