kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Recommendation System for Product Test Failures Using BERT
KTH. Ericsson AB, Stockholm, Sweden.
Ericsson AB, Stockholm, Sweden.
Ericsson AB, Stockholm, Sweden.
Ericsson AB, Stockholm, Sweden.
Show others and affiliations
2023 (English)In: 15th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2023 as part of IC3K 2023 - Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, INSTICC , 2023, p. 206-213Conference paper, Published paper (Refereed)
Abstract [en]

Historical failure records can provide insights to investigate if a similar situation occurred during the troubleshooting process in software. However, in the era of information explosion, massive amounts of data make it unrealistic to rely solely on manual inspection of root causes, not to mention mapping similar records. With the ongoing development and breakthroughs of Natural Language Processing (NLP), we propose an end-to-end recommendation system that can instantly generate a list of similar records given a new raw failure record. The system consists of three stages: 1) general and tailored pre-processing of raw failure records; 2) information retrieval; 3) information re-ranking. In the process of model selection, we undertake a thorough exploration of both frequency-based models and language models. To mitigate issues stemming from imbalances in the available labeled data, we propose an updated Recall@K metric that utilizes an adaptive K. We also develop a multi-stage training pipeline to deal with limited labeled data and investigate how different strategies affect performance. Our comprehensive experiments demonstrate that our two-stage BERT model, fine-tuned on extra domain data, achieves the best score over the baseline models.

Place, publisher, year, edition, pages
INSTICC , 2023. p. 206-213
Keywords [en]
Information Retrieval, Language Models, Recommendation System
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-341693DOI: 10.5220/0012160800003598Scopus ID: 2-s2.0-85179760636OAI: oai:DiVA.org:kth-341693DiVA, id: diva2:1823050
Conference
15th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2023 as part of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2023, Hybrid, Rome, Italy, Nov 13 2023 - Nov 15 2023
Note

Part of ISBN 9789897586712

QC 20231229

Available from: 2023-12-29 Created: 2023-12-29 Last updated: 2023-12-29Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Sun, Xiaolong
By organisation
KTH
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 224 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf