Medical vocabulary mining using distributional semantics on Japanese patient blogsShow others and affiliations
2014 (English)In: SMBM 2014 - Proceedings of the 6th International Symposium on Semantic Mining in Biomedicine, University of Aveiro , 2014, p. 57-62Conference paper, Published paper (Refereed)
Abstract [en]
Random indexing has previously been successfully used for medical vocabulary expansion for Germanic languages. In this study, we used this approach to extract medical terms from a Japanese patient blog corpus. The corpus was segmented into semantic units by a semantic role labeller, and different pre-processing and parameter settings were then evaluated. The evaluation showed that similar settings are suitable for Japanese as for previously explored Germanic languages, and that distributional semantics is equally useful for semi-automatic expansion of Japanese medical vocabularies as for medical vocabularies in Germanic languages.
Place, publisher, year, edition, pages
University of Aveiro , 2014. p. 57-62
Keywords [en]
Blogs, Data mining, Expansion, Distributional semantics, Parameter setting, Pre-processing, Random indexing, Semantic roles, Semantic units, Semi-automatics, Vocabulary expansions, Semantics
National Category
Information Systems
Identifiers
URN: urn:nbn:se:kth:diva-302250Scopus ID: 2-s2.0-85072894192OAI: oai:DiVA.org:kth-302250DiVA, id: diva2:1595673
Conference
6th International Symposium on Semantic Mining in Biomedicine, SMBM 2014, 6 October 2014 through 7 October 2014
Note
QC 20210920
2021-09-202021-09-202022-06-25Bibliographically approved