Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
What types of translations hide in wikipedia?
Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 060, Japan..
KTH, School of Computer Science and Communication (CSC).
Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 060, Japan..
2008 (English)In: LARGE-SCALE KNOWLEDGE RESOURCES: CONSTRUCTION AND APPLICATION / [ed] Tokunaga, T Ortega, A, SPRINGER-VERLAG BERLIN , 2008, p. 59-+Conference paper, Published paper (Refereed)
Abstract [en]

We extend an automatically generated bilingual Japanese-Swedish dictionary with new translations, automatically discovered from the multi-lingual online encyclopedia Wikipedia. Over 50,000 translations, most of which are not present in the original dictionary, are generated, with very high translation quality. We analyze what types of translations can be generated by this simple method. The majority of the words are proper nouns, and other types of (usually) uninteresting translations are also generated. Not counting the less interesting words, about 15,000 new translations are still found. Checking against logs of search queries from the old dictionary shows that the new translations would significantly reduce the number of searches with no matching translation.

Place, publisher, year, edition, pages
SPRINGER-VERLAG BERLIN , 2008. p. 59-+
Series
Lecture Notes in Artificial Intelligence, ISSN 0302-9743 ; 4938
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
URN: urn:nbn:se:kth:diva-242922DOI: 10.1007/978-3-540-78159-2_6ISI: 000253958000006Scopus ID: 2-s2.0-40549096702ISBN: 978-3-540-78158-5 (print)OAI: oai:DiVA.org:kth-242922DiVA, id: diva2:1287896
Conference
3rd International Conference on Large-Scale Knowledge Resources, MAR 03-05, 2008, Tokyo Inst Technol, Tokyo, JAPAN
Note

QC 20190212

Available from: 2019-02-12 Created: 2019-02-12 Last updated: 2019-02-12Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Sjobergh, Olof
By organisation
School of Computer Science and Communication (CSC)
Electrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf