Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automatic Keyword Extraction Using Domain Knowledge
SICS.ORCID iD: 0000-0003-4042-4919
Show others and affiliations
2008 (English)In: Computational Linguistics and Intelligent Text Processing, Berlin / Heidelberg: Springer , 2008, 1Chapter in book (Refereed)
Abstract [en]

Documents can be assigned keywords by frequency analysis of the terms found in the document text, which arguably is the primary source of knowledge about the document itself. By including a hierarchi- cally organised domain specific thesaurus as a second knowledge source the quality of such keywords was improved considerably, as measured by match to previously manually assigned keywords. In the presented ex- periment, the combination of the evidence from frequency analysis and the hierarchically organised thesaurus was done using inductive logic programming.

Place, publisher, year, edition, pages
Berlin / Heidelberg: Springer , 2008, 1.
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:kth:diva-221506ISBN: 978-3-540-41687-6 (print)OAI: oai:DiVA.org:kth-221506DiVA, id: diva2:1175284
Note

QC 20180123

Available from: 2016-10-31 Created: 2018-01-17 Last updated: 2018-01-23Bibliographically approved

Open Access in DiVA

No full text in DiVA

Search in DiVA

By author/editor
Karlgren, JussiBoström, Henrik
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 2 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf