Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Information Retrieval and Visualization for the Historical Domain
Saarland University, Saarbrücken, Germany.
2011 (English)In: Language Technology for Cultural Heritage: Selected Papers from the LaTeCH Workshop Series / [ed] Sporleder, Caroline; van den Bosch, Antal; Zervanou, Kalliopi, Berlin, Heidelberg: Springer Berlin/Heidelberg, 2011, 197-212 p.Chapter in book (Other academic)
Abstract [en]

Working with large and unstructured collections of historical documents is a challenging task for historians. Despite the recent growth in the volume of digitized historical data, available collections are rarely accompanied by computational tools that significantly facilitate this task.We address this shortage by proposing a visualization method for document collections that focuses on graphical representation of similarities between documents. The strength of the similarities is measured according to the overlap of historically significant information such as named entities,or the overlap of general vocabulary. Similarity strengths are then encoded in the edges of a graph.The graph provides visual structure, revealing interpretable clusters and links between documents that are otherwise difficult to establish. We implement the idea of similarity graphs within an information retrieval system supported by an interactive graphical user interface. The system allows querying the database, visualizing the results and browsing the collection in an effective and intuitive way. Our approach can be easy adapted and extended to collections of documents in other domains.

Place, publisher, year, edition, pages
Berlin, Heidelberg: Springer Berlin/Heidelberg, 2011. 197-212 p.
Keyword [en]
historical collections, information retrieval, graph visualization clustering, recommender system
National Category
Language Technology (Computational Linguistics)
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-211102DOI: 10.1007/978-3-642-20227-8_11OAI: oai:DiVA.org:kth-211102DiVA: diva2:1127537
Note

QC 20170731

Available from: 2017-07-16 Created: 2017-07-16 Last updated: 2017-08-03Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full text

Search in DiVA

By author/editor
Shore, Todd
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 32 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf