Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Information Retrieval and Visualization for the Historical Domain
2011 (English)In: Language Technology for Cultural Heritage: Selected Papers from the LaTeCH Workshop Series / [ed] Sporleder, Caroline; van den Bosch, Antal; Zervanou, Kalliopi, Berlin, Heidelberg: Springer Berlin/Heidelberg, 2011, 197-212 p.Chapter in book (Other academic)
Abstract [en]

Working with large and unstructured collections of historical documents is a challenging task for historians. Despite the recent growth in the volume of digitized historical data, available collections are rarely accompanied by computational tools that significantly facilitate this task.We address this shortage by proposing a visualization method for document collections that focuses on graphical representation of similarities between documents. The strength of the similarities is measured according to the overlap of historically significant information such as named entities,or the overlap of general vocabulary. Similarity strengths are then encoded in the edges of a graph.The graph provides visual structure, revealing interpretable clusters and links between documents that are otherwise difficult to establish. We implement the idea of similarity graphs within an information retrieval system supported by an interactive graphical user interface. The system allows querying the database, visualizing the results and browsing the collection in an effective and intuitive way. Our aproach can be easy adapted and extended to collections of documents in other domains.

Place, publisher, year, edition, pages
Berlin, Heidelberg: Springer Berlin/Heidelberg, 2011. 197-212 p.
Keyword [en]
historical collections, information retrieval, graph visualization clustering, recommender system
National Category
Language Technology (Computational Linguistics)
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:kth:diva-211102DOI: 10.1007/978-3-642-20227-8_11OAI: oai:DiVA.org:kth-211102DiVA: diva2:1127537
Available from: 2017-07-16 Created: 2017-07-16 Last updated: 2017-07-16

Open Access in DiVA

No full text

Other links

Publisher's full text
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

Altmetric score

Total: 9 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf