Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Revealing Relations between Open and Closed Answers in Questionnaires through Text Clustering Evaluation
KTH, School of Computer Science and Communication (CSC), Numerical Analysis and Computer Science, NADA.
KTH, School of Computer Science and Communication (CSC), Numerical Analysis and Computer Science, NADA.ORCID iD: 0000-0002-4178-2980
2008 (English)In: Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), 2008, 1-7 p.Conference paper, Published paper (Refereed)
Abstract [en]

Open answers in questionnaires contain valuable information that is very time-consuming to analyze manually. We present a method forhypothesis generation from questionnaires based on text clustering. Text clustering is used interactively on the open answers, and the usercan explore the cluster contents. The exploration is guided by automatic evaluation of the clusters against a closed answer regarded as acategorization. This simplifies the process of selecting interesting clusters. The user formulates a hypothesis from the relation betweenthe cluster content and the closed answer categorization. We have applied our method on an open answer regarding occupation comparedto a closed answer on smoking habits. With no prior knowledge of smoking habits in different occupation groups we have generated thehypothesis that farmers smoke less than the average. The hypothesis is supported by several separate surveys. Closed answers are easyto analyze automatically but are restricted and may miss valuable aspects. Open answers, on the other hand, fully capture the dynamicsand diversity of possible outcomes. With our method the process of analyzing open answers becomes feasible.

Place, publisher, year, edition, pages
2008. 1-7 p.
National Category
Computer and Information Science
Identifiers
URN: urn:nbn:se:kth:diva-10123ISI: 000324028901137OAI: oai:DiVA.org:kth-10123DiVA: diva2:209206
Conference
LREC´08, 28-30 May, Marrakesh, Marocco
Note

QC 20100806

Available from: 2009-03-24 Created: 2009-03-24 Last updated: 2014-09-29Bibliographically approved

Open Access in DiVA

No full text

Other links

LREC

Authority records BETA

Velupillai, Sumithra

Search in DiVA

By author/editor
Rosell, MagnusVelupillai, Sumithra
By organisation
Numerical Analysis and Computer Science, NADA
Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 20 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf