kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Grade levels for multiple-choice tests
KTH, School of Electrical Engineering (EES), Electromagnetic Engineering. (RCAM)ORCID iD: 0000-0002-2462-8340
KTH, School of Electrical Engineering (EES), Electromagnetic Engineering. (RCAM)ORCID iD: 0000-0002-2964-7233
Svenska kraftnät.
2014 (English)Conference paper, Poster (with or without abstract) (Refereed)
Abstract [en]

In education there is nearly always an evaluation of how much of the course syllabus the student has learnt. The multiple-choice tests are a type of written examination that is easy to use, since the correction is quickly done. The problem with these test is to find a good grading scale, since the students can get more points than what their knowledge indicate, as they can guess.

There are different approaches to analyze the results of multiple-choice tests. However, the articles that have been found do not discuss the number of grades that a multiple-choice test should be able to differentiate. Also in those articles there are no figures, where the effect of guessing is shown.

The purpose is to design a multiple-choice test. The inputs are the grade scale and the number of alternatives and the output is the number of questions required for a given level of errors in grading. As an example we consider a multiple-choice test with 40 questions with 4 alternatives. We study the distribution of four students that knows nothing, 25 \%, 50 \% and 75 \%  of the questions. They guess the rest of the questions.

It can be observed that the distribution narrows when going from the left to the right. This is due to a smaller variance, since the number of guesses decreases. With a smaller variance, the height of the peak has to increase, in order to get the area below the curve. The overlapping tails are bigger to the left, as the variance is bigger. Thus the risk of putting a student in the wrong category is bigger for the lower grades. The rightmost peak is the least symmetric, with a steeper left-hand side. Then the binomial distribution is not approximated by a normal distribution so well, since the number of trials is lower. With 60 questions instead of 40, the intersection between the curves is at a lower level of probability, which means that the risk of giving a student a wrong grade is lower.

Place, publisher, year, edition, pages
Åbo, 2014.
Keywords [en]
Binomial distribution, normal approximation, multiple-choice tests
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Research subject
Electrical Engineering
Identifiers
URN: urn:nbn:se:kth:diva-166107OAI: oai:DiVA.org:kth-166107DiVA, id: diva2:809274
Conference
Nordstat 2014, Turku, Finland, 02-06 Jun 2014
Funder
SweGRIDS - Swedish Centre for Smart Grids and Energy Storage
Note

QC 20150507

Available from: 2015-04-30 Created: 2015-04-30 Last updated: 2024-03-18Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records

Westerlund, PerHilber, Patrik

Search in DiVA

By author/editor
Westerlund, PerHilber, Patrik
By organisation
Electromagnetic Engineering
Electrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 234 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf