Change search

Cite
Citation style
• apa
• harvard1
• ieee
• modern-language-association-8th-edition
• vancouver
• Other style
More styles
Language
• de-DE
• en-GB
• en-US
• fi-FI
• nn-NO
• nn-NB
• sv-SE
• Other locale
More languages
Output format
• html
• text
• asciidoc
• rtf
Grade levels for multiple-choice tests
KTH, School of Electrical Engineering (EES), Electromagnetic Engineering. (RCAM)ORCID iD: 0000-0002-2462-8340
KTH, School of Electrical Engineering (EES), Electromagnetic Engineering. (RCAM)ORCID iD: 0000-0002-2964-7233
Svenska kraftnät.
2014 (English)Conference paper, Poster (with or without abstract) (Refereed)
##### Abstract [en]

In education there is nearly always an evaluation of how much of the course syllabus the student has learnt. The multiple-choice tests are a type of written examination that is easy to use, since the correction is quickly done. The problem with these test is to find a good grading scale, since the students can get more points than what their knowledge indicate, as they can guess.

There are different approaches to analyze the results of multiple-choice tests. However, the articles that have been found do not discuss the number of grades that a multiple-choice test should be able to differentiate. Also in those articles there are no figures, where the effect of guessing is shown.

The purpose is to design a multiple-choice test. The inputs are the grade scale and the number of alternatives and the output is the number of questions required for a given level of errors in grading. As an example we consider a multiple-choice test with 40 questions with 4 alternatives. We study the distribution of four students that knows nothing, 25 \%, 50 \% and 75 \%  of the questions. They guess the rest of the questions.

It can be observed that the distribution narrows when going from the left to the right. This is due to a smaller variance, since the number of guesses decreases. With a smaller variance, the height of the peak has to increase, in order to get the area below the curve. The overlapping tails are bigger to the left, as the variance is bigger. Thus the risk of putting a student in the wrong category is bigger for the lower grades. The rightmost peak is the least symmetric, with a steeper left-hand side. Then the binomial distribution is not approximated by a normal distribution so well, since the number of trials is lower. With 60 questions instead of 40, the intersection between the curves is at a lower level of probability, which means that the risk of giving a student a wrong grade is lower.

Åbo, 2014.
##### Keywords [en]
Binomial distribution, normal approximation, multiple-choice tests
##### National Category
Electrical Engineering, Electronic Engineering, Information Engineering
##### Research subject
Electrical Engineering
##### Identifiers
OAI: oai:DiVA.org:kth-166107DiVA, id: diva2:809274
##### Conference
Nordstat 2014, Turku, Finland, 02-06 Jun 2014
##### Funder
SweGRIDS - Swedish Centre for Smart Grids and Energy Storage
##### Note

QC 20150507

Available from: 2015-04-30 Created: 2015-04-30 Last updated: 2015-05-07Bibliographically approved

#### Open Access in DiVA

No full text in DiVA

Westerlund, Per

#### Search in DiVA

##### By author/editor
Westerlund, PerHilber, Patrik
##### By organisation
Electromagnetic Engineering
##### On the subject
Electrical Engineering, Electronic Engineering, Information Engineering

urn-nbn

#### Altmetric score

urn-nbn
Total: 89 hits

Cite
Citation style
• apa
• harvard1
• ieee
• modern-language-association-8th-edition
• vancouver
• Other style
More styles
Language
• de-DE
• en-GB
• en-US
• fi-FI
• nn-NO
• nn-NB
• sv-SE
• Other locale
More languages
Output format
• html
• text
• asciidoc
• rtf
v. 2.35.4
| | | |