Due to the lockdown where I am students have taken exams at home. This obviously opens the easy possibility of cheating. While marking, I noticed two scripts had solutions and methods that were remarkably similar if not the exact same. I am thinking of how to gather statistical evidence of such cheating. My question will be: how can we use such evidence to be able to make a quantifiable judgement of potential cheating?
Consider the exam question: Prove that an odd number multiplied by an odd number equals an odd number. Let's say that many solutions followed such a method/template:
- Let a and b be some odd numbers,
- then a=2n+1 and b=2m+1 for some n and m
- then some algebra
- then the conclusion that we have an expression of the form 2p+1 which is some odd c.
While marking I would have some students freely use a,b,c,n,m,p as their parameters, others who would use x,y,z,p,q,r, others using a,b,c,x,y,z etc. Some might assume the two numbers need to be the same. Some might skip the question completely. Crucially, there is variation in the choices available to a student at this position in the exam.
Consider another test question: Find the inverse function of $f(x)=\frac{2x+1}{3-2x}$ and the solution could be of the form:
- Express with $y=$
- use a few algebraic steps
- swap x and y
- express as $f^{-1}(x)=$
Other students decided to swap first, or do the algebra steps in different ways, or to make particular algebraic errors, or to express the solution as $y=$. So again we have choices and variation.
This can be done for every question, one can collect the number of choices students actually made in their solutions at a particular position. So if the exam has 15 questions I would have potentially 15 (or more) different positions where there is variation (or perhaps none if they all happen to use the same choice). Let's say that the number of variations are $n_1,\ldots,n_{15}$.
Now it turns out that two students happened to have the exact same choice in say 12 of the 15 positions.
So then my question: What would be the best way to use this evidence of $n_1,\ldots,n_{15}$ choices/variations and that in the 12 positions {1,2,3,4,6,7,9,10,12,13,14,15} there was a match and in positions {5,8,11} there was no match, to support the claim of cheating?
I hope the question is on topic, appropriate and clear. Incidentally, when I mentioned this to my bosses, they simply shrugged their shoulders and said that we cannot prove cheating so there's nothing we can do. I want to challenge that statement.