We're a little crazy, about science!

Stereotype means girls should expect poorer physics grades

Girls should expect poorer physics grades

Imagine that you are a female student and give the exact same answer to a physics exam question as one of your male classmates, but you receive a significantly poorer grade. This is precisely what happens on a regular basis, as concluded in a study by Sarah Hofer, a researcher in the group led by ETH professor Elsbeth Stern.

Hofer asked secondary school physics teachers in an online test to grade an exam answer. She presented 780 participants from Switzerland, Germany and Austria with the same question from the field of classical mechanics and the exact same fictitious – and only partially correct – student answer.

The only thing that the ETH scientist varied in the experiment was a short, introductory written statement: half of the trial participants were led to believe that they had to grade an answer from a “male student”, the other half “a female student”. Hofer left the participants in the dark about the purpose of her study, and instead pretended it related to a cross-comparison of two different methods for correcting exams.

The participants graded the physics task differently. In her analysis, Hofer compared the range of grades of the supposed female students with those of the supposed male students. The good news: for teachers who had taught for at least ten years, the gender of the student had no influence on the grade. The bad news: teachers in Switzerland and Austria who had taught for less than ten years gave the girls a significantly poorer grade than the boys. As an example: teachers with five or less years of professional experience discriminated girls by a grade of 0.7 (Switzerland) and 0.9 (Austria) on average.

“Teachers with less teaching experience are possibly more guided by the bias that girls are worse in physics than boys when grading,” says Hofer.

Earlier studies have already provided evidence that girls have to work harder for the same grades in science-related subjects, but most of those studies looked at the field of mathematics. The present study is the most comprehensive and most recent one for the field of physics and the German-speaking countries.

It is known that biases and stereotypes have an impact on grading when the evaluator does not have enough information or is extremely stressed or overwhelmed.

“Teachers with less experience are apparently more influenced by contextual information such as gender,” says Hofer.

The results of the new study are curious for German secondary school teachers with less than then years of teaching experience: the male teachers graded the girls and boys the same, while the female teachers behaved like their Swiss and Austrian colleagues and graded the girls more poorly. German female teachers with five or less years of experience discriminated the girls by a grade of 0.9 on average.

One possible explanation is that German male teachers are more sensitised than their colleagues in the other countries studied due to promotion programmes for girls in the STEM fields (science, technology, engineering and mathematics). However, as Hofer points out, such programmes exist in all three countries.

In addition to gender, the researcher also varied the specialisation of the fictitious students in languages vs. science in the introduction in the online test; specialisation did not affect the grade.

For ETH professor Stern, the poorer grades for girls, as demonstrated in this study, are part of a more fundamental problem:

“Girls and women cannot count on being rewarded for their effort.”

At times they will be graded too well, other times too poorly. Their grades do not reflect actual performance as well as they do for boys and men, which makes it difficult for them to find their direction.

“As a girl, when you already have the feeling in school that you won’t be fairly graded in sciences, then you tend to lose interest in these subjects,” says Stern.

Instead, scientifically-gifted women all too frequently turn to other subjects in which they are likely to be more strongly promoted. This is something that should be taken into account in the ongoing STEM promotion programmes.

“Grades are the feedback that students receive for their performance, and they strongly affect their self-perception, motivation and willingness to make an effort,” says Hofer.

“Teachers should therefore take grades very seriously,” says Stern. Likewise, even greater attention should be paid to grading during teacher training. This will be done in the teacher training at ETH Zurich.

But even more fundamentally, stereotypes should be critically scrutinised, especially at school. When grading exam questions, a more structured approach with clear criteria could help teachers grade objectively and block out stereotypes.

“It would be important for teachers to use an evaluation scheme for each exam that outlines how many points should be awarded for which partial answers and which clearly defines what are careless mistakes and consequential errors.”

It would also be helpful if teachers covered the student’s name when grading.

Hofer, S. (2015). Studying Gender Bias in Physics Grading: The role of teaching experience and country International Journal of Science Education, 37 (17), 2879-2905 DOI: 10.1080/09500693.2015.1114190

3 responses

  1. takfurkaffe

    An excellent and thought provoking article with lessons to be learnt for encouraging equal opportunities in STEM!


    January 12, 2016 at 12:47 am

  2. This is very interesting, particularly the differences in grading between more and less experienced teachers, and in different countries; it really shows the huge cultural and societal influence these things have. I think the problem is likely to be worse when there’s no clear-cut right or wrong answer: essay grading springs to mind. It can only be exacerbated by the “stereotype threat” effect – the well-documented phenomenon whereby e.g. women, conditioned to think they will do less well on a maths exam, for example, will perform to that expectation. Certainly for my university exams, however, they were only marked on candidate number: name (and gender) was hidden.


    January 17, 2016 at 1:07 pm

    • That’s exactly how I think the problem could be fixed. Blind the teachers (or graders) to the student in question and let the work speak for itself. It would make for a much better, not to mention more fair, system. Especially in the no clear cut correct answer side of things. Excellent insights and thanks for sharing!


      January 17, 2016 at 1:15 pm

But enough about us, what about you?

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.