Authentic Assessments Should Measure Real World Skills Not Tests
behavioral science9 min read1,746 words

Authentic Assessments Should Measure Real World Skills Not Tests

Authentic assessments measure real-world skills better than traditional tests, improving student engagement and career readiness.

S

Sahil Batra

Anthropologist and travel writer who has lived across five countries. Covers how...

The Test That Forgot What It Was Testing

skills assessment workshop
skills assessment workshop

When Jan McArthur started digging into the research on "authentic assessment," she found something strange. The term was everywhere in higher education. Professors used it to describe projects that mimicked real world tasks. Administrators used it to justify curriculum changes. Consultants used it to sell workshops.

But almost nobody, McArthur realized, had asked a basic question: What does "authentic" actually mean here?

McArthur, an education researcher at Lancaster University, traced the concept back to the 1980s, when educators in the United States first used it to describe assessments that went beyond multiple choice and memorization. The idea was simple: instead of testing whether students could recall facts, test whether they could do something that mattered. But over the decades, the meaning had shifted. By 2022, when McArthur published her analysis in Higher Education, the phrase had become a catchall for anything that looked vaguely practical.

The problem, she argued, was that most definitions of authentic assessment conflated two very different things: the "real world" and the "world of work." And that conflation, left unexamined, was quietly hollowing out the very purpose of education.

Why "Real World" Became a Trap

Here is the paradox McArthur identified. When educators say they want assessments to reflect the "real world," they usually mean they want students to perform tasks that resemble what professionals do. A nursing student runs a mock triage. An engineering student designs a bridge. A business student pitches to fake investors. These are better than multiple choice tests, sure. But they also carry an implicit assumption: that the purpose of education is to produce workers.

McArthur (2022) traced how this assumption became dominant. The shift started in the 1990s, as universities faced pressure to demonstrate their economic value. Governments wanted graduates who could fill jobs. Employers wanted recruits who needed less training. Authentic assessment, in this frame, became a way to prove that a degree translated into workplace competence.

But McArthur pushed back on this logic. She argued that the "world of work" is not the same as the "real world." Work is one part of life. The real world also includes citizenship, relationships, creativity, rest, and the ability to question the systems you inhabit. When assessments only measure workplace readiness, they quietly train students to see themselves as human capital rather than as humans.

The Philosophical Gap Nobody Noticed

McArthur's most surprising finding was not about what the research did show, but about what it did not address. She reviewed decades of literature on authentic assessment and found that almost none of it engaged with the rich philosophical tradition of thinking about authenticity. Philosophers from Heidegger to Taylor to Sartre had spent centuries wrestling with what it means to be authentic. Educators, meanwhile, were using the word as if its meaning were obvious.

This matters because philosophy gives us tools to ask harder questions. If authenticity means being true to yourself, then an authentic assessment should ask: True to which self? The self that performs for a grade? The self that follows instructions? Or the self that questions, creates, and finds meaning?

McArthur (2022) argued that the current approach to authentic assessment focuses almost entirely on the task rather than the person. It asks: Does this assignment look like something a professional would do? But it rarely asks: Does this assignment allow a student to be themselves, to find purpose, to connect their learning to something they genuinely care about?

What McArthur Actually Did

McArthur's paper is a theoretical analysis, not an experiment. She did not run a study with 500 students or measure effect sizes. Instead, she did something rarer and in some ways harder: she traced the intellectual history of a concept, identified its hidden assumptions, and proposed a more coherent framework.

Her method was philosophical and critical. She read widely across the assessment literature, identified patterns and gaps, and then brought in philosophical sources to challenge those patterns. She also looked at how the term "authentic assessment" had been used differently in different countries. In Australia and New Zealand, for example, the term had stronger connections to indigenous education and community knowledge. In the UK and US, it was more narrowly tied to employability.

This cross national comparison revealed something important. The meaning of "authentic" was not fixed. It was shaped by cultural and economic pressures. And those pressures, left unexamined, were steering assessment toward a narrow definition of value.

The Missing Element: Social Value

McArthur's central argument is that we should stop asking whether an assessment is "authentic" in the sense of resembling work. Instead, we should ask whether it has social value.

Social value, as she defined it, includes economic value but is not limited to it. An assessment with social value might help a student understand how to participate in democratic processes. It might help them think critically about media. It might help them develop empathy for people different from themselves. It might help them find work that feels meaningful, not just profitable.

McArthur (2022) wrote: "We should move from simply focusing on the authentic task to considering why that task matters." This shift in emphasis is subtle but profound. It changes the question from "Does this look real?" to "Does this matter in a way that connects the student to something larger than themselves?"

Three Ways Authentic Assessment Goes Wrong

McArthur identified several recurring problems in how authentic assessment is currently practiced.

  • The performative trap. When assessments are designed to mimic work, students quickly learn to perform competence rather than develop it. They focus on hitting the right notes for the evaluator rather than engaging with the material on their own terms. This is the opposite of authenticity.
  • The narrowness problem. Work based assessments tend to measure a limited set of skills: following instructions, meeting deadlines, producing standard outputs. They rarely measure curiosity, moral reasoning, creativity, or the ability to question authority. These are precisely the skills that matter most in a changing world.
  • The well being blind spot. McArthur found that the literature on authentic assessment almost never discussed student well being. The assumption seemed to be that if an assessment was "real," it was automatically good for students. But McArthur argued that authenticity should also mean allowing students to experience genuine achievement, which is closely tied to self worth and mental health.

What a Better Assessment Looks Like

McArthur did not prescribe a single model. But her analysis pointed toward several principles that could guide better practice.

An assessment with social value would ask students to engage with real problems, but not just problems from the workplace. It might ask a history student to create a public exhibit that helps a local community understand its past. It might ask a computer science student to build a tool that addresses a social need. It might ask a literature student to write a review that helps people decide what to read.

The key is that the assessment connects the student to a community beyond the classroom. It gives the student a reason to care beyond the grade. And it allows the student to see themselves as someone who can contribute, not just someone who can perform.

McArthur (2022) put it this way: "Senses of achievement can become richer, thus enhancing the students' sense of self, self worth, and well being."

What This Research Does Not Prove

McArthur's paper is a conceptual argument, not an empirical test. It does not prove that students who experience socially valuable assessments have better outcomes. It does not provide a checklist or a scoring rubric. It does not claim that all work based assessments are bad.

What it does is reveal the assumptions hiding inside a popular term. It shows that "authentic assessment" means different things to different people, and that the dominant meaning has been shaped by economic pressures rather than educational philosophy.

This leaves open a fascinating question: If we redesigned assessments around social value instead of employability, would students actually learn more? Would they be happier? Would they be better citizens? These are empirical questions that McArthur's framework invites researchers to explore.

The Real Problem with "Real World"

Here is what McArthur's analysis made clear to me. The phrase "real world" is doing a lot of ideological work. It makes a particular vision of education sound neutral and inevitable. Who could be against preparing students for the real world?

But the "real world" is not a single thing. It is a contested space. For some people, the real world is the office. For others, it is the community center, the voting booth, the protest, the garden, the home. When we define authenticity as alignment with the workplace, we are not just making a pedagogical choice. We are making a political one. We are saying that the most important thing a person can learn is how to be a productive worker.

McArthur's intervention is to say: That is not the only choice. And it might not even be the best one.

What This Actually Means

Here is what I take away from McArthur's analysis, translated into concrete shifts in thinking.

  • Stop asking if an assessment looks like work. Start asking if it matters. The question is not "Does this resemble a professional task?" but "Does this give the student a reason to care beyond the grade?" That is the real measure of authenticity.
  • Assess what you actually value. If you value curiosity, test curiosity. If you value collaboration, test collaboration. If you value moral reasoning, test moral reasoning. Do not test only the things that are easy to measure in a workplace simulation.
  • Connect students to communities, not just careers. An authentic assessment should help a student see how their knowledge matters to people outside the classroom. That might mean working with a local organization, creating something for a public audience, or tackling a problem that affects real lives.
  • Consider well being as part of the equation. An assessment that leaves students feeling hollow or anxious is not authentic, even if it looks like a real job. Authenticity should include the experience of genuine achievement, which is tied to self worth.
  • Be honest about what you are teaching. When you design an assessment, you are also teaching a lesson about what matters. If your assessments only measure workplace readiness, you are teaching students that their value is defined by their employability. You might not believe that. But your assessments are saying it for you.

References

  1. [1]Jan McArthur (2022). Rethinking authentic assessment: work, well-being, and society. Higher EducationDOI· 163 citations
#authentic assessment#real-world skills#education reform#student engagement
S

Sahil Batra

Anthropologist and travel writer who has lived across five countries. Covers how place shapes behaviour, what migration research reveals about identity, and the economics of movement.

Reader Comments (2)

Dr. Ananya Sharma★★★★★

Interesting. Our engineering grads ace theory but struggle with client projects. Real-world problem-solving is what industry needs, not just exam scores. How do we train faculty to design such assessments?

Ravi Iyer★★★★★

As a hiring manager, I see this daily. Candidates with great GPAs often lack practical judgment. We've started our own simulation-based tests. Glad research is catching up to ground reality.

Leave a comment

Related Articles