The Exam That Changed Everything

In 2023, a chatbot sat for the United States Medical Licensing Exam. It passed. Then it took the bar exam. It passed that too. Then the GRE, the SAT, and a battery of AP tests. ChatGPT cleared them all, often scoring in the 90th percentile or higher. The machine did not get nervous. It did not have test anxiety. It did not stay up late cramming.
Firuz Kamalov, David Santandreu Calonge, and Ikhlaas Gurrib, researchers at Canadian universities who published a major review in Sustainability (Kamalov et al., 2023), watched this unfold and realized something urgent: education had entered a new phase, and nobody had agreed on the rules. Their paper, which synthesizes hundreds of studies on AI in education, does not try to scare you. It does not try to sell you a utopia. Instead, it does something rarer. It asks what happens when a tool that can write a passable essay, solve a calculus problem, and diagnose a patient appears in every classroom simultaneously.
The answer is not what you expect. The authors conclude that AI will not replace teachers. But it will force them to change what teaching even means.
What the Research Actually Found

Kamalov and his colleagues did not run experiments with students. They did something arguably harder. They reviewed the existing literature across three axes: applications, advantages, and challenges. They wanted to know what the evidence actually says about AI in education, not what the hype claims.
The methodology is straightforward. They searched major databases for peer-reviewed studies on AI in education published between 2018 and 2023. They filtered for relevance, quality, and recency. Then they coded each study for its focus area: intelligent tutoring systems, automated assessment, personalized learning, or collaborative teacher-student interaction. They also tracked negative outcomes and ethical concerns.
What emerged is a picture more nuanced than either the techno-optimists or the doomsayers want to admit. The authors found that AI's strongest impact is not in replacing human judgment but in handling the mechanical parts of teaching that consume teachers' time and energy. Grading multiple-choice tests. Generating practice problems. Tracking which students are falling behind and which are bored.
But the authors also found something subtler. The same AI that can personalize learning for a student can also be gamed by that student. The same system that writes an essay can also plagiarize one. The same tool that tutors can also cheat.
The Three Things AI Does Better Than Teachers

Personalized Learning at Scale
Every teacher knows the problem. You have thirty students in a room. Three are lost. Three are bored. The rest are somewhere in between. You cannot teach each of them individually. You cannot.
AI can. Kamalov et al. (2023) document how intelligent tutoring systems already adapt to individual students' pace, style, and knowledge gaps. These systems do not just present material. They track which concepts a student struggles with, which they master quickly, and which they forget a week later. Then they adjust.
The authors cite studies showing that students using AI-driven personalized platforms outperform those in traditional lecture formats by measurable margins. But the key insight is not the performance boost. It is that the AI handles the granular, repetitive work of differentiation, freeing the teacher to do what only a human can.
Instant, Consistent Feedback
A teacher with 150 students cannot give detailed feedback on every essay every week. It is physically impossible. AI can. Kamalov et al. (2023) review automated assessment tools that evaluate not just spelling and grammar but argument structure, evidence use, and logical coherence.
But here is the catch the authors emphasize. The AI is good at pattern recognition. It can tell you that your thesis statement is weak because it has seen 10,000 weak thesis statements. It cannot tell you why your thesis statement matters to you, or how to find the courage to write a better one.
Predictive Intervention
The most powerful use of AI in education may be the least visible. Kamalov et al. (2023) describe predictive systems that analyze student data to identify who is at risk of dropping out, failing, or disengaging. These systems flag problems before the teacher sees them.
A student who stops logging in. A student whose quiz scores suddenly drop. A student who used to participate in discussions and now stays silent. The AI notices. The teacher acts.
The Three Things Teachers Do That AI Cannot
Contextual Judgment
AI can grade an essay on the American Revolution. It cannot understand why a student from a military family wrote about loyalty differently than a student whose parents immigrated as refugees. It cannot see the personal context that shapes a student's thinking.
Kamalov et al. (2023) are explicit about this limitation. They write that AI systems lack the ability to interpret the social, emotional, and cultural dimensions of learning. A machine can identify a wrong answer. It cannot understand why the student gave that answer, or what the student was actually trying to say.
Emotional Intelligence
A student who just lost a parent does not need a personalized learning plan. They need a human who notices they are not okay. Kamalov et al. (2023) review studies showing that teacher-student relationships are among the strongest predictors of academic success. AI cannot build those relationships.
The authors note that AI can simulate empathy. It can be programmed to say the right words at the right time. But simulation is not the same as presence. A student knows the difference between a chatbot that asks "How are you feeling?" and a teacher who actually wants to know.
Modeling Intellectual Courage
This is the one that matters most. Kamalov et al. (2023) argue that education is not just about transmitting information. It is about teaching students how to think, how to question, how to be wrong gracefully, and how to change their minds. AI can provide information. It cannot model intellectual humility.
A teacher who says "I don't know, let's find out together" is doing something no AI can replicate. A teacher who admits a mistake, who changes position based on new evidence, who shows that learning is a process not a product, is teaching the most important lesson of all.
The Ethical Minefield Nobody Is Talking About
Kamalov et al. (2023) do not gloss over the problems. They catalog them with the same rigor they apply to the benefits.
The Cheating Paradox
AI makes cheating easier and harder to detect. A student can ask ChatGPT to write an essay, submit it, and the teacher may never know. But the same student who uses AI to cheat is also failing to learn the skills they need. The authors point out that the solution is not better detection software. It is redesigning assignments so that cheating is pointless.
If the assignment asks for a summary of a text, the AI wins. If the assignment asks for a personal connection to the material, or a creative application, or a reflection on the process of learning itself, the AI is useless.
The Data Privacy Problem
Every interaction with an AI tutoring system generates data. Where the student struggles. What they search for. How long they take. Who they are. Kamalov et al. (2023) warn that this data can be used to improve education or to surveil students. The same system that identifies a struggling student can also identify a student who might be a political dissident, or who has learning disabilities, or who is questioning their gender identity.
The authors argue that without strong data protection frameworks, AI in education becomes a surveillance tool dressed as a tutor.
The Access Gap
AI costs money. The best systems require computing infrastructure, training, and maintenance. Kamalov et al. (2023) note that the schools that need AI the most, underfunded schools in rural areas or poor communities, are often the least likely to have access to it. The result is that AI could widen the achievement gap it is supposed to close.
What the Research Does Not Prove
The paper is a review, not an experiment. It synthesizes existing findings but does not generate new data. That means several important questions remain open.
First, the long term effects of AI on learning are unknown. Most studies reviewed by Kamalov et al. (2023) cover weeks or months, not years. We do not know what happens to a student who uses AI tutors from kindergarten through college. Does it make them more independent or more dependent? Does it accelerate learning or create a ceiling?
Second, the paper does not resolve the question of whether AI improves deep understanding or just surface performance. A student who uses AI to generate practice problems may score higher on tests without actually understanding the material at a conceptual level. The authors acknowledge this but cannot answer it.
Third, the research is heavily focused on Western, educated, industrialized, rich, and democratic contexts. We know very little about how AI tutors work in cultures with different educational traditions, different relationships to authority, or different definitions of success.
The Only Way Forward
Kamalov et al. (2023) end their paper with a clear conclusion. The only way forward is to embrace the new technology while implementing guardrails to prevent its abuse. This is not a compromise. It is a recognition that the genie is out of the bottle. AI is already in classrooms, whether schools have official policies or not. Students are already using it. The question is whether teachers will lead the integration or be dragged along.
The authors propose several guardrails. Transparency about when AI is being used. Data privacy protections. Equity of access. Teacher training. And most importantly, a shift in what we assess. If we keep testing the things AI can do, we will keep being outsmarted by machines. If we start testing the things only humans can do, we will finally see what education is actually for.
What This Actually Means
- ▸Stop assigning work that AI can do. If a chatbot can complete your homework in thirty seconds, that homework is not teaching anything worth learning. Redesign assignments around process, reflection, and personal connection. The AI can summarize a text. It cannot tell you why that text made you cry.
- ▸Use AI for the mechanical parts of teaching. Grading multiple choice. Generating practice problems. Tracking attendance. Identifying struggling students. These tasks consume teacher time without using teacher skill. Let the machine do them. Then use the reclaimed hours for the work that matters: one on one conversations, mentoring, and modeling how to think.
- ▸Teach students how to use AI critically. The students who will thrive are not the ones who ban AI. They are the ones who learn when to trust it, when to question it, and when to ignore it. This is a new literacy, as important as reading and writing.
- ▸Protect student data like it is a weapon. Because it is. The same data that helps a tutor personalize learning can be used to track, profile, and discriminate. Schools need policies that treat student data with the same seriousness as medical records.
- ▸Accept that the teacher's role is changing, not disappearing. The teacher of the future will not be a dispenser of information. That job is already gone. The teacher of the future will be a coach, a mentor, a guide, and a human being who shows up fully. That job cannot be automated. It can only be chosen.
References
- [1]Firuz Kamalov, David Santandreu Calonge, Ikhlaas Gurrib (2023). New Era of Artificial Intelligence in Education: Towards a Sustainable Multifaceted Revolution. SustainabilityDOI· 898 citations
