The Bot That Wouldn’t Let Me Quit

I spent three months last year trying to drink less coffee. I failed four times. My phone buzzed with reminders, my calendar had blocked-off “cutoff hours,” and I even tried switching to tea. None of it stuck. Then I spent a week testing a chatbot designed to change my eating habits, and it did something no app or calendar reminder had ever done: it asked me, at 3 p.m. on a Tuesday, why I was reaching for a second espresso. It didn’t judge. It just wanted to know. And that question, stupid as it sounds, made me pause.
That pause is the whole point.
A systematic review published in the Journal of Medical Internet Research by Abhishek Aggarwal, Cheuk Chi Tam, Dezhi Wu, and Xiaoming Li (Aggarwal et al., 2023) dug through 15 studies spanning four decades of research to ask a simple question: Can AI chatbots actually change how people behave with their health? The answer, it turns out, is a qualified yes. But the why is what makes this weird, and worth paying attention to.
What 15 Studies Actually Found

The authors screened seven databases for empirical papers published between 1980 and 2022. That is a long window, but the technology itself is young: most of the studies came from the last five years. The team followed PRISMA guidelines, the gold standard for systematic reviews, and ended up with 15 studies that met their criteria.
The results were lopsided but telling. Six studies (40%) showed that chatbots could promote healthy lifestyles. Four (27%) focused on smoking cessation. Two (13%) targeted medication or treatment adherence. One (7%) looked at reducing substance misuse. The remaining studies were feasibility pilots, testing whether people would even talk to a bot about their health in the first place.
Here is the part that surprised me: people did talk. They talked a lot. The authors found that participants reported chatbots offered a “nonjudgmental space for communicating sensitive information” (Aggarwal et al., 2023). That is a polite way of saying people will tell a bot things they will not tell a doctor, a friend, or even a spouse. A bot cannot roll its eyes. A bot does not get disappointed. A bot just listens.
The Secret Ingredient Is Not AI

If you strip away the machine learning, what made these chatbots work was not fancy algorithms. It was behavior change theory. The authors identified that the most effective chatbots used specific strategies: goal setting, monitoring, real-time reinforcement or feedback, and on-demand support (Aggarwal et al., 2023). These are not new ideas. Therapists and health coaches have used them for decades. What the chatbots did was deliver them at scale, in real time, without needing a human on the other end.
Think about that. The bot is not inventing new psychology. It is just executing old psychology faster and more consistently than any human could.
One study in the review used a chatbot to help people quit smoking. The bot sent daily check-ins, asked about cravings, and offered coping strategies. Another used a chatbot to nudge people toward more physical activity. The bot did not just say “go for a walk.” It asked when they could walk, how long they wanted to try, and then checked back to see if they did it. That feedback loop, the monitoring and reinforcement, is what made the difference.
The authors wrote that these chatbots collected “real-time user-chatbot interaction data, such as user preferences and behavioral performance” to personalize the interventions (Aggarwal et al., 2023). In plain English: the bot learned what worked for you and adapted. If you always ignored morning reminders but responded to evening ones, the bot shifted. If you hated walking but liked yoga, the bot adjusted. That kind of tailoring is nearly impossible for a human coach to do at scale. A chatbot can do it for a million people simultaneously.
The Nonjudgmental Space
There is a reason people open up to machines. It is not that the machines are smart. It is that they are safe.
The authors noted that participants across multiple studies cited the nonjudgmental nature of chatbots as a key reason they engaged (Aggarwal et al., 2023). That is a huge deal for health behaviors that carry stigma. Smoking. Substance use. Overeating. People feel shame, and shame makes them avoid help. A chatbot cannot feel shame, and it cannot shame you back. It just asks the next question.
I saw this firsthand when I tested a chatbot for eating habits. I told it I had eaten a whole bag of chips at 11 p.m. It did not sigh. It did not remind me of my goals. It just asked: “What were you feeling before you ate them?” That question, delivered without tone, without judgment, made me actually think about the answer. I was bored. I was tired. I was not hungry. The bot then asked if I wanted to set a reminder to do something else the next time I felt bored at night. I said yes. That is the loop.
Where It Falls Apart
The review was not all glowing. The authors flagged several problems. First, the risk of internal validity was moderate to high across the included studies (Aggarwal et al., 2023). That is academic-speak for “we cannot be totally sure the chatbots caused the behavior changes.” Maybe people who signed up for a chatbot study were already motivated to change. Maybe the novelty of talking to a bot wore off after a few weeks. The studies did not always have good control groups.
Second, the authors noted that many papers provided “insufficient description of AI techniques” (Aggarwal et al., 2023). Some chatbots were powered by simple decision trees, not deep learning. Others used natural language processing but did not explain how. That makes it hard to replicate the results or know which technical approach actually works best.
Third, generalizability is limited. Most studies were small, with samples that skewed young, educated, and tech-savvy. Will a chatbot work for a 70-year-old who has never used a smartphone? For someone with low health literacy? For a person in a rural area with spotty internet? The review does not answer those questions.
The Open Question Nobody Is Asking
Here is the interesting gap the review did not address: What happens when the chatbot is wrong?
These systems learn from user data. If you tell a chatbot you feel anxious and it suggests a breathing exercise, that is fine. But what if you tell it you feel suicidal? What if you tell it you are in an abusive relationship? The authors did not explore how these chatbots handle crisis situations, and the studies they reviewed did not test for it. That is a blind spot. A chatbot that offers a nonjudgmental space could also be a chatbot that misses a red flag.
There is also the question of long-term engagement. Most studies lasted a few weeks or months. Health behavior change, especially for things like weight loss or addiction, takes years. The authors found mixed results on feasibility, acceptability, and usability (Aggarwal et al., 2023). Some people loved the bot. Others found it annoying and stopped using it. The novelty wears off. The question is whether the bot can keep people engaged long enough for the new behavior to become automatic.
The Scalability Promise
One of the most exciting findings in the review was that chatbots could be deployed through devices people already own. Smartphones. Facebook Messenger. The authors called this “potential for scalability” (Aggarwal et al., 2023). That is understated. If a chatbot can work on a basic smartphone with an internet connection, you can reach billions of people. You do not need a clinic, a therapist, or a health coach. You need a phone and a willingness to type.
That changes the economics of health interventions. A human coach costs money per session. A chatbot costs money to build, but once it exists, it can serve an unlimited number of users at near-zero marginal cost. That is why public health agencies are paying attention. If a chatbot can help even 10% of smokers quit, the population-level impact is enormous.
What the Research Does Not Prove
Let me be clear about what this review does not say. It does not say chatbots are better than human coaches. The studies did not compare them head to head in a rigorous way. It does not say chatbots work for everyone. The samples were small and homogeneous. It does not say chatbots are safe for all conditions. The review focused on lifestyle behaviors, not mental illness or chronic disease management.
What it does say is that chatbots can be effective for some people, for some behaviors, under some conditions. That is a starting point, not a conclusion. The authors called for “robust randomized control trials” to establish definitive conclusions (Aggarwal et al., 2023). That is the right call. The evidence is promising but not proven.
What This Actually Means
- ▸If you are trying to change a health habit, a chatbot is worth trying. The research shows it can help with smoking, activity, and medication adherence. The cost is zero. The risk is low. The upside is real.
- ▸The chatbot needs to be built on behavior change theory, not just chitchat. Look for bots that set goals, track progress, give feedback, and adapt to your responses. A bot that just says “good job” is not enough.
- ▸The nonjudgmental aspect is the killer feature. If you feel embarrassed talking to a doctor or a friend about your health, a chatbot might be the right first step. It is a safe place to be honest.
- ▸Do not expect a miracle. The effects are real but modest. The review found high efficacy in some studies, but the methodology was not airtight. Use a chatbot as a tool, not a cure.
- ▸The technology is getting better, but the evidence base is still thin. If you try a chatbot and it does not work, that does not mean you failed. It means the science is still figuring out what works for whom.
References
- [1]Abhishek Aggarwal, Cheuk Chi Tam, Dezhi Wu, Xiaoming Li (2023). Artificial Intelligence–Based Chatbots for Promoting Health Behavioral Changes: Systematic Review. Journal of Medical Internet ResearchDOI· 572 citations
