Generative AI (genAI) chatbots have become part of the learners’ toolkit and will likely continue to become even more performant in the future. On the one hand, educators and researchers (Skulmowski, 2024; Zhai et al., 2024) are concerned about learners’ overreliance on genAI chatbots and the offloading of learning processes. On the other hand, these chatbots also provide rich opportunities for self-regulated learning (SRL; Pan et al., 2025). To support students in skillful learning with genAI chatbots, we need more insight into their actual interaction styles, as well as design knowledge on how to develop chatbots that facilitate rather than hinder deeper learning.
This study explores the interaction patterns of high school students in a research task with a chatbot. It investigates whether chatbot-initiated questions can serve as coregulatory prompts to enhance SRL, influencing students’ interaction patterns, mental effort, and learning outcomes.
While chatbots can provide interactive learning experiences, prior research indicates that students primarily engage with them as static content generators rather than conversational partners (Klar, 2025). Research also shows that learners perceive chatbots as easy-to-use (Ngo, 2023) and have high self-efficacy which could lead to overconfidence and reduced mental effort (Stadler et al., 2024). Students already use chatbots without external guidance in information tasks (von Garrel & Mayer, 2023). The design of chatbot system prompts, i.e., their permanent role, may shape how learners interact with them and encourage more effective self-regulated learning. For example, chatbots can output monitoring prompts in the shape of comprehension questions and reflection questions (Guo, 2022). As the underlying genAI technology improves further, chatbots could become more widespread and more reliable in the coregulation of learning (Hadwin et al., 2018) and a continuum of self-regulated, coregulated, and AI-regulated learning becomes increasingly feasible (Molenaar, 2022). Research that accompanies these technological developments and iteratively tests designs that enhance learning is needed (Reigeluth & Honebein, 2024). This study investigates whether chatbot-initiated questions, as a form of coregulation, can influence student behavior, promoting deeper learning strategies while maintaining learner autonomy.
A two-group randomized experimental study was conducted with 71 high school students (14.7 years old on average) who performed 15-minute exploratory research on conspiracy theories using a chatbot based on ChatGPT-4o. Participants were randomly assigned to:
Control Group (CG): Standard chatbot responses.
Experimental Group (EG): The chatbot was given a system prompt to ask comprehension and reflection questions at the end of each response, for example: “Why do you think people sometimes prefer simple explanations?”
Apart from the chatbot system prompts, both groups were equal regarding task, time, or instruction. Both groups were given the open-ended task to find out as much as they could about the topic of conspiracy theories. This open task was designed to reflect authentic information-seeking settings. The EG was not specifically informed about the chatbot role nor instructed to respond to the questions. Student interactions were recorded and their chatbot interactions were coded with qualitative content analysis. Mental effort was assessed using Paas’ (1992) self-report scale, and knowledge gain was measured through pre-post testing. The knowledge tests were open-ended questions and the texts were rated in terms of breadth, depth, and factuality. Two independent raters reached very good interrater agreement with an average weighted κ = .86 across the three criteria after two rounds. Group differences were analyzed with a MANOVA and post-hoc ANOVAs.
The quantitative findings suggest chatbot-initiated questions significantly influenced self-regulated learning behavior: The EG did answer chatbot questions in 41% of their interactions, indicating that they responded to the prompt for deeper processing in many cases while taking the liberty to not respond to them in other cases. As hypothesized, the CG used significantly more adaptation prompts like “Make it shorter” or “Give me more details on this aspect” (M = 2.5) than in the EG (M = 1.42, p = .043), suggesting that when chatbot responses included questions, students engaged in fewer modifications. Knowledge gain was slightly higher in the EG (M = 1.8) than in the CG (M = 1.35), but the difference was not statistically significant (p = .141). Mental effort ratings did not differ significantly (p = .960), contradicting the hypothesis that chatbot questions encourage investing more mental effort.
To gain a better understanding of the learners’ chatbot interactions, every prompt they entered was coded. Table 1 shows the occurrences of these chatbot interactions for both groups individually and in total. The experimental group used about two more prompts per chat than the control group (11.8 prompts on average in the EG; 8.9 in the CG). This overall higher engagement allowed the experimental group to pose slightly more questions despite also giving vastly more answers. In both groups, but especially in the control group, the interaction patterns can be described as using the chatbot as a static text generator. Only 31% of the students in both groups used prompts to adapt the chatbot output to their needs. In the control group, the students did not engage in a conversation with the chatbot: they rarely asked follow-up questions or showed other kinds of conversational interaction. There are more cases of prompts asking for coregulation in the CG than the EG, such as “teach me”. Although these numbers are small, they show that some students in this sample were aware of this chatbot affordance.
Table 1
Codes, Subcodes, and Their Frequencies for Chatbot Interaction
Code/Subcode | Instances n = 71 | EG n = 35 | CG n = 36 |
Total | 699 | 377 | 322 |
Question | 278 | 142 | 136 |
| 174 | 84 | 90 |
| 68 | 43 | 25 |
| 17 | 7 | 10 |
| 12 | 6 | 6 |
| 7 | 2 | 5 |
Answer | 169 | 156 | 13 |
| 133 | 122 | 11 |
| 24 | 23 | 1 |
| 12 | 11 | 1 |
Adapting Prompts | 140 | 50 | 90 |
| 77 | 23 | 54 |
| 16 | 6 | 10 |
| 15 | 4 | 11 |
| 13 | 7 | 6 |
| 11 | 1 | 10 |
| 6 | 2 | 4 |
| 55 | 23 | 28 |
| 37 | 18 | 19 |
| 8 | 2 | 6 |
| 12 | 4 | 8 |
| 5 | 3 | 2 |
| 3 | 1 | 2 |
| 2 | 0 | 2 |
| 2 | 0 | 2 |
Off-topic, e.g., “Best Winter perfumes”, “What is inflation?” | 36 | 7 | 29 |
Conversational Prompts | 55 | 19 | 36 |
| 26 | 6 | 20 |
| 13 | 7 | 6 |
| 9 | 3 | 6 |
| 6 | 2 | 4 |
| 1 | 1 | 0 |
Questions on the functionality of the chatbot, e.g., “Do you have an opinion?”, “Are you ChatGPT?” | 20 | 2 | 18 |
Note. Instances = number of coded student prompts; EG = experimental group; CG = control group.
Chatbot-initiated questions successfully encouraged learners to reflect on their current understanding, prior knowledge, and conceptions. However, their effect on mental effort and knowledge gain was limited, possibly due to students’ perception of chatbots as “easy” tools rather than as partners in learning. Overall, the students underutilized core chatbot affordances like engaging in deeper conversation and adapting the chatbot responses to their needs. The chatbot-initiated questions alleviated this to some degree. This shows that students are responsive to coregulatory chatbot designs that are easy to implement.
The investigated design in the form of a system prompt was just one of many ways to support self-regulated and coregulated learning. As this technology is evolving, there are innumerable options to explore further chatbot designs. The phase in which we need “research to improve” instructional designs rather than “research to prove” their effectiveness, as Reigeluth and Honebein (2024) suggest, will likely continue for some time. As this is a time of exploration for learners, educators, and researchers alike, why should we not include learners in participatory research for chatbot designs? Future research could work with learners (and educators) on designing the chatbot support that learners need and want (Amaefule et al., 2024; Newman et al., 2024). This would shift the discussion away from overreliance and offloading to learner empowerment and co-designed chatbots as partners in learning.