1 of 16

1

The Impact of Student-AI Collaborative Feedback Generation on Learning Outcomes

Anjali Singh

PhD Candidate

School of Information

Christopher Brooks

Associate Professor

School of Information

Xu Wang

Assistant Professor

Computer Science and Engineering

2 of 16

Background: Hint Generation at Scale

2

Peer Feedback

+ Effective form of active learning

- Writing good hints is challenging for students

AI for Generating Hints

+ Helpful for scaling and automation

- Can make learners over-reliant on AI support

- Limited success in generating good hints in complex domains

3 of 16

Proposed Solution: Student-AI Collaboration

Singh et al. 2024

  • Students revised GPT-4 generated hints for given incorrect solutions

Findings:

+ Students who revised GPT-4 hints wrote better phrased and more specific hints

- Students got biased by GPT-4 hint; Low accuracy GPT-4 hints led to low accuracy student generated hints

Implication: ��Providing AI-generated hints to students after they have attempted the task independently for a ‘second opinion’

3

Solution B:��<Incorrect code solution>

Singh, A., Brooks, C., Wang, X., Li, W., Kim, J., & Pandey, D. (2023). Bridging Learnersourcing and AI: Exploring the Dynamics of Student-AI Collaborative Feedback Generation. arXiv preprint arXiv:2311.12148.

4 of 16

A Three-arm Experiment

4

AI-assistance

Students can get immediate assistance from GPT-4 during hint-writing

Research Question: Impact of hint-writing activity design on students’ learning outcomes?

AI-revision

Students first write a hint, then see GPT-4 hint and then rewrite the hint�

No AI-support

Students write hint without any AI support

Level of AI support

5 of 16

Hint Writing Activity

5

.�.

.

6 of 16

Study Outline

6

Week 1

Pre-test consisting of 10 MCQs (single correct answer) on basic python programming

Week 2

Hint-writing assignment based on programming assignment for week-2

Week 3

Hint-writing assignment based on programming assignment for week-3

Week 4

Post-test consisting of 6 MCQs (2 with single correct answer and 4 with more than one correct answer) based on course concepts and debugging skills

7 of 16

Prompting GPT-4

Write a brief hint in less than 100 words on the given incorrect solution for the given assignment. The hint should be written keeping in mind that the hint receiver is a novice data science learner with only introductory Python programming and statistics knowledge. The hint should have the following qualities: �- It should help the student who wrote the incorrect solution identify their mistakes and fix them, without giving away the full solution to them. �- It should be specific, i.e., it should provide information about how and where the code does or does not meet the assignment goals. �- It should not refer to the correct solution provided below ��Start your response with “Hint:” and highlight keywords, variable names, messages, line numbers and error names in bold. Do not write any text in addition to the hint. �———————————————————————— �<Programming Assignment Problem Statement>�� Correct Solution: <Correct Code> ��Incorrect Solution: <Incorrect Code>

7

8 of 16

Impact of Hint-writing Designs on Learning Outcomes

8

9 of 16

Study Results

  • Selected total 55 after propensity score matching to get students from each group with similar pre-test scores.
  • AI-assistance group had lowest mean post-test score
  • Difference in post-test scores between conditions was not statistically significant (p = 0.18),

9

Level of AI support

10 of 16

Implications

  • Students can possibly learn more when prompted to first think of the solution on their own before seeking AI-based assistance. �
  • Need for research on designs that promote active student engagement with AI tools

10

11 of 16

Limitations

  • Small sample size
  • Study conducted in a single course at a single institution

11

12 of 16

Next Steps

12

Creating a model for evaluating the quality of hints generated by LLMs or humans based on accuracy and pedagogical attributes (e.g., specificity, phrasing)

Using the high-quality hints to improve LLM based hint generation models using chain of thought prompting and fine-tuning.

13 of 16

Concluding Thoughts

  • What is the right amount of AI support in a given educational context?
  • How can we design educational AI tools that encourage active student participation?

13

Level of AI support

AI-assistance

AI-revision

No AI-support

14 of 16

�Anjali Singh�singhanj@umich.edu� @singhanj13

Thank

You ☺

14

�Co-authors:

Christopher Brooks

Xu Wang

15 of 16

Data Collected

Per student, we collected the following data:

  • Time spent on worked example and hint writing activity
  • Incorrect code shown to them
  • Correct code shown to them
  • Hint written by them
  • Revised hint written by them (for students in AI-revision condition)
  • GPT-4 hint shown to them (if not in the Baseline condition)
  • Survey responses (qualitative and Likert scale ratings)
    • Hint-writing activity experience
    • Perceived learning benefits

15

16 of 16

Hint-writing System Design

16

Finish and submit programming assignment

Start hint-writing assignment

All final submissions

Incorrect submissions from previous cohort

Go through worked example

Compare I and C;�Identify the mistakes in I

Assign randomly to one of the 3 conditions

C: Fetch learner’s final submission if it is correct, else fetch instructor solution

I: Fetch incorrect code that is most similar to C

AI-assistance��GPT-4 hint for I available on demand by clicking “Show ChatGPT hint” button

AI-revision

  1. Write hint with no support
  2. Read GPT-4 hint
  3. Rewrite hint

Baseline

Write hint with no additional available support

Submit final hint