Bhagesh Gaur⧪, Karan Gupta⧪, Aseem Srivastava⧪, Manish Gupta♠, Md Shad Akhtar⧪
Assess and Prompt: A Generative RL Framework for Improving Engagement in Online Mental Health Communities
⧪ Indraprastha Institute of Information Technology Delhi (IIIT Delhi), India
♠ Microsoft, India
“Over 40% of help-seeking posts on Reddit mental health forums get no response.” (Sharma et al., 2020; Kim et al., 2023)
Even in supportive spaces, silence can deepen isolation.
Why do so many cries for help online go unanswered?
We aim to understand and bridge this communication gap.
Support-seeking posts often miss key ingredients of help
Clear expression is the bridge between asking for help and receiving it.
Posts without clear ‘support attributes’ fail to elicit engagement
Posts without clear ‘support attributes’ fail to elicit engagement
We shift focus from ‘how to respond’ → to ‘how to help users express better.’
Including Event, Effect, Requirement in post increases the number of comments
Can a language model identify missing support attributes in a post and prompt the user to express them?
To study this aspect and address the gaps, we propose two major contributions:
�
We propose REDDME, a manually annotated corpus of Reddit posts.
The following attributes are annotated
with spans (rationales), their intensity
levels and guided question as per
taxonomy.
Stats:
Total posts: 4760
Average Post Length: 179.62
Total Guided Questions: 7909
Dataset: REDDME
Taxonomy: CueTaxo
MH-COPILOT empowers support-seekers to tell their stories better.
Can we help users express what they need - before they give up asking?
MH-COPILOT: Assess → Prompt → Learn (RL)
POST
POST w/ Attribute Spans
Contextual Attribute Span Classifier
(CSpan)
Support Attribute Intensity Detection
(Intensity Classifier)
Attribute Level Intensity
TAXONOMY
Taxonomy-based Question Prompt
LM Layer (Dn)
LM Layer (Dn-1)
LM Layer (Dn-2)
Attribute Intensity
VERIFIER MODULE
Reference Model’s
Response Ranking
Reward Computation
DPO
What made you feel <X> ?
Can you elaborate more on <X> ?
What can help you overcome <X> ?
What made you feel anxious?
Can you elaborate more on how you feel?
What can help you overcome your anxiety?
Level 1
Level 2
Level 3
Level 4
Level 5
Structural
Assessor
Empathy
Assessor
Context Evaluator
Suggestive Question Generator
(Language Model)
MH-COPILOT: Assess → Prompt → Learn (RL)
POST
POST w/ Attribute Spans
Contextual Attribute Span Classifier
(CSpan)
Support Attribute Intensity Detection
(Intensity Classifier)
Attribute Level Intensity
TAXONOMY
Taxonomy-based Question Prompt
LM Layer (Dn)
LM Layer (Dn-1)
LM Layer (Dn-2)
Attribute Intensity
VERIFIER MODULE
Reference Model’s
Response Ranking
Reward Computation
DPO
What made you feel <X> ?
Can you elaborate more on <X> ?
What can help you overcome <X> ?
What made you feel anxious?
Can you elaborate more on how you feel?
What can help you overcome your anxiety?
Level 1
Level 2
Level 3
Level 4
Level 5
Structural
Assessor
Empathy
Assessor
Context Evaluator
Suggestive Question Generator
(Language Model)
MH-COPILOT: Assess → Prompt → Learn (RL)
POST
POST w/ Attribute Spans
Contextual Attribute Span Classifier
(CSpan)
Support Attribute Intensity Detection
(Intensity Classifier)
Attribute Level Intensity
TAXONOMY
Taxonomy-based Question Prompt
LM Layer (Dn)
LM Layer (Dn-1)
LM Layer (Dn-2)
Attribute Intensity
VERIFIER MODULE
Reference Model’s
Response Ranking
Reward Computation
DPO
What made you feel <X> ?
Can you elaborate more on <X> ?
What can help you overcome <X> ?
What made you feel anxious?
Can you elaborate more on how you feel?
What can help you overcome your anxiety?
Level 1
Level 2
Level 3
Level 4
Level 5
Structural
Assessor
Empathy
Assessor
Context Evaluator
Suggestive Question Generator
(Language Model)
MH-COPILOT: Assess → Prompt → Learn (RL)
POST
POST w/ Attribute Spans
Contextual Attribute Span Classifier
(CSpan)
Support Attribute Intensity Detection
(Intensity Classifier)
Attribute Level Intensity
TAXONOMY
Taxonomy-based Question Prompt
LM Layer (Dn)
LM Layer (Dn-1)
LM Layer (Dn-2)
Attribute Intensity
VERIFIER MODULE
Reference Model’s
Response Ranking
Reward Computation
DPO
What made you feel <X> ?
Can you elaborate more on <X> ?
What can help you overcome <X> ?
What made you feel anxious?
Can you elaborate more on how you feel?
What can help you overcome your anxiety?
Level 1
Level 2
Level 3
Level 4
Level 5
Structural
Assessor
Empathy
Assessor
Context Evaluator
Suggestive Question Generator
(Language Model)
MH-COPILOT: Assess → Prompt → Learn (RL)
Results
Annotators reported MH-COPILOT’s outputs “occasionally surpass gold standard”
Metric | w/o Verifier | w/ Verifier |
Empathy (D1) | 3.27 | 3.43 |
Relevance (D2) | 1.82 | 2.27 |
Context (D3) | 2.19 | 3.31 |
Fluency (L3) | 3.82 | 4.02 |
Human Eval
Verifier + Taxonomy → Quality Improvement Beyond Numbers
MH-COPILOT transforms generative RL from text optimization → social interaction enhancement.
Generative RL can teach models to ask better questions
Assess and Prompt: A Generative RL Framework for Improving Engagement in Online Mental Health Communities�Bhagesh Gaur, Karan Gupta, Aseem Srivastava, Manish Gupta, Md Shad Akhtar
Scan the QR code to access:
Thank You