Chat-Ghosting: Methods for Auto-Completion in Dialog Systems
Anubhav Mandal, Sandeep Mishra, Bishal Santra, Tushar Abhishek, Pawan Goyal, Manish Gupta
Main Conference Track
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Motivation: Why Chat-Ghosting ?
The Rise of Conversational AI�
Problem: Chat-ghosting has received little attention from NLP/ML community
Microsoft Copilot
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
What is Chat-Ghosting ?
Task Definition�
Input
Output
Goal: Predict completion such that [prefix; completion] is a valid response
Key Difference from Query Auto Completion (QAC):
Microsoft Copilot
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Research Gaps & Contributions
�Gaps:
Our Contributions:
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Benchmark Datasets
1. Daily Dialog (DD) - Open-Domain Human Conversations
2. DSTC7-Ubuntu (DU) - Technical Domain Conversations
�
3. Open Assistant (OASST) - Human-Bot Interactions
4. ShareGPT (SGPT) - Real-World LLM Conversations
S.No | Dataset | Type | Domain | Avg Words | Avg Chars | Train Size | Test Size |
1 | DD | Human-Human | Open | 12.37 | 57 | 69,216 | 7,986 |
2 | DU | Human-Human | Tech | 13.48 | 74 | 549,002 | 5,588 |
3 | OASST | Human-Bot | Mixed | 20.36 | 115 | 19,421 | 981 |
4 | SGPT | Human-Bot | Mixed | 53.27 | 341 | 328,078 | 1,088 |
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Methods Evaluated
We evaluated the following methods with/without context (previous utterances in conversation):
Standard QAC methods:
Neural Language Models
Prompt Engineering
Context Integration: Prepending for neural models, reranking for tries/n-grams
�
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Evaluation Metrics
Quality Metrics:
User Experience Metrics
Additional Evaluations (For Completeness)
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Key Results on DD and DU dataset
Chat-Ghosting task for the Daily Dialog (DD) dataset. PT=Pretrained, FT=Finetuned
Chat-Ghosting task for the DSTC7-Ubuntu (DU) dataset. PT=Pretrained, FT=Finetuned
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Key Results on OASST and SGPT dataset
Chat-Ghosting task for the Open Assistant (OASST) dataset. PT=Pretrained, FT=Finetuned
Chat-Ghosting task for the Share GPT (SGPT) dataset. PT=Pretrained, FT=Finetuned
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Key Results on DD and DU dataset with Context
Contextual Chat-Ghosting task for the Open Assistant (OASST) dataset. PT=Pretrained, FT=Finetuned
Contextual Chat-Ghosting task for the DSTC7-Ubuntu (DU) dataset. PT=Pretrained, FT=Finetuned
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Key Results on OASST and SGPT dataset with Context
Contextual Chat-Ghosting task for the Daily Dialog (DD) dataset. PT=Pretrained, FT=Finetuned
Contextual Chat-Ghosting task for the Share GPT (SGPT) dataset. PT=Pretrained, FT=Finetuned
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Key Takeaways
On Human-Human Interactions datasets: DD & DU
On Human-Bot Interactions datasets: OASST & SGPT
�Practical recommendation: Utilize a hybrid system. Use MPC for seen prefixes (memory-based precision), and T5 or QB for unseen prefixes depending on latency budget.
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Entropy-Based Early Stopping for Neural Language Models
Problem: Long predictions less likely to be accepted as whole by the end user.
Solution:
Benefit: +75% P-Prec improvement, +120% TES improvement with shorter predictions.
�
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Trade-offs: Accuracy vs. Inference Latency
Inference Latency for each chat-ghosting method (in milliseconds) at max Trigger Rate. PT=Pretrained, FT=Finetuned. cDD and cDU are Contextual datasets.
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Non-Contextual Ghosting Examples
Examples of suffixes predicted by various non-contextual models for different prefixes.
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Contextual Ghosting Examples
Examples of suffixes predicted by various contextual models for different prefixes.
Chat-Ghosting: Methods for Auto-Completion in Dialog Systems. EACL 2026.
Conclusion
Thank You
Anubhav Mandal, Sandeep Mishra, Bishal Santra, Tushar Abhishek, Pawan Goyal, Manish Gupta