Representation Learning for Dialog Models
Recent work
gmanish@microsoft.com
2
Agenda
gmanish@microsoft.com
3
What is dialog modeling?
gmanish@microsoft.com
4
S1: hi , how are you doing ? i am getting ready to do some cheetah chasing to stay in shape . |
S2: you must be very fast . hunting is one of my favorite hobbies . |
S1: i am ! for my hobby i like to do canning or some whittling . |
S2: i also remodel homes when i am not out bow hunting. |
Context
Response
Persona for S2: i like to remodel homes. i like to go hunting. i like to shoot a bow. my favorite holiday is halloween.
Context
Response
Reddit Dataset Example
PersonaChat Dataset Example
What is dialog modeling?
Karthik Gopalakrishnan, Behnam Hedayatnia, Qinlang Chen, Anna Gottardi, Sanjeev Kwatra, Anu Venkatesh, Raefer Gabriel, and Dilek Hakkani-Tür. 2019. Topical-chat: Towards knowledge-grounded open-domain conversations. Proc. Interspeech 2019, pages 1891–1895.
gmanish@microsoft.com
5
Context
Response
TopicalChat Dataset Example
Knowledge section for topic "Fish"
A fish is an aquatic, craniate, gill-bearing animal that lacks limbs with digits. Included in this definition are the living hagfish, lampreys, and cartilaginous and bony fish as well as various extinct related groups. Approximately 95% of living fish species are ray-finned fish, belonging to the class Actinopterygii, with around 99% of those being teleosts.
S1: I think fish are so cool there is actually a breed of jellyfish that is immortal. |
S2: i had rememered hearing about that before. Immortatlity is wasted on a jellyfish haha. did you know a seahorse is the only fish that has an actual neck? |
S1: That is so funny I guess I never considered a seahorse a fish. The black swallower fish sounds a lot like a snake because it can eat pray that is so large. |
S2: i guess they live up to their name then! |
S1: It seems they do. I also didn't know that there was a difference with how freshwater and saltwater fish drink. |
What is dialog modeling?
Agrawal, Harsh, Mishra, Aditya; Gupta, Manish; Mausam.: Multimodal persona based generation of comic dialogs. In ACL. pp. 14150–14164 (2023)
gmanish@microsoft.com
6
Context
Response
ComSet Dataset Example
Why care about dialog modeling?
https://www.ometrics.com/blog/list-of-fun-chatbots/ https://www.linkedin.com/pulse/global-chatbot-market-size-projected-surpass-around-usd-sundstrom/
gmanish@microsoft.com
7
What are popular approaches for dialog modeling?
gmanish@microsoft.com
8
What are popular metrics for evaluating dialog systems?
Agrawal, Harsh, Aditya Mishra, and Manish Gupta. "Multimodal Persona Based Generation of Comic Dialogs." In ACL, pp. 14150-14164. 2023.
gmanish@microsoft.com
9
Agenda
gmanish@microsoft.com
10
Why learn a new representation for dialog systems?
gmanish@microsoft.com
11
Word-level reasoning
Discourse-level reasoning
How do we optimize with dialog structure awareness?
gmanish@microsoft.com
12
How is InfoNCE-S computed?
13
gmanish@microsoft.com
Experimental Setup
14
Downstream task details. Adv.: Adversarial, Neg.: Negative
gmanish@microsoft.com
Probing (Frozen LM) Results
gmanish@microsoft.com
15
Std LMs
Dialog specific LMs
Finetuning Results: LM weights are also updated
gmanish@microsoft.com
16
Std LMs
Dialog specific LMs
Qualitative Analysis
17
Context:
User 1: Are you busy tomorrow morning ?
Response:
By User 2 (Human): I'm free . What's up ?
By DMI Model: No , not this weekend . I have too much work to do .
Context:
User 1: Hi , Dan . What's new ?
User 2: Where have you been ? I've been trying to get in touch with you for two days .
User 1: I went to Salt Lake City on business . What's up ?
Response
By User 2 (Human): I got fired .
By DMI Model: Not much . I had to pay an unexpected bill , so I needed the money back .
Example Predictions on Empathetic-Intent Task
gmanish@microsoft.com
Conclusions
gmanish@microsoft.com
18
Agenda
gmanish@microsoft.com
19
What is the comic dialog generation problem?
Agrawal, Harsh, Aditya Mishra, and Manish Gupta. "Multimodal Persona Based Generation of Comic Dialogs." In ACL, pp. 14150-14164. 2023.
20
gmanish@microsoft.com
MPDialog Model Architecture
Agrawal, Harsh, Aditya Mishra, and Manish Gupta. "Multimodal Persona Based Generation of Comic Dialogs." In ACL, pp. 14150-14164. 2023.
21
gmanish@microsoft.com
How does MPDialog perform?
Agrawal, Harsh, Aditya Mishra, and Manish Gupta. "Multimodal Persona Based Generation of Comic Dialogs." In ACL, pp. 14150-14164. 2023.
22
gmanish@microsoft.com
Comic-wise Quantitative Analysis
Agrawal, Harsh, Aditya Mishra, and Manish Gupta. "Multimodal Persona Based Generation of Comic Dialogs." In ACL, pp. 14150-14164. 2023.
23
BLEURT
MaUde
Perplexity
gmanish@microsoft.com
Qualitative Analysis
Agrawal, Harsh, Aditya Mishra, and Manish Gupta. "Multimodal Persona Based Generation of Comic Dialogs." In ACL, pp. 14150-14164. 2023.
24
Human Evaluation Results
gmanish@microsoft.com
Conclusions
25
Agrawal, Harsh, Aditya Mishra, and Manish Gupta. "Multimodal Persona Based Generation of Comic Dialogs." In ACL, pp. 14150-14164. 2023.
gmanish@microsoft.com
Agenda
gmanish@microsoft.com
26
Using LLMs as dialog models
gmanish@microsoft.com
27
Optimizing the prompts
28
gmanish@microsoft.com
Analysis of Prompt Lengths
29
gmanish@microsoft.com
Usable information-density (UID)
30
gmanish@microsoft.com
31
gmanish@microsoft.com
Conclusions
32
Agenda
gmanish@microsoft.com
33
Summary
gmanish@microsoft.com
34
Research Opportunities
gmanish@microsoft.com
35
Thanks!
gmanish@microsoft.com
36