1 of 28

CrowdLLM: A Synthetic Crowd Simulator for Crowdsourcing with LLM Workers

Augmented with Lightweight Generative Model

Feng (Ryan) Lin1, Hanming Zheng2, Keyu Tian2, Congjing Zhang1, Li Zeng2, Shuai Huang1

1University of Washington 2City University of Hong Kong

2 of 28

2

Collaborators

Dr. Shuai Huang

Ryan Lin

Congjing Zhang

Keyu Tian

Hanming Zheng

Dr. Li Zeng

3 of 28

Agenda

  • Introduction

  • Methodology

  • Experiments

  • Conclusion

3

4 of 28

Outline

  • Introduction

  • Methodology

  • Experiments

  • Conclusion

4

5 of 28

Crowd-based Decision-making

5

  • A variety of systems builds on the participation of humans

Crowdsourcing

Voting

Recommender System

6 of 28

2

Digital Population

6

Real Population

Decision-making

Can LLMs build a realistic Digital Population?

Real humans are costly, hard to recruit, …

7 of 28

Limitation of LLM-based Population

7

[1] Yang, Joshua C., et al. "LLm Voting: Human choices and ai collective decision-making." Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society. 2024.

[2] Y. Gao, D. Lee, G. Burtch, & S. Fazelpour, “Take caution in using LLMs as human surrogates”, PNAS, 2025.

Heuristic, lack clear analytical formulation

Human

GPT

Llama

Voting

8 of 28

Outline

  • Introduction

  • Methodology

  • Experiments

  • Conclusion

8

9 of 28

2

Our Proposed CrowdLLM

9

LLM

Real Population

Digital Population

Blender

Decision-making

Generative Model

Profile

Individual Belief

Reference

Decision

10 of 28

2

Architecture of CrowdLLM

10

11 of 28

2

Architecture of CrowdLLM

11

12 of 28

2

Architecture of CrowdLLM

12

13 of 28

1. Reference Generation

13

  • Reference decision is a purely LLM-based decision

Task problem

Prompt

Pretrained LLM

Decisions

Response

14 of 28

2. Profile Generator

14

We generate realistic profiles of the digital population by adopting the state-of-the-art generative models.

Easy-to-sample

distribution

Target population

15 of 28

15

Personalized

Bias Distribution

3. Belief Generator

 

Age: 25 years old

Gender: Male

Occupation: Student

Marriage: Single

 

Semi-Implicit VAE

 

 

Encoder

Decoder

 

 

 

Sample

Latent Space

16 of 28

4. Blender

16

  • We can correct the bias through a blender :

LLM

CrowdLLM

 

 

 

 

17 of 28

Training of CrowdLLM

17

Reconstruction Loss

Decision-making loss

Individual Belief

LLM Backbone

 

 

Profiles

Blender

 

 

 

18 of 28

Outline

  • Introduction

  • Methodology

  • Experiments

  • Conclusion

18

19 of 28

Experimental Studies

19

Real-World Case Studies

Simulation Studies

Ablation Study

Data efficiency

Cost effectiveness

Voting

Amazon Product Reviews

Crowdsourcing

Belief Diversity

20 of 28

Case I: Voting

20

State-of-the-art

Our CrowdLLM

21 of 28

Case II: Amazon Product Reviews

21

Beauty (448 products, 10957 users)

Music (629 products, 12396 users)

22 of 28

Case II: Amazon Product Reviews

22

23 of 28

Case II: Amazon Product Reviews

23

Real Human

CrowdLLM

LLMs

24 of 28

Case III: Crowdsourcing

24

Experiments on multiple datasets:

  • Offensiveness
  • QA Difficulty
  • Politeness
  • Mitigating Biases
  • Differences in Fairness

25 of 28

Experimental Studies

25

Real-World Case Studies

Simulation Studies

Ablation Study

Data efficiency

Cost effectiveness

Voting

Amazon Product Reviews

Crowdsourcing

Belief Diversity

26 of 28

Data Efficiency of CrowdLLM

26

CrowdLLM shows great superiority over other LLM-based methods with only 1% training data (human data).

27 of 28

Outline

  • Introduction

  • Methodology

  • Experiments

  • Conclusion

27

28 of 28

Conclusion

  • CrowdLLM creates digital populations through a combination of LLMs and generative models.

  • The digital population generated by CrowdLLM can capture the diversity of real human population.

  • CrowdLLM provides a promising solution to practical issues in crowd-based decision-making involving real human population.

28