Introduction to Trust & Safety
Camille François (Columbia University / Niantic Labs)
Mariana Olaizola Rosenblat (NYU Stern Center for Business and Human Rights)
Matthew Soeth (All Tech is Human / Tremau /Former TikTok)
Learning Objectives
Today we will:
A note on difficult content, and on class etiquette
Purpose and History of T&S
What drives T&S
Copyright: Trust and Safety Foundation
High-level taxonomy of relevant abuses
Violent & Criminal Behavior
Regulated Goods & Services
Offensive & Objectionable Content
High-level taxonomy of relevant abuses (cont.)
User Safety
Scaled Abuse
Deceptive & Fraudulent Behavior
Community-Specific Rules
Key point: relevant abuse types depend on your audiences, feature, product, etc.
History and evolution of T&S field
Approaches and Best Practices in T&S
Reactive vs. proactive models
Sample company response following a user report of T&S violation
Managing trade-offs
Where T&S fits in an organization
Gaining senior management support
Building a T&S team
Sample functions
Policy
Operations
Safety by Design
Engineering
Threat Detection, Intelligence
Child Safety
Product (tooling)
Enforcement & Investigations
Discussion: contrasting perspectives on building T&S teams
Technologies used to implement T&S
Overview of automated technologies
Shortcomings
Circumvention techniques
Biases in training data
Lack of transparency in how databases are populated
NLP classifiers struggle with nuance and can be under- or over-inclusive in their coverage
The Future of T&S
Advancements in large-language models (LLMs) and generative AI technology:
T&S in the “metaverse”:
Copyright: Sanal Savunma, https://www.sanalsavunma.com/what-is-metaverse/
Additional Reading & Materials
Books:�The 26 Words that Created the Internet
Speech Police�Prosocial��LinkedIn Learning�Becoming a Trust & Safety Leader�
Articles, Papers, & Standards:�The Santa Clara Principles
Making the Business Case for Trust and Safety�Oasis Consortium - trust & safety standards
US/EU: Joint Statement on Protecting Human Rights Defenders Online
Digital Thriving and Prosocial Design in Gaming