CSCI-SHU 205: Topics in Computer Science
Human-AI Alignment
Hua Shen
Course Website: https://hua-shen.org/src/course_bialign.html
2025-09-01
Lecture 1
Welcome to BiAlign course👏 !
Hello from Your Instructor!
Hua Shen
Outline
By joining today’s class –
· decide whether this course fits you well;
· learn an overview and motivation of HAI-Alignment;
· share what you want to get from this course.
Outline
Share Your Thoughts 🙌
What’s your understanding of “Human-AI Alignment”
AI systems are deeply integrated into our lives …
Autonomous Cars
Writing Assistant
Image Generation
AI ethics are NOT fully aligned with human values…
Crashes with Autonomous Cars
Writing Assistant
Generates Misinformation
Stereotypical Biases
In Image Generation
Challenges in state-of-the-art research
Human-centered and AI-centered research & skills are largely divided!
“Find” AI Problems for Humans
“Address” Problems in AI
Human-AI Alignment!
What is Human-AI Alignment?
What is Human-AI Alignment?
Towards Bidirectional Human-AI Alignment
What is Human-AI Alignment?
Towards Bidirectional Human-AI Alignment
Objective of Human-AI Alignment
Maximizing Capabilities AND Minimizing Risks in Human–AI Co-Evolution
Scope of Human-AI Alignment
Human-AI Alignment involves:
Who is the human in “Human-AI Alignment”?
Efforts towards Human–AI Alignment so far…
Efforts towards Human–AI Alignment so far…
2025 Tutorial on Human-AI Alignment
@Dec,2025
Hua Shen
Instructors:
Mitchell Gordon
Adam Tauman Kalai
Panelists:
Yoshua Bengio
Dawn Song
Monojit Choudhury
Hannah Kirk
Eric Gilbert
Outline
State-of-the-art Research
Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C., Mishkin, P., ... & Lowe, R. Training language models to follow instructions with human feedback. NeurIPS 2022.
Human feedback is primarily rating and ranking…
State-of-the-art Research
"Constitutional ai: Harmlessness from ai feedback." arXiv:2212.08073.
"Red teaming language models with language models." arXiv:2202.03286.
Responsible AI work commonly involves minimal human participation
Missing Diverse Human Participation…
Ion Stoica, Keynote: Reliability: An AI Challenge. Agentic AI Summit 2025. https://www.youtube.com/watch?v=c39fJ2WAj6A (1:27:07)
— Ion Stoica
Co-Founder, Databricks & Anyscale
Professor of UC Berkeley
@Agentic AI Summit 2025
Without humans in the loop of AI development & deployment…
How?
This BiAlign course equips you with fundamental knowledge, technical skills, and practical project experience to advance human-AI alignment in future research.
Outline
Course Overview
I. Foundations
II. Methods
III. Practice
III. Practice
I. Foundations (Week 1-2)
Course Overview
I. Foundations
II. Methods
III. Practice
III. Practice
II. Methods (Week 3-9)
Course Overview
I. Foundations
II. Methods
III. Practice
III. Practice
III. Practice (Week 10-15)
Course Goals:
Mapping Goals to Course Activities
Course Goals | Course Activities |
1. Gain Foundational Knowledge | Lectures (By the instructor) |
2. Familiarize Cutting-edge Research | Paper Presentations (Lead by You👏) |
3. Harness Hands-on Skills | Two Assignments
|
4. Practical Projects | One Final Project |
Check details at: https://hua-shen.org/src/course_bialign.html
Course Activities
Paper Presentation: 20%
Project: 50%
Assignments: 20%
Participation: 10%
Clarification on Several Course Activities
Paper Presentation
Paper Title
Name(s)
Clarification on Several Course Activities
Course Assignments
Assignment 1 (10%):
Assignment 2 (10%):
LLM Post-Training Alignment (AI)
Human-LLM Interactive Alignment (HCI)
Clarification on Several Course Activities
Projects
Course Policy
Computing Resource
— Generative AI Tools and Services in NYU Shanghai
Service | How to Access | Collect data? |
Commercial |
| Yes (personal use only) |
Institutional Licence @NYU IT |
| No (NYU wide license) |
Private By Request |
| No |
Outline
Love to know more about you!
What’s your experience + expectation on this course
Share Your Experience & Insights
Your feedback & discussion is always welcome!
Summary
What is Human-AI Alignment? (20 min)
Course Overview and Logistics (15 min)
Why Human-AI Alignment? What if Not? (20min)
Discussion: Know more about You! (20min)
Next class –
Overview: Evolving Challenges of AI Alignment and Human's Role
Assignment #0
Due: 11:59 PM, Sep 7, 2025 (Sun).
(China Standard Time)
Reading Materials
Thank You 💛
See you on Wednesday!