AI for Data Analysis
Mary Ton
DH Librarian
Cadence Cordell
MLIS Maven
Jess Hagman
Social Sciences Research Librarian
Today’s slides
AI is already part of our research process
Text-to-speech
Image recognition
Goals for today
General Considerations
Key Limitations
“An error” as imagined by Gemini
Check out “A Gentle Introduction to ChatGPT” for more examples.
Modeling ≠ meaning
Plots
Villain
Plans
Diabolical
Morrow
corn
1876
rotation
1843
crop
experimental
The Morrow Plots
1843?
1876?
Limited Knowledge and Bias
*Note: ChatGPT assumed that the librarian was an unmarried woman in multiple iterations of the song, even though the prompt was gender neutral.
Write a sea shanty about a librarian and a cat
Oh, Miss Lily and Whiskers, a tale to be told,
In the library they wandered, their stories unfold.
With books in their hands and a twinkle in their eyes,
They sang songs of adventure 'neath the endless skies.
There’s bias in image-based AI too…
A librarian and a cat
A librarian
Even the cat is white…
Data Privacy
Data Cleaning
Tools
ChatGPT
Copilot Enterprise
OpenRefine
Tasks
Sample Prompts
Generating Code
Tools
Gemini
Copilot Enterprise on Gitbub
Code and copyright*
“In the office’s view, it is well-established that copyright can protect only material that is the product of human creativity. Most fundamentally, the term ‘author,’ which is used in both the Constitution and the Copyright Act, excludes non-humans.”
Generally cannot claim copyright on text, images, and code that you generated with AI
May claim copyright on sufficiently creative prompts
May claim copyright on modifications that you make to generated text
*I am not a lawyer.
Generally cannot claim copyright on procedures
Sample prompts
Analyzing Your Data
AI in Major Qualitative Data Analysis Programs
MAXQDA
ATLAS.ti
NVivo
Other options
What can AI do for your qualitative analysis?
Interpretive qualitative data analysis relies on the researcher’s experiences and previous knowledge, as well as in-depth understanding of the data to develop rich and relevant analysis.
What is the AI doing? Description? Summarizing or paraphrasing? Coding?
Can the technology do that without the project-specific knowledge that you and your research team bring to the project.
What type of qualitative research are you doing?
Document Your Process
Ingredients of a good
AI disclosure statement
Document your decisions
Most qualitative research methods expect the researcher to be transparent and reflexive about their analysis process, making the documentation of how you have used AI even more important.
How does your own developing interpretation of the data rely on analysis you can get from an AI based tool.
Conclusions
Questions to consider
AI
Human
Automation Continuum
Questions to consider
AI
Human
Automation Continuum
Questions to consider
AI
Human
Automation Continuum
Questions to consider
AI
Human
Automation Continuum
Resources
Library Resources
Level up your skills through the Savvy Researcher workshop series:
Check out previous workshops on AI, copyright, and text mining on the DH@Illinois Media Space Channel:
https://go.illinois.edu/dhchannel
NEW! Generative AI LibGuide:
Questions?
Jess Hagman (jhagman@illinois.edu)
Mary Ton (maryton@illinois.edu)
Sara Benson (srbenson@illinois.edu)
Additional Resources