Smarter Searches, Deeper Discovery: Integrating Natural Language Search in Primo
Matthew Hartman, Timothy Kohn
October 17th, 2025
Who We Are
Matt Hartman
Senior Lead, Delivery Services & Library Applications
Tim Kohn
Resource Sharing Manager
2
Stony Brook Libraries, AI
3
Stony Brook Libraries, AI Team
4
Content Services and Management
Access and Users Services
Digital Services
Academic Engagement
Stony Brook Libraries, AI Team
5
6
7
Primo Analytics
8
7.5% of searches no results
Around half of sessions have more than one search
Facet usage is startlingly low
Growing number of natural language-style questions
Does the World Cup promote world unity or increase nationalistic tensions?
How does expenditure affect the percentage of votes for a candidate?
During the American Revolutionary War did local militia service records show that families with more property were more likely to enlist?
How did the penalties of witchcraft get worse over time?
9
10
11
12
13
14
15
16
17
(From ENUG 2024) What do users need from our systems?
18
Quick, reliable discovery of resources
Quick, reliable delivery of resources
Support for information literacy
Illustration of and support for skills that can be used in other platforms
Compelling incentives to use library tools to discover & access materials purchased/licensed for their research
Low stress, high reward, navigation through the research process
Webb, H. & Babb, N. (2024, October 18) Artificial Intelligence Across the Discovery Landscape. [Conference session]. Ex Libris Northeast User Group (ENUG) 2024, , New Brunswick, NJ, United States.
19
Varnum, K., Kessler, R., Zhu, J., Hazen, T., Patham, B., & Holloway, J. (2025). Generative Artificial Intelligence and Web-Scale Discovery: A report from the NISO Open Discovery Initiative Standing Committee
20
Varnum, K., Kessler, R., Zhu, J., Hazen, T., Patham, B., & Holloway, J. (2025). Generative Artificial Intelligence and Web-Scale Discovery: A report from the NISO Open Discovery Initiative Standing Committee
21
Varnum, K., Kessler, R., Zhu, J., Hazen, T., Patham, B., & Holloway, J. (2025). Generative Artificial Intelligence and Web-Scale Discovery: A report from the NISO Open Discovery Initiative Standing Committee
Painting the picture
Focus more on…
Focus less on…
22
We can do that!
23
Asks Question
Rewrites in Boolean string
Top 30 Search Results
Top 5 for user query
Knowledge
Base
Summarization
4o mini
4o mini
24
Output from Primo RA = 925,163 results, uses top 5 usable CDI results
(factors contributing to global warming) OR (causes of global warming) OR (global warming contributors) OR ((global warming) OR (climate change causes)) OR ((human activities) AND (global warming factors)) OR ("factors contributing to global warming") OR (greenhouse gases climate change global warming causes) OR (((anthropogenic) OR (human-induced) AND (global warming) OR (climate change) AND factors)) OR (carbon emissions deforestation industrialization global warming) OR (climate drivers)
OR (What factors contribute to global warming?)
25
26
27
28
29
30
31
library.edu/discovery/search?query=any,contains, [your query here] & [your default settings here]
Issues
32
33
&facet=tlevel,include,peer_reviewed
&mfacet=tlevel,include,online_resources,1
&mfacet=tlevel,include,available_p,1
&mfacet=searchcreationdate,include,2020%7C,%7C2025,1
34
Query
User natural language query moved to AI agent that parses the information.
Boolean Creation
AI agent uses training data to create a boolean string that best represents the topic of interest.
Apply Facets
Multiple AI agents work concurrently to apply relevant facets based on the input, including creation date, material type, peer-reviewed, local holdings, and online only facets.
Load Results
All of this information is brought together as a search results page using the Everything Search scope in our catalog.
Query
Topic
Date
Facets
35
Query
User natural language query moved to AI agent that parses the information.
Boolean Creation
AI agent uses training data to create a boolean string that best represents the topic of interest.
Apply Facets
Multiple AI agents work concurrently to apply relevant facets based on the input, including creation date, material type, peer-reviewed, local holdings, and online only facets.
Load Results
All of this information is brought together as a search results page using the Everything Search scope in our catalog.
Topic
(factor* OR cause* OR contributor* OR driver*) AND ("global warming" OR "climate change" OR "greenhouse effect")
36
Query
User natural language query moved to AI agent that parses the information.
Boolean Creation
AI agent uses training data to create a boolean string that best represents the topic of interest.
Apply Facets
Multiple AI agents work concurrently to apply relevant facets based on the input, including creation date, material type, peer-reviewed, local holdings, and online only facets.
Load Results
All of this information is brought together as a search results page using the Everything Search scope in our catalog.
Date
2020-2025
Recent
Format
Book
Local
37
Query
User natural language query moved to AI agent that parses the information.
Boolean Creation
AI agent uses training data to create a boolean string that best represents the topic of interest.
Apply Facets
Multiple AI agents work concurrently to apply relevant facets based on the input, including creation date, material type, peer-reviewed, local holdings, and online only facets.
Load Results
All of this information is brought together as a search results page using the Everything Search scope in our catalog.
library.edu /
search = BOOLEAN� & date = RECENT� & format = LOCAL BOOKS
Benefits of Agentic Model
38
User Interface Considerations
39
40
User Interface Considerations
41
User Interface Considerations
Homepage Widget
42
43
Codebase
44
Primo
OpenAI
Web Server
Timing
45
Element | Runtime (s) |
Non-API code runtime | .05 |
API calls | 2-6 |
WebServer Response | 1-2 |
Total | 3-8 |
Top 10% (p90) Latency - 90% of all queries load in under 5.5s, indicating predictable system speed for nearly all users.
OpenAI vs AI Server
OpenAI version (GPT 4.1-mini)
AI Server (llama 3.1)
46
Require Sign In?
Capture more data at the cost of limiting usage.
47
Data
48
We are retaining no personal information. No gathered data is used in training.
API Costs
49
.04 cents per search
4.5 cents per 100 uses
~ a dollar a month
Model | Input | Output |
4.1-mini | $0.40 | $1.60 |
4.1-mini-sbubool | $0.80 | $3.20 |
Model | Input | Output |
4.1-mini | $0.41 | $0.68 |
4.1-mini-sbubool | $0.66 | $1.21 |
Listed Pricing
Actuals
Usage (10/1 - 10/14)
50
1245 Advanced
421 SEARCH AI
Usage (10/1 - 10/14)
51
Informational
topic-based or reference-seeking queries like:
“AI bias and facial recognition” or
“Hamlet by Shakespeare”
Exploratory
multiconcept or relational searches like:
“give me peer reviewed articles about the future of creative writing and its relationship with AI”
Creative
open-ended or speculative queries:
“How does a college student's mental health affect their grades. Has it changed since the pandemic?”
Revisiting the goal
Focus more on…
Focus less on…
52
Limitations and User Feedback
53
What did we learn
Future Directions
54
search.library.stonybrook.edu
https://github.com/Stony-Brook-Libraries-Digital-Services
Learn more about this project and more!
THANK YOU!�ANY QUESTIONS?
We’re looking forward to hearing from you!
matthew.hartman@stonybrook.edu
timothy.kohn@stonybrook.edu