1 of 34

MEDLINE search retrieval issues: �A longitudinal query analysis of five vendor platforms

Sean Burns�Associate Professor, College of Communication and Information, University of Kentucky

Tyler Nix�Interim Asst. Director, Research and Informatics�Taubman Health Sciences Library, University of Michigan

2 of 34

Where can I find…

Available on the event site https://guides.lib.umich.edu/TaubmanTalks/MedlinePlatforms

  • Slides (now)
  • Padlet (now)
  • Link to article (now)
  • Search queries file (now)
  • Recording (~1 week)
  • Recording transcript (with recording)
  • Chat and Q&A files (with recording)

2

3 of 34

Today's Talk

What it is:

  • Presentation of research on search retrieval across Medline platforms
  • Hear from the authors!
  • Ask questions!
  • What do the the results mean for information professionals/expert searchers?

3

4 of 34

Today's Talk

What it isn't:

  • "How to search Medline" tutorial
  • Comparison of Medline platform tools and features
  • "What's wrong with my search?"

4

5 of 34

"What it isn't" resources

  • "How to search" tutorial(s) / Comparison of Medline platform tools and features
    • Lots of sources (platform documentation, MLA courses, workshops, Carrie Price's Youtube)
  • "What's wrong with my search?"

5

6 of 34

Questions!

We'll pause at 3 points during today's session to answer questions:

  • Methods
  • Results
  • Discussion

Please put your questions for the presenters into the "Q&A" option, NOT the chat!

For questions related to specific platforms that you'd like to crowdsource, please use the Padlet on the event page

6

7 of 34

MEDLINE search retrieval issues: �A longitudinal query analysis of five vendor platforms

Sean Burns�Associate Professor, College of Communication and Information, University of Kentucky

Tyler Nix�Interim Asst. Director, Research and Informatics�Taubman Health Sciences Library, University of Michigan

8 of 34

Research question

When controlling for search query syntax, are search results generally consistent across MEDLINE platforms?

8

9 of 34

Motivation

  • To investigate how search result counts vary when searching MEDLINE records across five platforms and controlling for search query syntax.
  • To document longitudinal consistencies and/or changes in these results

9

10 of 34

Methods

  • Developed 29 sets of search queries
  • A "set" is a group of equivalent queries for each of 5 MEDLINE platforms (PubMed, ProQuest, EBSCOhost, Web of Science, and Ovid)
  • 145 total queries, run monthly Oct 2018-�Sept 2019
  • Searches are logically and semantically equivalent (and platform-specific)

10

11 of 34

Methods

  • Search result counts were compared to PubMed’s MEDLINE as the baseline.
  • Results were compared using modified �z-scores

11

12 of 34

Acronyms

  • PMML: PubMed MEDLINE
  • PQML: ProQuest MEDLINE
  • EHML: EBSCOhost MEDLINE
  • WSML: Web of Science MEDLINE
  • OML: Ovid MEDLINE

12

13 of 34

Query example: Case 02�Single Mesh Term, No Explode

13

PMML

"neoplasms"[MH:NOEXP]

PQML

MESH.EXACT("neoplasms")

EHML

MH("neoplasms")

WSML

MH=("neoplasms")

OML

neoplasms.SH

14 of 34

Query example: Case 11�Single Mesh term, ≥2 branches, exploded, keyword, and date: 1950-2015

14

PMML

"dementia"[MH] AND "immune"[ALL] AND 1950:2015[DP]

PQML

MESH.EXACT.EXPLODE( "dementia") AND NOFT("immune") AND YR(1950-2015)

EHML

MH("dementia+") AND TX("immune") AND YR 1950-2015

WSML

MH:exp=("dementia") AND TS=("immune") AND PY=(1950-2015)

OML

1. EXP dementia/ AND immune.AF 2. limit 1 to YR=1950-2015

15 of 34

Methods Questions

with Mark MacEachern

15

16 of 34

Results

  • Results indicate there is substantial variance in search result counts across 5 MEDLINE platforms.

16

17 of 34

Results

17

18 of 34

Results, Fig. 3

Viewing results at macro (group) scale shows broad differences between the platforms…� �but does not capture fluctuation within each platform.

18

19 of 34

Fig 3. Macro view of cases #03–#11.

Burns CS, Nix T, Shapiro RM II, Huber JT (2021) MEDLINE search retrieval issues: A longitudinal query analysis of five vendor platforms. PLOS ONE 16(5): e0234221. https://doi.org/10.1371/journal.pone.0234221

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234221

20 of 34

Results, Fig. 4-6

Viewing results at micro (individual) scale shows significant fluctuation within platforms throughout the year.

20

21 of 34

Fig 4. Micro view of cases #03–#11.

Burns CS, Nix T, Shapiro RM II, Huber JT (2021) MEDLINE search retrieval issues: A longitudinal query analysis of five vendor platforms. PLOS ONE 16(5): e0234221. https://doi.org/10.1371/journal.pone.0234221

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234221

22 of 34

Fig 5. Micro view of cases #03–#11.

Burns CS, Nix T, Shapiro RM II, Huber JT (2021) MEDLINE search retrieval issues: A longitudinal query analysis of five vendor platforms. PLOS ONE 16(5): e0234221. https://doi.org/10.1371/journal.pone.0234221

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234221

23 of 34

Fig 6. Micro view of cases #03–#11.

Burns CS, Nix T, Shapiro RM II, Huber JT (2021) MEDLINE search retrieval issues: A longitudinal query analysis of five vendor platforms. PLOS ONE 16(5): e0234221. https://doi.org/10.1371/journal.pone.0234221

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234221

24 of 34

Results, Fig. 7-8

Variance and deviation in search result counts compared to PubMed, using modified z-score

24

25 of 34

Fig 7. Deviations per case and platform.

Burns CS, Nix T, Shapiro RM II, Huber JT (2021) MEDLINE search retrieval issues: A longitudinal query analysis of five vendor platforms. PLOS ONE 16(5): e0234221. https://doi.org/10.1371/journal.pone.0234221

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234221

26 of 34

Fig 8. Deviations per case and platform with outliers.

Burns CS, Nix T, Shapiro RM II, Huber JT (2021) MEDLINE search retrieval issues: A longitudinal query analysis of five vendor platforms. PLOS ONE 16(5): e0234221. https://doi.org/10.1371/journal.pone.0234221

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234221

27 of 34

Results, Fig. 9

MEDLINE update frequency across platforms

27

28 of 34

Fig 9. Search result count differences for COVID-19 related searches across PubMed based platforms.

Burns CS, Nix T, Shapiro RM II, Huber JT (2021) MEDLINE search retrieval issues: A longitudinal query analysis of five vendor platforms. PLOS ONE 16(5): e0234221. https://doi.org/10.1371/journal.pone.0234221

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234221

29 of 34

Results Questions

with Mark MacEachern

29

30 of 34

Discussion

  • Differences in metadata functionality across platforms (e.g. date and MeSH fields)
  • Online-first versus issue-based publications impact bibliographic control
  • Variable timing of MEDLINE update across platforms
  • Reproducibility appears to improve when combining multiple fields in query

30

31 of 34

Open Q&A Time

31

32 of 34

Thanks

Sean Burns – sean.burns@uky.edu�Associate Professor, College of Communication and Information, University of Kentucky

Tyler Nix - tnix@umich.edu�Interim Asst. Director, Research and Informatics�Taubman Health Sciences Library, University of Michigan

32

33 of 34

Wrap up

Available on the event site https://guides.lib.umich.edu/TaubmanTalks/MedlinePlatforms

  • Slides (now)
  • Padlet (now)
  • Link to article (now)
  • Search queries file (now)
  • Recording (~1 week) - sign up for notification!
  • Recording transcript (with recording)
  • Chat and Q&A files (with recording)

33

34 of 34

Coming soon to Taubman Library Talks…

March/April-ish:

34