Newspaper day 20/03/2023
Program
13u30-13h45: Introductions
13h45-13h55: Brecht Deseure, presentation of KBR newspaper department and CAMille project
13h55-14h05: Julie Birkholz, presentation of Digital Research Lab
14h05-14h25: Niklas Stenzel (ZOG)
14h25-14h45: Elias Degruyter (UGent)
14h45-15h05: coffee break
15u05-15u25: Vincent Ducatteeuw (UGent)
15h25-15u45: Leon Castelein (UGent)
15u45-16u05: Tess Dejaeghere (UGent)
16h05-16u25: Concluding discussion
3
WiFi network: KBR_Readers
Access code: KBR_2019
WiFi network: KBR_Events
Access code: Ghs442uF
Feedback
Green post-its
What can KBR do to improve access to the newspapers collections for researchers?
Yellow pos-its
Ideas for future newspapers days.
Establish a network/mailing list?
Yearly event with open cfp?
Focus on newspapers or also include periodicals?
…
4
KBR Newspapers department
•Largest and most diverse historical newspapers collection in Belgium
•From the 17th century until today
•Legal deposit: 1 physical issue of each Belgian newspaper deposited daily. Digital legal deposit underway
•Ca. 2000 Belgian titles of the ‘grande presse’
•Ca. 1000 Belgian titles of the ‘petite presse’
•Ca. 500 foreign newspapers from 60 countries
•Colonial newspapers
5
Special collections
•Fonds Gaston Mertens: over 60.000 specimens from ca. 1.650 Belgische localities (17th century - 1948)
•Fonds Philippe Vandermaelen: 2.405 specimens of foreign newspapers of the 19th century (Europa and USA)
•Fonds Jacques Delmelle: complete journalistic oeuvre of one Belgian journalist, comprising over 10.000 articles/publications
6
Access
•One of KBR’s most consulted and vulnerable collections
•Different supports: paper, microfilm (600 titles), digital
•127 digitized and OCR-ed titles (1814-1950) = ca. 4 million pages
•1950-1989 (= ca. 3,5 million pages) will be digitized within the following three years
•Until 1918: freely accessible, also remotely
•After 1918: only accessible on KBR computers (or remotely with special permission)
•Simple search functions
7
8
9
10
CAMille FED-tWIN (KBR-ULB)
Goals
Components
11
The CAMille platform
12
Database of Belgian journalists
13
Research projects
14
15
16
Digital Research Lab�
Julie M. Birkholz, Lead of KBR’s Digital Research Lab & Assistant Professor Digital Humanities,
Ghent University
Goals
17
in practice
18
DATA-KBR-BE
Facilitating data-level access to KBR’s digitised and born-digital collections for digital humanities research
Digitised Historical Newspapers as Data
Interdisciplinary Research Scenarios
From Collections to Corpora
Which newspapers are digitised?
Importance of historical context
Zentrum für Ostbelgische Geschichte
German Language Newspapers in Belgium
Discursive Identity Construction in East Belgium
�
A Linguistic and Discourse Historical Analysis of the Patterns of Language and Communication in East Belgian Mass Media
25
�
Agenda
26
�
Research Interest
27
�
Link to digital newspapers
28
Methodology
Niklas Stenzel
Zentrum für Ostbelgische Geschichte
Kaperberg 2-4
B-4700 Eupen
www.geschichte.be
�
Methodology
30
Methodology
AntConc
Freeware corpus analysis toolkit for concordancing and text analysis
Niklas Stenzel
Zentrum für Ostbelgische Geschichte
Kaperberg 2-4
B-4700 Eupen
www.geschichte.be
Discussion
Niklas Stenzel
Zentrum für Ostbelgische Geschichte
Kaperberg 2-4
B-4700 Eupen
www.geschichte.be
Presentation Elias Degruyter
33
Presentation Vincent Ducatteeuw
34
Presentation Leon Castelein
35
Ownership of public space
36
Data collection & newspapers
37
38
Spatial analysis
Katrina Navickas, Protest and the politics of space and place, 1789-1848, 180
39
40
Beyond Babylonian Confusion �a case-study based approach for multilingual NLP on historical literature
Author: Tess Dejaeghere
Supervisors: Julie Birkholz, Els Lefever, Christophe Verbruggen
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 101004984
42
Personalia
Background
Ongoing Research
Natural Language Processing vs. (Digital) Humanities
(Digital) Humanities: analyze text(s) according to a hermeneutic model to make inferences about past and present.
Natural Language Processing: subfield of artificial intelligence and computational linguistics aimed at making natural language understandable to computers.
Ongoing Research
Natural Language Processing vs. (Digital) Humanities
(Digital) Humanities: analyze text(s) according to a hermeneutic model to make inferences about past and present.
Natural Language Processing: subfield of artificial intelligence and computational linguistics aimed at making natural language understandable to computers.
Ongoing Research
Natural Language Processing vs. (Digital) Humanities
Named Entity Recognition (NER)
Example: “Julie works in Ghent.”
Sentiment Analysis (SA)
Example: “Julie was happy with her cup of tea!” 🡪 POSITIVE
PERSON
LOCATION
Ongoing Research
The gaps
! Need for transparent, reproducible and durable NLP workflows which are tailored to heuristic research.
! Need for an overview of the possibilities and limitations of NLP-tools in literary-historical research settings.
! Need for practical insights regarding NLP application to data with literary-historical characteristics.
! Need for workflow communication standards.
🡪 Facilitate exchange of practices between NLP and DH.
Ongoing Research
Approach: case-study based
! Suggestions regarding tool selection, tool output interpretation, workflow dissemination
THANK �YOU
Tess.Dejaeghere@ugent.be
Feedback
Yellow post-its
What can KBR do to improve access to the newspapers collections for researchers?
Blue post-its
Ideas for future newspapers days.
Establish a network/mailing list?
Yearly event with open cfp?
Focus on newspapers or also include periodicals?
…
50