1 of 16

Share Family tenant infrastructure

2 of 16

Core principles

Redundancy of data is complex to manage
Linking entities is easier than duplicate data
Cooperate and maintain autonomy at the same time
Homogeneity of datasets and possible services to be shared
Centralize core data through a lightweight method
Distribute the technologic load to achieve long-term sustainability
Profile levels of cooperation among systems and initiatives

2

Over time, from the initial Share-VDE and Share-Catalogue projects, the initiatives and sub-projects supported by the same LOD Platform technology have increased (e.g. Kubikat network in Germany, Parsifal network of ecclesiastical libraries in Rome etc.); SVDE itself is evolving into autonomous branches differentiated by scope and domain (the PCC data pool, the National bibliographies Working Group etc.).

This ecosystem falls under the broader community of the Share Family and has an impact on the architecture of individual systems that are autonomous (and will continue to be) but at the same time share the same technology.

Ultimately, the purpose of the architecture outlined in this document is to offer the possibility of connecting these systems to each other, as an added value of collaboration in the community of linked open data for libraries.

3 of 16

Solution in the Share Family architecture

Creation of more branches in the Share Family, named tenants

Consistent groups of institutions gathered by similar scope or from the same domain:

Share-VDE

Share-Catalogue

Kubikat-LOD

PCC data pool

Parsifal project (network of ecclesiastical university libraries in Rome)

National bibliographies Group

3

4 of 16

What is a tenant

Definition of tenant from Wikipedia:

“The term software multitenancy refers to a software architecture in which a single instance of software runs on a server and serves multiple tenants”.

“A tenant is a group of users who share a common access with specific privileges to the software instance. With a multitenant architecture, a software application is designed to provide every tenant a dedicated share of the instance - including its data, configuration, user management, tenant individual functionality” etc.

4

5 of 16

Benefits

More efficient data management

Technological sustainability

Dedicated applications and services tailored to the institutions members of the various branches

From the users perspective this enables richer and specialized sets of resources to be consulted

5

6 of 16

Main components of the Share Family tenants

Each tenant of the Share Family will have its own components and data will live in autonomous storages.

Each tenant has:

its own CKB (e.g. Sapientia CKB, PCC CKB, Kubikat-LOD CKB, URBE CKB etc.)
its own Cluster (URI) Registry with specific namespace
its own J.Cricket CKB editor (= entity editor to manually update linked data entities)
its own datastores (i.e. RDBMS, Search Engine and RDF Store)
its own discovery portal with one or more skins (skin = filter on a part of tenant whole data)

e.g. general SVDE discovery portal skin https://svde.org + other ad hoc skins such as https://penn.svde.org/)

6

7 of 16

Triplestore

Stardog

Share-VDE discovery portal and institutional skins

Share-VDE libraries

original records

Share-Catalogue libraries

original records

Share-VDE

portal

Share-Catalogue

discovery interface

Enrichment with external sources (VIAF, ISNI, LCSH, FAST etc.)

Parsifal

libraries

original records

National bibliographies

original records

PCC libraries

original records

Parsifal

CKB

Entity registry

NatBib

CKB

Entity registry

PCC

data pool

CKB

Entity registry

SVDE

Sapientia

CKB

Entity registry

Share Family tenant infrastructure

Share

Catalogue

CKB

Entity registry

Penn institutional

skin portal

other

institutional

skin portals

Parsifal

discovery interface

NatBib

discovery interface

PCC data pool

discovery interface

INDEX

The Share Family of initiatives includes different branches and sister projects, supported by the same LOD Platform technology. Each branch or project is hosted in a specific tenant of the system architecture with a corresponding specific Cluster Knowledge Base and a dedicated web entity discovery portal. In some cases, within a single tenant a customised skin (ie. a sub-portal of the main entity discovery) can be created to address ad hoc needs of an institution, or group of institutions, willing to expose only their own data or to integrate local services in the Share environment.

While the main entity discovery portal of a tenant shows the data of all the institutions feeding the tenant's Cluster Knowledge Base, the skin portal gives the ability to filter only the data of the institution or group of institutions that the skin portal has been designed for.

The creation of more branches in the Share Family stems from the need to gather institutions with a similar scope or from the same domain in consistent groups of members sharing the same areas of interest as far as the type of bibliographic information conveyed in their library catalogue. This approach is reflected in Share software architecture, that is based on tenants infrastructure.

Each tenant has its own components and the data resides in autonomous storages.

8 of 16

Share Family Index and Share Family Identifier

8

SFI - Share Family Index

index that centralizes core entity data from each CKB (e.g. entities URIs and very few data only for search and redirection purposes)
central index able to point to all the URIs in all the CKBs of the different tenants

SFId - Share Family Identifier

each entity has a unique URI within the tenant’s CKB namespaces
all the URIs for the same entity in different namespaces are grouped under a unique Share Family Identifier in the Share Family Index
the SFId is a unique identifier linking to URIs that identify the entities in each CKB
the SFId carries the minimum amount of data needed to identify the entities

9 of 16

Share Family Index and Share Family Identifier

E.g. Ernest Hemingway URIs in different CKB namespaces and the corresponding Share Family Identifier in the Index (the following URIs are for simulation purposes):

http://sfi/agents/456789 [Share Family Identifier]

sameAs

https://svde.org/agents/101631288986955

sameAs

https://svde.org/pcc/agents/7890123

sameAs

http://kubikat-lod.org/agents/456789

9

10 of 16

Central orchestration: the SFI - Share Family Index

The Share Family Index is:

a registry which is in charge to create and assign a unique Share Family Identifier (SFId)
a central index that aggregates URIs for entities that are stored in different CKBs
a metasearch engine able to run queries in all the Share Family tenants
an orchestrator of queries and messages between tenants according to service user agreements/profiles, functioning similarly to the ESB - Enterprise Service Bus
a pointer to entities URIs in the individual CKBs of the different tenants

The Share Family Index is not:

a database/storage
a CKB
a Search Engine

11 of 16

Interaction through the Share Family Index

The SFI orchestrates among all tenants the communication of the changes done within an individual CKB

CKBs are decoupled: they never interact directly each other, they are always intermediated by SFI

The SFI acts as an Enterprise Service Bus (ESB) - it provides routing, transformation, propagation, policies, search services across the interconnected CKB instances.

11

12 of 16

Examples of interaction use cases

1. Change alert only: two (or more) tenants hold the same entity; tenant 1 receives a notification that some data of the entity has been changed in tenant 2.

2. Propagate changes done to entity data: two (or more) tenants hold the same entity; tenant 1 changes some data of the entity and propagates the changes to tenant 2.

Example: A library using Sinopia creates an Instance that already exists in the PCC data pool of SVDE
Scenario 1: every time an entity undergoes changes, the SFI receives push notifications from Sinopia and vice versa
Scenario 2: periodic triggers unsolicited to collect and register changes

12

13 of 16

Search mechanism

The SFI search layer sits on top of all tenants; it allows simultaneous searches across them (metasearch or federated search).

This function is the opposite of a traditional skin, which allows to configure searches on subsets of data (example of a skin: https://penn.svde.org/ is a skin that filters Penn data from the whole Sapientia CKB where Penn data is included).

13

14 of 16

Share family

discovery interface

Triplestore

Stardog

Share Family metasearch layer (APIs layer)

Kubikat-LOD

CKB

URI registry

J.Cricket editor

Share-VDE

CKB

URI registry

J.Cricket editor

PCC

CKB

URI registry

J.Cricket editor

Share

Catalogue

CKB

URI registry

J.Cricket editor

National

libraries

CKB

URI registry

J.Cricket editor

Share Family Index

Share-VDE

skin portal

Search on its own tenant

Multi-tenants metasearch

15 of 16

Share Family Index: an ESB++ for the tenants

15

Policies

Routing

Share Family Index

Search

Transform

Registry

Propagation

16 of 16

Levels of cooperation

Consensus from the groups of Institutions involved

16

Ad hoc agreements to set up among the communities/groups of institutions to establish Service User Profiles