Agreement to contribute new data to the SweLL infrastructure

JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload.

You are welcome to contribute your data to the SweLL infrastructure: https://spraakbanken.gu.se/en/projects/swell

Benefits

- Unified / comparable metadata between all datasets
- Storage and maintenance at Språkbanken Text (spraakbanken.gu.se)
- Use of SweLL tools for data annotation (SVALA, pseudonymization service, annotation management, servers, versioning, etc)
- Searchable repository (and downloadable data from other projects, where permitted)
- Upload to Korp and Strix for searches (if agreed)

Formalities

To contribute a new data set, please follow the steps below. Tick the boxes on the right where your confirmation is required:
(Steps below are inspired by the TALKBANK CHILDES: https://talkbank.org/share/contrib.html)

Step 1. Registration of interest

Please send an email message to swell@svenska.gu.se about your intentions.

If necessary, you will get instructions how to register to the SweLL-portal.

Message was sent

Required

Step 2. Files and metadata compliance *

We accept only transcribed (i.e. digital, machine readable) files (see guidelines https://gupea.ub.gu.se/handle/2077/69429 ) together with (1) copies of signed informed agreements from learners, (2) metadata about writers/speakers (e.g. https://docs.google.com/document/d/1-3b6h1dh1RVEz6UJhjkkS4S0b6Y-36WT11KJ8_QtBFE/edit?usp=sharing), (3) metadata about tasks they are performing (e.g. https://docs.google.com/document/d/1r-AcL8Iha2IYAv88lP0KDDEMVlJT3JEWLztG1lQd2RA/edit?tab=t.0 ), and (4) confirmation that the data that is pseudonymized (e.g. https://gupea.ub.gu.se/handle/2077/69431 ). You guarantee that as long as possible the metadata types are adjusted to the SweLL types for better comparison between the data sets

I guarantee all data has been collected following informed consents (signed and included)

I guarantee that the files are transcribed

Demographic metadata is included

Task metadata is included

The data/metadata are pseudonymized(i.e. contain no personal numbers, names, addresses, e-mail addresses, ...)

Required

Step 3. ID-numbering *

Texts are assigned a SweLL ID-number starting with a letter and followed by a combination of numbers and letters. Your ID-numbering might be lost in the import process.

I agree to the change of ID-numbering

Required

Step 4. Corpus documentation (Readme) *

Each contributed corpus needs to have a documentation file (e.g. readme), that is, a basic set of facts that are indispensable for the proper interpretation of the data by other researchers or users. We require that the facts below are included into the Documentation/Readme file (see an example for SweLL here: https://spraakbanken.gu.se/swell/portal/files/instructions/SweLL_-_Metadata_explanation.pdf). Please use the list below as a checklist.

Contact person details (e.g. your contact information; or ours if you transfer the rights to us)

Information on consent from informants and the project information that was provided to the informants.

Information on informants including demographic metadata description.

Acknowledgments (project, funding) and citation (articles). In addition, all users will be asked to cite a reference to SweLL infrastructure.

Warnings about limitations on the use or annotation of the data.

Pseudonym categories and specifications (if pseudonymization has been applied before import to SweLL)

Annotation description. Goals of data collection, balance, sampling, procedures, reliability checks, guidelines and code taxonomies.

Additional information

Restrictions or suggestions on access for new users

Required

Step 5 - Confirmation *

I confirm that the Readme/Documentation file is included and contains information according to the description in Step 4 - Corpus Documentation.

Required

Step 6. Pseudonymization of personal information *

I confirm that any personal information in the texts/transcripts is pseudonymized.

(if no pseudonymization has been applied) I agree that automatic pseudonymization service be applied to the current dataset before it is stored in SweLL database (LINK)

Required

Step 7. Access to the dataset *

By default we apply SweLL ACCESS restrictions (https://sunet.artologik.net/gu/swell) according to which only approved users can access the data. The data can be used ONLY for research & development or didactic purposes within language learning context. Any other alternatives need to be discussed with lawyers.

I agree to apply SweLL ACCESS principles to the data I import

No, I would like to suggest another access option

Other:

Required

If you said "No" above, please add your suggestion here

Step 8. Access via Korp (and Strix in the future) *

Where appropriate, we can upload your data to Korp for more effective searches

I agree to have this dataset searchable in Korp/Strix with password protection (with ACCESS conditions above)

I agree to have this dataset searchable in Korp/Strix without password protection

I do not grant access to this dataset via Korp/Strix

Required

Step 9. Name of the dataset *

Step 10. Date and place *

Step 11. Name and affiliation of the responsible contributor *

Step 12. Email of the contact person *

Step 13. Confirmation *

I confirm that the information provided above is correct and agree to contribute this data to the SweLL infrastructure.

Required

We are very thankful for the kindness and collegiality you are showing in contributing your hard-won data.

Please, contact sb-info@svenska.gu.se after submitting this agreement. Further details will be sent to you in a mail.

Submit

Clear form

Never submit passwords through Google Forms.

This content is neither created nor endorsed by Google. - Terms of Service - Privacy Policy

Does this form look suspicious? Report

Forms