Agreement to contribute new data to the SweLL infrastructure
You are welcome to contribute your data to the SweLL infrastructure: https://spraakbanken.gu.se/en/projects/swell 
Sign in to Google to save your progress. Learn more
Benefits
-       Unified / comparable metadata between all datasets
-       Storage and maintenance at Språkbanken Text (spraakbanken.gu.se)
-       Use of SweLL tools for data annotation (SVALA, pseudonymization service, annotation management, servers, versioning, etc)
-       Searchable repository (and downloadable data from other projects, where permitted)
-       Upload to Korp and Strix for searches (if agreed)
Formalities
To contribute a new data set, please follow the steps below. Tick the boxes on the right where your confirmation is required:
(Steps below are inspired by the TALKBANK CHILDES: https://talkbank.org/share/contrib.html)
Step 1. Registration of interest

Please send an email message to swell@svenska.gu.se about your intentions. 
If necessary, you will get instructions how to register to the SweLL-portal.
*
Required
Step 2. Files and metadata compliance   *
We accept only transcribed (i.e. digital, machine readable) files  (see guidelines https://gupea.ub.gu.se/handle/2077/69429 ) together with (1) copies of signed informed agreements from learners,  (2) metadata about writers/speakers (e.g. https://docs.google.com/document/d/1-3b6h1dh1RVEz6UJhjkkS4S0b6Y-36WT11KJ8_QtBFE/edit?usp=sharing),  (3) metadata about tasks they are performing (e.g. https://docs.google.com/document/d/1r-AcL8Iha2IYAv88lP0KDDEMVlJT3JEWLztG1lQd2RA/edit?tab=t.0 ), and (4) confirmation that the data that is pseudonymized (e.g. https://gupea.ub.gu.se/handle/2077/69431 ). You guarantee that as long as possible the metadata types are adjusted to the SweLL types for better comparison between the data sets
Required
Step 3.  ID-numbering *
Texts are assigned a SweLL ID-number starting with a letter and followed by a combination of numbers and letters. Your ID-numbering might be lost in the import process.
Required
Step 4.  Corpus documentation (Readme) *
Each contributed corpus needs to have a documentation file (e.g. readme), that is, a basic set of facts that are indispensable for the proper interpretation of the data by other researchers or users. We require that the facts below are included into the Documentation/Readme file (see an example for SweLL here: https://spraakbanken.gu.se/swell/portal/files/instructions/SweLL_-_Metadata_explanation.pdf). Please use the list below as a checklist. 
Required
Step 5 - Confirmation *
Required
Step 6. Pseudonymization of personal information *
Required
Step 7. Access to the dataset *
By default we apply SweLL ACCESS restrictions (https://sunet.artologik.net/gu/swell) according to which only approved users can access the data. The data can be used ONLY for research & development or didactic purposes within language learning context. Any other alternatives need to be discussed with lawyers.
Required
If you said "No" above, please add your suggestion here
Step 8. Access via Korp (and Strix in the future) *
Where appropriate, we can upload your data to Korp for more effective searches
Required
Step 9. Name of the dataset *
Step 10. Date and place *
Step 11. Name and affiliation of the responsible contributor *
Step 12. Email of the contact person *
Step 13. Confirmation *
Required
We are very thankful for the kindness and collegiality you are showing in contributing your hard-won data.
Please, contact sb-info@svenska.gu.se after submitting this agreement. Further details will be sent to you in a mail.
Submit
Clear form
Never submit passwords through Google Forms.
This content is neither created nor endorsed by Google. - Terms of Service - Privacy Policy

Does this form look suspicious? Report