The British National Corpus 2014 is a major project led by Lancaster University to create a 100 million word corpus (a large collection of ‘real life’ language) of modern-day British English.This corpus is used by researchers to understand more about how language works and how it is evolving. Educators, dictionary compilers and the interested public will also be able to access it to find usage examples of modern British English in different genres.
To collect SMS messages (including WhatsApp and Facebook messages), we rely on the generosity of the British public to provide us with their messages to incorporate into the corpus. We are asking the public to provide us with copies of any messages sent between 2014 and 2018. The messages should be unedited, but anonymised if you wish.
You can submit these SMS messages as word documents (doc, docx, rtf etc.), .txt files, or any other common electronic format. Your contribution to this world-leading language resource will be fully credited in the corpus documentation.
To submit your WhatsApp messages:1. Open up WhatsApp on your mobile device.2. Select the conversation that you wish to share.3. Click on the name of the other person (or group) at the top of the chat.4. Scroll down, press "Export Chat", and then "Without Media".5. Email the chat log to yourself.6. You are welcome to anonymise the data by replacing any names or personal information with [anon]. We are able to automatically remove the timestamp and user ID, but you are responsible for anonymising personal information within the chat itself, if you wish.7. Upload the .txt file below.
To submit your Facebook messages:1. Open Google Chrome web browser.2. Click the three vertical dots in the task bar > More Tools > Extensions.3. Search for Message/Chat Downloader, and click ADD TO CHROME.4. Log into Facebook, go to your messages in the full messenger window (by selecting "See all in Messenger"). Select the conversation you want to download.5. Press the new messenger icon in the address bar.6. Select the date range you want and click "Start download". When the download is complete, the page will be updated to only show your selected conversation.7. Hold down Ctrl + A to select all, then copy the text into a .txt file.8. You are welcome to anonymise the data by replacing any names or personal information with [anon]. We are able to automatically remove the timestamp, but you are responsible for anonymising personal information within the chat itself, if you wish.9. Upload the .txt file below.
When submitting your messages, bear in mind that names often appear within messages that may not be picked up by our automatic anonymisation tool. For example, in the following message, we would automatically remove the timestamp and user ID, but you would be responsible for replacing the name (Jonathan) with [anon] if you wish.
Original message:"[13/07/2017, 9:47:16 pm] Matt: Jonathan, are you ready to start yet?"
With your anonymisation:"[13/07/2017, 9:47:16 pm] Matt: [anon], are you ready to start yet?"
With our anonymisation, as it will appear in the corpus:"[anon] [anon]: Jonathan, are you ready to start yet?"
Thank you very much for your contribution.
The Lancaster team.email: firstname.lastname@example.org