Private lives and public libraries.
A quantitative approach to the study of user behavior


A paper for


Libraries Plus: Adding Value in the Cultural Community


8th Northumbria International Conference on Performance Measurement in Libraries and Information Services (PM8)






Tord Høivik
Associate professor
Oslo University College

Oslo, 2009


Summary


Purpose


The paper presents and demonstrates an effective way of gathering statistical information about user activities (traffic) inside libraries. We outline the method, describe the procedures in sufficient detail for practical use, present traffic data from about twenty Norwegian public libraries and indicate how such data can be used for advocacy and strategic planning.


Design and methodology


Librarians have, in general, very little systematic information about activities inside their libraries. Staff meets users every day, so there is no shortage of impressions, ideas and mental images. But this information is ad hoc and qualitative rather than systematic and quantitative.


Count the traffic (CTT) is a cheap and simple method to gather such data. It is based on regular and systematic “tours of observation” through the public areas of the library. It can be carried out by the library’s own staff rather than by hired consultants. The data gathered will tell you, in some detail, about the structure of activities in the various parts (zones) of the library throughout the day (daily cycle) and the week (weekly cycle).


The data analysis is not technically difficult: standard spreadsheets will do. Many libraries should be able to do their own data processing without external assistance. The generation and evaluation of relevant tables requires some skills in standard social science research.


Findings


The method has now been applied in more than fifty Norwegian libraries – including public, academic, special and school libraries. Additional CTT studies will be carried out by library students in 2009.


When we compare the data with our intuitive expectations, it is fair to say that in public libraries


  1. the actual use of computers – including personal (mobile) computers – was higher than expected
  2. the frequency of activities carried out in groups – involving children and students in particular – was higher than expected
  3. purely social activities – not involving computers or media – were higher than expected


Research limitations


Gathering data by direct observation is generally time-consuming. CTT was designed to minimize the data collection effort by using time sampling. But we do lose some detail. In 2007 five of the largest public libraries in Norway carried out an observation-based study using a different methodology. Instead of “sweeping through” the building at regular intervals, the observers “shadowed” individual users from the moment they entered till the time they left the library. Since observation was combined with a brief exit interview, it was also possible to link background and behavioral data.


This project, which was called Storbyundersøkelsen (The Metropolitan Study), gives more information about individual behavior than the CTT approach. The two methods are highly compatible. They map and quantify the same types of activities. The shadowing method is much more labor intensive than the sweeps method, however. A combination of relatively frequent CTTs with occasional “shadow studies” may therefore be the best overall data collection strategy.


Practical implications


CTT was designed for practical use. It show how – and to what extent – the various parts of the library are used, throughout the day and through a typical week. Using traffic counts, libraries are able to document the type and intensity of use. This is useful for allocating resources and – not least – for reorganizing the library space.


In a municipal setting, CTTs generates new types of data that are likely to be of interest to politicians and senior administrators. Some data will also be useful in contacts with parents and other stakeholders among the general public.


At the regional and national level, reliable data on user behavior make “life in the library” visible in a new way. Loans and visits are rather abstract categories. Reading, talking, browsing and using computers provide more vivid images of what the physical library actually provides.


Originality


The method provides important new data on user behavior in the physical library. The method can be applied in all types of libraries – and indeed in all types of visitor-oriented institutions.


It is cheap enough to be repeated on a regular basis, and simple enough to be carried out by library staff or young students.  The observation categories are standardized, but easy to use, since they are based on normal social concepts.  All libraries that choose the CTT approach will generate comparable data. We publish our data sets on the open web – and encourage others  to do so as well.


Both the method and the data should be of interest to all persons that support evidence-based librarianship.


For more information, see: http://pliny.wordpress.com/events/florence-2009/




Introduction

Count the traffic (CTT) is a method for systematic mapping of user activities in libraries and other visitor oriented institutions. For many years libraries have mainly measured their success by counting the number of loans, or by the number of loans per capita – based on the population served. This statistic is still important. But usage patterns are changing. In many Western countries, the amount of lending is going down.


The number of library visits captures another important dimension of library use. The average Norwegian visits a public library five times a year – and borrows slightly more than five books or other media. But it is not the case that every visit results in a loan. About fifty percent of the visitors do not borrow anything at all. They visit the library for other purposes - to read, to play, to study or to use the internet. To understand what is happening in the public library sector, we must explore a wider range of indicators.


That is easier said done. Researchers may play around with many different types of data in their own projects. But meaningful library indicators require repetition and standardization. Today we need new types of data to understand new types of behavior - on a regular basis. But official library statistics are highly resistant to change. Once a set of procedures and publication patterns have been established, they tend to be contiued by their own momentum. Why create additional work for everybody involved?


CTT tries to develop something new in the space between the individual project and the statistical bureaucracy. Like LibQual, CTT is a standardized instrument designed to be used for comparative purposes. Standard methods of data gathering and analysis are common in most scientific and technical fields. We find scales of intelligence and physical agility - for individuals, of corruption and quality of life - for countries; and of windiness and hardness in nature - the Beaufort and the Moh scale respectively. The most developed instrument in the library field is probably the German library index BIX, which combines seventeen statistical variables into a single additive index.


CTT is not a scale or an indicator as such, but a lightweight method of data collection that can be used to develop useful indicators. To realize its knowledge potential, libraries that use it must provide open access to their data sets.


Libraries are more than collections – they are places where people participate in a wide range of activities. Standard library statistics reveals very little about activities inside libraries. In this paper we show how to gather and analyze such data. The method has been tested by library staff in several Norwegian libraries, with guidance from the author, and also tried out by more than fifty second-year library students as part of their second-year practice periods (five weeks in late spring).


The basic method is simple: at fixed intervals - usually once an hour - an observer should walk through the public areas and note down the number of visitors, and what they are doing, in various parts of the library. This should be repeated for at least one weekly cycle. The counting days should preferably be spread out during several weeks or months, to reduce the impact of random or seasonal events. All the information should collected in spreadsheets and analyzed.


In the English-speaking world the method is known as seating sweeps. Two Canadian researchers, Lisa Given and Gloria Leckie, first used the sweeps method in 1999, to study user behavior at the Toronto Reference Library and the Vancouver Public Library. The Norwegian version has been developed in close collaboration with several Norwegian libraries and has now been tested out in more than fifty institutions. We have labelled the method CTT - or Count The Traffic -  in English, from the expression  In Norway it is known as TTT – from the expression Tverrgående TrafikkTelling (transversal traffic counting)


In Norway, the method is introduced to library students as follows:


The purpose of this study is to investigate how library visitors actually use library services – inside the library itself. The method has been tested in Lillehammer and Drammen [two medium-sized cities], and you may therefore compare your results with those of other libraries. CTT can also be applied ito academic and special libraries and to other institutions with extensive visitor areas and acitivities.

What we observe in this study is not the individual user, but the different activities that take place inside the library.


All library workers know that the level of activity follows certain patterns or cycluses. The traffic varies throughout the day, the week and the year. More accidental variations come on top of this, for instance due to the weather, special events and moveable holidays (Easter, Whitsun). We do not try to observe on a continual basis. Our observation schedule depends on a sample of observation times. We must see to it that this sample is representative. Observations should therefore be spread evenly throughout the day, while the actual counting days ought to be distributed over several weeks.


To undertake this study, you will need the following resources:



The list of activities is standardized. Additional categories and sub-categories may be added, but the basic structure should be retained, to facilitate comparisons between libraries. CTT includes fourteen basic categories - with a fifteenth for anything not covered by the basic set.


Table A. Fourteen basic categories


1
Walks or stands alone.Covers standing or walking around without browsing and without relating to library staff or other users.
2
Walks or stands in company. Participates in a group of two or more persons that stands or walks around without browsing and without relating to library staff.
3
Sits alone.
Sits alone without relating to media, to library staff or to other users.
4
Sits in a group without media.
Participates in a group of two or more persons that does not relate to books or other media or to library staff.
5
Browses alone
Covers browsing or scanning of items on shelves while standing or walking around.
6
Browses in company
Participates in a group of two or more persons that browse or scan items on shelves together while standing or walking around.
7
Sits alone reading (or writing) . Sits and reads by her/himself. Includes individual work - reading and or writing - without using ICT. Includes listening to music, watching videos and using other media - but not the use of computers.
8
Sits in a group with media. Participates in a group where at least one person relates to books or other media. [Use (10) for groups with active PC].
9
Sits alone with own computer. Sits alone with active mobile computer (active screen)
10
Sits in a group with own computer(s). Participates in a group where at least one person is using a PC of their own (active screen).
11
Sits alone with library computer.Sits alone with active computer (active screen).
12
Sits in a group with library computer(s)Participates in a group of two or more persons that is using one or more library PCs (active screen).
13
Contact with staff. Covers all direct contact with staff. Here we want to register activities where staff spends time with the users, whether it involves speaking, writing, demonstrating or walking around.
14
Queuing
Covers all visible waiting for service or facilities, whether in a proper line or not: waiting for staff, waiting for access to equipment, toilet queues, aso.
15
Other
Activities not covered by 1-14.

The actual floor plan and the division into zones will of course differ greatly from library to library. An architectural plan may be available – otherwise the students were asked to make their own sketch of all areas open to the public.


The data collected allow us to calculate the number of hours library users spent on each of the fifteen activities, in each designated zone, for each hour of the day and each day of the week. This means that the level of detail is high. The many factual details may be of great interest to the staff in that particular library. But for comparative purposes - between libraries and years - we need to summarize the data.


Sandvika Public Library


The most basic summary looks at the activities only - disregarding time and space, so to speak. As an example I take a data set from Bærum, a large municipality just west of Oslo. Sandvika Public Library is a large branch library in an urban area, serving more than twenty thousand people in the western part of Bærum.


Table B. The distribution of time between activities, by rank


 

Activity
Time spent


Sits alone reading (or writing)
30%
Sits alone with library computer
17%
Browses alone 8%
Walks or stands alone 7%
Sits in a group without media 7%
Sits alone with own computer
6%
Sits in a group with own computer(s)
5%

Walks or stands in company
5%
Contact with staff 4%
Sits in a group with media 4%
Sits alone 3%
Sits in a group with library computer(s) 2%
Browses in company 1%
Queuing 1%
Other activities 1%
SUM
100%
N
2974

Data source


I would not use the fifteen observational categories for comparisons, however. They provide too much detail. The pattern of activities becomes clearer if we aggregate categories into larger sets. One approach is to distinguish between traditional and modern library activities. Here I take modern to cover computer based and group activities, while everything else - except other - is categorized as traditional

Table C. The distribution of time between aggregated activities

Activity
Time spent
Modern activities
46%
involving groups
23%
involving computers
30%
involving both*
7%
Traditional activities
52%
sitting down
32%
standing up
16%
contact with staff
4%
Other activities
1%
SUM
99%
* to be subtracted
N = 2974

Data source: Table B

Indicators are statistical variables that express or represent interesting properties of the libraries - or entities - that we study. The CTT approach allows us to study a number of properties that are not covered by official library statistics. The most important ones, in my view, are the following:

  1. To what extent does library use have a modern character?
  2. To what extent do activities involve groups rather than individuals?
  3. To what extent are computers being used?
  4. How much of the time at the library is spent interacting with staff?

Some additional questions are also of interest:

5. To what extent do people use their own computers?
6. How much time is spent browsing the shelves (and other media displays)
7. How much time at the library is spent standing up or walking about?
8. How much time is spent without interacting with media or computers?

The corresponding indicators are easily defined by grouping the appropriate categories. The four main indicators - let me call them A as a group, covers:

  1. Modernity: 2 + 4 + 6 + 8 + 9 + 10 + 11 + 12
  2. Group use:  2 + 4 + 6 + 8 + 10 + 12
  3. Computer use: 9 + 10 + 11 + 12
  4. Staff contact: 9

The three in the second rank (B) follow:

  1. Own computer ratio:  (9 + 10)/(9 + 10 + 11 + 12)
  2. Browsing time: 5 + 6
  3. Non-media time: 1 + 2 + 3 + 4 + 14
  4. Stand-up time: 1 + 2 + 5 + 6 + 14

It would be more correct to remove category 15 - other activities - from the percentage basis. But since it is generally quite low, I have not bothered to do so in this paper.

Sandvika and Høvik

To see the indicators at work, we compare the branch library in Sandvika with Høvik,  a small branch library in the same municipality. The time use patterns differ substantially:

Table D. Eight observation based indicators, Høvik and Sandvika


Høvik
Sandvika
A1. Modernity
31%
46%
A2. Group use
6%
19%
A3. Computer use
25%
30%
A4. Staff contact 4%
4%
***
***
***
B1. Own computer ratio 9% 36%
B2. Browsing time 10%9%
B3. Non-media time 26% 22%
B4. Stand-up time31%
21%
N
128
2974
Time
May 2008 and February 2009 (combined into one weekly cycle) May 2008 (one weekly cycle) and February 2009 (one weekly cycle)
Source
Sydsæter-Knudsen (2009), p. 43Sydsæter-Knudsen (2009), p. 41


The differences between Sandvika and Høvik are clearly related to demographic as well as material factor


The smaller library is located in a building belonging to the local parish, in an old and established residential area. The Sandvika branch is very centrally located in the administrative, commercial and cultural centre of Bærum. The Høvik branch is small, traditional and lacks group rooms and other spaces for social activities. For several years Bærum Public Library has actually tried to close it down - but the local residents have resisted ferociously and managed to save it for the time being.


The student who carried out these studies (as her final bachelor project), tried to estimate the age of the visitors as well as their activities.  Høvik was overwhelmingly frequented by older people:

Table E. Visitors by age group, percent

Library branch
Children
Youth
Adults
Seniors
Sum
Høvik
7%
7
34
52
100% (N = 128)
Sandvika
8%
15
62
15
100% (N=1464)*

* 2009 only. Source: Sydsæter-Knudsen (2009), p. 85


The demographic differences among the users are mainly due to self-selection. The catchment areas of Høvik and Sandvika are quite similar in their age distributions. In Table F we show the relative visitor rates for the different age groups. The numbers are quite approximate, of course, but since the tendencies are so strong, I am inclined to trust them.


In Sandvika, the age distribution among the visitors corresponds roughly to that of the population for adults and seniors. Young people are somewhat overrepresented, while children are somewhat underrepresented.  But note that we do not measure the number of visitors, but rather the time they spend inside the library. A youngster spending an hour on homework is equivalent to four children spending fifteen minutes each - dropping off the books they have finished and retrieving some juicy new volumes for the next few weeks.


Table F. Relative visitor rates by age group


Library branch
Children
Youth
Adults
Seniors
Sum
Høvik
0.39
0.70
0.60
3.47
1.00
Sandvika
0.50
1.50
1.02
1.15
1.00
Høvik, population
18%
10
57
15
100%
Sandvika, population
16%
10
61
13
100%

Source: Sydsæter-Knudsen (2009), p. 40


Twenty public libraries


A spreadsheet showing the values for twenty public libraries has been published - on Google Docs - by Plinius Data:



Some main results are given in Table G.


Table G. Ranges and median values for eight observation based indicators, based on twenty public libraries. Norway 2008-2009



Range
Communities
Median
A1. Modernity
31 - 66 %
Høvik - Drammen
45%
A2. Group use
6 - 61 %
Høvik - Sør-Varanger
24 %
A3. Computer use
13 - 41 %
Sande - Drammen
31 %
A4. Staff contact 1 - 19 %
Rygge - Høvik
6%
***



B1. Own computer ratio 0 - 91 %
Løten - Rygge
38%
B2. Browsing time 0 - 48 %
Gjøvik - Sande
11%
B3. Non-media time 7 - 60 %
Bodø - Gjøvik
19%
B4. Stand-up time1 - 52 %
Rygge - Sande
21%
N
20

20

We note - first of all - the great variation. If we accept these indicators as meaningful, they reveal substantial differences between libraries - even within the same municipality.

The typical values - as measured by the median - are also revealing. The proposed modernity indicator - the time spent on group and digital activities - is approaching fifty percent. Both group use and computer use is higher than most people are aware of.

Only six percent of the time is actually spent talking with library staff. Even that figure may be too high: it is likely that the amount of contact time decreases as we go from smaller to larger library units. A weighted average (or even a weighted median) would then give a more realistic picture.

Cell phones are replacing stationary phones - and mobile computing is replacing the desktops. But in 2009 the demand for stationary computers in libraries is still outstripping the supply. Computer time is often allocated in thirty minute chunks, which is a real nuisance to our patrons. We might as well introduce overnight lending of popular novels ...

As more and more libraries provide free internet zones, and more and more people can afford small laptops, this particular shortage will disappear. Already, more than one third of library computer use is actually based on personal data equipment. The interesting thing is that people still come to the library to use their portables there rather than at home.

The last three indicators - B2, B3 and B4 - reveal the non-book, non-media, non-sitting-down nature of public libraries. By measuring these minor aspects of life at the library, we may bring some extra attention to areas of service and improvement that are often overlooked. I think of the space between shelves and books within reach, of color schemes and visual zoning, of air and light, of sound and silence.

Appreciation

I would like to thank the many students who have chosen Count The Traffic as their research training exercise during the practicum periods in 2008 and 2009. Without their diligent work the type of analysis undertaken in this paper would have been impossible. A special thank to Benedicte Charlotte Sydsæter-Knudsen and Lise Nybakk who went one step further, choosing CTT as the subject of their final third-year study.




Bibliography

  1. Given, Lisa M. and Gloria J. Leckie (2003). “Sweeping” the library: Mapping the social activity space of the public library. Library & Information Science Research. Vol. 25, Issue 4, Winter, pp. 365-385. 

  2. Høivik, Tord (2008). Count the traffic. Paper for IFLA in Quebec.

  3. Høivik, Tord (2009). Students in action [= PL 29/09]. A traffic study from Oslo University College.
  4. Leckie, G.J. & Hopkins, J. (2002). The public place of central libraries: Findings from Toronto and Vancouver. Library Quarterly, 72, pp. 326-372.

  5. Nybakk, Lise (2009). Når brukeren beveger seg. En studie av brukere og brukeratferd ved Løten folkebibliotek. Oslo: Oslo University College. - Unpublished BA paper. [= A study of users and user behavior in Løten Public Library]
  6. Plinius Data: Patterns of activity, Public libraries. Norway. 2008-2009.
  7. Sydsæter-Knudsen, Benedicte Charlotte (2009). En observasjonsstudie av brukeradferden i Sandvika og Høvik bibliotek. Oslo: Oslo University College. - Unpublished BA paper. [= An observational study of user behavior in Sandvika and Høvik libraries]