JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload.

1 of 55

Process measures and data analysis��PhD Summer School on Translation Processes Research�CBS, August 2011

1

Kristian Tangsgaard Hvelplund�

^{kthj.isv@cbs.dk}

2 of 55

Tutorials and hands-on sessions�
Groups of 4�

Come up with a research project
Design an experiment
Consider relevant process measures and how to analyse the data
Run the experiment
Data analysis��

2

3 of 55

Translation as information processing��Based on Baddeley and Hitch (1974), Baddeley (2007), Bennaroch (2006), Eysenck and Keane (2010), Jaekl and Harris (2007)��

3

�Working memory

7 items (+/- 2)�< 18 seconds��

Long-term memory�∞

Sensory memory�< 500 ms�-> 60 ms

Motor system�-> 200 ms

Attentional control

4 of 55

Methods of data elicitation�
Eye tracking

Cognitive sciences
Psycholinguistics
Psychology
Human-computer interaction
Marketing research
Etc.

Key logging

Translation process studies
Writing process studies��

4

5 of 55

Advantages of eye tracking and key logging�
Reliability

We can be fairly certain that eye-tracking data and key-logging data are manifestations of ongoing cognitive processing

Nonintrusive

The reliability of the data as reflections of the participant’s translation process is not compromised by the research process

Completeness and level of detail

Eye-tracking data and key-logging data in combination offer a highly complete and detailed representation of the translation process��

5

6 of 55

Typical key-logging based measures�
Pauses�Character count�Revision behaviour�Editing�Etc.��

6

7 of 55

Typical eye-movement based measures�
Fixation duration

The time the eye fixates on a single locale – often measured in milliseconds (ms)

Fixation count

More fixations reflects more cognitive effort / fewer fixations reflects less cognitive effort

Pupil size

Pupil size reflects cognitive processing intensity

Total gaze time

More time spent in a region reflects more cognitive effort / less time spent in a region reflects less cognitive effort

7

8 of 55

Typical eye-movement based measures�
First pass gaze duration / fixation count / pupil size

Reflects the cognitive effort / intensity during the initial reading of a word or segment�Often character count is introduced as control variable

Second pass gaze duration / fixation count / pupil size

Reflects the cognitive effort / intensity during the subsequent reading of a word or segment�Is often compared with first pass

Regressions

May indicate uncertainty on the part of the reader in comprehending the text�More regressions = problems with comprehension

Transitions

Reflects the number of times attention shifts between two tasks��

8

9 of 55

Types of eye movements�
Reading consists of two types of eye movements + pupillary movement:�

(Visual) fixations
Saccades
Pupil dilation and pupil constriction

9

10 of 55

Visual fixation�
The continued maintenance of visual gaze at a specific location so that the retina is stabilised over an object of interest (Duchowski 2007: 46).�
Eye-mind and immediacy assumptions (Just and Carpenter 1980: 331)��“there is no appreciable lag between what is being fixated and what is being processed”, and��“the interpretations at all levels of processing are not deferred; they occur as soon as possible”��

10

11 of 55

Interpretation and problems�
Visual focus of a word = cognitive resources allocation to this word

Short fixations = less cognitive effort
Long fixations = more cognitive effort�

Covert attention

Attention can shift independently from eye movement

Saccades

The eye is blinded during saccadic eye movements��

11

12 of 55

Visual fixation – durations based on task�
Silent reading = 225 ms
Reading aloud = 275 ms
Reading emerging text (reading while typing) = 400 ms (Rayner 1998: 373)
Reading during translation (Jakobsen and Jensen 2008)

Source text reading = 218 ms
Target text reading = 259 ms

��

12

13 of 55

Saccades�
Rapid eye movements between actual fixations. No visual information transmitted to the cognitive system. �
Saccade speed -> 500 degrees per second�Typical saccade length during reading -> 2 degrees (8 letter characters)�Typical saccade duration -> 30 ms�
Saccades account for around 5-15 percent of all eye movements during reading. ��

13

14 of 55

Pupillary movement�
Small adjustable opening in the centre of the eye’s iris that allows light to enter the eye’s retina.

Interpretation of pupillary movement relies also on the eye-mind assumption (Just and Carpenter 1980: 331); �
Pupillary movement when fixation = relative change in cognitive resources allocated��-> Smaller pupils = relatively less cognitive effort�-> Larger pupils = relatively more cognitive effort��

14

15 of 55

Three types of eye trackers�

Head-supported

Head-mounted

Remote��

15

16 of 55

Comparison chart�
Spatial resolution�-> The smallest change in eye position that can be measured
Temporal resolution�-> Number of recorded eye positions per second��

	Spatial resolution	Temporal resolution	Intrusiveness
Head-supported	High�0.25 degree inaccuracy ~0.5 cm inaccuracy	Very high�> 1000 Hz	Very high�No head movement
Head-mounted	Medium 0.5-1 degree inacc. ~1-2 cm inaccuracy	Medium to high�30 to 200 Hz	High�Free head and body movement
Remote	Medium�0.5 degree inaccuracy �~1 cm inaccuracy	Medium�50 to 120 Hz	Moderate�Free head movement

16

17 of 55

Implications for research design �
Use large fonts (20 pitch tahoma or larger)
Use short texts (no longer than 200 source text words)
Consider a design in which no online or offline translation aids are available
Consider having objects of interest that cover large areas of the screen

17

18 of 55

Analysing eye-tracking data��with ClearView and Tobii Studio

Recording scenes

Temporal object of interest

Areas of interest (AOIs)

Spatial object of interest��

18

19 of 55

Eye-tracking measures��with ClearView and Tobii Studio��

Quantitative measures
Average fixation duration�Fixation count Transitions	Gaze time / fixation count�Number of fixations Number of attention shift from one area/task to another

19

Qualitative tools
Hot spot / heat maps�Gaze replay�Gaze plot	Static background image and hotspot mask Dynamic background image and fixations �Static background image and fixations

20 of 55

Hot spot visualisation

�Reading

experiment

20

21 of 55

Gaze plot visualisation��Translation of the�Spielberg text�(EN->DA)

21

22 of 55

Gaze replay ��

22

23 of 55

Limitations of these measures�
Qualitative measures are relevant when

Getting an initial impression of processing pattern
Visualising processing patterns

Quantitative measures are few ...

Average fixation duration across one participant / task
Fixation count
What about pupil size information?
What about first/second pass information? Regression data?
What about key-logging information?

... and potentially misleading

Average fixation duration doesn’t consider variance between individual fixation durations

Solution

Raw data

��

23

24 of 55

Raw eye-tracking data ��

24

25 of 55

Raw data – time stamp��

25

26 of 55

Raw data – fixations��

26

27 of 55

Raw data – fixation coordinates��

27

28 of 55

Raw data – pupil diameter (mm)��

28

29 of 55

Raw data – key-logging��

29

30 of 55

Raw data – fixation annotation��

30

31 of 55

Advantages of analysing raw data�
The process data set is much richer / many more items
Pupil size values
Individual fixation durations
Simultaneous reading and typing
First / second pass gaze duration

Disadvantages of analysing raw data�
Very labour intensive

31

32 of 55

Eye-tracking data quality�
The quality of the eye-tracking data is sensitive to various factors, including: (cf. e.g. O’Brien 2009, Hvelplund 2011)

Lighting conditions
Distance to the eye tracker
Too much head movement
Eye colour
Optical aids�

Precautionary steps:

Maintain the same dimmed lighting conditions (preferably articifial light)
Distance no more than 55-65 cm from the eye tracker
Avoid too much movement
Avoid participants who have very dark eyes*
Avoid participants who wear glasses or contact lenses

32

33 of 55

Measuring eye-tracking data quality�
Still risk of poor eye-tracking data. Three ways to see if the data quality is acceptable:�

Mean Fixation Duration
Gaze Time on Screen as percentage of total production time
Gaze sample to Fixation Percentage��

33

34 of 55

Mean fixation duration�
Typical fixation duration during reading is 225 to 275 ms (Rayner 1998)

If fixation duration around 175-200 ms or shorter, perhaps flawed data�

Example = Participant A, Task 1: Mean fixation duration = 146 ms

34

35 of 55

Gaze time on screen�
Written translation involves some amount of ST reading and TT reading.

If very limited ST and TT reading during the task, perhaps flawed data�

Example = Participant A, Task 1: Total gaze time on screen = 24 seconds / 8 percent ��

35

36 of 55

Gaze sample to fixation percentage�
Fixations account for 85-95 percent of all eye movements
Saccades account for 15-5 percent of all eye movements

If the raw data reflects a distorted distribution, perhaps flawed data�

Example = Participant A, Task 1: Fixations = 50 percent, saccades = 50 percent ��

36

37 of 55

Data quality comparison��

37

38 of 55

Example – reading for different purposes

Eye movement behaviour across four different types of reading task (Jakobsen and Jensen 2008)

Six professional translators & six student translators
L2 English -> L1 Danish

��

38

	Average fixation duration (n =12)	Average fixation count (n =12)
Reading for comprehension�Reading for translation�Reading while speaking a translation�Reading while typing a translation	205 ms 205 ms 235 ms 218 ms (ST) 259 ms (TT)	145�223�520�708 (ST) 882 (TT)

39 of 55

Example – translation directionality

Eye tracking translation directionality (Pavlovic and Jensen 2009)

Four professional translators & four student translators�
L2 English -> L1 Danish (~250 words)�L1 Danish -> L2 English (~250 words)

��

39

40 of 55

Example – translation directionality

Both hypotheses confirmed. All comparisons p < 0.05.

��

40

Hypothesis	Average gaze time (n =8)	Average fixation duration (n = 8)	Average pupil size (n = 8)
TT processing requires more effort than ST processing into L1	385.5 sec (TT) 212.8 sec (ST)	415 ms (TT) 248 ms (ST)	3.45 mm (TT) 3.38 mm (ST)
TT processing requires more effort than ST processing into L2	378.8 sec (TT) 212.8 sec (ST)	399 ms (TT) 245 ms (ST)	3.52 mm (TT) 3.42 mm (ST)

41 of 55

Example – translation directionality

Partially (tentatively) confirmed. No statistical test performed.

��

41

Hypothesis	Average fixation duration in ms (n = 2)	Average pupil size in mm (n = 2)
L2 translation requires more cognitive effort than L1 translation	ST = 258 (L1) & 247 (L2)�TT = 395 (L1) & 383 (L2)	ST = 3.37 (L1) & 3.42 (L2)�TT = 3.45 (L1) & 3.51 (L2)

42 of 55

Allocation of cognitive resources in translation ��Investigate how translators allocate cognitive resources during translation.�Two indicators are employed to investigate allocation of cognitive resources to the source text and the target text:�
Cognitive resource management
Cognitive load��How are these indicators affected by:�
Different types of processing
Differences in translational expertise
Differences in time conditions

42

43 of 55

The processes of translation ��

Source text (ST) processing (cf. e.g. Kintsch 1988)
Source text reading	Orthographic analysis
Source text comprehension	Lexical analysis�Propositional analysis�Text representation and LTM transfer

43

Target text (TT) processing (cf. e.g. Kellogg 1996)
TT reformulation	Planning�Encoding�Verification of translation
TT typing	Finger movement programming�Executing finger movement
TT reading	Orthographic analysis

44 of 55

Sequential and parallel processing��

Sequential processing (cf. e.g. Seleskovitch 1976)
Identification of source text meaning is processed independently before target language production can begin

44

Parallel processing (cf. e.g. Gerver 1976, de Groot 1997)
Identification of source text meaning is processed simultaneously with target language reformulation

ST

TT

ST

TT

ST

TT

ST

TT

ST

TT

ST

TT

ST

TT

45 of 55

Data collection and analysis��

45

Eye-tracking data (fixation data, saccade data, pupil data)
Tobii 1750 (50 Hz) eye-tracker
Fixations and saccades -> source text processing and target text processing�Changes in pupil size -> changes in cognitive load

Key-logging data (typing events)
ClearView software
Key-logging data -> target text processing

AOIs
Source text -> large AOI
Target text -> large AOI

46 of 55

Independent variables�

46

Independent variables
Processing type	Source text, target text, parallel processing
Translational expertise	12 professional translators, 12 student translators
Text complexity	Easy text, difficult text
Time constraint	No time pressure, heavy time pressure

47 of 55

Dependent variables�

��

47

Dependent variables
Attention units (AU)�Duration between attention shifts (milliseconds)	�Management of cognitive resources
Pupil size�AU pupil size measurements (millimeters)	�Cognitive load

48 of 55

Hypotheses��

48

Cognitive load (pupil size)�-> How does cognitive load vary during translation?
Processing type	TT processing > ST processing
Processing type	Parallel ST/TT processing > ST processing & TT processing
Expertise	Student translators > professional translators
Time pressure	Time pressure > no time pressure

Cognitive resource management (AU duration)�-> How are cognitive resources managed during translation?
Processing type	TTAUs > STAUs
Processing type	PAUs < STAUs & TTAUs
Expertise	Students’ AUs > professionals’ AUs
Time pressure	Time pressure AUs < no time pressure AUs

49 of 55

Findings – cognitive resource management��

49

TTAUs > STAUs = confirmed
Nearly all (12 of 13) relevant comparisons were significant.
-> Comprehension is less cognitively demanding than reformulation as it is performed more� quickly.�-> Large difference between professionals and students, indicating that professionals are� better at flexibly adjusting resource allocation.

PAUs < STAUs & TTAUs = confirmed
All relevant comparisons were significant.�PAU duration across factors was non-significantly different (429 ms)
-> Parallel ST/TT processing occurs in translation�-> Parallel ST/TT processing is subject to WM storage and/or processing limitations�-> Upper parallel processing limit on the cognitive system

50 of 55

Findings – cognitive resource management��

50

Students’ AUs > professionals’ AUs = partially confirmed
Students’ STAUs were generally significantly longer than professionals’ STAUs�Students’ TTAUs were generally significantly shorter than professionals’ TTAUs
Professional translators are better at quickly arriving at a meaning hypothesis�Students become satisfied with a translation more quickly than professionals

Time pressure AUs < no time pressure AUs = partially confirmed
STAUs were generally significantly shorter under time pressure�TTAUs were non-significantly different under the two time conditions
Time pressure only affects comprehension and not reformulation; TT reformulation is fairly static.

51 of 55

Findings – cognitive load��

51

TT processing > ST processing = confirmed
All relevant comparisons were significant; TT pupils were systematically larger than ST pupils
Language comprehension in translation is cognitively less demanding than language production in translation. Provides further support for the management hypothesis.

Parallel ST/TT processing > ST processing & TT processing = partially confirmed
Parallel ST/TT pupils were systematically larger than ST pupils�Parallel ST/TT pupils were generally smaller than TT pupils
Automatic processing of ST or TT content occurs in translation.�Professional translators rely more on automatic processing than student translators.

52 of 55

Findings – cognitive load��

52

Student translators > professional translators = confirmed
All relevant comparisons were significant; students’ pupils were systematically larger than professionals’.
Cognitive load is higher for student translators than for professional translators.�-> professional translators rely more on automatic processing.�-> cognitive cost of task switching between ST and TT is higher for students

Time pressure > no time pressure = confirmed
All relevant comparisons were significant; pupils were systematically larger under time pressure than under no time pressure
Cognitive load is higher when translation under time pressure than when translating under no time pressure

53 of 55

Findings summary��

53

Language production in translation is generally more effortful than language comprehension
Parallel processing taxes heavily on the cognitive system
Professional translators show greater flexibility with respect to resource allocation than student translators
Time pressure affects mainly the comprehension aspect of translation rather than the production aspect
Professional translators are likely to rely more on automatic processing than student translators
The cost of switching between tasks is higher for students than for professionals

54 of 55

What does the quantitive data not tell us?

Correlation between more cognitive effort and translation quality?
Better translation product if more time is available?
Can we readily assume that an ’optimal’ translation process will lead to better translation quality?
How did the translators experience the time comstraints?
How did the translators experience text complexity? ��

54

55 of 55

Quantitative data needs to be supplemented with ...

Questionnaire data
Translation quality assessment
Verbal protocols which do not interfere with the translation process
Etc.

55