1 of 55

Process measures and data analysisPhD Summer School on Translation Processes Research�CBS, August 2011

1

Kristian Tangsgaard Hvelplund�

kthj.isv@cbs.dk

2 of 55

  • Tutorials and hands-on sessions
  • Groups of 4�
    • Come up with a research project
    • Design an experiment
    • Consider relevant process measures and how to analyse the data
    • Run the experiment
    • Data analysis����

2

3 of 55

  • Translation as information processing�����������Based on Baddeley and Hitch (1974), Baddeley (2007), Bennaroch (2006), Eysenck and Keane (2010), Jaekl and Harris (2007)���

3

�Working memory

7 items (+/- 2)�< 18 seconds��

Long-term memory�∞

Sensory memory�< 500 ms�-> 60 ms

Motor system�-> 200 ms

Attentional control

4 of 55

  • Methods of data elicitation
  • Eye tracking
    • Cognitive sciences
    • Psycholinguistics
    • Psychology
    • Human-computer interaction
    • Marketing research
    • Etc.

  • Key logging
    • Translation process studies
    • Writing process studies����

4

5 of 55

  • Advantages of eye tracking and key logging
  • Reliability
    • We can be fairly certain that eye-tracking data and key-logging data are manifestations of ongoing cognitive processing

  • Nonintrusive
    • The reliability of the data as reflections of the participant’s translation process is not compromised by the research process

  • Completeness and level of detail
    • Eye-tracking data and key-logging data in combination offer a highly complete and detailed representation of the translation process���

5

6 of 55

  • Typical key-logging based measures
  • Pauses�Character count�Revision behaviour�Editing�Etc.��

6

7 of 55

  • Typical eye-movement based measures
  • Fixation duration
    • The time the eye fixates on a single locale – often measured in milliseconds (ms)

  • Fixation count
    • More fixations reflects more cognitive effort / fewer fixations reflects less cognitive effort

  • Pupil size
    • Pupil size reflects cognitive processing intensity

  • Total gaze time
    • More time spent in a region reflects more cognitive effort / less time spent in a region reflects less cognitive effort

7

8 of 55

  • Typical eye-movement based measures
  • First pass gaze duration / fixation count / pupil size
    • Reflects the cognitive effort / intensity during the initial reading of a word or segment�Often character count is introduced as control variable

  • Second pass gaze duration / fixation count / pupil size
    • Reflects the cognitive effort / intensity during the subsequent reading of a word or segment�Is often compared with first pass

  • Regressions
    • May indicate uncertainty on the part of the reader in comprehending the text�More regressions = problems with comprehension

  • Transitions
    • Reflects the number of times attention shifts between two tasks��

8

9 of 55

  • Types of eye movements
  • Reading consists of two types of eye movements + pupillary movement:�
    • (Visual) fixations
    • Saccades
    • Pupil dilation and pupil constriction

9

10 of 55

  • Visual fixation
  • The continued maintenance of visual gaze at a specific location so that the retina is stabilised over an object of interest (Duchowski 2007: 46).�
  • Eye-mind and immediacy assumptions (Just and Carpenter 1980: 331)��“there is no appreciable lag between what is being fixated and what is being processed”, and��“the interpretations at all levels of processing are not deferred; they occur as soon as possible���

10

11 of 55

  • Interpretation and problems
  • Visual focus of a word = cognitive resources allocation to this word
    • Short fixations = less cognitive effort
    • Long fixations = more cognitive effort�
  • Covert attention
    • Attention can shift independently from eye movement

  • Saccades
    • The eye is blinded during saccadic eye movements���

11

12 of 55

  • Visual fixation – durations based on task
  • Silent reading = 225 ms
  • Reading aloud = 275 ms
  • Reading emerging text (reading while typing) = 400 ms (Rayner 1998: 373)
  • Reading during translation (Jakobsen and Jensen 2008)
    • Source text reading = 218 ms
    • Target text reading = 259 ms

���

12

13 of 55

  • Saccades
  • Rapid eye movements between actual fixations. No visual information transmitted to the cognitive system. �
  • Saccade speed -> 500 degrees per second�Typical saccade length during reading -> 2 degrees (8 letter characters)�Typical saccade duration -> 30 ms�
  • Saccades account for around 5-15 percent of all eye movements during reading. ����

13

14 of 55

  • Pupillary movement
  • Small adjustable opening in the centre of the eye’s iris that allows light to enter the eye’s retina.

  • Interpretation of pupillary movement relies also on the eye-mind assumption (Just and Carpenter 1980: 331); �
  • Pupillary movement when fixation = relative change in cognitive resources allocated��-> Smaller pupils = relatively less cognitive effort�-> Larger pupils = relatively more cognitive effort����

14

15 of 55

  • Three types of eye trackers

Head-supported

Head-mounted

Remote����

15

16 of 55

  • Comparison chart�
  • Spatial resolution-> The smallest change in eye position that can be measured
  • Temporal resolution-> Number of recorded eye positions per second���������

Spatial resolution

Temporal resolution

Intrusiveness

Head-supported

High0.25 degree inaccuracy

~0.5 cm inaccuracy

Very high> 1000 Hz

Very highNo head movement

Head-mounted

Medium

0.5-1 degree inacc.

~1-2 cm inaccuracy

Medium to high�30 to 200 Hz

High�Free head and body movement

Remote

Medium0.5 degree inaccuracy~1 cm inaccuracy

Medium50 to 120 Hz

ModerateFree head movement

16

17 of 55

  • Implications for research design
  • Use large fonts (20 pitch tahoma or larger)
  • Use short texts (no longer than 200 source text words)
  • Consider a design in which no online or offline translation aids are available
  • Consider having objects of interest that cover large areas of the screen

17

18 of 55

  • Analysing eye-tracking data�with ClearView and Tobii Studio

  • Recording scenes
    • Temporal object of interest

  • Areas of interest (AOIs)
    • Spatial object of interest�����������

18

19 of 55

  • Eye-tracking measures�with ClearView and Tobii Studio�����������

Quantitative measures

Average fixation duration�Fixation count

Transitions

Gaze time / fixation count�Number of fixations

Number of attention shift from one area/task to another

19

Qualitative tools

Hot spot / heat maps�Gaze replay�Gaze plot

Static background image and hotspot mask

Dynamic background image and fixations �Static background image and fixations

20 of 55

  • Hot spot visualisation

�Reading

experiment

20

21 of 55

  • Gaze plot visualisation�Translation of the�Spielberg text�(EN->DA)

21

22 of 55

  • Gaze replay �����������

22

23 of 55

  • Limitations of these measures
  • Qualitative measures are relevant when
    • Getting an initial impression of processing pattern
    • Visualising processing patterns

  • Quantitative measures are few ...
    • Average fixation duration across one participant / task
    • Fixation count
    • What about pupil size information?
    • What about first/second pass information? Regression data?
    • What about key-logging information?

  • ... and potentially misleading
    • Average fixation duration doesn’t consider variance between individual fixation durations

  • Solution
    • Raw data

��

23

24 of 55

  • Raw eye-tracking data �����������

24

25 of 55

  • Raw data – time stamp�����������

25

26 of 55

  • Raw data – fixations�����������

26

27 of 55

  • Raw data – fixation coordinates�����������

27

28 of 55

  • Raw data – pupil diameter (mm)�����������

28

29 of 55

  • Raw data – key-logging�����������

29

30 of 55

  • Raw data – fixation annotation�����������

30

31 of 55

  • Advantages of analysing raw data
  • The process data set is much richer / many more items
  • Pupil size values
  • Individual fixation durations
  • Simultaneous reading and typing
  • First / second pass gaze duration

  • Disadvantages of analysing raw data
  • Very labour intensive

31

32 of 55

  • Eye-tracking data quality
  • The quality of the eye-tracking data is sensitive to various factors, including: (cf. e.g. O’Brien 2009, Hvelplund 2011)

    • Lighting conditions
    • Distance to the eye tracker
    • Too much head movement
    • Eye colour
    • Optical aids
  • Precautionary steps:

    • Maintain the same dimmed lighting conditions (preferably articifial light)
    • Distance no more than 55-65 cm from the eye tracker
    • Avoid too much movement
    • Avoid participants who have very dark eyes*
    • Avoid participants who wear glasses or contact lenses

32

33 of 55

  • Measuring eye-tracking data quality
  • Still risk of poor eye-tracking data. Three ways to see if the data quality is acceptable:
    • Mean Fixation Duration
    • Gaze Time on Screen as percentage of total production time
    • Gaze sample to Fixation Percentage��������

33

34 of 55

  • Mean fixation duration
  • Typical fixation duration during reading is 225 to 275 ms (Rayner 1998)

  • If fixation duration around 175-200 ms or shorter, perhaps flawed data�
    • Example = Participant A, Task 1: Mean fixation duration = 146 ms

34

35 of 55

  • Gaze time on screen
  • Written translation involves some amount of ST reading and TT reading.

  • If very limited ST and TT reading during the task, perhaps flawed data�
    • Example = Participant A, Task 1: Total gaze time on screen = 24 seconds / 8 percent ���������

35

36 of 55

  • Gaze sample to fixation percentage
  • Fixations account for 85-95 percent of all eye movements
  • Saccades account for 15-5 percent of all eye movements

  • If the raw data reflects a distorted distribution, perhaps flawed data�
    • Example = Participant A, Task 1: Fixations = 50 percent, saccades = 50 percent��������

36

37 of 55

  • Data quality comparison��������

37

38 of 55

  • Example – reading for different purposes

  • Eye movement behaviour across four different types of reading task (Jakobsen and Jensen 2008)

  • Six professional translators & six student translators
  • L2 English -> L1 Danish

������������

38

Average fixation duration (n =12)

Average fixation count (n =12)

Reading for comprehension�Reading for translation�Reading while speaking a translation�Reading while typing a translation

205 ms

205 ms

235 ms

218 ms (ST)

259 ms (TT)

145�223�520�708 (ST)

882 (TT)

39 of 55

  • Example – translation directionality

  • Eye tracking translation directionality (Pavlovic and Jensen 2009)

  • Four professional translators & four student translators�
  • L2 English -> L1 Danish (~250 words)�L1 Danish -> L2 English (~250 words)

������������

39

40 of 55

  • Example – translation directionality

  • Both hypotheses confirmed. All comparisons p < 0.05.

������������

40

Hypothesis

Average gaze time (n =8)

Average fixation duration (n = 8)

Average pupil size (n = 8)

TT processing requires more effort than ST processing into L1

385.5 sec (TT)

212.8 sec (ST)

415 ms (TT)

248 ms (ST)

3.45 mm (TT)

3.38 mm (ST)

TT processing requires more effort than ST processing into L2

378.8 sec (TT)

212.8 sec (ST)

399 ms (TT)

245 ms (ST)

3.52 mm (TT)

3.42 mm (ST)

41 of 55

  • Example – translation directionality

  • Partially (tentatively) confirmed. No statistical test performed.

������������

41

Hypothesis

Average fixation duration in ms (n = 2)

Average pupil size in mm (n = 2)

L2 translation requires more cognitive effort than L1 translation

ST = 258 (L1) & 247 (L2)�TT = 395 (L1) & 383 (L2)

ST = 3.37 (L1) & 3.42 (L2)�TT = 3.45 (L1) & 3.51 (L2)

42 of 55

  • Allocation of cognitive resources in translation Investigate how translators allocate cognitive resources during translation.�Two indicators are employed to investigate allocation of cognitive resources to the source text and the target text:�
  • Cognitive resource management
  • Cognitive load��How are these indicators affected by:
  • Different types of processing
  • Differences in translational expertise
  • Differences in time conditions

42

43 of 55

  • The processes of translation ���

Source text (ST) processing (cf. e.g. Kintsch 1988)

Source text reading

Orthographic analysis

Source text comprehension

Lexical analysis�Propositional analysis�Text representation and LTM transfer

43

Target text (TT) processing (cf. e.g. Kellogg 1996)

TT reformulation

Planning�Encoding�Verification of translation

TT typing

Finger movement programming�Executing finger movement

TT reading

Orthographic analysis

44 of 55

  • Sequential and parallel processing�����

Sequential processing (cf. e.g. Seleskovitch 1976)

Identification of source text meaning is processed independently before target language production can begin

44

Parallel processing (cf. e.g. Gerver 1976, de Groot 1997)

Identification of source text meaning is processed simultaneously with target language reformulation

ST

TT

ST

TT

ST

TT

ST

TT

ST

TT

ST

TT

ST

TT

45 of 55

  • Data collection and analysis

45

Eye-tracking data (fixation data, saccade data, pupil data)

Tobii 1750 (50 Hz) eye-tracker

Fixations and saccades -> source text processing and target text processing�Changes in pupil size -> changes in cognitive load

Key-logging data (typing events)

ClearView software

Key-logging data -> target text processing

AOIs

Source text -> large AOI

Target text -> large AOI

46 of 55

  • Independent variables

46

Independent variables

Processing type

Source text, target text, parallel processing

Translational expertise

12 professional translators, 12 student translators

Text complexity

Easy text, difficult text

Time constraint

No time pressure, heavy time pressure

47 of 55

  • Dependent variables

���

47

Dependent variables

Attention units (AU)�Duration between attention shifts (milliseconds)

�Management of cognitive resources

Pupil size�AU pupil size measurements (millimeters)

�Cognitive load

48 of 55

  • Hypotheses����

48

Cognitive load (pupil size)�-> How does cognitive load vary during translation?

Processing type

TT processing > ST processing

Parallel ST/TT processing > ST processing & TT processing

Expertise

Student translators > professional translators

Time pressure

Time pressure > no time pressure

Cognitive resource management (AU duration)�-> How are cognitive resources managed during translation?

Processing type

TTAUs > STAUs

PAUs < STAUs & TTAUs

Expertise

Students’ AUs > professionals’ AUs

Time pressure

Time pressure AUs < no time pressure AUs

49 of 55

  • Findings – cognitive resource management

49

TTAUs > STAUs = confirmed

Nearly all (12 of 13) relevant comparisons were significant.

-> Comprehension is less cognitively demanding than reformulation as it is performed more� quickly.�-> Large difference between professionals and students, indicating that professionals are� better at flexibly adjusting resource allocation.

PAUs < STAUs & TTAUs = confirmed

All relevant comparisons were significant.�PAU duration across factors was non-significantly different (429 ms)

-> Parallel ST/TT processing occurs in translation�-> Parallel ST/TT processing is subject to WM storage and/or processing limitations�-> Upper parallel processing limit on the cognitive system

50 of 55

  • Findings – cognitive resource management

50

Students’ AUs > professionals’ AUs = partially confirmed

Students’ STAUs were generally significantly longer than professionals’ STAUs�Students’ TTAUs were generally significantly shorter than professionals’ TTAUs

Professional translators are better at quickly arriving at a meaning hypothesis�Students become satisfied with a translation more quickly than professionals

Time pressure AUs < no time pressure AUs = partially confirmed

STAUs were generally significantly shorter under time pressure�TTAUs were non-significantly different under the two time conditions

Time pressure only affects comprehension and not reformulation; TT reformulation is fairly static.

51 of 55

  • Findings – cognitive load

51

TT processing > ST processing = confirmed

All relevant comparisons were significant; TT pupils were systematically larger than ST pupils

Language comprehension in translation is cognitively less demanding than language production in translation. Provides further support for the management hypothesis.

Parallel ST/TT processing > ST processing & TT processing = partially confirmed

Parallel ST/TT pupils were systematically larger than ST pupilsParallel ST/TT pupils were generally smaller than TT pupils

Automatic processing of ST or TT content occurs in translation.�Professional translators rely more on automatic processing than student translators.

52 of 55

  • Findings – cognitive load

52

Student translators > professional translators = confirmed

All relevant comparisons were significant; students’ pupils were systematically larger than professionals’.

Cognitive load is higher for student translators than for professional translators.�-> professional translators rely more on automatic processing.�-> cognitive cost of task switching between ST and TT is higher for students

Time pressure > no time pressure = confirmed

All relevant comparisons were significant; pupils were systematically larger under time pressure than under no time pressure

Cognitive load is higher when translation under time pressure than when translating under no time pressure

53 of 55

  • Findings summary

53

Language production in translation is generally more effortful than language comprehension

Parallel processing taxes heavily on the cognitive system

Professional translators show greater flexibility with respect to resource allocation than student translators

Time pressure affects mainly the comprehension aspect of translation rather than the production aspect

Professional translators are likely to rely more on automatic processing than student translators

The cost of switching between tasks is higher for students than for professionals

54 of 55

  • What does the quantitive data not tell us?

  • Correlation between more cognitive effort and translation quality?
  • Better translation product if more time is available?
  • Can we readily assume that an ’optimal’ translation process will lead to better translation quality?
  • How did the translators experience the time comstraints?
  • How did the translators experience text complexity?�����������

54

55 of 55

  • Quantitative data needs to be supplemented with ...

  • Questionnaire data
  • Translation quality assessment
  • Verbal protocols which do not interfere with the translation process
  • Etc.

55