1 of 23

27^th International Command and Control Research and Technology Symposium, Quebec City, Oct 2022

A Data Collection Approach for Evaluating National Security Collective Training Events

Jerzy Jarmasz

Defence R&D Canada, Toronto Research Centre

Michael Thomson, Jonathan Fugelsang, Evan Risko

HumanSystems Inc.

CAN UNCLASSIFIED

2 of 23

Overview

Overview of Exercise Global Archer 2020
Measurement challenges
Approach
Review of data collection
What did we learn?
Next steps

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

3 of 23

Overview of the Exercise: Global Archer 2020 (GA20)

Collective “field exercise” Training Event, led by Canadian Special Operations Forces Command (CANSOFCOM) in 2020.

Expedited National Security (NS) crisis incident scenario
NS partners collaborated to develop coordinated, Whole of Government (WoG) response to a potential National threat in a very compressed timeframe
Activities at Tactical, Operational and Strategic levels
Decisions centred on determining authorities (legal and functional) and roles to enable expedited, lawful, pan-domain response to the threat

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

4 of 23

Measurement Challenges

No a priori “right way” of achieving strategic objectives

No objective criteria for inter-agency activity effectiveness

No direct access to objective activity data (e.g., comms logs)
“Field exercise” nature of event:

Varying degrees of access to activities across Strategic, Operational and Tactical levels
Varying degrees of access to partner organizations other than DND/CAF

Real-life requirements led to real-time scenario changes

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

5 of 23

The approach

Focus on Inter-agency activity at Op/Strat Levels
Triangulation: combine types of measures for multiple perspectives
Draw on various literatures:

C2 (esp. in WoG/pan-domain contexts)
Naturalistic decision making
Multi-team systems
Collective training design and assessment

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

6 of 23

The measures developed

Survey instrument

Post-exercise questionnaire, administered to all

Adapted Social Network Analysis (SNA) method

Probing perceptions of information sharing, pre & post exercise

Semi-structured interviews (supporting SNA)
Informal observations at planning/lead up events and GA20 itself

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

7 of 23

Post-Exercise Questionnaire

4 Survey themes, developed with CANSOFCOM:

WoG coordination and Planning
Exercise conduct
Exercise fidelity and realism
Exercise value and effectiveness

Was sent to all participating GC entities by CANSOF; responses were encouraged but voluntary (27 returns received from 9 orgs)
Only descriptive stats analyzed; number of responses did not support statistical comparisons

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

8 of 23

Sample questions

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

9 of 23

SNA instrument

SNA methodology probing information sharing

Adapted by asking about perceptions of info sharing (objective data not accessible)
Asked variety of questions: was info shared, was it timely, relevant etc.

Asked questions pre & post (e.g., what info sharing did you expect vs. what did you experience)
Responses collected with 3-pt rating scale
Various metrics (e.g., frequency, centrality) computed
Data collected from 6 consenting orgs

DND/CAF entities counted as one org in the following analyses

Wide range of analyses possible; only select few presented here due to time constraints

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

10 of 23

SNA questionnaire sample

In response to a national security crisis, my organization would typically send information to [or: rely upon, or expect info from]:
	Org 1	Org 2	…	Org X
Not at all	Coded as 1
Infrequently	Coded as 2
Frequently	Coded as 3

During the Ex, my organization sent information to:
	Org 1	Org 2	…	Org X
Not at all
Infrequently
Frequently

Sample Pre-exercise question:

Sample post-exercise question:

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

11 of 23

Big picture: Caveats and Findings

Some important caveats:

Small # of respondents (limits statistical validity)
Measures not (yet) validated
Mostly subjective measures
Key decision making events in the scenario had to be scripted out due to real-world operational requirements

Data shed light on inter-agency activity as a whole, but evaluation of any single org, or comparison of performance of participating orgs in GA20 to be avoided.

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

12 of 23

Survey findings: Theme 1 WoG Coordination and Planning

To what extent did…	Mean rating	SD
Q1: inter-departmental info sharing effectively support WoG coordination and planning?	3.47	1.3
Q2: current governance frameworks (e.g., FTRP) and policies support WoG cordination and planning?	3.05	1.24
Q3: the departments effectively balance the need to protect vs. the need to share information?	3.78	1.24
Q4: departments develop a shared understanding of the governance structure?	3.20	1.12
Q5: departments develop shared Situation Awareness during the exercise	3.34	1.47
Q6: departments develop coordinated objectives?	3.35	1.44
Q7: the national security partners effectively coordinate the execution of their activities?	3.79	1.3
Theme Overall	3.46	0.89
Internal consistency (Cronbach’s Alpha): 0.94

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

13 of 23

Survey findings: Theme 2 Exercise Conduct

To what extent were… (* = reverse coded post-hoc)	Mean Rating	SD
Q1: exercise injects disseminated appropriately (e.g., on time, to the right players)?	3.84	0.99
*Q2: blue Force objectives hindered by ineffective inject dissemination or reception?	4.07	0.59
Q3: requests for Information (RFIs) made by Blue Force players to other Blue Force players justified?	4.09	1.66
Q4: requests for Information (RFIs) made to Blue Force players responded to appropriately?	3.50	1.78
Q5: requests for Information (RFIs) made by Blue Force players to EXCON justified?	3.75	1.95
Q6: backstops employed by Trusted Agents to ensure the good progress of the scenario?	3.87	0.96
Theme Overall	3.86	0.66
Internal consistency (Cronbach’s Alpha): 0.82

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

14 of 23

Survey findings: Theme 3 Exercise fidelity and realism

To what extent did… (* = reverse coded post-hoc)	Mean rating	SD
Q1: the governance structure employed in the scenario represent a plausible governance structure for an actual NS crisis incident	4.05	1.25
Q2: the information sharing protocols employed in the exercise reflect real GC protocols (e.g., FTRP) and their use?	3.72	1.53
Q3: the scenario support the use of inter-departmental or WoG information sharing, situation awareness or planning systems/tools?	4.05	1.43
*Q4: the exercise impose artificial challenges or frictions on the Blue Force players?	3.70	0.82
Q5: the exercise capture important real-life challenges that would face the WoG apparatus during a NS crisis incident?	4.38	0.65
Q6: the exercise involve the staff and planners who would or should be involved in a real NS crisis incident?	3.92	1.27
Q7: the exercise involve the decision makers (e.g., ADMs or above) who would or should be involved in a real NS crisis incident?	3.31	1.84
Q8: the scenario reflect a realistic level of complexity for NS crisis information sharing and decision making?	4.38	0.71
Q9: the scenario capture the stress of real NS WoG activities?	3.67	1.15
*Q10: the various planning conferences provide an artificial “heads up” to the Blue Force?	3.06	1.39
Theme Overall	3.81	0.86
Internal consistency (Cronbach’s Alpha): 0.23

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

15 of 23

Survey findings: Theme 4 Exercise value and effectiveness

To what extent did the exercise, and/or its planning over the last 18 months…	Mean rating	SD
Q1: improve your department’s understanding of WoG governance during a NS crisis incident?	3.55	1.20
Q2: improve your department’s ability to support WoG decision making during a NS crisis?	3.68	1.43
Q3: enable departments to identify key areas for improvement in their ability to support WoG NS crisis planning?	4.00	1.43
Q4: improve WoG information sharing?	3.38	1.63
Q5: improve WoG common situation awareness?	3.63	1.64
Q6: improve WoG planning and objective setting?	3.57	1.60
Q7: improve WoG execution of coordinated effects?	3.53	1.49
Theme Overall	3.81	0.86
Internal consistency (Cronbach’s Alpha): 0.93

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

16 of 23

SNA send/receive ratings

Sub-organization	Pre	Post	Correlation
DND	2.3	2.3	0.64
RCMP	2.3	2.1	0.71
ITAC	2.8	1.4	0.2
IRCC	2	1.3	0.24
FINTRAC	1.7	1.4	0.60

Sub-organization	Pre	Post	Correlation
DND	2.5	1.8	0.45
RCMP	2.3	2.1	0.70
ITAC	2.8	2.1	0.51
IRCC	1.8	1.3	-0.16
FINTRAC	1.8	1.8	0.68

Mean info “would send to” (Pre) and “we did send to” (post) ratings and correlations

Mean info “would receive from” (pre) and “we did receive” (post) ratings and correlations

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

17 of 23

Alternate metric: SNA centrality graphs (1/2)

Comparison between sub-organization’s expectations about who they need to send information to both pre-exercise (left panel) and post-exercise (right panel)

Note: node size and colour reflects centrality, i.e., larger and darker nodes reflects higher centrality; line weights and colour reflects the expected frequency with which information is expected to be sent to a given sub-organization, i.e., heavier and darker lines reflect more frequent sending of information.)

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

18 of 23

Alternate metric: SNA centrality graphs (2/2)

Comparison between sub-organization’s expectations about who they need to receive information from both pre-exercise (left panel) and post-exercise (right panel)

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

19 of 23

Example of possible additional analysis: �Alignment of info sharing

Alignment of reported info sharing expectations (pre-exercise) between select participating sub-organizations

Each arrow represents alignment of perceptions by two organizations of the information flow from the first to the second:

Red solid line: misaligned

Blue dashed line: moderately aligned

Green dashed line: well aligned

While the earlier caveats on drawing inferences comparing organizations apply, this analysis illustrates a metric that could be used to monitor expectations between participants during an exercise.

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

20 of 23

Discussion: Insights about GA20

Observed high willingness for inter-agency collaboration, as well as a large amount of inter-agency activity

reflected in data and informal observations
Inter-agency activity occurs at all levels, not just Strat or Op/Strat; many interlocking loops of activity

The planning & work-up activities were instrumental in priming this collaboration
Flexible scenario design allowed the Ex to proceed despite real-life interference

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

21 of 23

Discussion: Insights about Data Collection

Post-Ex survey design: all themes except one had robust internal consistency

Theme 3 needs a review; but the questions overall can serve as basis for future exercise evaluations

SNA is a very promising methodology; its full potential remains to be explored

While objective data on info sharing would have been preferable, adapted SNA allows investigation of dynamics of expectations and “mental models” of inter-agency activity

Using multiple metrics worked well

Ensured we got some useable data despite real-time scenario changes
Provided multiple perspectives on complex event

CAN UNCLASSIFIED

CAN CLASSIFICATION // REL TO

1 of 23

2 of 23

3 of 23

4 of 23

5 of 23

6 of 23

7 of 23

8 of 23

9 of 23

10 of 23

11 of 23

12 of 23

13 of 23

14 of 23

15 of 23

16 of 23

17 of 23

18 of 23

19 of 23

20 of 23

21 of 23

22 of 23

23 of 23