ABCDEFGHIJKLMNOPQRSTUVWXYZAAABACADAEAFAGAHAIAJAKALAMANAOAPAQARASATAUAVAWAXAYAZ
1
2
Pearson's Correlation Matrix--Fully Analytical Inference Under the (Gaussian) Identity Matrix
1.
p-value for the entire matrix
3
These results are obtainable under fully generalized data conditions and correlation values via
2.
CDF matrix
4
the NAbC algorithm from Opdyke, JD, (2022), "Beating the Correlation Breakdown, for Pearson's
3.
Inverse CDF ('quantile') Correlation Matrix
5
and Beyond: Robust Inference and Flexible Scenarios and Stress Testing for Financial Portfolios"
4.
Upper/Lower 95% Correlation Matrices (i.e. Simultaneous Confidence Interval matrices,
6
(on SSRN, ResearchGate, www.DataMineit.com)
as well as Individual Cell Confidence Intervals)
7
8
This is a fully analytic solution, requiring no sampling ('rejection' methods or otherwise). The null hypothesis is the Gaussian identity matrix. Green cells require valid user input.
9
For 1. & 2.:
Insert a (positive definite) correlation matrix in the green cells of the top matrix (rows 17-20) to obtain the unique corresponding CDF matrix in the orange cells (as well as the 2-sided p-value of the matrix).
10
For 1. & 3.:
Conversely, insert a matrix of CDF values in the green cells of the next matrix (rows 27-30) to obtain the unique corresponding correlation matrix in the orange cells (as well as the 2-sided p-value of the matrix).
11
For 1. & 4.:
Insert the desired α (cell E36) to obtain, via Simultaneous Confidence Intervals, the Upper/Lower (1-α)% C.I. Correlation Matrices in the orange cells in rows 36-48 (as well as the 1-sided p-value each of the two matrices).
12
numeric tolerance
(user-specified input)
13
(user-specified input)
(user-specified input)
1.00E-307
for CDF values near 0 or 1
14
N sample size =
252
estimated/observed Correlation Matrix
Cholesky factorization
Spherical Angles
corresponding CDF matrix
(recommended value=1.00E-307)
15
"'distance" = LNP =
16
10.60000.20000.00000.20001.000000000.00000.00070.50000.0007
LNP = ln( product of all 2-sided p-values )
17
1. & 2.:
0.60001-0.10000.10000.20000.60000.80000000.92730.00001.00000.02400.0000
matrix (2-sided) p-value =
= SUM [ ln(each p-value) ]
18
0.2000-0.100010.1000-0.20000.2000-0.27500.9404001.36941.85530.00071.00000.01140.99891.000000-762.91
19
Null = Gaussian identity matrix
0.00000.10000.10001-0.20000.00000.12500.14290.981801.57081.44551.42630.50000.02400.01140.9998
20
0.20000.2000-0.2000-0.200010.20000.2500-0.1821-0.20900.90591.36941.31281.76421.79760.00070.00000.99890.9998
21
10000k=4321
22
01000
23
00100
(user-specified input)
24
00010
CDF matrix
Spherical Angles
Cholesky factorization
corresponding ('quantile') Correlation Matrix
25
00001
"'distance" = LNP =
26
0.00260.00260.00260.00261.0000000010.17590.17590.17590.1759
LNP = ln( product of all 2-sided p-values )
27
0.0026 0.00260.00260.00261.39400.17590.98440000.175910.20170.20170.2017
matrix (2-sided) p-value =
= SUM [ ln(each p-value) ]
28
1. & 3.:
0.0026 0.0026 0.00260.00261.39401.39370.17590.17350.9690000.17590.201710.22680.22680.050000-52.75
29
0.0026 0.0026 0.0026 0.00261.39401.39371.39330.17590.17350.17110.953800.17590.20170.226810.2512
30
0.0026 0.0026 0.0026 0.0026 1.39401.39371.39331.39300.17590.17350.17110.16870.93870.17590.20170.22680.25121
31
k=4321
32
33
SIMULTANEOUS CONFIDENCE INTERVALS (lower CDF values are associated with higher Correlation values, and vice versa)
34
(Upper) CDF matrix (α/2)
Spherical Angles
Cholesky factorization
UPPER (1-α/2)% Confidence Interval Correlation Matrix
35
(user-specified input)
"'distance" = LNP =
36
α0.050
1. & 4.:
0.00250.00250.00250.00251.0000000010.17610.17610.17610.1761
LNP = ln( product of all 1-sided p-values )
37
CI: 1-α
0.9500.00250.00250.00250.00251.39380.17610.98440000.176110.20200.20200.2020
matrix (1-sided) p-value =
= SUM [ ln(each p-value) ]
38
0.00250.00250.00250.00251.39381.39340.17610.17370.9689000.17610.202010.22710.22710.025000-59.80
39
('back into' individual rather
0.00250.00250.00250.00251.39381.39341.39310.17610.17370.17130.953700.17610.20200.227110.2516
40
than simultaneous CI's; for
0.00250.00250.00250.00251.39381.39341.39311.39270.17610.17370.17130.16890.93860.17610.20200.22710.25161
41
example, α=0.44734 gives 95%
42
CI's for all cells individually
(Lower) CDF matrix (1-α/2)%
Spherical Angles
Cholesky factorization
LOWER (α/2)% Confidence Interval Correlation Matrix
43
for N=252)
"'distance" = LNP =
44
1. & 4.:
0.99750.99750.99750.99751.000000001-0.1761-0.1761-0.1761-0.1761
LNP = ln( product of all 1-sided p-values )
45
0.99750.99750.99750.99751.7478-0.17610.9844000-0.17611-0.1400-0.1400-0.1400
matrix (1-sided) p-value =
= SUM [ ln(each p-value) ]
46
0.99750.99750.99750.99751.74781.7482-0.1761-0.17370.968900-0.1761-0.14001-0.1048-0.10480.025000-59.80
47
0.99750.99750.99750.99751.74781.74821.7485-0.1761-0.1737-0.17130.95370-0.1761-0.1400-0.10481-0.0706
48
0.99750.99750.99750.99751.74781.74821.74851.7489-0.1761-0.1737-0.1713-0.16890.9386-0.1761-0.1400-0.1048-0.07061
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100