Controlled Abstention Networks (CAN)
Prof. Elizabeth A. Barnes & Prof. Randal J. Barnes
presentation to AI2ES
May 2021
Controlled Abstention Networks (CAN)
The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging:
We cannot expect to make perfect predictions all of the time.
The abstention loss works by incorporating uncertainty in the network’s prediction to identify the more confident samples and abstain (say “I don’t know”) on the less confident samples.
...the abstention loss is applied during training to preferentially learn from the more confident samples.
Launching point
Had idea of abstention networks back in 2018 (i.e. IDK networks) but had no clue how to do it.
Never would have made it this far without the great PhD work of Dr. Sunil Thulasidasan!
Our results build off of the groundwork put down by his paper and dissertation for classification tasks
Manuscripts submitted + arXiv
General Idea
Adding uncertainty to regression tasks
* write-up coming soon...
Uncertainty for regression tasks
Barnes and Barnes (in prep)
Uncertainty for regression tasks
Barnes and Barnes (in prep)
Uncertainty for regression tasks
Barnes and Barnes (2021)
Barnes and Barnes (in prep)
Powerful Baseline Approach
Adding abstention
Abstention During Training
Abstention During Training
prediction weight
baseline -log(p)
controls amount of abstention
data-specific scale
Abstention During Training
prediction weight
baseline -log(p)
controls amount of abstention
data-specific scale
A simple 1D example
Barnes and Barnes (2021)
20% of the data
80% of the data
A simple 1D example
Barnes and Barnes (2021)
20% of the data
80% of the data
A simple 1D example
Barnes and Barnes (2021)
20% of the data
80% of the data
A simple 1D example
Barnes and Barnes (2021)
20% of the data
80% of the data
A more complex example
Synthetic Climate Data
Mamalakis, Ebert-Uphoff and Barnes (2021)
y = -.019
Synthetic Climate Data
Mamalakis, Ebert-Uphoff and Barnes (2021)
Barnes and Barnes (2021)
output
layer
input
layer
hidden
layer
hidden
layer
y = -.019
Synthetic Climate Data
EXPERIMENT: Forecasts of Opportunity
Mamalakis, Ebert-Uphoff and Barnes (2021)
Barnes and Barnes (2021)
output
layer
input
layer
hidden
layer
hidden
layer
y = -.019
Forecast of Opportunity when the average in this box is > 0.5
Abstention outperforms baseline
Barnes and Barnes (2021)
Abstention outperforms baseline
Barnes and Barnes (2021)
Abstention outperforms baseline
Specific labels corrupted (structured noise)
Shuffled sample labels
(arbitrary label noise)
Forecasts of opportunity
(skillful predictions)
Corrupted inputs
(input data cleaner)
CAN outperforms baseline networks
Barnes and Barnes (2021a)
Barnes and Barnes (2021b)
Take home ideas
https://github.com/eabarnes1010/controlled_abstention_networks