A WORKFLOW FOR ERROR ANALYSIS FOR DRUG RESPONSE PREDICTION
VIA STATISTICAL STANDARDIZATION AND DISTRIBUTION ANALYSIS
JAKE GWINN, JUSTIN M. WOZNIAK, RAJEEV JAIN, YITAN ZHU, �ALEXANDER PARTIN, THOMAS BRETTIN, AND RICK STEVENS�DATA SCIENCE & LEARNING�ARGONNE NATIONAL LABORATORY
WORKS @ SC — November 17, 2025 — St. Louis
OVERVIEW: HPC WORKFLOWS
Workflows are more relevant than ever in a time of increasing automation
Workflows
Cancer�Data�Sets
+
Training�Tasks
OUTLINE
3
SWIFT/T: DESIGNED FOR EXASCALE
Hierarchical concurrency in MPI environments
Finalist 2020
$ conda install -c swift-t swift-t
ML
ML
ML
SIM
ML
ML
ML
SIM
THE SWIFT PROGRAMMING MODEL�
All progress driven by concurrent dataflow
(int r) myproc (int i, int j)
{
int x = F(i);
int y = G(j);
r = x + y;
}
SWIFT SYNTAX
int i = 4;
string s = "hello world";
file image<"snapshot.jpg">;
app (file o) myapp(file f, int i)
{ mysim "-s" i @f @o ; }
typedef image file;
image A[];
type protein_run {
file pdb_in; file sim_out;
}
bag<blob>[] B;
if (x == 3) {
y = x+2;
s = strcat("y: ", y);
}
foreach f,i in A {
B[i] = convert(A[i]);
}
merge(analyze(B[0], B[1]),
analyze(B[2], B[3]));
ASYNCHRONOUS DYNAMIC LOAD BALANCER
ADLB for short
MPI: THE MESSAGE PASSING INTERFACE
CANDLE/SUPERVISOR OVERVIEW
9
CANDLE/SUPERVISOR IMPLEMENTATION
10
LEARNING ON REAL SUPERCOMPUTERS
Steep learning curve with myriad technologies
Deep learning (Keras, TensorFlow, Horovod)
etc. …
EXAMPLE: INCREMENTAL LEARNING
run_stage(int N, int S, string this, int stage,
void block, string plan_id) {
void parent = run_single(this, stage, block, plan_id);
if (stage < S) {
foreach id_child in [1:N] {
run_stage(N, S, this+"."+id_child, stage+1, parent,
plan_id, db_file, runtype);
}}
}
run_single(string node, int stage, void block) {
json_fragment = make_json_fragment(node, stage);
json = "{\"node\": \"%s\", %s}" % (node, json_fragment);
block => obj(json, node);
}
HIGH ERROR DRUGS (HEDS)
DISTRIBUTION OF ERRORS
NEW ERROR METRIC
WORKFLOW SPECIFICATION
VARIOUS I/O IN HED WORKFLOW
RESULTS FOR 10 AUC RANGES
RESULTS FOR 2 AUC RANGES
CHALLENGES: INCREASING COMPLEXITY
?�Deployment�?
TAKEAWAYS
Reusability in a framework for deep learning and cancer
THANKS
22
ACKNOWLEDGMENTS
QUESTIONS?
WOZ@ANL.GOV
www.anl.gov