1 of 21

Why and how the external STG interpreter is useful

Csaba Hruska

2 of 21

GRIN Compiler

Optimizer for lazy and strict functional languages with optional whole program analysis support

Two sub-projects:

GRIN optimizer�https://github.com/grin-compiler/grin�
GHC Haskell frontend https://github.com/grin-compiler/ghc-whole-program-compiler-project

GRIN IR:� Urban Boquist’s PhD thesis (1999),� Thomas Johnsson (advisor, inventor of compiled graph reduction, G-Machine)

3 of 21

Haskell frontend: GHC whole program compiler project�https://github.com/grin-compiler/ghc-whole-program-compiler-project

GHC Haskell frontend: GHC-WPC, exports IR of the whole program

new output format: .modpak, .ghc_stgapp (module IR + program dependencies)�new output binary: _cbits.a, _stubs.a (package’s C + generated FFI glue code)

components (cabal projects):

external-stg - small package, independent of GHC
stg-compiler - uses GHC’s codegen & RTS
stg-interpreter - run programs, independent of GHC

4 of 21

Goal: Observability

GHC compiler pipeline is a black box�Haskell runtime evaluation is a black box

I’d like to know these:

Why is the compiled executable so large? What does it contain?
Does my code run frequently? When?
What are the application’s Haskell module and C lib dependencies?
What primops and RTS features are used in the application?
When does the code do direct and indirect function calls or jumps?
What’s in the memory? Is there anything that should not be there?
When does my code allocate memory? Stack or heap?

5 of 21

.modpak & .fullpak are zip files

Zip file format = standard container�Zstd compression = speed and space efficiency

One .modpak for each Haskell module, containing:

Haskell source
Core (text)
STG (text)
Cmm (text)
Asm (text)
Compile options (text)
External STG (binary)

�The .fullpak contains the full program, all .modpak content combined.

6 of 21

Compiler pipeline design

souffle datalog compiler for static analysis implementation
ghc-wpc + external-stg
external-stg-compiler
external-stg-interpreter
gephi graph analysis and visualization tool

7 of 21

External STG

Small package, independent of GHC�External STG is a self-contained copy of GHC STG IR data type

STG defines the evaluation of Haskell programs (operational semantics)�STG is a simple functional language

Project wide unique names

Public name: package + module + binder
Private name: package + module + binder + uniq

Why not GHC Core? Core changes too often, STG is stable.�STG = Core without type lambdas in A-Normal Form, to make codegen simpler

8 of 21

External STG Interpreter

Goal: Run all Haskell applications that GHC can compile� Be independent of GHC�Design: Simple Haskell, purely functional design (StateT StgState IO)�Heap model: IntMap with monotonic address space, no address reuse

�High-level model of GHC primops and Runtime System�(manually extracted from the native backend implementation)

�Validation: GHC testsuite + quickcheck tests for simple primops�

When things go wrong we get nice pattern match error instead of a segfault

9 of 21

Experience & Insights

The interpreter is surprisingly useful for various things:

learn GHC primop + RTS semantics
debug Haskell programs
profile Haskell programs
help frontend developers, i.e. Idris2 on ext-stg�get nice pattern match error instead of segfaults
research: full trace and call graph export and visualization

10 of 21

Interpreter = Learning tool

The STG interpreter has its own implementation of primops and RTS in Haskell. (independent of GHC)

If you can read Haskell, then you will understand:

closure / lazy evaluation / partial application handling
complex primitive types (MVar, Array, ByteArray, etc)
exception handling
thread handling
FFI

11 of 21

Interpreter = Debugger & Profiler

It’s easy to collect and observe runtime data in the pure State monad.�New property to track = new field in StgState data type

Profiler features:

Full trace
Whole program call graph
Execution region call graph
Full state dump

Debug features:

Breakpoints
Stack trace
Step by step evaluation
Observe variables
Observe heap
Program execution regions
Track heap object origin
Debug script

12 of 21

Tech stack for debugging

Gephi - visualization, exploratory data analysis�Souffle datalog - analysis of the exported program state and traces�Haskell - collect data during the interpretation

.tsv file format for traces and output data (tab separated values)

�https://gephi.org�https://souffle-lang.github.io

13 of 21

Ext-STG debugger defects

Perfect program-point precision in STG level, but not in Haskell level

Problem: GHC does not provide enough source location ➞� can not map STG program-points to Haskell source accurately

Solution: connect STG code to Haskell manually, check the STG/Core IRs

�Precise module level program-point granularity�Manual mapping is needed for module internals

14 of 21

Debug and profile experience

GHC

Full recompile in profile mode
Invazive code changes�HasCallStack, cost centre, trace
Limited runtime observation�eventlog, ghc-debug
New debug feature =�recompile GHC and the project
No step by step debugger

STG Interpreter

No profile mode recompilation�use the same .fullpak
Everything is observable�primops, heap, foreign calls, memory access patterns, StgState
New debug feature =�interpreter recompile +�run the same .fullpak
Programmable debugger�debug script

15 of 21

Haskell design defects

Requiring invasive code changes by design is wrong!

cost centres = bad developer experience, blocks debugging�profile mode & cost centre recompilation = bad UX/DX��GHC is black box�Runtime System is black box��It’s bad when people make their code less readable or lower level just to allow debugging or better performance.

Make the optimizer and the runtime evaluation observable instead.

It is bad when GHC optimizer capabilities shapes the Haskell programming style.

16 of 21

Accidental complexity = community damage

How much effort would you invest to learn the runtime evaluation of Haskell programs?�For GHC Haskell you need to understand:� core, stg, cmm, C, primops, GHC’s codegen, RTS and design decisions�What if this exceeds your resource budget?�Would you give up and switch language?�Would you wait for others to fix your problem or explore your research idea?

It is a myth that Haskell evaluation is complex and tricky.�It’s an engineering and implementation issue.

17 of 21

My experience with GHC dev community

no feedback

no brainstorming��

You think differently ➞ You are alone

18 of 21

DEMO

Simple program

OpenGL minigame

GHC compiling hello.hs

19 of 21

Conclusion

Developer experience matters

Persistent IR

Simplicity

Observability

Data visualization

20 of 21

Support on Patreon

patreon.com/csaba_hruska

21 of 21

Consulting

csabahruska.com