Technical Reporting:�AI-generated zero draft content�Scaffolding, not substitute
27.2.26
Jules Colomer, Leader Portfolio Performance and Results
1
Purpose & Contents
2
An evolving Performance & Results Management context: More juice, less squeeze
Questions are: More interconnected (e.g. $ and results). X-Integrated partnership (Centers/Independent/System), Domain/Entity/Geog specific. Requirements are: More (dis-)aggregable. Quicker. Simpler. More transparent. Higher Quality. Repeatable. Tunable. Made for AI.
Expectations
Delivery cost
Resources
3
AI tool use in Performance and Results Management
Goals:
Efficiency at scale without sacrificing rigor:
AI reduces duplication, accelerates evidence aggregation, and supports traceability. Tools integrate directly into CGIAR workflows, dashboards, and reporting processes.
Credibility through human-in-the-loop validation:
Unlike many general-purpose AIs, PPT’s AI outputs remain transparent, verifiable, and linked to underlying sources. Human-expert in the loop quality assurance and safeguards maintain accuracy.
Status:
We have moved beyond isolated AI pilots and are now demonstrating an integrated, human-in-the-loop architecture for accelerating evidence generation, validation, and synthesis across CGIAR.
Objective of providing AI-generated zero draft
The zero draft removes the heavy lifting of evidence compilation so teams can focus on interpretation, narrative, and decision-relevant insight.
5
Read the report: https://cgspace.cgiar.org/items/5ca2c041-3ea3-4f5b-84b4-4431230acfb1
6
PRMS 2022-24
Human expert: Conflict transformation and peacebuilding specialist
Product
What it is/isn’t
Version 0 (v0) — AI-Generated Zero Draft
evidence pre-assembly.
The v0 is a machine-generated synthesis created from evidence in PRMS. It is intended to provide a comprehensive, evidence-linked starting point, not an approved narrative.
Authorship
Generated using available data.
No human author.
No institutional ownership.
Purpose
Reduce blank-page effort.
Surface relevant evidence, trends, and signals.
Provide a structured scaffold aligned with the reporting template.
Nature of content
Evidence-compiled, not judgement-based.
May include gaps, inconsistencies, or weak signals.
May contain interpretation that requires verification.
Accountability
v1+ = human-authored report
sense-making and narrative construction.
The moment a human edits the v0, the document becomes a human-authored report. From that point onward, responsibility shifts fully to the Programme/Accelerator team.
Authorship
Owned by Programme/Accelerator leadership and contributors.
Human judgement becomes primary.
Purpose
Interpret evidence.
Apply context, nuance, and strategic judgement.
Make claims, narratives, and positioning decisions.
Nature of content
Selective, intentional, and curated.
Includes interpretation, framing, and emphasis.
Reflects managerial and scientific judgement.
Accountability
v0 assembles evidence; human drafts create meaning and assume responsibility.
Scaffolding NOT Substitute
8
Data/Information/Knowledge/Wisdom
Wisdom: "integrated knowledge—information made super-useful“ & the ability to make sound judgments and decisions apparently without thought
Knowledge: Knowledge is a fluid mix of framed experience, values, contextual information, expert insight and grounded intuition that provides an environment and framework for evaluating and incorporating new experiences and information.
FRAMEWORK = TOC
Information: organized or structured data, which has been processed in such a way that the information now has relevance for a specific purpose or context, and is therefore meaningful, valuable, useful and relevant.
Data: discrete, objective facts or observations, which are unorganized and unprocessed and therefore have no meaning or value because of lack of context and interpretation
FRAMEWORK
9
Pipeline Steps — Overview
1
Data Consolidation
PRMS and ToC data combined into a single consolidated data source
2
Section Slicing
Python script slices data — creates 1 slice per Annual Report Section
3
Mini-Slicing & LLM Narrative
Per section: Python creates mini-slices (e.g. partners by country); LLM produces targeted narrative
4
Standard Visuals
Python also creates standard charts and visuals for each section
5
Assembly & Editing
All sections assembled into full editable report with prompts to re-generate or discard
6
Word Export
Export as .docx with watermarks — watermarks must be removed before official use
⚠ Important: The LLM is focused on putting the narrative together, NOT on calculating numbers. All numerical computations are handled by the Python pipeline.
PRMS + ToC → Python Slicing → LLM Narrative → Editable Report → Word Export
Annual Report Generation Pipeline
INPUT DATA
CONSOLIDATION
SLICING
SECTIONS
PER-SECTION PROCESSING
PRMS Data
ToC Data
Consolidated
Data Source
Python Script
Slices data into sections
1 per Annual Report Section
Section 1
Section 2
...
Section N
Per-Section Processing
Python creates mini-slices
e.g. partners by country,
KPIs by region, etc.
LLM reviews relevant slices
Produces targeted narrative
Python creates standard visuals
Narr. 1
Narr. 2
Narr. N
⚠ Important: The LLM focuses on putting the narrative together, NOT on calculating numbers.
HOLISTIC REVIEW
ASSEMBLY
EXPORT
LLM Holistic Review
Reviews each section holistically
and produces cohesive narrative
Full Editable Report
All sections assembled
Access to prompts to
re-generate or discard sections
Export: Word (.docx)
Includes watermarks that
need to be removed
before use
PRMS + ToC → Python Slicing → LLM Narrative → Editable Report → Word Export
What gets generated/what sections it covers
An AI-generated draft narrative and a set of draft graphs will be produced in Word for the following sections:
The Word document will be watermarked as Draft – Not for Publication, and is intended for consideration by P/A teams for inclusion in the Annual Technical Report.
The Word doc. will be preceded by an Excel export of the underlying data used to generate the draft narrative and graphs.
# | Section title | Content | Use of AI tool to generate text + graphs |
1 | Overview for 2025 | 2 pages: Factsheet/Info about the P/A/Budget/Executive summary/high-level results diagram | Yes |
2 | Annual progress | 6 pages: Program-level ToC progress + AoW progress + AoW ToC diagram + traffic light progress ratings Highlight boxes; stories from countries /regions; quotes from local partners | Yes |
3 | Partnerships | 2 pages: Overview of partnerships +1-2 optional figures *For Accelerator reports, the focus is Portfolio linkages, with an option to also include partnerships, depending on the operating model of the Accelerator. | Yes |
4 | Adaptive management | 2 pages: Narrative across all adaptive management enablers | No |
5 | Key result story | 2 pages: In-focus story; similar to previous reports | No |
12
Next steps
13
P/A Technical Report: key dates and deliverables
P/A Annual Technical Report | 4 March 2026 |
|
2-6 March 2026 |
| |
9 March 2026 |
| |
Throughout March 2026 |
| |
27 March 2026 |
| |
April 2026 |
| |
13 – 24 April 2026 |
| |
28 April – 5 May 2026 |
| |
6 May 2026 |
| |
6 May 2026 |
|
14
Guardrails – workflow and principle
CGIAR uses AI-assisted tools within its reporting workflows to support evidence synthesis and draft preparation. All published content is reviewed, validated, and approved by responsible human authors prior to release.
15
16
Q&A
Is this replacing us?
Will this misrepresent our work?
Is this going to increase our workload?
Who is accountable if something is wrong?
Are we required to use it?
What about bias or hallucinations?
What if our data in PRMS is incomplete or wrong?
Will funders see AI-generated text?
Do we disclose use of AI?
17
Resources
AI tools for evidence exploration and use
From Food Security to Food for Security (2 pager)
A Review of CGIAR’s Engagement in Fragility and Conflict: Summary Report (10 pager)
2022 CGIAR Technical Reports �2023 CGIAR Technical Reports�2024 CGIAR Technical Reports
CGIAR Technical Reporting Arrangement
CGIAR Performance and Results Management Framework (PRMF)
Innovation Portfolio Management
Reports on Responsible Innovations & Scaling
Scaling Readiness website�Online course on Innovation and Scaling�Readiness Calculator
Performance & Results Hub�Evaluation and Management Response Actions Tracker
Email: performanceandresults@cgiar.org
18