Project Expo 2025
13:30–15:00, Friday, May 30th, 2025
Lecture Hall, Kavli IPMU
CD3 Year 2 Summary
Jia Liu
google font "Chango"
color #3ec70b
Theoretical Physics
Mathematics
Experimental (Astro)physics
Data Science
& AI/ML
Center for Data-Driven Discovery
Est. April 2023 cd3.ipmu.jp/
Workshops, Conferences, and Schools
Baryons in the Universe 2024
FAIRS-Japan 2024
LSS QUEST
AstroAI Asian Network Summer School
PFS Collaboration Meeting
Galaxy and Black-hole Evolution
Focus Week on Primordial Black Holes
LiteBIRD F2F Meeting
Code of Conduct
CD3 is committed to creating a safe, inclusive, and respectful environment for all members of our community. CD3 members are required to:
Treat each other with respect and professionalism. Discrimination, harassment, and retaliation of any kind will not be tolerated.
Follow the highest standards of academic integrity and honesty. This includes, but is not limited to, avoiding plagiarism, fabrication, and falsification of data. All members are required to comply with Kavli IPMU’s Research Ethics.
Researchers using AI tools should document this use in the methods, acknowledgements, or other appropriate sections in the paper. Do not use AI tools as a credited author on a research paper, as it can not carry accountability.
Housekeeping & Announcements
Incoming Postdocs
Otavio Alves
Michele Veronesi
Vera Maiboroda
Takafumi Tsukui
Andrew Santos
Luca Marchetti
Rahul Ramesh
Nisha Grewal
3rd Year Focuses
AI for
literature review
Joaquin Armijo, Linda Blot
AI tools for literature review
Why use AI Tools?
How do AI tools help?
What tools are available?
AI-powered software for searching, analyzing, and summarizing academic literature.
Examples (not limited to): ChatGPT, Scispace (formerly known as typeset.io), Research Rabbit and Anara
Using ChatGPT for literature and insight extraction
Example Task: “Summarize key ML approaches used in weak lensing.”->
Useful for early-stage scoping, defining keywords, identifying popular methods
Limitations:
Using ChatGPT for literature and insight extraction
Good level of knowledge. Provides references when needed!
Using ChatGPT for literature and insight extraction
Not only papers, but useful links to different tools (in github). Ml-in-cosmology, CNN_mass_maps
SciSpace for Deep Reading and Concept Linking
Example Task: “Understand methodology sections of several weak lensing + ML papers.”->
Not as conversational as ChatGPT—designed for deep dives, not exploration.
Provides synthesis of information.
SciSpace for Deep Reading and Concept Linking
ResearchRabbit for Visual Literature Exploration
Example Task: “Map the key papers and authors in ML + weak lensing.”->
For example: Input the papers from queried by ChatGPT, SciSpace and generate graph (network/timeline)
Combining the Tools in a Literature Pipeline
Step | Tool | Purpose |
1 | ChatGPT | Define scope, suggest keywords and themes |
2 | ResearchRabbit | Find key authors, citation networks |
3 | SciSpace | Read and digest the hardest papers |
4 | ChatGPT/Anara | Summarize, compare, ask questions |
Suggested by ChatGPT
Summary & Conclusions
AI for Coding
Jess Cowell, Ben Horowitz
AI
PHYSICS
cd3 expo 2025
AI FOR CODING
HIGHSCORE 2500
WHAT IS AI-ASSISTED CODING?
01
12
07
WHY WOULD YOU USE?
01
12
07
TOPICS COVERED
level 1
level 2
level 3
final boss
AGENDA
prompting
tips and tricks
chatgpt
ide editors
PROMPTS
A prompt is the input you give to an AI model to produce a desired response. It can be a:
Prompts guide the AI’s behavior and output.
prompts are so important people even
do 18 hour courses on them!
HOW TO PROMPT
Good prompts
Bad Prompts
Examples:
“Write a code.”
“Fix this.”
“Help me.”
“Why isn’t it working?”
GOOD FOR:
01
12
07
LEVEL 1: CHAT GPT
NOT THE BEST FOR
Can you explain how to use except statements in python?
EXAMPLE 1: NEW CONCEPTS
Here’s a concise 5-line summary of what the code does:
Can you summarise this code in 5 lines?
<PASTE SOME CLASS CODE SNIPPET HERE>
EXAMPLE: CLASS CODE SNIPPET
Can you explain how it calculates the Hubble rate?
EXAMPLE: CLASS CODE SNIPPET
GOOD FOR:
pro is free for students
basic version free for everyone
BAD FOR:
LEVEL 2: GITHUB COPILOT PLUGIN
An AI assistant plugin for code editors (e.g. VS code, cursor)
Suggests code as you type- like ctrl tab but generates entire lines of code
Trained on open-source code from GitHub
NEW CODING AGENT...? (MAY 19TH)
BONUS LEVEL: TERMINAL INTEGRATION
Use the Right Tools
Customize Your Experience
Work Smarter with AI
🚨 Always Double-Check
TIPS AND TRICKS
A WORD OF WARNING
Beware of physics or maths mistakes in your code! They can be subtle...
Real life examples;
<Ai makes mistakes>
“When AI Gets the Science Wrong”
DON’T LET IDE EDITORS GET CARRIED AWAY
Real Example:
IDEs like cursor can rewrite all files in your directory if you let them
DON’T END UP LIKE THIS...
Using AI responsibly
Katya Vovk, Joaquin Armijo
How you use AI
(in writing, coding, translating, reviewing, etc.)
How AI is designed
(by developers, to make systems safe and fair)
Responsible AI
Using AI Responsibly
Privacy & Data Governance
Transparency & Reproducibility
Fairness
Plagiarism by AI. Even with vague description, it generates copyrights protected images
Privacy & Data Governance
Transparency & Reproducibility
Fairness
Privacy & Data Governance
Transparency & Reproducibility
Fairness
Lapushkin et al. (2019)
Request to generate just soda results in certain brand exclusively.
Privacy & Data Governance
Transparency & Reproducibility
Fairness
Safe to Do | Check Carefully | Big No |
Use AI to brainstorm or outline ideas | Cross-check factual or technical outputs | Don’t plagiarize AI-generated text/images |
Disclose AI use when relevant | Verify AI-translated or summarized content | Don’t input confidential data into public tools |
Use AI to support learning (summaries, comparisons) | Test AI-suggested code, citations, conclusions | Don’t fake fluency with AI translations |
| | Don’t blindly trust AI for decision-making |
The Golden Rule
Read the guidelines: every journal, funder, and institution has evolving policies.
If you’re unsure whether an AI use is appropriate, ask:
”Would I be okay justifying this in a peer review or to a grant panel?”
CD3 Project Expo 2025
Stay after for a group photo!
Project | Presenter (non-expert) | Experts | Time (minutes) |
Neutrino programs: SK, HK, T2K | Elisa Ferreira | Patrick de Perio, Mark Vagins | 6 |
Subaru programs: PFS, HSC | Tom Melia | KG Lee, Jingjing Shi, John Silverman, Masahiro Takada | 6 |
CMB programs: LiteBIRD, Simons Observatory, CMB-S4 | Linda Blot | Tomotake Matsumura, Guillaume Patanchon, Toshiya Namikawa | 6 |
XENON | Jingjing Shi | Masaki Yamashita, Kai Martens | 4 |
Belle II | Masaki Yamashita | Takeo Higuchi | 4 |
Machine learning theory | Guillaume Patanchon | Simeon Hellerman, Elisa Ferreira, Leander Thiele, Masahito Yamazaki | 4 |
ELSI | Leander Thiele | Hiromi Yokoyama | 4 |
Euclid, LSST | Masahito Yamazaki | Linda Blot, Masahiro Takada | 4 |
COSI | KG Lee | Tad Takahashi | 4 |
Neutrino programs: SK, HK, T2K
Presenter: Elisa Ferreira
Experts: Patrick de Perio, Mark Vagins, Tsui Ka Ming
Elisa Ferreira
Experts: Mark Vagins, Patrick de Perio, Tsui Ka Ming
Neutrino programs:
T2K, SuperK, HyperK,
Super-Kamiokande=Super-Kamioka Neutrino Detection Experiment
Science goals:
Neutrino scatter off electrons or nucleus, relativistic charged particle
=> Cherenkov light is emitted in a cone shape to the direction of a charged particle, detected by the PMT on the tank’s wall
Gadolinium upgrade (SuperK-Gd)
Enhances ability to detect neutrino by increasing the efficiency of neutron capture on gadolinium, which improves the detector's sensitivity to neutrino interactions
Science goal:
Science goal:
Gadolinium upgrade (SuperK-Gd)
Enhances ability to detect neutrino by increasing the efficiency of neutron capture on gadolinium, which improves the detector's sensitivity to neutrino interactions
Science goals:
Tokai-to-Kamioka
Neutrino beam created at J-PARC (accelerator)
Near detector: measure neutrino before it oscillates
SK: Measure neutrino beam after it oscillates
Tokai-to-Kamioka
Neutrino beam created at J-PARC (accelerator)
Near detector: measure neutrino before it oscillates
SK: Measure neutrino beam after it oscillates
Science goals:
Detector:
Science goals:
Tokai-to-Kamioka
Neutrino beam created at J-PARC (accelerator)
Near detector: measure neutrino before it oscillates
SK: Measure neutrino beam after it oscillates
Detector:
September 20, 2024
World's largest artificial underground cavity
(45 out of 71 m deep)
Public tour June 29! �Space limited, apply here by Sunday night! https://www-sk.icrr.u-tokyo.ac.jp/news/detail/1705
Hyper-Kamiokande
188 kton water
Near Detectors
J-PARC
Main science goal:
Hyper-Kamiokande
188 kton water
Near Detectors
J-PARC
Main science goal:
Error Source | % Error for CPV search |
φ + σ (ND constrained) | 2.7 |
φ + σ (ND unconstrained) | 1.2 |
Nucleon removal energy | 3.6 |
π re-interactions | 1.6 |
σ(νe), σ(ν̅e) | 3.0 |
NC γ + other | 1.5 |
SK far detector | 1.5 |
Total | 6.0 |
Need to reduce to <3% for future!
T2K 2021 sys
Hyper-Kamiokande
188 kton water
Near Detectors
J-PARC
Main science goal:
Differentiable Detector Simulator (DDSim)
Differentiable Detector Simulator (DDSim)
Automation of physics model tuning (via backpropagation)
Subaru programs: PFS, HSC
Presenter: Tom Melia
Experts: KG Lee, Jingjing Shi, �John Silverman, Masahiro Takada
CMB programs: LiteBIRD, Simons Observatory, CMB-S4
Presenter: Linda Blot
Experts: Tomotake Matsumura, Guillaume Patanchon, Toshiya Namikawa
CMB
Linda Blot
Cosmic Microwave Background (CMB)
Main science goals:
Large scales: primordial gravitational waves from inflation
Small scales:
Cosmic Microwave Background (CMB)
Space:
Ground:
Cosmic Microwave Background (CMB)
Space:
Ground:
Data Analysis Challenges:
The Lite (Light) satellite for the study of B-mode polarization and Inflation from cosmic background Radiation Detection
The LiteBIRD is currently under reformation so the detailed specifications are subject to change
Prototype
@ IPMU 1F
Atacama desert in Chile ~5200m, high and dry: better atmospheric conditions
Construction is done! LAT first light in February, currently being calibrated, 3SATs are online and taking data. 3 more SATs are under construction thanks to SO Japan, and SO UK
15m
14m
2 types of telescopes: (SAT 10% of the sky, LAT 50 % of the sky)
SAT
LAT
Large-area survey: light relics
Small-area survey: primordial gravitational waves
1 LAT: 5m telescope, 130k detectors
9 SATs: 75k detectors in total
2 LATs: 6m telescope, 130k detectors each
All detectors cooled to 0.1 Kelvin
XENON
Presenter: Jingjing Shi
Experts: Masaki Yamashita, Kai Martens
Jingjing Shi
Expert: Masaki Yamashita
Science Goal:
Rare event:
Looking for dark matter from underground
Detector:
Gas-liquid dual phase Xe Time projection chamber (TPC)
Gd-loaded Water Tank
LXe Detector
Photosensor(PMT) array
XENON with CD3
8B Neutrinos Data Analysis :
Boosted decision tree (BDT) classifiers are developed to distinguish between 8B CEνNS signals and the background events.
Not only Dark Matter ...
Highlight from XENON
First indication of Solar 8B Neutrinos
via Coherent Elastic Neutrino-Nucleus
PRL 133, 191002 (2024)
8B signal
Background
8B signal
Background
Light(S1) and Charge(S2) signal from Photosensors
From left: Kai Martens, Masaki Yamashita
Tianyu Zhu (Postdoc)
Caio Ishikawa (PhD student)
Xiaoxin Wang (student)
Belle II
Presenter: Masaki Yamashita
Experts: Takeo Higuchi
Masaki Yamashita
Expert: Takeo Higuchi
Overview:
Science goals:
Kavli IPMU members
Takeo Higuchi ... faculty Fanli Zeng ... D3
Fumiaki Otani ... D3
Tomoyuki Shimasaki ... D3
Deven Misra ... M1
19 detector modules produced in IPMU’s clean room, delivered in 2019
Source: arXiv:2402.17260
GFlaT: new machine learning tool using graph neural network.
Drastic improvement: 18% better tagging efficiency compared to previous algorithm (see arXiv:2402.17260)
Belle II will restart in 2025 Oct!
Silicon Vertex Detector
Data analysis challenge: flavor tagging
i.e. identify from from their disintegration products.
Machine learning theory
Presenter: Guillaume Patanchon
Experts: Simeon Hellerman, Elisa Ferreira, Leander Thiele, Masahito Yamazaki
Deep learning principles
Physics informed ML: QFT/ML
Euclid, LSST
Presenter: Masahito Yamazaki
Experts: Linda Blot, Masahiro Takada
COSI
Presenter: KG Lee
Experts: Tad Takahashi
COSI (Presenter: KG Lee; Expert: Tadayuki Takahashi)
Technical Highlights
Scientific Goals – Annihilation Lines & Nucleosynthesis
Multi-Messenger Astronomy
Exploring DM
ELSI
Presenter: Leander Thiele
Experts: Hiromi Yokoyama
Social Science (ELSI)
Expert:
Hiromi Yokoyama
Presenter:
Leander Thiele
ELSI = Ethical, Legal and Social Implications
more negative attitude
General method:
– conduct online surveys (careful about biases!)
– identify predictors, models
– measure ethical stance of public
Measuring ethics
AI ethics
AI ethics