PSA: how to use GPGPU
In general: do not leave jupyter notebooks idle while occupying GPU vram
Wishlist (to learn/discuss in the future) pls keep at top
Done
Date | Lecture Title | Discussion leader |
24/3/1 | Boris | |
24/3/8 | Intro to deep learning (lecture 3) | Jia |
24/3/15 | Katya | |
24/3/22 | Hands on session | |
24/3/29 | Linda | |
24/4/5 | Object detection & Semantic segmentation (7&8) | Cesar |
24/4/19 | VAEs, GANs (9-11) | Leander |
24/4/26 | Cesar | |
24/6/14 | Sequence modeling & transformer (13-14) | |
24/6/21 | ViT (15) | Katya |
| | |
Friday Jan 31
HAC preparation
Dec 18th 2024, Ben, Jia
Plan
Project | Human | AI mentor |
Astrophysics: Search for evidence/possible duration of AGN duty cycle in MaNGA galaxies | Katya, Ievgen | Kevin & Suyog |
The galaxy-dark matter halo connection of Lyman-break galaxies (LBGs): occupation models, physical properties, and impact on constraints of growth-rate of LSS from LBGxLBG clustering signal in configuration-space (2PCF). (Advisor: Kevin) | SUYOG GARG & Kevin | Katya & Ievgen |
Diffusion Model for Baryon Painting (Advisor: Ben) | Omar Yehia | Masaya Ichikawa (&Jingjing) |
Emulator for dark matter halo intrinsic alignment (Advisor: Jingjing) | Masaya Ichikawa | Omar Yehia (Ben) |
Prediction of the shape of anti-reflective subwavelength structure based on ultra-short pulse laser parameters | Kosuke Aizawa | Jessica Cowell |
Radio galaxies as CMB foreground for HalfDome simulation (Advisor: Jia) | Jessica Cowell | Cesar Jesus-Valls |
Neutrino project TBD | Cesar Jesus-Valls | Kosuke Aizawa |
CD3 Hack Friday
Dec 13th 2024, 1:30-16:30pm 3F seminar room
What do you want to explore this afternoon?
Katya
CD3 Hack Friday
Nov 29th 2024, 1:30-16:30pm 3F seminar room
What do you want to explore this afternoon?
Jia’s course outline
Intro: Through this course, we will explore the history of our universe and its components. We will also practice using the tools scientists employ to study the cosmos. Emphasis will be placed on developing essential skills for scientific research—problem-solving, critical thinking, and computational analysis.
10 topics: big bang and a brief history, dark matter, dark energy, galaxies, stars, black holes, planets, scientific measurements, scientific programming, and statistics.
Leander
Can’t show much from ACT…
CD3 Hack Friday
Oct 11th 2024, 1:30-16:30pm 3F seminar room
What do you want to explore this afternoon?
CD3 Hack Friday
Oct 4th 2024, 1:30-16:30pm 3F seminar room
What do you want to explore this afternoon?
CD3 Hack Friday
Sept 27 2024, 1:30-16:30pm 3F seminar room
What do you want to explore this afternoon?
Katya
CD3 Hack Friday
Sept 20 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
NG MFs - Joaquin
Found some high S2N for different fields (cutting the edges)
More NG features when combining fields
Akira - Stats of equator-masked Healpix map
Q: Sub box from bigbox and tile it as for small box
It will have same replication effect.
Katya
CD3 Hack Friday
Sept 13 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Topics of interest for cosmology x ML
— implicit-likelihood inference / simulation-based inference / likelihood-free inference
— emulators for summary statistics
— field-based surrogate models (e.g., baryon painting)
— differentiable simulations
Anything missing?
Jia: spent whole afternoon admiring lectures from Lukas Heinrich, TUM
CD3 Hack Friday
July 26 2024, 1:30-5pm 3F seminar room
Gender Diversity in STEM in Japan: Challenges & Solutions with Insights from Researchers
Shawn Mobley
2024-07-26
Japan is awesome
But it struggles with gender equality
Many universities: undergraduate student body 20% female
For context
Especially in STEM
Following statistics: Sasada & Bannai, 2021 – mathematics
Estimates: High school around 20%, Undergraduate around 15%
Compared to select OECD nations
Share of females keeps diminishing
Left with 2 separate but related problems: entrance and retention
Many possible causes and many possible solutions
Many projects underway
Ikkatai et al., 2021
Ceci et al., 2009
Ceci et al., 2009
Wanted to hear from researchers
Some initial findings / observations
Some considerations
Next steps
Shoutouts
My travels around Japan
Thank you!
Astro Visualization
Bonny Y. Wang
Interactive Examples
Web Interactive Example
Particle Data
Density Field Data
Basic Setup + Cheatsheet
BlenderExamples/IPMU_Workshop/MergerGas_4.npz
BlenderExamples/DensityFeildVis.py
Work with Density Field Animation
Use
BlenderExamples/IPMU_Workshop/gas_{0-100}.vdb
Shift + A -> Volume -> Import OpenVDB
Render on Cluster
wget https://mirror.freedif.org/blender/release/Blender4.2/blender-4.2.0-linux-x64.tar.xz
tar -xf blender-4.2.0-linux-x64.tar.xz
# Render the first frame
./blender-4.2.0-linux-x64/blender -b file.blend -f 1
# Render the animaiton
./blender-4.2.0-linux-x64/blender -b file.blend -a
Other Useful Things
CD3 Hack Friday
July 12th 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Leander: debaryonification
Float Precisions
Always 10^(original magnitude - 8)
CD3 Hack Friday
July 5th 2024, 1:30-5pm 3F seminar room
PSA: how to use GPGPU
In general: do not leave jupyter notebooks idle while occupying GPU vram
What do you want to explore this afternoon? (5/7)
Leander: wrote some introduction and made a nice figure
Observations of a grand-design spiral starburst galaxy at z=1.463
CD3 Hack Friday
June 28th 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Akira - Fibonacci Grid (link)
How to chop as many flat patch from full-sky
Some NFW integrals - Leander
Jess :(
Jia, playing with JaxPM: https://github.com/DifferentiableUniverseInitiative/JaxPM/blob/main/notebooks/Introduction.ipynb
What are these 2 fields?? ^
Why use galaxy dynamics to measure cluster mass
M_Sigma relationship
CD3 Hack Friday
“Project Brainstorming” Informal Discussion
June 21st 2024, 1:30-5pm 3F seminar room
Name - Project description
Please introduce the motivation and general picture of your data-analysis/simulation/ML project. Please also emphasize the part new people can join, or other groups can collaborate with.
Each project should take 5-8 min and a couple of slides..
Subaru HSC – weak lensing
Subaru PFS – BAO, redshift-space distortion, full-shape …
Simulations that MT is aware of
Sims | N-body | Models | Products | Lead |
Dark Quest I | N=2048^3 L=1 or 2 Gpc/h 21 outputs in z=[0,1.5] | 101 models �for wCDM | Halo, matter N_h(M), hh, hm, mm Redshift-space hh | T. Nishimichi Y. Kobayashi |
Dark Quest II | N=3000^3, 2048^3 L=1Gpc/h z range ? | 1000 for w0waCDM+nu | So far, only mm | T. Nishimichi S. Tanaka |
TBD (within FY2024) | N=4096^3 L=1 Gpc/h z range ? | 1000 realizations for LCDM | halos, subhalos (planned) | T. Ishiyama |
TBD (ongoing?) | N=2048^3 (need to check) | XX for LCDM | Light-cone sims WL, halos, IA, … | K. Osato |
Field Level Inference with Subaru PFS and FRBs (KG)
Project I: FRBs and spectroscopic surveys
Use dispersion measure DM=∫ ne dl from FRBs to probe cosmic baryon distribution
Example of field-level inference: build bespoke IGM+CGM models for each FRB sightline
TBD: (i) NN (or other ML)-based models trained directly on hydro sims (ii) Developing new likelihood framework
Simba simulations (Credit: D. Sorini)
Project II: Subaru PFS GE Survey
PFS Galaxy Evolution Survey will:
Long-term goal: direct connections between galaxy evolution models and cosmology. E.g. Ben H is developing a differentiable cosmological hydro code
HSC Y3/Y6 weak lensing simulation-based-inference / higher order statistics
HSC Y1 weak lensing higher-order stats
Thiele+2023
Marques+2024
Grandon+2024
Cheng+2024
HalfDome Cosmological Simulations https://halfdomesims.github.io/
HalfDome Cosmological Simulations https://halfdomesims.github.io/
* Inputs from SO, CMB-S4, LSST, DESI, PFS, SPHEREx, Roman collaborators
| mmDL Sehgal+2010 Han+2021 | Websky Stein+2020 Li+2022 | Agora Omori 2022 | Stage IV requirements* | Half Dome 2024- |
N-body box Nparticles | 1 Gpc/h�10243 | 7.7 Gpc�61443 | 1 Gpc/h 38403 | a few Gpc | 3.5 Gpc/h, 61443 |
Min. Mhalo | 1013 M⊙ | 1.2 x 1013 M⊙ | 1.5 x 109 M⊙/h | 1012 M⊙ | 1012 M⊙ |
LSS observables | None | None | 𝜅, clusters, LIM | 𝜅, galaxies,clusters | 𝜅, galaxies, clusters, +more |
No. realizations | 1 | 1 | 1 | 10–100 | 11+1 fNL�(more to come) |
HalfDome Cosmological Simulations https://halfdomesims.github.io/
LSST Y1 simulations for higher-order stats (Joaquin, Jia)
simulation | ICs | nbody | ray-tracing | catalog | stats | note |
A (check volume effect): bigger box, use outer rim, no new run needed: 5.225Gpc, Np=10240^3 | NA | Done (OuterRim) | done | done | done | The 4th and fifth bins are problematic at the moment |
B (new baseline for cosmo-varying runs): Lbox=0.6Gpc/h, Np=2048^3 | done | done | done (optimizing z-sampling and sub-sampling) | done (optimizing z-sampling and sub-sampling) | ongoing | |
F (check resolution effect) - Higher-res: Lbox: 0.6Gpc/h, Np = 3072^3 | Done | Scheduled | | | | |
G (check if our res overkill) Lower res for baseline B: Lbox: 0.6Gpc/h, Np = 1024^3 | done | done | Two seeds at NERSC, ready to ray trace | | | |
Water Cherenkov (WC) Detector Principle
CD3 Project Brainstorming - Patrick de Perio 113
Neutrino Interactions
CD3 Project Brainstorming - Patrick de Perio 114
T2K (~HK)
Accelerator
Neutrino Flux
Hyper-K Range of Sensitivity
νl
l-
n
p
W+
νl
l-
p
p
W
π+
v̅l
e+
p
n
W-
Increasing energy
IBD
CCQE
CC RES
CC DIS
O(1)-O(10) MeV
O(100) MeV
O(1) GeV
O(10) GeV
WC Event Topologies
CD3 Project Brainstorming - Patrick de Perio 115
CC RES
v̅l
e+
p
n
W-
IBD
νl
l-
n
p
W+
CCQE
νl
l-
p
p
W
π+
CC DIS
LowE
HighE
10 MeV
e-
1 GeV
e-
A rich problem for �event reconstruction:
Analysis Pipeline
“Event reconstruction”: A method that encodes our physics knowledge for quickly inferring the original particle properties from detector data
CD3 Project Brainstorming - Patrick de Perio 116
Tuning/training�MC
Detector Physics Simulator
Calibration data
Physics�MC
Physics�data
Reconstructed MC
Reconstructed data
Physics analysis
Reconstruction
Reconstruction algorithm tuning
training
ML model
Event Reconstruction
Detector Physics Modeling
Current Paradigm of Detector Physics Modeling
CD3 Project Brainstorming - Patrick de Perio 117
Detector Physics Simulator
Detector Output
(ADC)
Simulation Input
(true dE/dX)
Reconstructed
(Calibrated dE/dX)
Reconstruction
Calibration data
Detector
physics model parameters
…
Geometry�
Cherenkov physics
Water properties (light scattering, absorption)
PMT and wall reflectivity
Residual magnetic fields
PMT+electronics response
Calibration�Source
Automation of Detector Physics Model Tuning
Research Proposal: differentiable detector physics simulator (DDSim)
CD3 Project Brainstorming - Patrick de Perio 118
Detector Physics Simulator
Key Elements:
Detector Output
(ADC)
Simulation Input
(true dE/dX)
Reconstruction
Calibration
data
Reconstructed
(Calibrated dE/dX)
Detector
physics model parameters
Software Frameworks and Logistics
CD3 Project Brainstorming - Patrick de Perio 119
CD3 Hack Friday
June 14th 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Jia: summary of https://ui.adsabs.harvard.edu/abs/2023A%26A...670L..23H/abstract
HMC runs with different choices of likelihood parameter. Need to wait and then stare.
CD3 Hack Friday
May 31th 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Linda: tSZ and kSZ for HalfDome
Linda: tSZ and kSZ for HalfDome
CD3 Hack Friday
May 24th 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Akira -
Kappa map generated from tiled box has higher value in zs = 3.0
CD3 Hack Friday
May 17th 2024, 1:30-5pm 3F seminar room
Diffusion Models (CS198-126 Lecture 12) - Cesar
A quick recap on other generative models (GANs)
A quick recap on other generative models (VAEs)
A quick recap on other generative models (Flows)
Diffusion
Summarized comparison
Fwd & Bwd in Difussion
Forward Process
Backward Process
Some details on how to actually do it
About the Architecture and Loss
Training and Usage
Some example
Stable diffusion
Bonus
What do you want to explore this afternoon?
CD3 Hack Friday
Apr 19th 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Angela:
GAN
GAN mode collapse
VAE – quick and easy stochasticity
basic autoregressive flow
Continuous normalizing flow
topological restrictions
Out-of-distribution fail
CIFAR10
SVHN
Normalizing flow issues
CD3 Hack Friday
Apr 5th 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Jia: did not learn about Lagrangian perturbation theory, but learned about the Commander pipeline in CMB
CD3 Hack Friday
Mar 29 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
CosmoGrid sims
Full sky simulations (lightcone) of matter distribution
Smoothing
CD3 Hack Friday
Mar 22 2024, 1:30-5pm 3F seminar room
Hands on Deep Learning
Camila: Meeting
Katya: poking the ML notebooks
CD3 Hack Friday
Mar 15 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
CD3 Hack Friday
Mar 8 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Leander: got VIDE to work on the galaxy catalogs
example QUIJOTE 1Gpc/h box
CD3 Hack Friday
Mar 1 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Cesar - I
Cesar - II
I want to do some classification of the MNIST dataset using keras, provide code for it
Open projects (anyone welcome to join/contribute) �Please list one project per bullet point for anyone to join, and point to the description page (day of the hack you presented)
CD3 Hack Friday
Feb 16 2024, 1:30-5pm 3F seminar room
Schedule for ML topics (1-slide, 5mins max)
Date | ML Topic | Discussion leader |
23/10/6 | Vision transformer | Akira |
23/10/20 | Vision transformer I | Hideki |
23/11/10 | Normalizing Flows for Probabilistic Modeling and Inference. | Joaquin |
23/11/17 | Vision transformer II + Globus on idark | Hideki |
23/11/24 | Lego day | |
23/12/01 | Symbolic regression | Cesar |
| | |
| | |
ToDo
What do you want to explore this afternoon?
ML courses:
Stanford machine learning (2022, 20 x 1.2 hours)
Machine learning for physics (2020, 11 x 1.5 hours)
Stanford computer vision (2017, 16 x 1 hour)
Harvard Intro to AI (2024, 6 x 1 hour)
AI-statistics and Astro-ML for beginners (2023, 10 x 1.5hours) (Good for more basic statistics mixed with ML and specific for Astro)
Introduction to Deep Learning (2018-2023, 63 video, ~40mins, a bit random, MIT)
Applied Machine Learning (Cornell Tech CS 5787, Fall 2020)
Berkeley Computer Vision (2022, 22 x ~45 min) (Linda’s favorite in terms of breath)
Akira
The strategy for iterative generation on healpix map
used binary & quaternary number
Camila
How to come up with a project (Jia’s version)
CD3 Hack Friday
Feb 9 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
CD3 Hack Friday
Feb 2 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Katya (with help of Chat GPT 3.5)
+config file
Junjie
old
new
target
prediction
CD3 Hack Friday
Jan 19 2024, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Possible projects for 2024
HST G141 FOV
Zoom-in
HST F160W
Direction of dispersion
H alpha
Proposed collaborative project:
Faraday Rotation Map from Planck (Contact: Jia, Toshiya)
Big question:
Boris: Finding galaxy cores hidden from view
Similar to near-IR JWST image
HST UV+Optical image contour
(with Dust emission in color)
missing flux
Mass peak
Science: How do galaxies form bulges?
Aim: Predict which galaxies have hidden mass cores without JWST near-IR images
CD3 Hack Friday
Dec 15, 2023, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
UV near-IR
CD3 Hack Friday
Dec 1, 2023, 1:30-5pm 3F seminar room
A brief introduction to Symbolic Regression
Crash course by César
What is symbolic regression?
We are all more than used to regression analysis, a.k.a doing ‘fits’ to data.
However, what happens if we don’t know what function we should use to fit the data?
→ Symbolic regression is a technique that allow us to search in the space of mathematical expressions for the model that best fits the data both in terms of accuracy and simplicity.
→ Analytic expressions well-reproducing data can provide direct intuition on the behaviour of a system without human bias and alleviate limitations of discrete data (e.g. analytical differentiation/integration or sampling).
How does it work?
Symbolic regression is extremely powerful… however: it is an NP-hard problem, namely, a very difficult problem computationally → a solution to your problem is not guaranteed.
To deal with the immense number of possibilities the ideas is to work with expression trees. The building block in symbolic regression are (typically) variables (x,y,z…) binary operators (“+”, “-”, “/”, “*”), unary operators (sin, cos, exp…), and constants.
The basic idea is to use a genetic algorithm:
Start with some random expression trees consisting of few variables, operators and constants.
Check fitness of each expression (cost function).
Select best expressions → generate new expression from them.
Next iteration (until convergence criteria is met).
Genetic algorithms
Genetic algorithms are inspired in biology. Each iteration defines a ‘generation’ consisting of a pool of ‘candidate solutions’. In every iteration expressions are individually tuned or combined, sorted by fitness, replace older generation.
How to define ‘Fitness’?
Imagine we define some complexity for each operator. Then one can calculate the complexity of any given expression:�X: complexity 1 X*X: complexity 2 cos(X)*cos(X)+x^3: complexity 5… etc!
In general, simplest is better. But also we can calculate the distance from the expression to the data, e.g. MSE. We have two variables… so which one do we choose?
Look at the pareto front!
How to get started with symbolic regression?
Let’s assume you want to try it for yourself, see how it works and explore if it fits your problem. How hard it is to get started?
I recommend getting started with PySR, very nice implementation and user friendly. Implemented by Miles Cranmer (who will visit IPMU in January).
→ In less than 1 hack session you can do some tests online using this fantastic demo in google collab, install PySr and start doing your own tests (I did it…).
Bonus: you can use PySR to learn analytic expressions from trained neural networks!
If you are interested checkout this seminar by M. Cranmer on YouTube.
What do you want to explore this afternoon?
Katya:
Histograms are showing the overlap between my classification of objects and catalogue types.
Boris
Akira
Jess :(
Camila:
César:
Complexity 4:
Complexity 10:
Claytor et al. 2021
CNN
w/ log-laplacian Adam optimizer
CD3 Hack Friday
Nov 24, 2023, 1:30-5pm 3F seminar room
CD3 x DLX
Special LEGO Hackathon
Nov 24 13:30, Kavli IPMU 3F
Hi There!
CD3 x DLX lego workshop
© 2023 DLX Design Lab
We are an international team of designers collaborating with researchers, industry, and the public �to bridge the gap between academia and society.
LOCATIONS
MEMBERS
PORTFOLIO
25+ people:
© 2023 DLX Design Lab
What do we do
EVERYTHING
WE DO IS IN
COLLABORATION
Defining product propositions from early stage technologies
Developing speculative scenarios based on fundamental research
Communicating research to ease understanding and spread public awareness
PRODUCT
VISION
DISSEMINATION
OMNI
Collaborator: 藤井研究室(生産技術研究所)
Low cost and open-source platform to gather
and analyse critical data from the seas
OMNI Microplastics
Citizen science project about engaging more people in the problem of ocean microplastics.
© 2023 DLX Design Lab
TALKING WITH NEURONS
Collaborator: 池内研究室(生産技術研究所)
An interactive installation that enables people to remotely connect with live neural tissues.
"Talking with Neurons" invites audiences to conduct a reciprocal conversation through sending voice inputs and receiving audio-visual representations of neuronal activity by in vitro neurons that respond to their vocal cues.
© 2023 DLX Design Lab
Every collaboration starts with a workshop
“The best way to have a good idea is �to have a lot of ideas”
Linus Pauling
What is a workshop?
A group interactive activity that generates ideas through active participation in structured exercises.
Workshop etiquette
Keep it visual!
Generate creative ideas
Workshop etiquette
Keep it visual! physical!
Generate creative ideas
Not a Lego competition!
Today’s plan
TODAY’S PLAN
Find a team mate
Build!
Tea break
20’
60’
30’
Wrap up
Show/Tell
30’
30’
PRESENTATION TIME
CD3 Hack Friday
Nov 17, 2023, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Actual data
Simulations
GAS
STARS
Boris
Cesar
COBAYA - Joaquin
CD3 Hack Friday
Nov 10, 2023, 1:30-5pm 3F seminar room
Normalizing flows for probabilistic modeling and inference
Deep mind paper: 1912.02762
Crash course by Joaquin
Probability density transformation
From base distribution to target distribution
p(u)
p(x)
Transformations: T(u)
How do we define the transformations T if we don’t know how p_u and p_x are related?
We can try to model p(u) applying transformations to x
Invertible transforms
The Jacobian
If T is invertible and differentiable then we can compose them:
Flows, inference, and maximum likelihood estimation
p(x) → p(x;θ) fitting a target distribution function p_x(x)
KL (Kullback-Leibler) divergence measures the expectation of p(x;θ) explaining the target distribution. For a sample {x_0,x_1,...,x_n} from p_x(x). We can estimate the expectation:
Where the fit can be estimated using maximum likelihood.
Params. from transformation T^-1
Params. from base distribution
Constructing flows: Finite composition
First type of flows: Autoregressive flow
With 𝛕 invertible:
This kind of transform. Has a triangular matrix Jacobian
Implementing flows
Transformers
Conic combination can include (non-linear) invertible activation functions: sigmoid, tanh, leaky ReLU.
Implementing flows
Conditioners
Coupling layers:
Coupling layers try to approximate the choosing of h and c creating a function F (can be a neural network).
Probabilistic modeling
From gaussian base(s):
What do you want to explore this afternoon?
Jia: message passing in GNN
Typst
CD3 Hack Friday
Oct 27, 2023, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Katya: automatic center detected offset. Solved by pinning the initial model center to the center of the cutout
What kind of tool do you use when debugging python code?
import pdb; pdb.set_trace()
%debug
CD3 Hack Friday
Oct 20, 2023, 1:30-5pm 3F seminar room
Hideki
Akira: Vision Transformer
From here
Token: 1d array & position
Scaled dot-product attention
Read the 0-th head (about class)
From here
Not trainable
trainable
Available code: here
What do you want to explore this afternoon?
Akira: Super-resolution diffusion on 3d dark matter density
Baptiste: make plot from test run for polarization angle requirements
(no ticks on the axis for you all, no spoilers :p )
To do:
CD3 Hack Friday
Oct 13, 2023, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
CD3 Hack Friday
Oct 6, 2023, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Akira: Vision Transformer
From here
Token: 1d array & position
Scaled dot-product attention
Read the 0-th head (about class)
From here
Not trainable
trainable
Available code: here
Structure Identification with reconstructed DM and Galaxies (Hideki)
64^3 grids for 128^3 Mpc/h box
Structure Identification with reconstructed DM and Galaxies (Hideki)
32^3 grids for 128^3 Mpc/h box
CD3 Hack Friday
Sept 29, 2023, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Testing emulator for Minkowski functionals – Joaquin
First unit test:
Replicate f(parameter) for all the parameters used in training.
I have a 10% error in the largest bin, which comes from the individual realizations, Probably driven by shot noise. I need to check how much is my error budget for individual sims.
Second unit test.
Replicate f(parameter) for parameter not in the training set, using the omitting one method.
Host Decomposition QA image - Wenke
Eigenvalue decomp sfdm sims - Andrew
Initial conditions code, testing a few different algorithms
<- halos density slices
CD3 Hack Friday
Sept 22, 2023, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
To do in general for data storage: a dedicated folder on idark for downloaded data for general use by IPMU
Data visualization using YT
With baryons
CD3 Hack Friday
“A safe place to experiment and fail”
Sept 15, 2023, 1:30-5pm 3F seminar room
What do you want to explore this afternoon?
Spec x transformer: literature review
Spec-surveys
CD3 Hack Friday
July 28, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
CD3 Hack Friday
July 21, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Mapping galaxy distribution to DM density field
Galaxy density field
DM density field
Reconstructed dm density field without RSD
True
Prediction
Corrcoef = 0.65
Reconstructed dm density field with RSD
True
Prediction
Corrcoef = 0.57
Light injectors (these are highlighted)
Rhea G.
CD3 Hack Friday
July 14, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Boris: Clumps in star-forming galaxies using JWST
CD3 Hack Friday
July 7, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Jia:
Tried to match halos between hydro and dark matter only runs
Based on 6D phase (position + velocity)
Try pos+velocity fit, kinda ok,
with vw=0.01 for now ->
CD3 Hack Friday
June 30, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Jia, change in halo mass function in CAMELS simulation, due to baryonic feedback
Katya: made comparison plots for two supersampling, need to do more plots for luminosity bins
CD3 Hack Friday
Jun 23, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
James
Yona
Had a discussion with Jingjing on the cross-correlation: it’s not that straightforward… but exciting perspectives!
Made a list of cool things to do in Lyon:)
https://docs.google.com/document/d/1fn2nS1RM9Nsiv_SCwj3NR8H2kKIflBcHQY9-jtiSUrA/edit?usp=sharing
No more cake!
Discuss - future improve of hack format
CD3 Hack Friday
Jun 16, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Cesar
Predictions agree both for nominal model & for model tuned based on unknown model data (good starting point).
CD3 Hack Friday
Jun 9, 2023
Your goal this afternoon
CD3 Hack Friday
Jun 2, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Figures missing but getting there :)
CD3 Hack Friday
May 26, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Jia: xiangchong was unavailable
so started learning game theory
Julien
Hideki (1.7 million galaxies -> 37,000 filaments)
Kosuke
CD3 Hack Friday
May 19, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Abhiram’s slides
Pitched 3 problems, generated some interest.
Please note that since I will move on from IPMU soon, it is best to get started asap.
CD3 project initiation
Abhiram Kidambi�
I use physical insight to develop new mathematics. ���Often, this developing new mathematics is restrictive due to the infinitude of mathematical possibilities. ����One could use computational techniques to either search for patterns, or a counter-example. Patterns in mathematics usually to be less complicated, often leading to a reasonable sanity check of a conjecture. �����Outline three major problems on which I work, and for which the use of ML techniques could be/are proving to be a game changer.
Problem 1:��Singularities of Feynman Integrals
Path Integrals
Everything that can occur, will occur.
Classical principle of least action��Unique path that minimizes action
Quantum principle of least action��Weighted contribution from all paths that minimize action
Feynman PhD thesis
1. All possible interactions contribute, but the more complicated the path the less it contributes
2. Total contribution (TC) should be finite
3. TC is a power series in physical coupling strength
4. At a given order in the power series (loop order), the set of all possible physical processes can be depicted by a cartoon, called a Feynman Diagram
Question: What is the probability that A -> B, where A and B are sets of particles?
5. Each diagram is basically a pictorial description of an integral known as a Feynman integral. There are “Feynman Rules” to convert diagram <-> Integral.
Paths = virtual particles & interactions
Problem: This makes no sense! Where do such integrals even come from?
Feynman diagrams, perturbative quantum field theory
Arithmetic geometry
Can we predict structures underlying the singular structure of the Feynman integral from some minimal amount of graph data?�
Also related to Mandy’s talk from the CD3 opening symposium.
Feynman graphs <-> Singularities
Problem 2:��Primes, quantum chaos, and beyond
Zeroes and special values of zeta functions and L functions
The Riemann Zeta function is one of the most beautiful functions in mathematics.�Key function in (probably) the most important unsolved problem in math.
When is
Conjecture: If then
Either or
It has been observed that the patterns and statistics of zeroes that lie on the critical line seem to obey some random quantum chaotic system. ��Can we spot patterns that exist between a “System of Zeroes”?
Problem 3:
Classifying solutions to string theory�(the Landscape)
10 dimensional �quantum* theory
4d Yang-Mills + Einstein Hilbert Gravity
Ex: Standard model
Ex: LCDM,…
“Hide away” 6 dimensions (conjectured internal dimensions of space). How you hide away 6 dimensions tells you what you get out in 4d.
Classifying solutions of string theory = Classifying the 6 dimensional internal spaces (manifolds).
Fundamentally manifolds are just spaces that locally look like as if they are flat.
There are many examples and types of such Calabi-Yau manifolds.
But a particular class of them (which are particularly interesting to study) can be constructed on a computer by studying polytopes in higher dimensions and “slicing” them *appropriately* (if possible). (My PhD Thesis)
Problem: The number of ways can you slice the polytopes depends on the number of vertices of the polytope and the number of polytopes. “Search space” becomes too large.��So the novel hope is to identify patterns in how to slice these polygons to construct Calabi-Yau manifolds.
The largest example of these 6d spaces are called Calabi-Yau manifolds.
What mathematicians want from ML/AI?
1. Can AI/ML models learn formal mathematics?��2. Can AI/ML models prove/disprove known results/dis-results?��3. Can AI/ML models verify new mathematical statements?��4. Can AI/ML models generate new mathematical statements?��
Computational mathematics and physics can flourish
What *I* want from CD3?
If you are interested in any of these topics (learning or contributing) get in touch with me.
Interesting problems to get started
End of AK’s slides
CD3 Hack Friday
May 12, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Katya
Connor was right, the problem with PDR2 data is the reference for zero-point.
This reference is something called ‘FLUXMAG0’ in the header of the cutout fits file.
FLUXMAG0 = fits_file_header['FLUXMAG0']
For the same object in PDR2 and PDR3 this number is different.
PDR2 'FLUXMAG0' = 0.0
PDR3 'FLUXMAG0' = 63095734448.0194
Since zero-point is calculated as:
zp = 2.5 * np.log10(FLUXMAG0)
I have -inf for PDR2.
Leander
Jia
Julien
Yona
Daniela
Power spectrum
Thomas
CD3 Hack Friday
April 28, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Julien (NB : plots are wrong! :( correcting that asap))
Fragmentation in high-z galaxies
-Boris
Dense NN on TNG power spectrum - Jia
Goal: predict the power spectrum (~30 numbers) from 6 cosmological parameters. Tried:
To try:
CD3 Hack Friday
April 21, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Cesar: (playing with AutoGPT) https://github.com/Significant-Gravitas/Auto-GPT
Auto-GPT concept (use GPT to run GPT…) (define up to 5 goals, let GPT-4 plan tasks to achieve goal and spawn GPT agents to do them), Can: search the internet, read files, save files…
Running several (~10) tests costed
Leander: learning galaxy halo connection
(Halo Occupation Distribution = HOD)
Idea: maybe the uncertainty in the galaxy halo connection is relatively low dimensional, but with typical HOD/… heuristics we’re missing the simplicity because we don’t include as much physics as is present in a hydro sim
Moving parts:
Antoine : plotting ok
Boris
Jia - discussed with (learning from) Derek new ways of running neutrino simulations
Sunao: simulation based inference for microlensing data
Goal: training P(model params | light curve data)
flux
time
Marginalized posterior
Red: true, Blue: trained posterior
Failed to train, maybe my modeling is bad.
But now I got how to use ML.
You can also take a look at Francois tutorial.
CD3 Hack Friday
April 14, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Jingjing: code successfully run on a cluster failed on a new cluster – good documentary tracking package installation needed
Yona: works! (with different method)
CD3 Hack Friday
April 7, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
Boris
CD3 Hack Friday
March 31, 2023
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
End of hack report
End of hack report
Connor
daniela
Junjie
Maybe 2d histogram instead of scatter plot
Check multiple bands B-V, R-I, G-I to get SF
Match to SDSS DR7 galaxies
Sam
Tried to understand this plot:
Cost to reproduce >~ $3000
Angela’s note
One way to use jupyter notebook on idark (from Connor Bottrell):
“sshuttle_idark”
Boris
Each beer today had 14g alcohol which contributes to 3% BAC for a 68kg male
CD3 Hack Friday
March 24, 2023
Introducing Hack Fridays (Jingjing, Cesar)
Your goal this afternoon
End of hack report 4:30pm
Please share 1 slide of your accomplishment / failure..
End of hack report
Daniela
I checked the discrepancies between the two average methods and the impact on the project’s result.
Thanks Jia, Leander and Albert for the discussion!
Sam
Possible improvements