模拟拟议ILD硅钨量热器中心区域的电磁簇射

共46个文件

py：28个

sh：4个

png：3个

版权申诉

179 浏览量 2023-04-08 02:17:41 上传评论收藏 490KB ZIP 举报

标题中的“模拟拟议ILD硅钨量热器中心区域的电磁簇射_Python.zip”表明这是一个使用Python编程语言进行的科学模拟项目，专注于研究ILD（International Large Detector，国际大型探测器）中的硅钨量热器中心区域的电磁簇射现象。ILD是高能粒子物理实验，如国际直线对撞机（ILC，International Linear Collider）的一个关键组成部分，用于探测和理解基本粒子的行为。在高能粒子物理学中，电磁簇射是指当高能粒子（如电子、正电子或光子）与物质相互作用时，产生的大量次级粒子（主要是光子和电子/正电子对）形成的能量释放过程。这些次级粒子又会继续与物质相互作用，产生更多的粒子，形成一个能量聚集的“簇”。在粒子探测器中，如ILD的硅钨量热器，这些簇射提供了关于初始粒子能量和性质的重要信息。 “硅钨量热器”是一种常见的粒子探测器组件，它结合了硅半导体探测器的高分辨率和钨的高辐射长度，用于精确测量电磁簇射的能量和位置。硅作为半导体，可以检测由簇射粒子产生的电荷，而钨则可以有效地吸收簇射的能量。 Python在科学计算和数据分析领域广泛应用，特别是在粒子物理学中，由于其丰富的库（如NumPy、SciPy、matplotlib、Pandas和ROOT等）和易于理解的语法，使得科学家们能够方便地进行数据处理、模拟和可视化。在这个项目中，可能包含以下步骤： 1. **粒子产生和相互作用模型**：使用Python库，如Geant4，来模拟粒子的生成和它们在硅钨量热器材料中的相互作用。 2. **簇射发展模拟**：通过跟踪次级粒子的产生和传播，计算能量沉积和簇射的几何形状。 3. **数据处理**：收集模拟结果，可能包括能量沉积、簇射轨迹和粒子数等信息，并用Python的数据分析工具进行处理。 4. **结果分析**：根据模拟数据，分析电磁簇射的特征，比如能量分辨率、位置分辨率以及不同入射角度和能量下的簇射行为。 5. **可视化**：使用matplotlib或类似库将结果以图形的形式展示出来，帮助理解物理过程。 6. **优化研究**：可能涉及到对探测器设计的优化，以提高对电磁簇射的探测性能。由于提供的标签为空，无法进一步细化具体的知识点。但根据文件名“getting_high-master”，这可能是一个GitHub项目的主分支，其中可能包含了整个模拟项目的代码结构、README文档、数据文件和其他辅助资源。通过深入研究这个项目，我们可以更深入地了解如何使用Python进行粒子物理的电磁簇射模拟。

资源推荐

资源详情

资源评论

收起资源包目录

模拟拟议ILD硅钨量热器中心区域的电磁簇射_Python.zip （46个子文件）

getting_high-master

BIBAE

BIBAE_Run.py 7KB

Generate_Plots.py 68KB

data_utils

data_loader.py 2KB

BIBAE_generate.py 2KB

BIBAE_PP_generate.py 2KB

BIBAE_Main.py 13KB

BIBAE_functions.py 32KB

BIBAE_models.py 12KB

Run_Plots.ipynb 5KB

BIBAE_PP_Run.py 7KB

WGAN

wGAN.py 18KB

wGAN_DDP.py 19KB

models

HDF5Dataset.py 988B

dcgan3D.py 4KB

postp.py 3KB

constrainer3D.py 1KB

training_data

lcio

create_lcio.py 2KB

photon_1evt.json 4.9MB

create_hdf5.py 3KB

generateG4-gun.sh 442B

gammaGun.mac 766B

create_root_tree.xml 4KB

ddsim_steer_gun.py 10KB

models

HDF5Dataset.py 985B

create_hdf5.sh 526B

corrections.py 2KB

marlin.sh 129B

docker

Dockerfile 358B

requirements.txt 104B

kf_pipelines

HDF5Dataset.py 985B

correctionsEOS.py 3KB

generateG4EOS.sh 741B

gammaGun.mac 766B

create_hdf5EOS.py 4KB

training_data.py 5KB

create_root_tree.xml 4KB

ddsim_steer_gun.py 10KB

training_data.py.yaml 5KB

figures

BAE_PP.png 123KB

VGAN.png 53KB

WGAN.png 65KB

GAN

models

HDF5Dataset.py 642B

GAN.py 2KB

main_GAN.py 6KB

.gitignore 136B

README.md 11KB

# Generative Models for High-granularity Calorimeter of ILD We are modelling electromagnetic showers in the central region of the Silicon-Tungsten calorimeter of the proposed ILD. We investigate the use of a new architecture: Bounded-Information Bottleneck Autoencoder. In addition, we are utilising WGAN-GP and vanilla GAN approaches. In total, we train 3 generative models. This repository contains ingredients for repoducing *Getting High: High Fidelity Simulation of High Granularity Calorimeters with High Speed* [[`arXiv:2005.05334`](https://arxiv.org/abs/2005.05334)]. A small fraction of the training data is available in [`Zenodo`](https://zenodo.org/record/3826103#.Xrz1RBZS-EI). ## Outline: * [Data Generation and Preparation](#Data-Generation-and-Preparation) * [Architectures](#Architectures) * [Training](#Training) ## Data Generation and Preparation #### Step 1: ddsim + Geant4 We use [`iLCsoft`](https://github.com/iLCSoft) ecosystem which includes `ddsim` and `Geant4`. It is better to use generation code that outputs big files to a scratch space. For DESY and NAF users: you may want to use DUST strogage in **CentOS7** NAF-WGs. First we need to pull `ILDConfig` repository and go to its specific folder. ``` git clone --branch v02-01-pre02 https://github.com/iLCSoft/ILDConfig.git cd ILDConfig/StandardConfig/production ``` copy all `.py`, `.sh` , `create_root_tree.xml` and `gammaGun.mac` files to this folder from `training_data` folder. We use `singularity` (with `docker` image) to generate Geant4 showers. Please export necessary singularity **tmp** and **cache** for convenience. ```bash export SINGULARITY_TMPDIR=/nfs/dust/ilc/user/eren/container/tmp/ export SINGULARITY_CACHEDIR=/nfs/dust/ilc/user/eren/container/cache/ ``` Please change it for **your** scratch space. Now we can start the instance and run it ```bash singularity instance start -H $PWD --bind $(pwd):/home/ilc/data docker://ilcsoft/ilcsoft-centos7-gcc8.2:v02-01-pre instance1 singularity run instance://instance1 ./generateG4-gun.sh instance1 ``` This generates **1000 showers**. Play with `gammaGun.mac` if you want to change it. #### Step 2: Marlin framework to create root files Now we would like to use `Marlin` framework in `iLCsoft`. `Marlin` takes `lcio` file, which was created in the previous step and creates `root` file ```bash ## copy create_root_tree.xml and marlin.sh file singularity run instance://instance1 ./marlin.sh "photon-shower-instance1.slcio" ``` #### Step 3: Conversion to hdf5 files It is handy to use `uproot` framework to stream showers from `root` file in order to create `hdf5` file, which is really important for our neutral network achiterctures. ```bash singularity run -H $PWD docker://engineren/pytorch:latest python create_hdf5.py --ncpu 8 --rootfile testNonUniform.root --branch photonSIM --batchsize 100 ``` #### Step 4: Remove staggering effects Our simulation of ILD calorimeter is a realistic one. That's why we have irregularities in geometry. This causes staggering in `x` direction; we see artifacts (i.e empty lines due to binning). In order to mitigate this effect, we apply a correction and thus remove artifacts. ```bash singularity run -H $PWD docker://engineren/pytorch:latest python corrections.py --input test_30x32.hdf5 --output showers-1k.hdf5 --batchsize 1000 --minibatch 1 ``` choose batchsize and mini-batch size such a way that `total showers = batchsize * minibatch` (i.e 1000 = 1000 * 1 ) #### Structure of HDF5 file Created file `showers-1k.hdf5` has the following structure: * Group named `30x30` * `energy` : Dataset {1000, 1} * `layers` : Dataset {1000, 30,30,30} As stated in our paper, we have trained our model on 950k showers, which is approximately 200 Gb. That is why we are able to put only a small fraction of our training data to [[`Zenodo`](https://zenodo.org/record/3826103#.Xrz1RBZS-EI)] ## Architectures The network architectures of generative models have a large number of moving parts and the contributions from various generators, discriminators, and critics need to be carefully orchestrated to achieve good results. Due to the high computational cost of the studies, no systematic tuning of hyperparameters was performed. ### GAN ![GAN](figures/VGAN.png) Our implementation of the `vanilla` GAN is a baseline model consisting of a generator and a discriminator. The generator network of the GAN consists of 3-dimensional transposed convolution layers with batch normalization. It takes a noise vector of length 100, uniformly distributed from -1 to 1, and the true energy labels E as inputs. The discriminator uses five 3-dimensional convolution layers followed by two fully connected layers with 257 and 128 nodes respectively. We flatten the output of the convolutions and concatenate it the with input energy before passing it to the fully connected layers. Each fully connected layer except the final one uses LeakyReLU (slope: −0.2) as an activation function. The activation in the final layer is sigmoid. In total, the generator has 1.5M trainable weights and the discriminator has 2.0M weights. ### WGAN ![WGAN](figures/WGAN.png) One alternative to classical GAN training is to use the Wasserstein-1 distance, also known as earth mover's distance, as a loss function. The WGAN architecture consists of 3 networks: * one generator with 3.7M weights, * one critic with 250k weights, * one constrainer network with 220k weights. The critic network starts with four 3D convolution layers with kernel sizes (X,2,2) with `X=10,6,4,4` which have 32, 64, 128, and 1 filters respectively. LayerNorm layers are sandwiched between the convolutions. After the last convolution, the output is concatenated with the `E` vector required for energy-conditioning. After that, it is flattened and fed into a fully connected network with 91, 100, 200, 100, 75, 1 nodes. Throughout the critic, LeakyReLU (slope: -0.2) is used as activation function. The generator network takes a latent vector `z` (normally distributed with length 100) and true `E` labels as input and separately passes them through a 3D transposed convolution layer using a `4x4x4` kernels with 128 filters. After that, the outputs are concatenated and processed through a series of four 3D transposed convolution layers (kernel size `4x4x4` with filters of 256, 128, 64, 32). LayerNorm layers along with ReLU activation functions are used throughout the generator. The energy-constrainer network is similar to the critic: three 3D convolutions with kernel sizes `3x3x3`, `3x3x3` and `2x2x2` along with 16, 32, and 16 filters are used. The output is then fed into a fully connected network with 2000, 100, and 1 nodes. LayerNorm layers and LeakyReLU (slope: -0.2) are sandwiched in between convolutional layers. ### BIB-AE and Post Processing ![Bib-AE](figures/BAE_PP.png) An instructive way for describing the base BIB-AE framework is by taking a VAE and expanding upon it. A default VAE consist of four general components: * encoder, * decoder, * latent-space regularized by the Kullback--Leibler divergence (KLD), * an L<sub>N</sub>-norm to determine the difference between the original and the reconstructed data. These components are all present as well in the BIB-AE setup. Additionally, one introduces a GAN-like adversarial network, trained to distinguish between real and reconstructed data, as well as a sampling based method of regularizing the latent space, such as another adversarial network or a maximum mean discrepancy (MMD, as described in the next section) term. In total this adds up to four loss terms: The KLD on the latent space, the sampling regularization on the latent space, the L<sub>N</sub> norm on the reconstructed samples and the adversary on the reconstructed samples. The guiding principle behind this is that the two latent space and the two reconstruction losses complement each other and, in combination, allow the network to

评论收藏

内容反馈

版权申诉