Sascha Meykopff (DESY)
Since Lars left us, I'm the only parallelized ASTRA maintainer. The parallelized version is still under development. Bug reports are welcome. Send an email to SASCHA (DOT) MEYKOPFF (AT) DESY (DOT) DE for bug reports and binary download requests. Please contact Klaus Flöttmann at DESY for information on ASTRA in general. Outdated information and old parallelized ASTRA downloads are still available from the old homepage.
Upstream update: new WAKE namelist
The current patch set supports nearly all parameters from the offical ASTRA manual. Currently the second emission is disabled and the debunch parameter is untested. The patch set supports parallel emission of particles. This dosen't work with the old parallel version. Sometimes the decision if the space charge field will be scaled or new calculated depends on the number of used cores. Then the number of iterations are different. Please decrease the parameter MAX_SCALE if you need to fix this. In the most cases this affects only the FFT space charge solver with long distance tracking. Please request a new version if you need rotated elements.
I ran the 'test1' benchmark from Photo Injector Test Facility at DESY, Location Zeuthen (PITZ) on the ferrari3 cluster. This test emitts 200k particles and track them 10 meters. The calculation of space charge is enabled with N_long_in=80 and Nrad=40 (Files: test1.in, Generator.in, Cavity, Cavity, Solenoid). The run time of the serial ASTRA version is 429 minutes.

Click on the result times to view the run time log file. The result table includes the loading time of all input files.
| Cores: | 2 | 4 | 8 | 16 | 32 | 64 | 96 | 128 | 160 | 192 | 256 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| R421 (min) | 162:11 | 109:34 | 64:02 | 42:21 | 27:18 | 22:26 | 18:14 | 16:50 | 16:28 | 17:31 | 18:49 |
| R57 (min) | 179:38 | 118:38 | 57:24 | 41:57 | 26:27 | 19:09 | 16:05 | 14:39 | 13:45 | 13:02 | 12:40 |
| 20K particles | 200K particles | 500K particles | 1M particles | 2M particles | |
|---|---|---|---|---|---|
| Single cpu run: | |||||
| Nrad=18;Nlong=40 | 2:55 | 13:52 | 31:39 | 1:01:42 | 2:05:21 |
| Nrad=30;Nlong=60 | 5:51 | 17:43 | 36:46 | 1:08:08 | 2:11:57 |
| Nrad=40;Nlong=100 | 12:15 | 24:48 | 44:14 | 1:16:47 | 2:21:05 |
| 4 cpu run: | |||||
| Nrad=18;Nlong=40 | 1:22 | 4:27 | 9:49 | 0:18:05 | 0:35:23 |
| Nrad=30;Nlong=60 | 3:09 | 6:30 | 12:05 | 0:21:14 | 0:38:15 |
| Nrad=40;Nlong=100 | 7:10 | 10:43 | 16:38 | 0:25:44 | 0:44:51 |