GK: loss cone updater refactored for arbitrary geometry by Maxwell-Rosen · Pull Request #997 · ammarhakim/gkeyll

Maxwell-Rosen · 2026-04-21T15:49:37Z

Hamiltonian-based Classification of Loss and Trapped Orbits in Mirror Geometry

This PR was generated with the assistance of GitHub Copilot for the unit tests and the updater functions. Most of it is LLM-generated, but I've read it all and understand it.

Consider a one-dimensional coordinate $z\in[z_L,z_R]$ aligned with a magnetic field line, with magnetic-field magnitude $B(z)$ and electrostatic potential $\phi(z)$. For a species of mass $m_s$ and charge $q_s$, the guiding-center Hamiltonian in $(z,v_\parallel,\mu)$ coordinates is

$$ H(z,v_\parallel,\mu)=\frac{1}{2}m_s v_\parallel^2+\mu B(z)+q_s\phi(z), $$

where $v_\parallel$ is the parallel velocity and $\mu$ is the magnetic moment. In the absence of collisions, $H$ and $\mu$ are invariants of motion, and particle trajectories lie along contours of constant $H$ in $(z, v_\parallel)$.
Turning points satisfy $v_\parallel=0$, hence

$$H=U(z,\mu), \qquad U(z,\mu)= \mu B(z)+q\phi(z), $$

where $U$ is the Yushmanov potential.

Prior work simply identified the trapped/passing boundary based on the
loss cone criterion

$$\mu = \frac{\frac{1}{2}m_s v_\parallel^2 + q_s\left(\phi(z) - \phi_m\right)}{B_m - B(z)}.$$

In that implementation, $B_m,\phi_m$ are determined at the peaks of $B$. However, the effective potential experienced by particles is $U$, which combines both $B$ and $\phi$. This can be essential for identifying the trapped region of phase space induced by angled neutral-beam injection of ions at $45^\circ$, where an off-axis density peak occurs. Furthermore, this implementation only uses a single loss criterion per region, when multiple can exist, for instance, in tandem mirrors. A Hamiltonian-based region-identification algorithm is more accurate and flexible because it directly addresses the Yushmanov potential.

Two-Wall Escape Criterion

One issue with a Hamiltonian-based identifier is that $H$ is non-unique. For instance, consider a symmetric simple mirror without an electric potential, just two peaks in B. The magnetic field has fourfold degeneracy at the half-maximum. The points outside the peaks in $B$ should be labeled passing, while those between peaks of $B$ should be trapped.

A trajectory at fixed $(H,\mu)$ can escape through the left wall at $z_L$ if it can traverse the interval $[z_L,z]$ without violating $H\ge U$. For a phase-space point $(z,v_\parallel,\mu)$, define the escape barrier to the left wall as

$$EB_L(z,\mu)=\max_{s\in[z_L,z]}U(s,\mu), $$

and the escape barrier to the right wall as

$$EB_R(z,\mu)=\max_{s\in[z,z_R]}U(s,\mu). $$

The minimum escape barrier required for escape through at least one wall is

$$EB(z,\mu)=\min\left(EB_L(z,\mu),EB_R(z,\mu)\right). $$

Defining the signed loss function

$$\mathcal{F}(z,v_\parallel,\mu)=H(z,v_\parallel,\mu)-EB(z,\mu). $$

$\mathcal{F}\ge 0$ means that there is no energy barrier between the wall and this point, indicating a passing orbit. If $\mathcal{F}<0$, then the escape barrier is at a higher energy than this point, so it is trapped. So the orbiting mask is

$$\chi_{\rm orbit}(z, v_\parallel,\mu) = \mathrm{bool}(\mathcal{F}(z,v_\parallel,\mu)<0)$$

Inclusion of Sheath Wall Potentials

The field $\phi(z)$ does not directly represent the sheath drop at the wall, but instead it is assumed that $\phi=0$ in Gkeyll's sheath boundary conditions. Let $\phi_L^{\mathrm{bc}}$ and $\phi_R^{\mathrm{bc}}$ denote prescribed wall potentials at $z_L$ and $z_R$, respectively, and let $\phi(z_L)$ and $\phi(z_R)$ be the values implied by the loaded field data near the walls. These are left general, but taken as zero in the simulations. The effective wall potentials are taken as

$$\begin{aligned} U_L^{\mathrm{wall}}(\mu) &=\max\left(\mu B(z_L)+q\phi(z_L),\mu B(z_L)+q\phi_L^{\mathrm{bc}}\right), \\ U_R^{\mathrm{wall}}(\mu) &=\max\left(\mu B(z_R)+q\phi(z_R),\mu B(z_R)+q\phi_R^{\mathrm{bc}}\right). \end{aligned}$$

The corresponding augmented path barriers are

$$\begin{aligned} \widetilde{EB}_L(z,\mu)&=\max\left(\mathcal{B}_L(z,\mu),U_L^{\mathrm{wall}}(\mu)\right), \\ \widetilde{EB}_R(z,\mu)&=\max\left(\mathcal{B}_R(z,\mu),U_R^{\mathrm{wall}}(\mu)\right), \end{aligned}$$

with an augmented escape barrier

$$\widetilde{EB}(z,\mu)=\min\left(\widetilde{EB}_L(z,\mu),\widetilde{EB}_R(z,\mu)\right). $$

Replacing $EB$ by $\widetilde{EB}$
yields a classifier that includes sheath reflection consistently.

Integration into the Gkeyll Workflow

Within Gkeyll, one first interpolates the fields to nodal values, computes $EB$, calculates the loss barrier, and then, if any corners of the cell are not orbiting, the cell is not evolved.

Community Standards

Documentation has been updated.
My code follows the project's coding guidelines.
Changes to layer/zero should have a unit test, e.g., core/zero.

Testing: (x (yes), blank (no))

Additional notes

Here is a comparison in a beam simulation with the current loss mask and the old one.
This is an R=32 mirror case with Boltzmann electrons, showing the difference in the fdot multiplier. Red means these cells are now masked with the new updater, while blue means cells that were masked, but are not with this version. In other words, this is new_mask - old_mask. We see the population of small mu particles which are trapped due to the off-axis beam injection, which is what we expect.

Here is another view, this time in pyvista

Here is a simulation run with the new loss cone mask. I'm re-running a beam R=32 case, which was used for the paper Gyrokinetic equilibria of high temperature superconducting magnetic mirrors

This did decrease the time-step during the OAP to 3.4e-7 versus 2.6e-6 seconds. Perhaps a more coarse mesh in mu would help.

…e maximum magnetic field is determined. Bmag max is stored as a gkyl_array. Right now, we only do this for bmag, but we need to store phi as a 1d maximum array. I haven't decided on the final design for how 2x OAP simulations should be accomodated. Perhpas we need some general gkyl_dg_array_reduce methods that take a 2D array and turn it into a 1D array instead of a 0D number. This is a kind of reduction method, but it's not a total reduction. I tested the regression test included for a 2x2v boltzmann mirror and the output of the magnetic field looks correct. The current implementation evaluates bmag at cell corners, but we ideally should do the corners in Z, but the quadrature nodes in psi.

- Introduced a new header file `gkyl_array_dg_find_peaks.h` that defines a structure and functions for finding peaks (local maxima, minima, and boundary values) in DG fields. - Implemented an internal structure in `gkyl_array_dg_find_peaks_priv.h` to manage peak finding operations, including storage for peak values and coordinates. - Removed unused initialization and writing of `bmag_max` arrays in `gyrokinetic.c` to streamline the geometry setup process. - Deleted the `gkyl_gk_geometry_bmag_max_init` and `gkyl_gk_geometry_bmag_max_release` functions from `gk_geometry.c` as they are no longer needed, simplifying the geometry management.

…es another DG array at the peaks of the initilized array. Add appropriate unit tests which pass to ctest_array_dg_find_peaks. Update gk_species_fdot_multiplier to use the project_on_peaks function with phi. Now, everything passed to loss_cone_mask_gyrokinetic is a gkyl_array. The loss_cone_mask is updated accordingly

…ount of compution we need for evaluating phi at its peak in the app. Unit tests pass. Regression tests look fine as well. They're all valgrind clean. I think the right way to do the paralellism is to do the peak finding on a global bmag, just like how it is done for the position_map, then when we evaluate phi, all processes evaluate it at this peak, however only one will return a true value. This process will broadcast the array to the rest of the processes

…on at the peak locations

… method, which is just like find_peaks, but it computes the global maximum or minimum.

…nd regression tests are brought over from another branch. Unit tests for the array mask, loss cone mask, and the regression tests for the kinetic electron POA mirror are valgrind free

…grind clean. Regression test is added and produces reasonable results.

…formance - Introduced a helper function `mkarr` to streamline array allocation for GPU and CPU. - Removed the `gkyl_loss_cone_mask_gyrokinetic_Dbmag_quad_wall` function and integrated its logic into the main processing flow. - Updated the GPU kernel `gkyl_loss_cone_mask_gyrokinetic_Dbmag_quad_cu_ker` to compute `Dbmag_quad` directly from `bmag_peak` instead of `bmag_max`. - Enhanced tandem mirror support by adding handling for `bmag_peak` and `phi_m` in the GPU kernels. - Simplified the logic for determining trapped particles in the `gkyl_loss_cone_mask_gyrokinetic_ker` and `gkyl_loss_cone_mask_gyrokinetic_quad_ker` functions. - Improved readability and maintainability by restructuring conditional checks and variable assignments.

…plier. Add possibility of kinetic electrons and tandem mirrors. The damping regression test is failing, both here and on main. They are for different issues. Main fails because the loss_cone updater has an issue. Here, it fails because it's using scale_by_cell with a multi-component array. I'm not sure the right way to fix this

…ove the aspects about the cellwise evaluation and quadrature points because that breaks the array_scale_by_cell method which is used

… refactor this in the future, but it is just proof of concept for now to make sure it works correctly.

… arrays need to be passed to objects like the loss_cone_mask, where it expects these to be GPU arrays. It's just easier to have this module fit the archatecture of the rest of the code, rather than doing something different and copying between device and host. It wouldn't interface well. Claude generated most of the cuda code, with strong guidence from Maxwell

…h compute sanitizer with the array_find_peaks which was causing crashes in the loss cone mask. These issues are fixed. There was some funny business regarding the basis being on the host vs device. Refactor the allocations in the GPU kernels to not be inside the kernels. Instead, it's allocated at init time. The GPU code pulses, which is odd, but it runs. The 2x2v POA regression test runs and is compute sanitizer clean. The other POA tests do not error either on GPU and are compute sanitizer clean.

…itting code to main and broke a unit test with the geometry enum changes

…hi_smooth_global array for improved performance and consistency across computations.

…er array release

…ion of doing the kinetic electron and tandem mirror. The code is built again to make sure nothing is affected.

…=2 relevant code for the peak finders

…egression test (and in my production simulations)

…erything is done in computational coordinates

…and switch to user-defined damping type in simulation context.

…d enhance comments for clarity.

- Removed unused includes and commented-out code for clarity. - Simplified the initialization of quadrature values by consolidating the logic into a single function. - Introduced a new function to compute escape barriers based on the potential and magnetic field. - Replaced the previous approach of handling quadrature nodes with a more efficient nodal representation. - Updated the main advance function to utilize the new escape barrier calculations and streamline the trapped particle detection logic. - Cleaned up memory management by ensuring proper release of allocated arrays.

…remove unnecessary parameters

…nce communication logic for global data assembly

… parallel execution with MPI

…and streamline node handling logic

- Removed unused inline functions for node coordinate matching and field value calculations in loss_cone_mask_gyrokinetic.c. - Simplified the escape_barriers function by directly integrating its logic into the main kernel. - Updated the CUDA kernel for loss cone mask gyrokinetic to streamline the computation of escape barriers and Hamiltonian checks. - Enhanced the handling of configuration and phase ranges in the CUDA implementation for better clarity and performance. - Adjusted the kernel launch parameters to accommodate the new structure of the code and ensure proper execution.

…nd enhance mask handling

…DA functions

…ul, but let's not clutter the codebase

… cone mask files

Maxwell-Rosen added 30 commits December 10, 2025 14:21

Remove some memory allocation during the advance methods and evaluati…

aa4afa0

…on at the peak locations

Remove old elements from gk_geometry. Add a agkyl_array_dg_reduce_dir…

976b6bd

… method, which is just like find_peaks, but it computes the global maximum or minimum.

Merge branch 'main' into gk-oap-2x-multispecies

61fa5ca

Implement kinetic electrons into the POA scheme. The unit tests run a…

3b933bb

…nd regression tests are brought over from another branch. Unit tests for the array mask, loss cone mask, and the regression tests for the kinetic electron POA mirror are valgrind free

Add support for symmetric tandem mirrors. Unit tests pass and are val…

79133e7

…grind clean. Regression test is added and produces reasonable results.

gk_species_damping is working on this branch too now. I needed to rem…

a6bff16

…ove the aspects about the cellwise evaluation and quadrature points because that breaks the array_scale_by_cell method which is used

Add another unit test to the loss cone mask

e2bc28f

Merge branch 'main' into gk-oap-2x-tandem-multispecies

fe0e3ed

Update variable name

1fb0516

GPU unit tests all pass. It's compute sanitizer clean. I will have to…

ce57b6a

… refactor this in the future, but it is just proof of concept for now to make sure it works correctly.

Merge branch 'main' into gk-oap-2x-tandem-multispecies

f3977bf

Fix loss cone mask unit test. Someone didn't run make check when comm…

909d741

…itting code to main and broke a unit test with the geometry enum changes

fdot multiplier works in parallel. Unit test fixed

c689992

Refactor global potential handling in damping modules to use shared p…

eed58bc

…hi_smooth_global array for improved performance and consistency across computations.

Fix missing semicolon in gk_species_damping_release function for prop…

0821a27

…er array release

remove the array_reduce_dir object. This is leftover from an old vers…

dec21d3

…ion of doing the kinetic electron and tandem mirror. The code is built again to make sure nothing is affected.

Merge branch 'main' into gk-oap-2x-tandem-multispecies

a807e79

Uncrustify format the files this PR modifies and also depricate all p…

6a4931a

…=2 relevant code for the peak finders

Merge branch 'main' into gk-oap-2x-tandem-multispecies

34d6229

Merge branch 'main' into gk-oap-2x-tandem-multispecies

740cf5a

Fix an issue with the non-uniform grids in the 2x2v nonuniform wham r…

062d869

…egression test (and in my production simulations)

Merge branch 'main' into gk-oap-2x-tandem-multispecies

5eb57ae

Maxwell-Rosen added 19 commits March 13, 2026 11:34

Merge branch 'main' into gk-oap-2x-tandem-multispecies

153d7b2

Reduce the number of frames in the regression tests

6d95ed7

Add sprintf to the app name

443c76c

Remove c2p from the loss cone mask, as it shouldn't be there since ev…

122f3be

…erything is done in computational coordinates

Remove c2p context from damping

f8f799b

Merge branch 'main' into gk-oap-2x-tandem-multispecies

92a3c05

Merge branch 'main' into gk-oap-2x-tandem-multispecies

6785b3f

Merge branch 'main' into gk-oap-2x-tandem-multispecies

a3ee0ac

Refactor damping implementation: remove loss cone damping references …

65aff2f

…and switch to user-defined damping type in simulation context.

Refactor position mapping logic: improve limiter line calculations an…

a69b487

…d enhance comments for clarity.

Refactor loss cone mask implementation: streamline bmag handling and …

888f367

…remove unnecessary parameters

Refactor loss cone mask implementation: update bmag handling and enha…

894c31c

…nce communication logic for global data assembly

Add unit test for loss cone mask gyrokinetic implementation: validate…

87f8027

… parallel execution with MPI

Refactor loss cone mask implementation: enhance barrier calculations …

f3f575b

…and streamline node handling logic

Refactor loss cone mask implementation: add GPU/CPU mismatch checks a…

c71d782

…nd enhance mask handling

Refactor loss cone mask implementation: add extern "C" linkage for CU…

aacc2fb

…DA functions

Remove the find_peaks method. Nothing is using it anymore. It is usef…

96d7090

…ul, but let's not clutter the codebase

Maxwell-Rosen marked this pull request as draft April 21, 2026 15:50

Maxwell-Rosen requested review from JunoRavin and manauref April 21, 2026 16:48

Maxwell-Rosen added 2 commits April 21, 2026 13:50

I pushed to the wrong branch. I forgot the GPU flag in this branch too

51a51cd

Cleanup: remove unnecessary blank lines and adjust formatting in loss…

139f14a

… cone mask files

Maxwell-Rosen marked this pull request as ready for review April 24, 2026 21:27

Maxwell-Rosen added 4 commits May 27, 2026 13:57

Merge branch 'main' into gk-lc-mask-yushmanov

1528838

Merge branch 'main' into gk-lc-mask-yushmanov

67be09b

Merge branch 'main' into gk-lc-mask-yushmanov

485ecb4

Adjust spacing

4bea130

manauref mentioned this pull request Jun 22, 2026

Fix fromfile initial conditions #1067

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GK: loss cone updater refactored for arbitrary geometry#997

GK: loss cone updater refactored for arbitrary geometry#997
Maxwell-Rosen wants to merge 60 commits into
mainfrom
gk-lc-mask-yushmanov

Maxwell-Rosen commented Apr 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Maxwell-Rosen commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Hamiltonian-based Classification of Loss and Trapped Orbits in Mirror Geometry

Two-Wall Escape Criterion

Inclusion of Sheath Wall Potentials

Integration into the Gkeyll Workflow

Community Standards

Testing: (x (yes), blank (no))

Additional notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Maxwell-Rosen commented Apr 21, 2026 •

edited

Loading