{
  "revision_id": 4,
  "updated": "2025-02-21T03:34:39.590202+00:00",
  "links": {
    "bibtex": "https://inspirehep.net/api/literature/2825799?format=bibtex",
    "latex-eu": "https://inspirehep.net/api/literature/2825799?format=latex-eu",
    "latex-us": "https://inspirehep.net/api/literature/2825799?format=latex-us",
    "json": "https://inspirehep.net/api/literature/2825799?format=json",
    "json-expanded": "https://inspirehep.net/api/literature/2825799?format=json-expanded",
    "cv": "https://inspirehep.net/api/literature/2825799?format=cv",
    "citations": "https://inspirehep.net/api/literature/?q=refersto%3Arecid%3A2825799"
  },
  "metadata": {
    "citation_count": 4,
    "citation_count_without_self_citations": 4,
    "core": true,
    "titles": [
      {
        "title": "Multiple right hand side multigrid for domain wall fermions with a multigrid preconditioned block conjugate gradient algorithm",
        "source": "arXiv"
      }
    ],
    "$schema": "https://inspirehep.net/schemas/records/hep.json",
    "authors": [
      {
        "uuid": "3b6e1c6a-4473-40fb-9c92-4952a2570d28",
        "record": {
          "$ref": "https://inspirehep.net/api/authors/1015525"
        },
        "full_name": "Boyle, Peter A.",
        "affiliations": [
          {
            "value": "Brookhaven",
            "record": {
              "$ref": "https://inspirehep.net/api/institutions/902689"
            }
          }
        ],
        "raw_affiliations": [
          {
            "value": "Physics Department, Brookhaven National Laboratory, Upton, 11777, NY, USA"
          }
        ]
      }
    ],
    "curated": true,
    "figures": [
      {
        "key": "c1462f6c57fb6daa021077c08765b0e1",
        "url": "https://inspirehep.net/files/c1462f6c57fb6daa021077c08765b0e1",
        "label": "fig:convergence-cg",
        "source": "arxiv",
        "caption": "Convergence of the red-black preconditioned conjugate gradient algorithm with physical quark mass on the Mobius domain wall fermion action with the Schur operator ${\\cal H}$ and on our $1.7$ GeV $48^3\\times 96 \\times 24$ (5.5fm) and $96^3\\times 192 \\times 24$ (11fm) test configurations. The volume of eigenvector data and memory required to deflated the latter configuration is prohibitive as the cost grows as the square of the volume, whereas the multigrid solver setup, cost and footprint is proportional to the volume. The convergence is dictated by the spectrum and since the lattice spacing, quark mass and fermion action are identical between the two volumes (which differ by a factor of 16) the spectral density and convergence history remain almost identical. The system is solved with a polynomial order that is far less than the rank of both systems, which have a dense spectrum, so that each crossing of residual polynomial is covering a large number of eigenvalues in both cases, meaning the worst case bound is a reasonable description.",
        "filename": "convergence_cg.png",
        "material": "preprint"
      },
      {
        "key": "2927c467284a427103db190802097a37",
        "url": "https://inspirehep.net/files/2927c467284a427103db190802097a37",
        "label": "fig:completenessLanczos",
        "source": "arxiv",
        "caption": "On our $48^3\\times 96$ test volume we can assess the completeness of the low mode by computing $ E_i = \\sqrt{|| (1 - PP^\\dagger) | i \\rangle ||}$ and $E_i^{smoothed} = \\sqrt{|| M_{IRS}(1 - PP^\\dagger) | i \\rangle ||}$ for each eigenpair, here with the Filter based subspace setup (left) and Lanczos based setup (right). Post-smoothing reduces the error and may be indicative of a relative cheap way to improve coarse operator based low mode variance reduction. The exact eigenvectors included in the coarse basis (right) are faithfully reproduced to numerical precision, while higher modes have a percent scale error.",
        "filename": "CompletenessFilter.png",
        "material": "preprint"
      },
      {
        "key": "0e6cca61e9fe3b880e366f96d3122db3",
        "url": "https://inspirehep.net/files/0e6cca61e9fe3b880e366f96d3122db3",
        "label": "fig:completenessLanczos",
        "source": "arxiv",
        "caption": "On our $48^3\\times 96$ test volume we can assess the completeness of the low mode by computing $ E_i = \\sqrt{|| (1 - PP^\\dagger) | i \\rangle ||}$ and $E_i^{smoothed} = \\sqrt{|| M_{IRS}(1 - PP^\\dagger) | i \\rangle ||}$ for each eigenpair, here with the Filter based subspace setup (left) and Lanczos based setup (right). Post-smoothing reduces the error and may be indicative of a relative cheap way to improve coarse operator based low mode variance reduction. The exact eigenvectors included in the coarse basis (right) are faithfully reproduced to numerical precision, while higher modes have a percent scale error.",
        "filename": "CompletenessLanczos.png",
        "material": "preprint"
      },
      {
        "key": "f4499acd8e430843500dab7d30800774",
        "url": "https://inspirehep.net/files/f4499acd8e430843500dab7d30800774",
        "label": "fig:exterior-zoom",
        "source": "arxiv",
        "caption": "Spectrum of the lowest 200 eigenmodes of the fine operator ${\\cal H}$ and the coarse operator with both Lanczos (green) and Chebyshev filter (blue) setup schemes with $62$ near null basis vectors on the $48^3$ lattice. A cluster of exactly $n_{basis}=62 $ very low eigenvalues are seen in the coarse operator, corresponding to the maximal diagonalisation of the fine operator within this set of near null vectors, whereas directions that involve a non-trivial coarse coordinate dependence in the coarse eigenvector necessarily incur spectral leakage at the boundaries between blocks and this lifts the coarse eigenvalue by an order of magnitude. The upper eigenvalue of the fine operator is around 89.0 while the upper eigenvalue of the coarse operator is around 37.0. On the right panel we zoom in on the lowest eigenvalues. We see that with the eigenvector setup the lowest eigenvalues are exactly reproduced by the coarse operator. This likely contributes a to a slightly improved convergence rate in section~\\ref{sec:solverresults}.",
        "filename": "Evals_compare.png",
        "material": "preprint"
      },
      {
        "key": "a99754194167e0584dff489225b00538",
        "url": "https://inspirehep.net/files/a99754194167e0584dff489225b00538",
        "label": "fig:exterior-zoom",
        "source": "arxiv",
        "caption": "Spectrum of the lowest 200 eigenmodes of the fine operator ${\\cal H}$ and the coarse operator with both Lanczos (green) and Chebyshev filter (blue) setup schemes with $62$ near null basis vectors on the $48^3$ lattice. A cluster of exactly $n_{basis}=62 $ very low eigenvalues are seen in the coarse operator, corresponding to the maximal diagonalisation of the fine operator within this set of near null vectors, whereas directions that involve a non-trivial coarse coordinate dependence in the coarse eigenvector necessarily incur spectral leakage at the boundaries between blocks and this lifts the coarse eigenvalue by an order of magnitude. The upper eigenvalue of the fine operator is around 89.0 while the upper eigenvalue of the coarse operator is around 37.0. On the right panel we zoom in on the lowest eigenvalues. We see that with the eigenvector setup the lowest eigenvalues are exactly reproduced by the coarse operator. This likely contributes a to a slightly improved convergence rate in section~\\ref{sec:solverresults}.",
        "filename": "Evals_compare_zoom.png",
        "material": "preprint"
      },
      {
        "key": "f2035a2b005478d5a77a9c8f94647bfb",
        "url": "https://inspirehep.net/files/f2035a2b005478d5a77a9c8f94647bfb",
        "label": "fig:48dnsty",
        "source": "arxiv",
        "caption": "Spectral density (modes per bin, bin width $5\\times 10^5$) for the coarse operator on the $48^3$ volume (left) and $96^3$ volume (right). There is a peak of $n_{basis}=62$ modes in the correct physical low mode region, corresponding to the dimension of the input set of near null vectors, with the eigenvectors in practice found by diagonalising mainly within this basis. Directions outside this sub-space necessarily induce upwards spectral leakage leaving this cluster clearly detached from the bulk spectrum with about 2 or 3 modes per bin. If we compare these, we see that the detached cluster again corresponds to the number of basis vectors defining the coarsening, but the density of modes in the higher, bulk spectrum grows linearly in the volume by exactly the expected factor of 16.",
        "filename": "48cube_density.png",
        "material": "preprint"
      },
      {
        "key": "791d28de6f11b24b830a94a317795e37",
        "url": "https://inspirehep.net/files/791d28de6f11b24b830a94a317795e37",
        "label": "fig:48dnsty",
        "source": "arxiv",
        "caption": "Spectral density (modes per bin, bin width $5\\times 10^5$) for the coarse operator on the $48^3$ volume (left) and $96^3$ volume (right). There is a peak of $n_{basis}=62$ modes in the correct physical low mode region, corresponding to the dimension of the input set of near null vectors, with the eigenvectors in practice found by diagonalising mainly within this basis. Directions outside this sub-space necessarily induce upwards spectral leakage leaving this cluster clearly detached from the bulk spectrum with about 2 or 3 modes per bin. If we compare these, we see that the detached cluster again corresponds to the number of basis vectors defining the coarsening, but the density of modes in the higher, bulk spectrum grows linearly in the volume by exactly the expected factor of 16.",
        "filename": "96cube_density.png",
        "material": "preprint"
      },
      {
        "key": "3b54775f30670b886e3f3d865d26a4ae",
        "url": "https://inspirehep.net/files/3b54775f30670b886e3f3d865d26a4ae",
        "label": "fig:convergenceall",
        "source": "arxiv",
        "caption": "(Left) Convergence of mrhs-HDCG and red-black preconditioned Conjugate Gradient on sample $48^3\\times 96$ and $96^3\\times 192$ configurations. (Right) Zoomed comparison between the two volumes on the HDCG convergence history.",
        "filename": "convergence_all.png",
        "material": "preprint"
      },
      {
        "key": "03f3ea4fb52f5bf37638373c61aceb8f",
        "url": "https://inspirehep.net/files/03f3ea4fb52f5bf37638373c61aceb8f",
        "label": "fig:convergenceall",
        "source": "arxiv",
        "caption": "(Left) Convergence of mrhs-HDCG and red-black preconditioned Conjugate Gradient on sample $48^3\\times 96$ and $96^3\\times 192$ configurations. (Right) Zoomed comparison between the two volumes on the HDCG convergence history.",
        "filename": "convergence_hdcg.png",
        "material": "preprint"
      },
      {
        "key": "18fbff715e5e46abf0e83793c13c5b68",
        "url": "https://inspirehep.net/files/18fbff715e5e46abf0e83793c13c5b68",
        "label": "fig:convergenceblockcg",
        "source": "arxiv",
        "caption": "Convergence of Flexible ADEF2 and preconditioned BlockCGrQ on the $48^3$ configurations with 12 right hand sides and 62 basis vectors (either eigenvectors or filtered noise vectors). The Block algorithm substantially reduces the difference between the eigenvector and filtered vector basis creation choices. The Block algorithm appears to better tolerate an imperfect setup than preconditioned CG alone. With 24 right hand sides and the Filter setup there is clearer evidence of the superlinear convergence property of BlockCG, however with the current software implementation the linear algebra overhead is too large to make this beneficial.",
        "filename": "convergence_hdcg_pbcgrq.png",
        "material": "preprint"
      },
      {
        "key": "577bb8f105e346119e351b9a146f383e",
        "url": "https://inspirehep.net/files/577bb8f105e346119e351b9a146f383e",
        "label": "fig:JacksonThetaFunctions",
        "source": "arxiv",
        "caption": "Chebyshev polynomial spectral bandpass filters used to evaluate the power spectrum of different operators entering our multigrid solver. This is an interesting probe and diagnostic of our algorithms.",
        "filename": "BandpassJackson.png",
        "material": "preprint"
      },
      {
        "key": "1ce57de2399e0559f9ec8b2261bf117b",
        "url": "https://inspirehep.net/files/1ce57de2399e0559f9ec8b2261bf117b",
        "label": "fig:sunspot",
        "source": "arxiv",
        "caption": "Profile of the fine grid domain wall operator running on a single node of the ANL Sunspot supercomputer. Intra-node communications are performed using DMA engines programmed via SYCL memcpy instructions and using the Level Zero API shared memory support. After gathering face data (multiple colours), communications (red) and computation (purple) are well overlapped, and the halo exchange  for the PDE stencil is performed concurrently with computation. The additional terms from the surface are added in as a second major kernel call. The performance in single precision is over 18 TF/s per node on large local volumes.",
        "filename": "64.64.32.96_perf.png",
        "material": "preprint"
      },
      {
        "key": "d761d571d662ce75407e45436d4b9e3a",
        "url": "https://inspirehep.net/files/d761d571d662ce75407e45436d4b9e3a",
        "label": "fig:blas",
        "source": "arxiv",
        "caption": "Performance of batched GEMM operations in TF/s as used in mrhs-HDCG across a variety of modern GPUs and some of the most significant current supercomputing platforms. The Matrix dimensions correspond to the application of the coarse grid operator, the projection of data from the fine grid to the coarse grid, the promotion from the coarse grid to the fine grid and the QR rotation that enters the Block conjugate gradient algorithms. These include the Frontier supercomputer at ORNL (four AMD MI250X GPUs and eight logical GCD's), the Sunspot/Aurora supercomputer at ANL (six Intel Pontevecchio GPUs and 12 logical tiles) and the Perlmutter supercomputer at NERSC. Multiple TF/s per logical GPU is easily obtained on most of the relevant matrix ranks, with the exception of the ThinQR factorisation in BlockCG on the fine grid on AMD and Intel libraries. This is relatively easily addressed in our implementation by using a batched call to execute many shorter $K$ matrix multiplications and then summing manually the resulting $12\\times 12$ matrices, and then yields several TF/s per GPU, but the vendor library delivers less than 10 GF/s performance without this approach.",
        "filename": "BlasLog.png",
        "material": "preprint"
      },
      {
        "key": "7f7b02b03167050906948907e406f0eb",
        "url": "https://inspirehep.net/files/7f7b02b03167050906948907e406f0eb",
        "label": "fig:mgridprofile",
        "source": "arxiv",
        "caption": "AMD Rocprof obtained profile from Frontier of the multigrid iteration on 18 nodes on the $48^3$ test problem after careful optimisation. The general Grid software kernels are shown executing along side GEMM kernels, used by projection to coarse, deflation, coarse Chebyshev solver, promotion to fine and then a thinQR rotation. Broadly it is possible to perform almost the entire multigrid preconditioner in BLAS routines using optimised hardware, except for the relatively modest overhead of data layout changes and of course halo-exchange routines.",
        "filename": "MgridIteration2.png",
        "material": "preprint"
      },
      {
        "key": "6119980570af640b2bdfb640bf924742",
        "url": "https://inspirehep.net/files/6119980570af640b2bdfb640bf924742",
        "label": "fig:mgridprofile",
        "source": "arxiv",
        "caption": "AMD Rocprof obtained profile from Frontier of the Coarse Grid operator on 18 nodes on the $48^3$ test problem after careful optimisation. The general Grid software kernels are shown executing along side GEMM kernels. There remains some scope for further optimisation in the coarse space operator as GPU synchronisation overhead remains a 50\\% overhead in that routine by fusing together larger batched operations, perhaps an estimated 10\\% effect on the overall solver performance.",
        "filename": "CoarseOperatorProfileAnnotated.png",
        "material": "preprint"
      }
    ],
    "license": [
      {
        "url": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
        "license": "arXiv nonexclusive-distrib 1.0",
        "material": "preprint"
      }
    ],
    "texkeys": [
      "Boyle:2024pio"
    ],
    "citeable": true,
    "abstracts": [
      {
        "value": "We introduce a class of efficient multiple right-hand side multigrid algorithm for domain wall fermions. The simultaneous solution for a modest number of right hand sides concurrently allows for a significant reduction in the time spent solving the coarse grid operator in a multigrid preconditioner. We introduce a preconditioned block conjuate gradient with a multigrid preconditioner, giving additional algorithmic benefit from the multiple right hand sides. There is also a very significant additional to computation rate benefit to multiple right hand sides. This both increases the arithmetic intensity in the coarse space and increases the amount of work being performed in each subroutine call, leading to excellent performance on modern GPU architectures. Further, the software implementation makes use of vendor linear algebra routines (batched GEMM) that can make use of high throughput tensor hardware on recent Nvidia, AMD and Intel GPUs. The cost of the coarse space is made sub-dominant in this algorithm, and benchmarks from the Frontier supercomputer system show up to a factor of twenty speed up over the standard red-black preconditioned conjugate gradient algorithm on a large system with physical quark masses.",
        "source": "arXiv"
      }
    ],
    "references": [
      {
        "raw_refs": [
          {
            "value": "[1] Peter A. Boyle. Advances in algorithms for solvers and gauge generation. 1 2024.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Advances in algorithms for solvers and gauge generation. 1"
          ],
          "label": "1",
          "authors": [
            {
              "full_name": "Boyle, Peter A."
            }
          ],
          "publication_info": {
            "year": 2024
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/753266"
        },
        "raw_refs": [
          {
            "value": "[2] Martin Luscher. Local coherence and deflation of the low quark modes in lattice QCD. JHEP, 07:081, 2007.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Martin Luscher. Local coherence and deflation of the low quark modes in lattice QCD"
          ],
          "label": "2",
          "publication_info": {
            "year": 2007,
            "artid": "081",
            "page_start": "081",
            "journal_title": "JHEP",
            "journal_volume": "07"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/756769"
        },
        "raw_refs": [
          {
            "value": "[3] J. Brannick, R. C. Brower, M. A. Clark, J. C. Osborn, and C. Rebbi. Adaptive Multigrid Algorithm for Lattice QCD. Phys. Rev. Lett., 100:041601, 2008.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Adaptive Multigrid Algorithm for Lattice QCD"
          ],
          "label": "3",
          "authors": [
            {
              "full_name": "Brannick, J."
            },
            {
              "full_name": "Brower, R.C."
            },
            {
              "full_name": "Clark, M.A."
            },
            {
              "full_name": "Osborn, J.C."
            },
            {
              "full_name": "Rebbi, C."
            }
          ],
          "publication_info": {
            "year": 2008,
            "artid": "041601",
            "journal_title": "Phys.Rev.Lett.",
            "journal_volume": "100"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[4] J. Brannick, R. C. Brower, M. A. Clark, J. C. Osborn, and C. Rebbi. Adaptive Multigrid Algorithm for the QCD Dirac-Wilson Operator. PoS, LATTICE2007:029, 2007.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Adaptive Multigrid Algorithm for the QCD Dirac-Wilson Operator. PoS, LATTICE:029"
          ],
          "label": "4",
          "authors": [
            {
              "full_name": "Brannick, J."
            },
            {
              "full_name": "Brower, R.C."
            },
            {
              "full_name": "Clark, M.A."
            },
            {
              "full_name": "Osborn, J.C."
            },
            {
              "full_name": "Rebbi, C."
            }
          ],
          "publication_info": {
            "year": 2007
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[5] M. A. Clark, J. Brannick, R. C. Brower, S. F. McCormick, T. A. Manteuffel, J. C. Osborn, and C. Rebbi. The Removal of critical slowing down. PoS, LATTICE2008:035, 2008.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "The Removal of critical slowing down. PoS, LATTICE:035"
          ],
          "label": "5",
          "authors": [
            {
              "full_name": "Clark, M.A."
            },
            {
              "full_name": "Brannick, J."
            },
            {
              "full_name": "Brower, R.C."
            },
            {
              "full_name": "McCormick, S.F."
            },
            {
              "full_name": "Manteuffel, T.A."
            },
            {
              "full_name": "Osborn, J.C."
            },
            {
              "full_name": "Rebbi, C."
            }
          ],
          "publication_info": {
            "year": 2008
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[6] Ronald Babich, James Brannick, Richard C. Brower, Michael A. Clark, Saul D. Cohen, James C. Osborn, and Claudio Rebbi. The Role of multigrid algorithms for LQCD. PoS, LAT2009:031, 2009.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Ronald Babich, James Brannick",
            "and Claudio Rebbi. The Role of multigrid algorithms for LQCD. PoS, LAT:031"
          ],
          "label": "6",
          "authors": [
            {
              "full_name": "Brower, Richard C."
            },
            {
              "full_name": "Clark, Michael A."
            },
            {
              "full_name": "Cohen, Saul D."
            },
            {
              "full_name": "Osborn, James C."
            }
          ],
          "publication_info": {
            "year": 2009
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[7] J. C. Osborn, R. Babich, J. Brannick, R. C. Brower, M. A. Clark, S. D. Cohen, and C. Rebbi. Multigrid solver for clover fermions. PoS, LATTICE2010:037, 2010.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Multigrid solver for clover fermions. PoS, LATTICE:037"
          ],
          "label": "7",
          "authors": [
            {
              "full_name": "Osborn, J.C."
            },
            {
              "full_name": "Babich, R."
            },
            {
              "full_name": "Brannick, J."
            },
            {
              "full_name": "Brower, R.C."
            },
            {
              "full_name": "Clark, M.A."
            },
            {
              "full_name": "Cohen, S.D."
            },
            {
              "full_name": "Rebbi, C."
            }
          ],
          "publication_info": {
            "year": 2010
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[8] A. Frommer, K. Kahl, S. Krieg, B. Leder, and M. Rottmann. An adaptive aggregation based domain decomposition multilevel method for the lattice wilson dirac operator: multilevel results. 7 2013.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "An adaptive aggregation based domain decomposition multilevel method for the lattice wilson dirac operator: multilevel results. 7"
          ],
          "label": "8",
          "authors": [
            {
              "full_name": "Frommer, A."
            },
            {
              "full_name": "Kahl, K."
            },
            {
              "full_name": "Krieg, S."
            },
            {
              "full_name": "Leder, B."
            },
            {
              "full_name": "Rottmann, M."
            }
          ],
          "publication_info": {
            "year": 2013
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1222681"
        },
        "raw_refs": [
          {
            "value": "[9] Andreas Frommer, Karsten Kahl, Stefan Krieg, Björn Leder, and Matthias Rottmann. Adaptive Aggregation-Based Domain Decomposition Multigrid for the Lattice Wilson– Dirac Operator. SIAM J. Sci. Comput., 36(4):A1581–A1608, 2014.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Andreas Frommer, Karsten Kahl, Stefan Krieg, Björn Leder, and Matthias Rottmann. Adaptive Aggregation-Based Domain Decomposition Multigrid for the Lattice Wilson- Dirac Operator"
          ],
          "label": "9",
          "publication_info": {
            "year": 2014,
            "page_end": "A1608",
            "page_start": "A1581",
            "journal_title": "SIAM J.Sci.Comput.",
            "journal_volume": "36"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[10] Andreas Frommer, Karsten Kahl, Stefan Krieg, Bjorn Leder, and Matthias Rottmann. Aggregation-based Multilevel Methods for Lattice QCD. PoS, LATTICE2011:046, 2011.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Andreas Frommer, Karsten Kahl, Stefan Krieg, Bjorn Leder, and Matthias Rottmann. Aggregation-based Multilevel Methods for Lattice QCD. PoS, LATTICE:046"
          ],
          "label": "10",
          "publication_info": {
            "year": 2011
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1490670"
        },
        "raw_refs": [
          {
            "value": "[11] Constantia Alexandrou, Simone Bacchio, Jacob Finkenrath, Andreas Frommer, Karsten Kahl, and Matthias Rottmann. Adaptive Aggregation-based Domain Decomposition Multigrid for Twisted Mass Fermions. Phys. Rev. D, 94(11):114509, 2016.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Constantia Alexandrou, Simone Bacchio, Jacob Finkenrath, Andreas Frommer, Karsten Kahl, and Matthias Rottmann. Adaptive Aggregation-based Domain Decomposition Multigrid for Twisted Mass Fermions"
          ],
          "label": "11",
          "publication_info": {
            "year": 2016,
            "artid": "114509",
            "journal_title": "Phys.Rev.D",
            "journal_volume": "94"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1674690"
        },
        "raw_refs": [
          {
            "value": "[12] Constantia Alexandrou, Simone Bacchio, and Jacob Finkenrath. Multigrid approach in shifted linear systems for the non-degenerated twisted mass operator. Comput. Phys. Commun., 236:51–64, 2019.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Constantia Alexandrou, Simone Bacchio, and Jacob Finkenrath. Multigrid approach in shifted linear systems for the non-degenerated twisted mass operator"
          ],
          "label": "12",
          "publication_info": {
            "year": 2019,
            "page_end": "64",
            "page_start": "51",
            "journal_title": "Comput.Phys.Commun.",
            "journal_volume": "236"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[13] Simone G. Bacchio. Simulating maximally twisted fermions at the physical pointa with multigrid methods. PhD thesis, Cyprus U., Wuppertal U., 2019.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Simulating maximally twisted fermions at the physical pointa with multigrid methods. PhD thesis, Cyprus U., Wuppertal U"
          ],
          "label": "13",
          "authors": [
            {
              "full_name": "Bacchio, Simone G."
            }
          ],
          "publication_info": {
            "year": 2019
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[14] Evan S Weinberg, Richard C. Brower, Kate Clark, and Alexei Strelchenko. Progress Report on Staggered Multigrid. PoS, LATTICE2016:273, 2017.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Kate Clark, and Alexei Strelchenko. Progress Report on Staggered Multigrid. PoS, LATTICE:273, 2017"
          ],
          "label": "14",
          "authors": [
            {
              "full_name": "Weinberg, Evan S."
            },
            {
              "full_name": "Brower, Richard C."
            }
          ],
          "publication_info": {
            "year": 2016
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1650136"
        },
        "raw_refs": [
          {
            "value": "[15] Richard C. Brower, M. A. Clark, Alexei Strelchenko, and Evan Weinberg. Multigrid algorithm for staggered lattice fermions. Phys. Rev. D, 97(11):114513, 2018.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Alexei Strelchenko, and Evan Weinberg. Multigrid algorithm for staggered lattice fermions"
          ],
          "label": "15",
          "authors": [
            {
              "full_name": "Brower, Richard C."
            },
            {
              "full_name": "Clark, M.A."
            }
          ],
          "publication_info": {
            "year": 2018,
            "artid": "114513",
            "journal_title": "Phys.Rev.D",
            "journal_volume": "97"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[16] Venkitesh Ayyar, Richard C. Brower, M. A. Clark, Mathias Wagner, and Evan Weinberg. Optimizing Staggered Multigrid for Exascale performance. PoS, LATTICE2022:335, 2023.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Venkitesh Ayyar",
            "Mathias Wagner, and Evan Weinberg. Optimizing Staggered Multigrid for Exascale performance. PoS, LATTICE:335, 2023"
          ],
          "label": "16",
          "authors": [
            {
              "full_name": "Brower, Richard C."
            },
            {
              "full_name": "Clark, M.A."
            }
          ],
          "publication_info": {
            "year": 2022
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[17] Saul D. Cohen, R. C. Brower, M. A. Clark, and J. C. Osborn. Multigrid Algorithms for Domain-Wall Fermions. PoS, LATTICE2011:030, 2011.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Multigrid Algorithms for Domain-Wall Fermions. PoS, LATTICE:030"
          ],
          "label": "17",
          "authors": [
            {
              "full_name": "Cohen, Saul D."
            },
            {
              "full_name": "Brower, R.C."
            },
            {
              "full_name": "Clark, M.A."
            },
            {
              "full_name": "Osborn, J.C."
            }
          ],
          "publication_info": {
            "year": 2011
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[18] P A Boyle. Hierarchically deflated conjugate gradient. 2 2014.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Hierarchically deflated conjugate gradient. 2"
          ],
          "label": "18",
          "authors": [
            {
              "full_name": "Boyle, P.A."
            }
          ],
          "publication_info": {
            "year": 2014
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[19] Azusa Yamaguchi and Peter Boyle. Hierarchically deflated conjugate residual. PoS, LATTICE2016:374, 2016.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Azusa Yamaguchi and Peter Boyle. Hierarchically deflated conjugate residual. PoS, LATTICE:374"
          ],
          "label": "19",
          "publication_info": {
            "year": 2016
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[20] Peter Boyle and Azusa Yamaguchi. Comparison of Domain Wall Fermion Multigrid Methods. 3 2021.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Peter Boyle and Azusa Yamaguchi. Comparison of Domain Wall Fermion Multigrid Methods. 3"
          ],
          "label": "20",
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1791487"
        },
        "raw_refs": [
          {
            "value": "[21] Richard C. Brower, M. A. Clark, Dean Howarth, and Evan S. Weinberg. Multigrid for chiral lattice fermions: Domain wall. Phys. Rev. D, 102(9):094517, 2020.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Dean Howarth, and Evan S. Weinberg. Multigrid for chiral lattice fermions: Domain wall"
          ],
          "label": "21",
          "authors": [
            {
              "full_name": "Brower, Richard C."
            },
            {
              "full_name": "Clark, M.A."
            }
          ],
          "publication_info": {
            "year": 2020,
            "artid": "094517",
            "journal_title": "Phys.Rev.D",
            "journal_volume": "102"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[22] Noël M. Nachtigal, Satish C. Reddy, and Lloyd N. Trefethen. How fast are nonsymmetric matrix iterations? SIAM Journal on Matrix Analysis and Applications, 13(3):778–795, 1992.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Noël",
            "How fast are nonsymmetric matrix iterations?",
            "Journal on Matrix"
          ],
          "label": "22",
          "authors": [
            {
              "full_name": "Nachtigal, M."
            },
            {
              "full_name": "Reddy, Satish C."
            },
            {
              "full_name": "Trefethen, Lloyd N."
            }
          ],
          "imprint": {
            "publisher": "SIAM"
          },
          "publication_info": {
            "year": 1992,
            "page_end": "795",
            "page_start": "778",
            "journal_title": "Anal.Appl.",
            "journal_volume": "13"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1631646"
        },
        "raw_refs": [
          {
            "value": "[23] M. A. Clark, Chulwoo Jung, and Christoph Lehner. Multi-Grid Lanczos. EPJ Web Conf., 175:14023, 2018.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Chulwoo Jung, and Christoph Lehner. Multi-Grid Lanczos"
          ],
          "label": "23",
          "authors": [
            {
              "full_name": "Clark, M.A."
            }
          ],
          "publication_info": {
            "year": 2018,
            "artid": "14023",
            "journal_title": "EPJ Web Conf.",
            "journal_volume": "175"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1279923"
        },
        "raw_refs": [
          {
            "value": "[24] Eigo Shintani, Rudy Arthur, Thomas Blum, Taku Izubuchi, Chulwoo Jung, and Christoph Lehner. Covariant approximation averaging. Phys. Rev. D, 91(11):114511, 2015.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Eigo Shintani, Rudy Arthur, Thomas Blum, Taku Izubuchi, Chulwoo Jung, and Christoph Lehner. Covariant approximation averaging"
          ],
          "label": "24",
          "publication_info": {
            "year": 2015,
            "artid": "114511",
            "journal_title": "Phys.Rev.D",
            "journal_volume": "91"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/834740"
        },
        "raw_refs": [
          {
            "value": "[25] Gunnar S. Bali, Sara Collins, and Andreas Schafer. Effective noise reduction techniques for disconnected loops in Lattice QCD. Comput. Phys. Commun., 181:1570–1583, 2010.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Sara Collins, and Andreas Schafer. Effective noise reduction techniques for disconnected loops in Lattice QCD"
          ],
          "label": "25",
          "authors": [
            {
              "full_name": "Bali, Gunnar S."
            }
          ],
          "publication_info": {
            "year": 2010,
            "page_end": "1583",
            "page_start": "1570",
            "journal_title": "Comput.Phys.Commun.",
            "journal_volume": "181"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/682997"
        },
        "raw_refs": [
          {
            "value": "[26] Justin Foley, K. Jimmy Juge, Alan O’Cais, Mike Peardon, Sinead M. Ryan, and Jon-Ivar Skullerud. Practical all-to-all propagators for lattice QCD. Comput. Phys. Commun., 172:145–162, 2005.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Justin",
            "Jimmy Juge, Alan O’Cais, Mike Peardon, Sinead M. Ryan",
            "and Jon-Ivar Skullerud. Practical all-to-all propagators for lattice QCD"
          ],
          "label": "26",
          "authors": [
            {
              "full_name": "Foley, K."
            }
          ],
          "publication_info": {
            "year": 2005,
            "page_end": "162",
            "page_start": "145",
            "journal_title": "Comput.Phys.Commun.",
            "journal_volume": "172"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1329971"
        },
        "raw_refs": [
          {
            "value": "[27] T. Blum et al. Domain wall QCD with physical quark masses. Phys. Rev. D, 93(7):074505, 2016.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Domain wall QCD with physical quark masses"
          ],
          "label": "27",
          "authors": [
            {
              "full_name": "Blum, T."
            }
          ],
          "publication_info": {
            "year": 2016,
            "artid": "074505",
            "journal_title": "Phys.Rev.D",
            "journal_volume": "93"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/353115"
        },
        "raw_refs": [
          {
            "value": "[28] Yigal Shamir. Chiral fermions from lattice boundaries. Nucl. Phys. B, 406:90–106, 1993.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Yigal Shamir. Chiral fermions from lattice boundaries"
          ],
          "label": "28",
          "publication_info": {
            "year": 1993,
            "page_end": "106",
            "page_start": "90",
            "journal_title": "Nucl.Phys.B",
            "journal_volume": "406"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/37941"
        },
        "raw_refs": [
          {
            "value": "[29] Vadim Furman and Yigal Shamir. Axial symmetries in lattice QCD with Kaplan fermions. Nucl. Phys. B, 439:54–78, 1995.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Vadim Furman and Yigal Shamir. Axial symmetries in lattice QCD with Kaplan fermions"
          ],
          "label": "29",
          "publication_info": {
            "year": 1995,
            "page_end": "78",
            "page_start": "54",
            "journal_title": "Nucl.Phys.B",
            "journal_volume": "439"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/659699"
        },
        "raw_refs": [
          {
            "value": "[30] Richard C. Brower, Hartmut Neff, and Kostas Orginos. Mobius fermions: Improved domain wall chiral fermions. Nucl. Phys. B Proc. Suppl., 140:686–688, 2005.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Hartmut Neff, and Kostas Orginos. Mobius fermions: Improved domain wall chiral fermions"
          ],
          "label": "30",
          "authors": [
            {
              "full_name": "Brower, Richard C."
            }
          ],
          "publication_info": {
            "year": 2005,
            "page_end": "688",
            "page_start": "686",
            "journal_title": "Nucl.Phys.B Proc.Suppl.",
            "journal_volume": "140"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/698111"
        },
        "raw_refs": [
          {
            "value": "[31] R. C. Brower, H. Neff, and K. Orginos. Mobius fermions. Nucl. Phys. B Proc. Suppl., 153:191–198, 2006.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Mobius fermions"
          ],
          "label": "31",
          "authors": [
            {
              "full_name": "Brower, R.C."
            },
            {
              "full_name": "Neff, H."
            },
            {
              "full_name": "Orginos, K."
            }
          ],
          "publication_info": {
            "year": 2006,
            "page_end": "198",
            "page_start": "191",
            "journal_title": "Nucl.Phys.B Proc.Suppl.",
            "journal_volume": "153"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1119406"
        },
        "raw_refs": [
          {
            "value": "[32] Richard C. Brower, Harmut Neff, and Kostas Orginos. The Möbius domain wall fermion algorithm. Comput. Phys. Commun., 220:1–19, 2017.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Harmut Neff, and Kostas Orginos. The Möbius domain wall fermion algorithm"
          ],
          "label": "32",
          "authors": [
            {
              "full_name": "Brower, Richard C."
            }
          ],
          "publication_info": {
            "year": 2017,
            "page_end": "19",
            "page_start": "1",
            "journal_title": "Comput.Phys.Commun.",
            "journal_volume": "220"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[33] Peter A. Boyle, Guido Cossu, Azusa Yamaguchi, and Antonin Portelli. Grid: A next generation data parallel C++ QCD library. PoS, LATTICE2015:023, 2016.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Guido Cossu, Azusa Yamaguchi, and Antonin Portelli. Grid: A next generation data parallel C++ QCD library. PoS, LATTICE:023, 2016"
          ],
          "label": "33",
          "authors": [
            {
              "full_name": "Boyle, Peter A."
            }
          ],
          "publication_info": {
            "year": 2015
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[34] Azusa Yamaguchi, Peter Boyle, Guido Cossu, Gianluca Filaci, Christoph Lehner, and Antonin Portelli. Grid: OneCode and FourAPIs. PoS, LATTICE2021:035, 2022.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Azusa Yamaguchi, Peter Boyle, Guido Cossu, Gianluca Filaci, Christoph Lehner, and Antonin Portelli. Grid: OneCode and FourAPIs. PoS, LATTICE:035, 2022"
          ],
          "label": "34",
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1189318"
        },
        "raw_refs": [
          {
            "value": "[35] Peter A. Boyle. The BAGEL assembler generation library. Comput. Phys. Commun., 180:2739–2748, 2009.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "The BAGEL assembler generation library"
          ],
          "label": "35",
          "authors": [
            {
              "full_name": "Boyle, Peter A."
            }
          ],
          "publication_info": {
            "year": 2009,
            "page_end": "2748",
            "page_start": "2739",
            "journal_title": "Comput.Phys.Commun.",
            "journal_volume": "180"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[36] P. A. Boyle. The BlueGene/Q supercomputer. PoS, LATTICE2012:020, 2012.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "The BlueGene/Q supercomputer. PoS, LATTICE:020"
          ],
          "label": "36",
          "authors": [
            {
              "full_name": "Boyle, P.A."
            }
          ],
          "publication_info": {
            "year": 2012
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[37] Swagato Mukherjee, Olaf Kaczmarek, Christian Schmidt, Patrick Steinbrecher, and Mathias Wagner. HISQ inverter on Intel® Xeon PhiTM and NVIDIA® GPUs. PoS, LATTICE2014:044, 2015.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Swagato Mukherjee, Olaf Kaczmarek, Christian Schmidt, Patrick Steinbrecher, and Mathias Wagner. HISQ inverter on Intel® Xeon PhiTM and NVIDIA® GPUs. PoS, LATTICE:044, 2015"
          ],
          "label": "37",
          "publication_info": {
            "year": 2014
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[38] O. Kaczmarek, C. Schmidt, P. Steinbrecher, and M. Wagner. Conjugate gradient solvers on Intel Xeon Phi and NVIDIA GPUs. In GPU Computing in High-Energy Physics, pages 157–162, 2015.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Conjugate gradient solvers on Intel Xeon Phi and NVIDIA GPUs. In GPU Computing in High-Energy Physics, pages 157-162"
          ],
          "label": "38",
          "authors": [
            {
              "full_name": "Kaczmarek, O."
            },
            {
              "full_name": "Schmidt, C."
            },
            {
              "full_name": "Steinbrecher, P."
            },
            {
              "full_name": "Wagner, M."
            }
          ],
          "publication_info": {
            "year": 2015
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1632766"
        },
        "raw_refs": [
          {
            "value": "[39] M. A. Clark, Alexei Strelchenko, Alejandro Vaquero, Mathias Wagner, and Evan Weinberg. Pushing Memory Bandwidth Limitations Through Efficient Implementations of Block-Krylov Space Solvers on GPUs. Comput. Phys. Commun., 233:29–40, 2018.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Alexei Strelchenko, Alejandro Vaquero, Mathias Wagner, and Evan Weinberg. Pushing Memory Bandwidth Limitations Through Efficient Implementations of Block-Krylov Space Solvers on GPUs"
          ],
          "label": "39",
          "authors": [
            {
              "full_name": "Clark, M.A."
            }
          ],
          "publication_info": {
            "year": 2018,
            "page_end": "40",
            "page_start": "29",
            "journal_title": "Comput.Phys.Commun.",
            "journal_volume": "233"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1685073"
        },
        "raw_refs": [
          {
            "value": "[40] Philippe de Forcrand and Liam Keegan. Rational hybrid Monte Carlo with block solvers and multiple pseudofermions. Phys. Rev. E, 98(4):043306, 2018.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Philippe de Forcrand and Liam Keegan. Rational hybrid Monte Carlo with block solvers and multiple pseudofermions"
          ],
          "label": "40",
          "publication_info": {
            "year": 2018,
            "artid": "043306",
            "journal_title": "Phys.Rev.E",
            "journal_volume": "98"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[41] J.M. Tang, Reinhard Nabben, C. Vuik, and Y.A. Erlangga. Comparison of two-level preconditioners derived from deflation, domain decomposition and multigrid methods. Journal of Scientific Computing, 39 (3), 2009, 39, 06 2009.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Reinhard Nabben, C",
            "Vuik, and Y.A. Erlangga. Comparison of two-level preconditioners derived from deflation, domain decomposition and multigrid methods",
            "06"
          ],
          "label": "41",
          "authors": [
            {
              "full_name": "Tang, J.M."
            }
          ],
          "publication_info": {
            "year": 2009,
            "artid": "39",
            "page_start": "39",
            "journal_title": "J.Sci.Comput.",
            "journal_volume": "39"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[42] M. Brezina, R. Falgout, S. MacLachlan, T. Manteuffel, S. McCormick, and J. Ruge. Adaptive smoothed aggregation (α sa) multigrid. SIAM Review, 47(2):317–346, 2005.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Adaptive smoothed aggregation (α sa) multigrid"
          ],
          "label": "42",
          "authors": [
            {
              "full_name": "Brezina, M."
            },
            {
              "full_name": "Falgout, R."
            },
            {
              "full_name": "MacLachlan, S."
            },
            {
              "full_name": "Manteuffel, T."
            },
            {
              "full_name": "McCormick, S."
            },
            {
              "full_name": "Ruge, J."
            }
          ],
          "publication_info": {
            "year": 2005,
            "page_end": "346",
            "page_start": "317",
            "journal_title": "SIAM Rev.",
            "journal_volume": "47"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[43] Gene H. Golub and Qiang Ye. Inexact preconditioned conjugate gradient method with inner-outer iteration. SIAM Journal on Scientific Computing, 21(4):1305–1320, 1999.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "and Qiang Ye. Inexact preconditioned conjugate gradient method with inner-outer iteration"
          ],
          "label": "43",
          "authors": [
            {
              "full_name": "Golub, Gene H."
            }
          ],
          "publication_info": {
            "year": 1999,
            "page_end": "1320",
            "page_start": "1305",
            "journal_title": "SIAM J.Sci.Comput.",
            "journal_volume": "21"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[44] Yvan Notay. Flexible conjugate gradients. SIAM J. Sci. Comput., 22:1444–1460, 2000.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Yvan Notay. Flexible conjugate gradients"
          ],
          "label": "44",
          "publication_info": {
            "year": 2000,
            "page_end": "1460",
            "page_start": "1444",
            "journal_title": "SIAM J.Sci.Comput.",
            "journal_volume": "22"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[45] Yong-Chull Jang and Chulwoo Jung. Split Grid and Block Lanczos Algorithm for Efficient Eigenpair Generation. PoS, LATTICE2018:309, 2019.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Yong-Chull Jang and Chulwoo Jung. Split Grid and Block Lanczos Algorithm for Efficient Eigenpair Generation. PoS, LATTICE:309, 2019"
          ],
          "label": "45",
          "publication_info": {
            "year": 2018
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[46] Dianne P. O’Leary. Yet another polynomial preconditioner for the conjugate gradient algorithm. Linear Algebra and its Applications, 154-156:377–388, 1991.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Yet another polynomial preconditioner for the conjugate gradient algorithm"
          ],
          "label": "46",
          "authors": [
            {
              "full_name": "O'Leary, Dianne P."
            }
          ],
          "publication_info": {
            "year": 1991,
            "page_end": "388",
            "page_start": "377",
            "journal_title": "Linear Algebra Appl.",
            "journal_volume": "154"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[47] Christian Schaerer, Daniel Szyld, and Pedro Torres. A posteriori superlinear convergence bounds for block conjugate gradient. ETNA - Electronic Transactions on Numerical Analysis, 58:115–135, 01 2022.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Christian Schaerer, Daniel Szyld, and Pedro",
            "posteriori superlinear convergence bounds for block conjugate gradient. ETNA -",
            "01"
          ],
          "label": "47",
          "authors": [
            {
              "full_name": "A, Torres."
            }
          ],
          "publication_info": {
            "year": 2022,
            "page_end": "135",
            "page_start": "115",
            "journal_title": "Elec.Trans.Numer.Anal.",
            "journal_volume": "58"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[48] Augustin A. Dubrulle. Retooling the method of block conjugate gradients. 2001.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Retooling the method of block conjugate gradients"
          ],
          "label": "48",
          "authors": [
            {
              "full_name": "Dubrulle, Augustin A."
            }
          ],
          "publication_info": {
            "year": 2001
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[49] Birk, Sebastian. Deflated shifted block Krylov subspace methods for Hermitian positive definite matrices. electronic; online, 2015.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Birk, Sebastian. Deflated shifted block Krylov subspace methods for Hermitian positive definite matrices. electronic"
          ],
          "label": "49"
        }
      },
      {
        "raw_refs": [
          {
            "value": "[49] Birk, Sebastian. Deflated shifted block Krylov subspace methods for Hermitian positive definite matrices. electronic; online, 2015.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "online"
          ],
          "label": "49",
          "publication_info": {
            "year": 2015
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[50] Yousef Saad. Iterative methods for sparse linear systems. SIAM, 2003.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Yousef Saad. Iterative methods for sparse linear systems"
          ],
          "label": "50",
          "imprint": {
            "publisher": "SIAM"
          },
          "publication_info": {
            "year": 2003
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[51] P. Concus, G. H. Golub, and D. P. O’Leary. A generalized conjugate gradient method for the numerical solution of elliptic partial differential equations. In J. R. Bunch and D. J. Rose, editors, Sparse Matrix Computations. New York, NY, USA, 1976.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "A generalized conjugate gradient method for the numerical solution of elliptic partial differential equations. In J. R. Bunch and D. J. Rose (eds.)",
            "Sparse Matrix Computations. New York, NY, USA"
          ],
          "label": "51",
          "authors": [
            {
              "full_name": "Concus, P."
            },
            {
              "full_name": "Golub, G.H."
            },
            {
              "full_name": "O'Leary, D.P."
            }
          ],
          "publication_info": {
            "year": 1976
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[52] M. R. Hestenes and E. Stiefel. Methods of conjugate gradients for solving linear systems. 49:409–436, 1952.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Methods of conjugate gradients for solving linear systems. 49:409-436"
          ],
          "label": "52",
          "authors": [
            {
              "full_name": "Hestenes, M.R."
            },
            {
              "full_name": "Stiefel, E."
            }
          ],
          "publication_info": {
            "year": 1952
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[53] Rudy Arthur. Non-perturbative renormalization and low mode averaging with domain wall fermions. PhD thesis, Edinburgh University, 2012.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Rudy Arthur. Non-perturbative renormalization and low mode averaging with domain wall fermions. PhD thesis, Edinburgh University"
          ],
          "label": "53",
          "publication_info": {
            "year": 2012
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[54] D. C. Sorensen. Implicit application of polynomial filters in a k-step arnoldi method. SIAM Journal on Matrix Analysis and Applications, 13(1):357–385, 1992.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Implicit application of polynomial filters in a k-step arnoldi method",
            "Journal on Matrix"
          ],
          "label": "54",
          "authors": [
            {
              "full_name": "Sorensen, D.C."
            }
          ],
          "imprint": {
            "publisher": "SIAM"
          },
          "publication_info": {
            "year": 1992,
            "page_end": "385",
            "page_start": "357",
            "journal_title": "Anal.Appl.",
            "journal_volume": "13"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2625168"
        },
        "raw_refs": [
          {
            "value": "[55] T. Blum et al. Update of Euclidean windows of the hadronic vacuum polarization. Phys. Rev. D, 108(5):054507, 2023.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Update of Euclidean windows of the hadronic vacuum polarization"
          ],
          "label": "55",
          "authors": [
            {
              "full_name": "Blum, T."
            }
          ],
          "publication_info": {
            "year": 2023,
            "artid": "054507",
            "journal_title": "Phys.Rev.D",
            "journal_volume": "108"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/724490"
        },
        "raw_refs": [
          {
            "value": "[56] M. A. Clark and A. D. Kennedy. Accelerating dynamical fermion computations using the rational hybrid Monte Carlo (RHMC) algorithm with multiple pseudofermion fields. Phys. Rev. Lett., 98:051601, 2007.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Accelerating dynamical fermion computations using the rational hybrid Monte Carlo (RHMC) algorithm with multiple pseudofermion fields"
          ],
          "label": "56",
          "authors": [
            {
              "full_name": "Clark, M.A."
            },
            {
              "full_name": "Kennedy, A.D."
            }
          ],
          "publication_info": {
            "year": 2007,
            "artid": "051601",
            "journal_title": "Phys.Rev.Lett.",
            "journal_volume": "98"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1764323"
        },
        "raw_refs": [
          {
            "value": "[57] Anthony Francis, Patrick Fritzsch, Martin Lüscher, and Antonio Rago. Master-field simulations of O(a)-improved lattice QCD: Algorithms, stability and exactness. Comput. Phys. Commun., 255:107355, 2020.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Anthony Francis, Patrick Fritzsch, Martin Lüscher, and Antonio Rago. Master-field simulations of O(a)-improved lattice QCD: Algorithms, stability and exactness"
          ],
          "label": "57",
          "publication_info": {
            "year": 2020,
            "artid": "107355",
            "journal_title": "Comput.Phys.Commun.",
            "journal_volume": "255"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/399281"
        },
        "raw_refs": [
          {
            "value": "[58] R. C. Brower, T. Ivanenko, A. R. Levi, and K. N. Orginos. Chronological inversion method for the Dirac matrix in hybrid Monte Carlo. Nucl. Phys. B, 484:353–374, 1997.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Chronological inversion method for the Dirac matrix in hybrid Monte Carlo"
          ],
          "label": "58",
          "authors": [
            {
              "full_name": "Brower, R.C."
            },
            {
              "full_name": "Ivanenko, T."
            },
            {
              "full_name": "Levi, A.R."
            },
            {
              "full_name": "Orginos, K.N."
            }
          ],
          "publication_info": {
            "year": 1997,
            "page_end": "374",
            "page_start": "353",
            "journal_title": "Nucl.Phys.B",
            "journal_volume": "484"
          }
        }
      }
    ],
    "public_notes": [
      {
        "value": "33 pages",
        "source": "arXiv"
      }
    ],
    "arxiv_eprints": [
      {
        "value": "2409.03904",
        "categories": [
          "hep-lat",
          "cs.DC",
          "cs.NA",
          "math.NA"
        ]
      }
    ],
    "document_type": [
      "article"
    ],
    "preprint_date": "2024-09-05",
    "control_number": 2825799,
    "number_of_pages": 33,
    "inspire_categories": [
      {
        "term": "Lattice",
        "source": "arxiv"
      },
      {
        "term": "Computing",
        "source": "arxiv"
      },
      {
        "term": "Math and Math Physics",
        "source": "arxiv"
      }
    ]
  },
  "id": "2825799",
  "uuid": "9e14057a-2bfb-4b0a-bebf-e6ced53a637a",
  "created": "2024-09-09T04:09:53.460227+00:00"
}