{
  "revision_id": 24,
  "uuid": "f2dd64d2-a538-4c5e-b05a-d552c34a191f",
  "created": "2023-06-21T02:37:42.265601+00:00",
  "links": {
    "bibtex": "https://inspirehep.net/api/literature/2669845?format=bibtex",
    "latex-eu": "https://inspirehep.net/api/literature/2669845?format=latex-eu",
    "latex-us": "https://inspirehep.net/api/literature/2669845?format=latex-us",
    "json": "https://inspirehep.net/api/literature/2669845?format=json",
    "json-expanded": "https://inspirehep.net/api/literature/2669845?format=json-expanded",
    "cv": "https://inspirehep.net/api/literature/2669845?format=cv",
    "citations": "https://inspirehep.net/api/literature/?q=refersto%3Arecid%3A2669845"
  },
  "id": "2669845",
  "updated": "2024-12-02T11:52:02.777857+00:00",
  "metadata": {
    "citation_count_without_self_citations": 7,
    "publication_info": [
      {
        "year": 2024,
        "artid": "114007",
        "material": "publication",
        "journal_title": "J.Phys.Soc.Jap.",
        "journal_volume": "93",
        "pubinfo_freetext": "J. Phys. Soc. Jpn. 93, 114007 (2024"
      }
    ],
    "citation_count": 9,
    "core": true,
    "dois": [
      {
        "value": "10.7566/JPSJ.93.114007",
        "source": "arXiv",
        "material": "publication"
      }
    ],
    "titles": [
      {
        "title": "Self-learning Monte Carlo with equivariant Transformer",
        "source": "arXiv"
      }
    ],
    "$schema": "https://inspirehep.net/schemas/records/hep.json",
    "authors": [
      {
        "uuid": "af82f25d-4e86-446e-af68-18d267a3529d",
        "record": {
          "$ref": "https://inspirehep.net/api/authors/2610859"
        },
        "full_name": "Nagai, Yuki",
        "affiliations": [
          {
            "value": "JAERI, Tokai",
            "record": {
              "$ref": "https://inspirehep.net/api/institutions/903713"
            }
          }
        ],
        "raw_affiliations": [
          {
            "value": "CCSE, Japan Atomic Energy Agency, 178-4-4, Wakashiba, Kashiwa, Chiba 277-0871, Japan"
          }
        ]
      },
      {
        "uuid": "9bced88c-69d6-45c5-9443-d264ac54631a",
        "record": {
          "$ref": "https://inspirehep.net/api/authors/1394499"
        },
        "full_name": "Tomiya, Akio",
        "affiliations": [
          {
            "value": "Osaka Inst. Tech.",
            "record": {
              "$ref": "https://inspirehep.net/api/institutions/908067"
            }
          }
        ],
        "raw_affiliations": [
          {
            "value": "Faculty of Technology and Science, International Professional University of Technology, 3-3-1, Umeda, Kita-ku, Osaka, 530-0001, Osaka, Japan"
          }
        ]
      }
    ],
    "curated": true,
    "figures": [
      {
        "key": "cf4f6b89e23ac220b3e284fe6eae67e9",
        "url": "https://inspirehep.net/files/cf4f6b89e23ac220b3e284fe6eae67e9",
        "label": "fig:attention",
        "source": "arxiv",
        "caption": "(\\textit{Left}) Construction of effective Hamiltonian with the use of the equivariant Transformer with three of attention layers. Yellow blocks are defined by Eq. \\eqref{eq:normalization_in_transformer}. We call purple blocks the attention layers. (\\textit{Right}) Blue blocks are equivariant attention block (See main text).",
        "filename": "Fig1.png",
        "material": "preprint"
      },
      {
        "key": "0eb6c48d7ae3644ff758d69298d5420a",
        "url": "https://inspirehep.net/files/0eb6c48d7ae3644ff758d69298d5420a",
        "label": "fig:SLMCresults",
        "source": "arxiv",
        "caption": "Magnitude of average magnetization and staggered magnetization for a two-dimensional system with $6 \\times 6 = 36$ lattice sites. For each temperature, we generate $2 \\times 10^5$ samples using exact diagonalization (red circles), $5000$ samples using SLMC with the linear model (green triangles) and the effective model with 3layer attention (blue squares).",
        "filename": "ms_N6_SLMC_MMS.png",
        "material": "preprint"
      },
      {
        "key": "72901384d69b71f44f6ea4181411e43c",
        "url": "https://inspirehep.net/files/72901384d69b71f44f6ea4181411e43c",
        "label": "fig:SLMCautocorrelation",
        "source": "arxiv",
        "caption": "(Color online) Autocorrelation for magnitude of the staggered magnetization for a two-dimensional system with $6 \\times 6 = 36$ lattice sites at $T = 0.01t$.",
        "filename": "AC_T001.png",
        "material": "preprint"
      },
      {
        "key": "d95aaa379d987176ff0046b0f2793674",
        "url": "https://inspirehep.net/files/d95aaa379d987176ff0046b0f2793674",
        "label": "fig:layerdep",
        "source": "arxiv",
        "caption": "Acceptance ratio in the SLMC with the effective model. We only consider the nearest neighbors in the last layer ($m_J = 1$). A blue square indicate the acceptance ratio for the linear model. Red circles indicate models with attention blocks, with $L = 1, 2, 3, 4, 5, 6$ from the left.",
        "filename": "Fig3.png",
        "material": "preprint"
      },
      {
        "key": "d34e14fa8a0699326bf0b5ee019daa8c",
        "url": "https://inspirehep.net/files/d34e14fa8a0699326bf0b5ee019daa8c",
        "label": "fig:effresults",
        "source": "arxiv",
        "caption": "(Color online) Magnitude of average magnetization and staggered magnetization for a two-dimensional system with $6 \\times 6 = 36$ lattice sites without using the SLMC. For each temperature, we generate $2 \\times 10^5$ samples using exact diagonalization (red circles), the linear model (green triangles) and the effective model with 3layer attention (blue squares).",
        "filename": "ms_N6_eff_MMS.png",
        "material": "preprint"
      },
      {
        "key": "166b62cebc7fe902faa7eff9badba581",
        "url": "https://inspirehep.net/files/166b62cebc7fe902faa7eff9badba581",
        "label": "fig:scaling",
        "source": "arxiv",
        "caption": "Estimated MSE as a functional of the number of trainable parameters. We only consider the nearest neighbors in the last layer ($m_J = 1$). Blue square corresponds to the estimated MSE for the linear model. Red circles indicate different models with $L=1,2,3,4,5,6$ from the left. Only points for $L\\geq2$ are fitted and the point for the linear model is not included.",
        "filename": "Fig4.png",
        "material": "preprint"
      }
    ],
    "license": [
      {
        "url": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
        "license": "arXiv nonexclusive-distrib 1.0",
        "material": "preprint"
      }
    ],
    "texkeys": [
      "Nagai:2023fxt"
    ],
    "citeable": true,
    "keywords": [
      {
        "value": "model: linear",
        "schema": "INSPIRE"
      },
      {
        "value": "model: exchange",
        "schema": "INSPIRE"
      },
      {
        "value": "dimension: 2",
        "schema": "INSPIRE"
      },
      {
        "value": "Monte Carlo",
        "schema": "INSPIRE"
      },
      {
        "value": "acceptance",
        "schema": "INSPIRE"
      },
      {
        "value": "machine learning",
        "schema": "INSPIRE"
      },
      {
        "value": "lattice",
        "schema": "INSPIRE"
      },
      {
        "value": "scaling",
        "schema": "INSPIRE"
      },
      {
        "value": "quality",
        "schema": "INSPIRE"
      }
    ],
    "abstracts": [
      {
        "value": "Machine learning and deep learning have revolutionized computational physics, particularly the\nsimulation of complex systems. Equivariance is essential for simulating physical systems because\nit imposes a strong inductive bias on the probability distribution described by a machine learning\nmodel. However, imposing symmetry on the model can sometimes lead to poor acceptance rates in\nself-learning Monte Carlo (SLMC). Here, we introduce a symmetry equivariant attention mechanism\nfor SLMC, which can be systematically improved. We evaluate our architecture on a spin-fermion\nmodel (i.e., double exchange model) on a two-dimensional lattice. Our results show that the pro-\nposed method overcomes the poor acceptance rates of linear models and exhibits a similar scaling\nlaw to large language models, with model quality monotonically increasing with the number of\nlayers. Our work paves the way for the development of more accurate and efficient Monte Carlo\nalgorithms with machine learning for simulating complex physical systems",
        "source": "arXiv"
      }
    ],
    "references": [
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2733869"
        },
        "raw_refs": [
          {
            "value": "[1] J. Liu, Y. Qi, Z. Y. Meng, and L. Fu, Self-learning monte carlo method, Physical Review B 95, 10.1103/physrevb.95.041101 (2017).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "dois": [
            "10.1103/physrevb.95.041101"
          ],
          "misc": [
            "Self-learning monte carlo method, Physical Review B",
            "95"
          ],
          "label": "1",
          "authors": [
            {
              "full_name": "Liu, J."
            },
            {
              "full_name": "Qi, Y."
            },
            {
              "full_name": "Meng, Z.Y."
            },
            {
              "full_name": "Fu, L."
            }
          ],
          "publication_info": {
            "year": 2017,
            "artid": "041101",
            "journal_title": "Phys.Rev.B",
            "journal_volume": "95"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1603904"
        },
        "raw_refs": [
          {
            "value": "[2] J. Liu, H. Shen, Y. Qi, Z. Y. Meng, and L. Fu, Selflearning monte carlo method and cumulative update in fermion systems, Phys. Rev. B 95, 241104 (2017).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Selflearning monte carlo method and cumulative update in fermion systems"
          ],
          "label": "2",
          "authors": [
            {
              "full_name": "Liu, J."
            },
            {
              "full_name": "Shen, H."
            },
            {
              "full_name": "Qi, Y."
            },
            {
              "full_name": "Meng, Z.Y."
            },
            {
              "full_name": "Fu, L."
            }
          ],
          "publication_info": {
            "year": 2017,
            "artid": "241104",
            "journal_title": "Phys.Rev.B",
            "journal_volume": "95"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2853739"
        },
        "raw_refs": [
          {
            "value": "[3] H. Shen, J. Liu, and L. Fu, Self-learning monte carlo with deep neural networks, Phys. Rev. B 97, 205140 (2018).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Self-learning monte carlo with deep neural networks"
          ],
          "label": "3",
          "authors": [
            {
              "full_name": "Shen, H."
            },
            {
              "full_name": "Liu, J."
            },
            {
              "full_name": "Fu, L."
            }
          ],
          "publication_info": {
            "year": 2018,
            "artid": "205140",
            "journal_title": "Phys.Rev.B",
            "journal_volume": "97"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[4] H. Kohshiro and Y. Nagai, Effective Ruderman– Kittel–Kasuya–Yosida-like interaction in diluted doubleexchange model: Self-learning monte carlo approach, J. Phys. Soc. Jpn. 90, 034711 (2021).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Effective Ruderman- Kittel-Kasuya-Yosida-like interaction in diluted doubleexchange model: Self-learning monte carlo approach"
          ],
          "label": "4",
          "authors": [
            {
              "full_name": "Kohshiro, H."
            },
            {
              "full_name": "Nagai, Y."
            }
          ],
          "publication_info": {
            "year": 2021,
            "artid": "034711",
            "journal_title": "J.Phys.Soc.Jap.",
            "journal_volume": "90"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[5] Y. Nagai, H. Shen, Y. Qi, J. Liu, and L. Fu, Self-learning monte carlo method: Continuous-time algorithm, Phys. Rev. B 96, 161102 (2017).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Self-learning monte carlo method: Continuous-time algorithm"
          ],
          "label": "5",
          "authors": [
            {
              "full_name": "Nagai, Y."
            },
            {
              "full_name": "Shen, H."
            },
            {
              "full_name": "Qi, Y."
            },
            {
              "full_name": "Liu, J."
            },
            {
              "full_name": "Fu, L."
            }
          ],
          "publication_info": {
            "year": 2017,
            "artid": "161102",
            "journal_title": "Phys.Rev.B",
            "journal_volume": "96"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[6] Y. Nagai, M. Okumura, K. Kobayashi, and M. Shiga, Self-learning hybrid monte carlo: A first-principles approach, Phys. Rev. B 102, 041124 (2020).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Self-learning hybrid monte carlo: A first-principles approach"
          ],
          "label": "6",
          "authors": [
            {
              "full_name": "Nagai, Y."
            },
            {
              "full_name": "Okumura, M."
            },
            {
              "full_name": "Kobayashi, K."
            },
            {
              "full_name": "Shiga, M."
            }
          ],
          "publication_info": {
            "year": 2020,
            "artid": "041124",
            "journal_title": "Phys.Rev.B",
            "journal_volume": "102"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1785003"
        },
        "raw_refs": [
          {
            "value": "[7] Y. Nagai, M. Okumura, and A. Tanaka, Self-learning monte carlo method with Behler-Parrinello neural networks, Phys. Rev. B 101, 115111 (2020).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Self-learning monte carlo method with Behler-Parrinello neural networks"
          ],
          "label": "7",
          "authors": [
            {
              "full_name": "Nagai, Y."
            },
            {
              "full_name": "Okumura, M."
            },
            {
              "full_name": "Tanaka, A."
            }
          ],
          "publication_info": {
            "year": 2020,
            "artid": "115111",
            "journal_title": "Phys.Rev.B",
            "journal_volume": "101"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[8] K. Kobayashi, Y. Nagai, M. Itakura, and M. Shiga, Self-learning hybrid monte carlo method for isothermalisobaric ensemble: Application to liquid silica, J. Chem. Phys. 155, 034106 (2021).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Self-learning hybrid monte carlo method for isothermalisobaric ensemble: Application to liquid silica"
          ],
          "label": "8",
          "authors": [
            {
              "full_name": "Kobayashi, K."
            },
            {
              "full_name": "Nagai, Y."
            },
            {
              "full_name": "Itakura, M."
            },
            {
              "full_name": "Shiga, M."
            }
          ],
          "publication_info": {
            "year": 2021,
            "artid": "034106",
            "journal_title": "J.Chem.Phys.",
            "journal_volume": "155"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1824408"
        },
        "raw_refs": [
          {
            "value": "[9] Y. Nagai, A. Tanaka, and A. Tomiya, Self-learning monte carlo for non-abelian gauge theory with dynamical fermions, Phys. Rev. D (2023).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Self-learning monte carlo for non-abelian gauge theory with dynamical fermions, Phys. Rev. D"
          ],
          "label": "9",
          "authors": [
            {
              "full_name": "Nagai, Y."
            },
            {
              "full_name": "Tanaka, A."
            },
            {
              "full_name": "Tomiya, A."
            }
          ],
          "publication_info": {
            "year": 2023
          }
        },
        "curated_relation": true
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1852799"
        },
        "raw_refs": [
          {
            "value": "[10] Y. Nagai and A. Tomiya, Gauge covariant neural network for 4 dimensional non-abelian gauge theory, (2021), arXiv:2103.11965 [hep-lat].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Gauge covariant neural network for 4 dimensional non-abelian gauge theory"
          ],
          "label": "10",
          "authors": [
            {
              "full_name": "Nagai, Y."
            },
            {
              "full_name": "Tomiya, A."
            }
          ],
          "arxiv_eprint": "2103.11965",
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1731778"
        },
        "raw_refs": [
          {
            "value": "[11] M. Albergo, G. Kanwar, and P. Shanahan, Flow-based generative models for markov chain monte carlo in lattice field theory, Physical Review D 100, 10.1103/physrevd.100.034515 (2019).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "dois": [
            "10.1103/physrevd.100.034515"
          ],
          "misc": [
            "Flow-based generative models for markov chain monte carlo in lattice field theory, Physical Review D",
            "100"
          ],
          "label": "11",
          "authors": [
            {
              "full_name": "Albergo, M."
            },
            {
              "full_name": "Kanwar, G."
            },
            {
              "full_name": "Shanahan, P."
            }
          ],
          "publication_info": {
            "year": 2019
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1785309"
        },
        "raw_refs": [
          {
            "value": "[12] G. Kanwar, M. S. Albergo, D. Boyda, K. Cranmer, D. C. Hackett, S. Racanière, D. J. Rezende, and P. E. Shanahan, Equivariant flow-based sampling for lattice gauge theory, Physical Review Letters 125, 10.1103/physrevlett.125.121601 (2020).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "dois": [
            "10.1103/physrevlett.125.121601"
          ],
          "misc": [
            "S. Racanière, D. J. Rezende, and P. E. Shanahan",
            "Equivariant flow-based sampling for lattice gauge theory, Physical Review Letters 125"
          ],
          "label": "12",
          "authors": [
            {
              "full_name": "Kanwar, G."
            },
            {
              "full_name": "Albergo, M.S."
            },
            {
              "full_name": "Boyda, D."
            },
            {
              "full_name": "Cranmer, K."
            },
            {
              "full_name": "Hackett, D.C."
            }
          ],
          "publication_info": {
            "year": 2020
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1811378"
        },
        "raw_refs": [
          {
            "value": "[13] D. Boyda, G. Kanwar, S. Racanière, D. J. Rezende, M. S. Albergo, K. Cranmer, D. C. Hackett, and P. E. Shanahan, Sampling using SU(N) gauge equivariant flows, Physical Review D 103, 10.1103/physrevd.103.074504 (2021).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "dois": [
            "10.1103/physrevd.103.074504"
          ],
          "misc": [
            "S. Racanière, D. J. Rezende, M. S. Albergo, K. Cranmer, D. C. Hackett, and P. E. Shanahan",
            "Sampling using SU(N) gauge equivariant flows, Physical Review D",
            "103"
          ],
          "label": "13",
          "authors": [
            {
              "full_name": "Boyda, D."
            },
            {
              "full_name": "Kanwar, G."
            }
          ],
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1867962"
        },
        "raw_refs": [
          {
            "value": "[14] M. S. Albergo, G. Kanwar, S. Racanière, D. J. Rezende, J. M. Urban, D. Boyda, K. Cranmer, D. C. Hackett, and P. E. Shanahan, Flow-based sampling for fermionic lattice field theories, Physical Review D 104, 10.1103/physrevd.104.114507 (2021).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "dois": [
            "10.1103/physrevd.104.114507"
          ],
          "misc": [
            "S. Racanière, D. J. Rezende, J. M. Urban, D. Boyda, K. Cranmer, D. C. Hackett, and P. E. Shanahan",
            "Flow-based sampling for fermionic lattice field theories, Physical Review D",
            "104"
          ],
          "label": "14",
          "authors": [
            {
              "full_name": "Albergo, M.S."
            },
            {
              "full_name": "Kanwar, G."
            }
          ],
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1875561"
        },
        "raw_refs": [
          {
            "value": "[15] D. C. Hackett, C.-C. Hsieh, M. S. Albergo, D. Boyda, J.-W. Chen, K.-F. Chen, K. Cranmer, G. Kanwar, and P. E. Shanahan, Flow-based sampling for multimodal distributions in lattice field theory (2021), arXiv:2107.00734 [hep-lat].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Flow-based sampling for multimodal distributions in lattice field theory"
          ],
          "label": "15",
          "authors": [
            {
              "full_name": "Hackett, D.C."
            },
            {
              "full_name": "Hsieh, C.-C."
            },
            {
              "full_name": "Albergo, M.S."
            },
            {
              "full_name": "Boyda, D."
            },
            {
              "full_name": "Chen, J.-W."
            },
            {
              "full_name": "Chen, K.-F."
            },
            {
              "full_name": "Cranmer, K."
            },
            {
              "full_name": "Kanwar, G."
            },
            {
              "full_name": "Shanahan, P.E."
            }
          ],
          "arxiv_eprint": "2107.00734",
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2037638"
        },
        "raw_refs": [
          {
            "value": "[16] M. S. Albergo, D. Boyda, K. Cranmer, D. C. Hackett, G. Kanwar, S. Racanière, D. J. Rezende, F. RomeroLópez, P. E. Shanahan, and J. M. Urban, Flow-based sampling in the lattice schwinger model at criticality, Physical Review D 106, 10.1103/physrevd.106.014514 (2022).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "dois": [
            "10.1103/physrevd.106.014514"
          ],
          "misc": [
            "S. Racanière, D. J. Rezende",
            "F. RomeroLópez, P. E. Shanahan, and J. M. Urban",
            "Flow-based sampling in the lattice schwinger model at criticality, Physical Review D",
            "106"
          ],
          "label": "16",
          "authors": [
            {
              "full_name": "Albergo, M.S."
            },
            {
              "full_name": "Boyda, D."
            },
            {
              "full_name": "Cranmer, K."
            },
            {
              "full_name": "Hackett, D.C."
            },
            {
              "full_name": "Kanwar, G."
            }
          ],
          "publication_info": {
            "year": 2022
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2116899"
        },
        "raw_refs": [
          {
            "value": "[17] R. Abbott, M. S. Albergo, D. Boyda, K. Cranmer, D. C. Hackett, G. Kanwar, S. Racanière, D. J. Rezende, F. Romero-López, P. E. Shanahan, B. Tian, and J. M. Urban, Gauge-equivariant flow models for sampling in lattice field theories with pseudofermions, Physical Review D 106, 10.1103/physrevd.106.074506 (2022).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "dois": [
            "10.1103/physrevd.106.074506"
          ],
          "misc": [
            "S. Racanière, D. J. Rezende",
            "F. Romero-López, P. E. Shanahan, B. Tian, and J. M. Urban",
            "Gauge-equivariant flow models for sampling in lattice field theories with pseudofermions, Physical Review D",
            "106"
          ],
          "label": "17",
          "authors": [
            {
              "full_name": "Abbott, R."
            },
            {
              "full_name": "Albergo, M.S."
            },
            {
              "full_name": "Boyda, D."
            },
            {
              "full_name": "Cranmer, K."
            },
            {
              "full_name": "Hackett, D.C."
            },
            {
              "full_name": "Kanwar, G."
            }
          ],
          "publication_info": {
            "year": 2022
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2133198"
        },
        "raw_refs": [
          {
            "value": "[18] R. Abbott, M. S. Albergo, A. Botev, D. Boyda, K. Cranmer, D. C. Hackett, G. Kanwar, A. G. D. G. Matthews, S. Racanière, A. Razavi, D. J. Rezende, F. RomeroLópez, P. E. Shanahan, and J. M. Urban, Sampling qcd field configurations with gauge-equivariant flow models (2022), arXiv:2208.03832 [hep-lat].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "S. Racanière, A. Razavi, D. J. Rezende",
            "F. RomeroLópez, P. E. Shanahan, and J. M. Urban",
            "Sampling qcd field configurations with gauge-equivariant flow models"
          ],
          "label": "18",
          "authors": [
            {
              "full_name": "Abbott, R."
            },
            {
              "full_name": "Albergo, M.S."
            },
            {
              "full_name": "Botev, A."
            },
            {
              "full_name": "Boyda, D."
            },
            {
              "full_name": "Cranmer, K."
            },
            {
              "full_name": "Hackett, D.C."
            },
            {
              "full_name": "Kanwar, G."
            },
            {
              "full_name": "Matthews, A.G.D.G."
            }
          ],
          "arxiv_eprint": "2208.03832",
          "publication_info": {
            "year": 2022
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2656559"
        },
        "raw_refs": [
          {
            "value": "[19] R. Abbott, M. S. Albergo, A. Botev, D. Boyda, K. Cranmer, D. C. Hackett, G. Kanwar, A. G. D. G. Matthews, S. Racanière, A. Razavi, D. J. Rezende, F. RomeroLópez, P. E. Shanahan, and J. M. Urban, Normalizing flows for lattice gauge theory in arbitrary space-time dimension (2023), arXiv:2305.02402 [hep-lat].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "S. Racanière, A. Razavi, D. J. Rezende",
            "F. RomeroLópez, P. E. Shanahan, and J. M. Urban",
            "Normalizing flows for lattice gauge theory in arbitrary space-time dimension"
          ],
          "label": "19",
          "authors": [
            {
              "full_name": "Abbott, R."
            },
            {
              "full_name": "Albergo, M.S."
            },
            {
              "full_name": "Botev, A."
            },
            {
              "full_name": "Boyda, D."
            },
            {
              "full_name": "Cranmer, K."
            },
            {
              "full_name": "Hackett, D.C."
            },
            {
              "full_name": "Kanwar, G."
            },
            {
              "full_name": "Matthews, A.G.D.G."
            }
          ],
          "arxiv_eprint": "2305.02402",
          "publication_info": {
            "year": 2023
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2138819"
        },
        "raw_refs": [
          {
            "value": "[20] A. Tomiya and S. Terasaki, GomalizingFlow.jl: A Julia package for Flow-based sampling algorithm for lattice field theory, (2022), arXiv:2208.08903 [hep-lat].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "GomalizingFlow.jl: A Julia",
            "package for Flow-based sampling algorithm for lattice field theory"
          ],
          "label": "20",
          "authors": [
            {
              "full_name": "Tomiya, A."
            },
            {
              "full_name": "Terasaki, S."
            }
          ],
          "arxiv_eprint": "2208.08903",
          "publication_info": {
            "year": 2022
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2702854"
        },
        "raw_refs": [
          {
            "value": "[21] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, Attention is all you need (2017), arXiv:1706.03762 [cs.CL].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Attention is all you need"
          ],
          "label": "21",
          "authors": [
            {
              "full_name": "Vaswani, A."
            },
            {
              "full_name": "Shazeer, N."
            },
            {
              "full_name": "Parmar, N."
            },
            {
              "full_name": "Uszkoreit, J."
            },
            {
              "full_name": "Jones, L."
            },
            {
              "full_name": "Gomez, A.N."
            },
            {
              "full_name": "Kaiser, L."
            },
            {
              "full_name": "Polosukhin, I."
            }
          ],
          "arxiv_eprint": "1706.03762",
          "publication_info": {
            "year": 2017
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2729542"
        },
        "raw_refs": [
          {
            "value": "[22] A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, and N. Houlsby, An image is worth 16x16 words: Transformers for image recognition at scale (2021), arXiv:2010.11929 [cs.CV].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "An image is worth 16x16 words: Transformers for image recognition at scale"
          ],
          "label": "22",
          "authors": [
            {
              "full_name": "Dosovitskiy, A."
            },
            {
              "full_name": "Beyer, L."
            },
            {
              "full_name": "Kolesnikov, A."
            },
            {
              "full_name": "Weissenborn, D."
            },
            {
              "full_name": "Zhai, X."
            },
            {
              "full_name": "Unterthiner, T."
            },
            {
              "full_name": "Dehghani, M."
            },
            {
              "full_name": "Minderer, M."
            },
            {
              "full_name": "Heigold, G."
            },
            {
              "full_name": "Gelly, S."
            },
            {
              "full_name": "Uszkoreit, J."
            },
            {
              "full_name": "Houlsby, N."
            }
          ],
          "arxiv_eprint": "2010.11929",
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[23] J. Jumper, R. Evans, A. Pritzel, et al., Highly accurate protein structure prediction with alphafold, Nature 596, 583 (2021).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Highly accurate protein structure prediction with alphafold"
          ],
          "label": "23",
          "authors": [
            {
              "full_name": "Jumper, J."
            },
            {
              "full_name": "Evans, R."
            },
            {
              "full_name": "Pritzel, A."
            }
          ],
          "publication_info": {
            "year": 2021,
            "artid": "583",
            "page_start": "583",
            "journal_title": "Nature",
            "journal_volume": "596"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[24] A. Radford, K. Narasimhan, T. Salimans, and I. Sutskever, Improving language understanding by generative pre-training, (2018).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Improving language understanding by generative pre-training"
          ],
          "label": "24",
          "authors": [
            {
              "full_name": "Radford, A."
            },
            {
              "full_name": "Narasimhan, K."
            },
            {
              "full_name": "Salimans, T."
            },
            {
              "full_name": "Sutskever, I."
            }
          ],
          "publication_info": {
            "year": 2018
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[25] A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, and I. Sutskever, Language models are unsupervised multitask learners, (2019).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Language models are unsupervised multitask learners"
          ],
          "label": "25",
          "authors": [
            {
              "full_name": "Radford, A."
            },
            {
              "full_name": "Wu, J."
            },
            {
              "full_name": "Child, R."
            },
            {
              "full_name": "Luan, D."
            },
            {
              "full_name": "Amodei, D."
            },
            {
              "full_name": "Sutskever, I."
            }
          ],
          "publication_info": {
            "year": 2019
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2729461"
        },
        "raw_refs": [
          {
            "value": "[26] T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan, R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin, S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, and D. Amodei, Language models are few-shot learners (2020), arXiv:2005.14165 [cs.CL].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Language models are few-shot learners"
          ],
          "label": "26",
          "authors": [
            {
              "full_name": "Brown, T.B."
            },
            {
              "full_name": "Mann, B."
            },
            {
              "full_name": "Ryder, N."
            },
            {
              "full_name": "Subbiah, M."
            },
            {
              "full_name": "Kaplan, J."
            },
            {
              "full_name": "Dhariwal, P."
            },
            {
              "full_name": "Neelakantan, A."
            },
            {
              "full_name": "Shyam, P."
            },
            {
              "full_name": "Sastry, G."
            },
            {
              "full_name": "Askell, A."
            },
            {
              "full_name": "Agarwal, S."
            },
            {
              "full_name": "Herbert-Voss, A."
            },
            {
              "full_name": "Krueger, G."
            },
            {
              "full_name": "Henighan, T."
            },
            {
              "full_name": "Child, R."
            },
            {
              "full_name": "Ramesh, A."
            },
            {
              "full_name": "Ziegler, D.M."
            },
            {
              "full_name": "Wu, J."
            },
            {
              "full_name": "Winter, C."
            },
            {
              "full_name": "Hesse, C."
            },
            {
              "full_name": "Chen, M."
            },
            {
              "full_name": "Sigler, E."
            },
            {
              "full_name": "Litwin, M."
            },
            {
              "full_name": "Gray, S."
            },
            {
              "full_name": "Chess, B."
            },
            {
              "full_name": "Clark, J."
            },
            {
              "full_name": "Berner, C."
            },
            {
              "full_name": "McCandlish, S."
            },
            {
              "full_name": "Radford, A."
            },
            {
              "full_name": "Sutskever, I."
            },
            {
              "full_name": "Amodei, D."
            }
          ],
          "arxiv_eprint": "2005.14165",
          "publication_info": {
            "year": 2020
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2798025"
        },
        "raw_refs": [
          {
            "value": "[27] OpenAI, Gpt-4 technical report (2023), arXiv:2303.08774 [cs.CL].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "OpenAI, Gpt-4 technical report"
          ],
          "label": "27",
          "arxiv_eprint": "2303.08774",
          "publication_info": {
            "year": 2023
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2853835"
        },
        "raw_refs": [
          {
            "value": "[28] J. Kaplan, S. McCandlish, T. Henighan, T. B. Brown, B. Chess, R. Child, S. Gray, A. Radford, J. Wu, and D. Amodei, Scaling laws for neural language models (2020), arXiv:2001.08361 [cs.LG].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Scaling laws for neural language models"
          ],
          "label": "28",
          "authors": [
            {
              "full_name": "Kaplan, J."
            },
            {
              "full_name": "McCandlish, S."
            },
            {
              "full_name": "Henighan, T."
            },
            {
              "full_name": "Brown, T.B."
            },
            {
              "full_name": "Chess, B."
            },
            {
              "full_name": "Child, R."
            },
            {
              "full_name": "Gray, S."
            },
            {
              "full_name": "Radford, A."
            },
            {
              "full_name": "Wu, J."
            },
            {
              "full_name": "Amodei, D."
            }
          ],
          "arxiv_eprint": "2001.08361",
          "publication_info": {
            "year": 2020
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[29] T. Lin, Y. Wang, X. Liu, and X. Qiu, A survey of transformers (2021), arXiv:2106.04554 [cs.LG].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "A survey of transformers"
          ],
          "label": "29",
          "authors": [
            {
              "full_name": "Lin, T."
            },
            {
              "full_name": "Wang, Y."
            },
            {
              "full_name": "Liu, X."
            },
            {
              "full_name": "Qiu, X."
            }
          ],
          "arxiv_eprint": "2106.04554",
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[30] Y.-H. Cao and J. Wu, A random cnn sees objects: One inductive bias of cnn and its applications (2021), arXiv:2106.09259 [cs.CV].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "A random cnn sees objects: One inductive bias of cnn and its applications"
          ],
          "label": "30",
          "authors": [
            {
              "full_name": "Cao, Y.-H."
            },
            {
              "full_name": "Wu, J."
            }
          ],
          "arxiv_eprint": "2106.09259",
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[31] M. Koyama, K. Fukumizu, K. Hayashi, and T. Miyato, Neural fourier transform: A general approach to equivariant representation learning (2023), arXiv:2305.18484 [stat.ML].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Neural fourier transform: A general approach to equivariant representation learning"
          ],
          "label": "31",
          "authors": [
            {
              "full_name": "Koyama, M."
            },
            {
              "full_name": "Fukumizu, K."
            },
            {
              "full_name": "Hayashi, K."
            },
            {
              "full_name": "Miyato, T."
            }
          ],
          "arxiv_eprint": "2305.18484",
          "publication_info": {
            "year": 2023
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2728115"
        },
        "raw_refs": [
          {
            "value": "[32] T. S. Cohen and M. Welling, Group equivariant convolutional networks (2016), arXiv:1602.07576 [cs.LG].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Group equivariant convolutional networks"
          ],
          "label": "32",
          "authors": [
            {
              "full_name": "Cohen, T.S."
            },
            {
              "full_name": "Welling, M."
            }
          ],
          "arxiv_eprint": "1602.07576",
          "publication_info": {
            "year": 2016
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2853764"
        },
        "raw_refs": [
          {
            "value": "[33] K. Barros and Y. Kato, Efficient langevin simulation of coupled classical fields and fermions, Phys. Rev. B 88, 235101 (2013).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Efficient langevin simulation of coupled classical fields and fermions"
          ],
          "label": "33",
          "authors": [
            {
              "full_name": "Barros, K."
            },
            {
              "full_name": "Kato, Y."
            }
          ],
          "publication_info": {
            "year": 2013,
            "artid": "235101",
            "journal_title": "Phys.Rev.B",
            "journal_volume": "88"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[34] G. Stratis, P. Weinberg, T. Imbiriba, P. Closas, and A. E. Feiguin, Sample generation for the spin-fermion model using neural networks, Phys. Rev. B 106, 205112 (2022).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Sample generation for the spin-fermion model using neural networks"
          ],
          "label": "34",
          "authors": [
            {
              "full_name": "Stratis, G."
            },
            {
              "full_name": "Weinberg, P."
            },
            {
              "full_name": "Imbiriba, T."
            },
            {
              "full_name": "Closas, P."
            },
            {
              "full_name": "Feiguin, A.E."
            }
          ],
          "publication_info": {
            "year": 2022,
            "artid": "205112",
            "journal_title": "Phys.Rev.B",
            "journal_volume": "106"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/531193"
        },
        "raw_refs": [
          {
            "value": "[35] J. Alonso, L. Fernández, F. Guinea, V. Laliena, and V. Martı́n-Mayor, Hybrid monte carlo algorithm for the double exchange model, Nuclear Physics B 596, 587 (2001).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "L. Fernández, F. Guinea, V. Laliena",
            "and V. Martı́n-Mayor, Hybrid monte carlo algorithm for the double exchange model"
          ],
          "label": "35",
          "authors": [
            {
              "full_name": "Alonso, J."
            }
          ],
          "publication_info": {
            "year": 2001,
            "artid": "587",
            "page_start": "587",
            "journal_title": "Nucl.Phys.B",
            "journal_volume": "596"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[36] N. Furukawa, Y. Motome, and H. Nakata, Monte carlo algorithm for the double exchange model optimized for parallel computations, Computer Physics Communications 142, 410 (2001), conference on Computational Physics 2000: ”New Challenges for the New Millenium”.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Monte carlo algorithm for the double exchange model optimized for parallel computations",
            "conference on Computational Physics 2000:"
          ],
          "label": "36",
          "title": {
            "title": "New Challenges for the New Millenium"
          },
          "authors": [
            {
              "full_name": "Furukawa, N."
            },
            {
              "full_name": "Motome, Y."
            },
            {
              "full_name": "Nakata, H."
            }
          ],
          "publication_info": {
            "year": 2001,
            "artid": "410",
            "page_start": "410",
            "journal_title": "Comput.Phys.Commun.",
            "journal_volume": "142"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[37] N. Furukawa and Y. Motome, Order n monte carlo algorithm for fermion systems coupled with fluctuating adiabatical fields, Journal of the Physical Society of Japan 73, 1482 (2004), https://doi.org/10.1143/JPSJ.73.1482.",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "dois": [
            "10.1143/JPSJ.73.1482"
          ],
          "misc": [
            "Order n monte carlo algorithm for fermion systems coupled with fluctuating adiabatical fields"
          ],
          "label": "37",
          "authors": [
            {
              "full_name": "Furukawa, N."
            },
            {
              "full_name": "Motome, Y."
            }
          ],
          "publication_info": {
            "year": 2004,
            "artid": "1482",
            "page_start": "1482",
            "journal_title": "J.Phys.Soc.Jap.",
            "journal_volume": "73"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[38] G. Alvarez, C. Şen, N. Furukawa, Y. Motome, and E. Dagotto, The truncated polynomial expansion monte carlo method for fermion systems coupled to classical fields: a model independent implementation, Computer Physics Communications 168, 32 (2005).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "C. Şen, N. Furukawa, Y. Motome, and E. Dagotto",
            "The truncated polynomial expansion monte carlo method for fermion systems coupled to classical fields: a model independent implementation"
          ],
          "label": "38",
          "authors": [
            {
              "full_name": "Alvarez, G."
            }
          ],
          "publication_info": {
            "year": 2005,
            "artid": "32",
            "page_start": "32",
            "journal_title": "Comput.Phys.Commun.",
            "journal_volume": "168"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[39] G. Alvarez, P. K. V. V. Nukala, and E. D’Azevedo, Fast diagonalization of evolving matrices: application to spinfermion models, Journal of Statistical Mechanics: Theory and Experiment 2007, P08007 (2007).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Fast diagonalization of evolving matrices: application to spinfermion models"
          ],
          "label": "39",
          "authors": [
            {
              "full_name": "Alvarez, G."
            },
            {
              "full_name": "Nukala, P.K.V.V."
            },
            {
              "full_name": "D'Azevedo, E."
            }
          ],
          "publication_info": {
            "year": 2007,
            "artid": "P08007",
            "journal_title": "J.Stat.Mech.",
            "journal_volume": "2007"
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2732463"
        },
        "raw_refs": [
          {
            "value": "[40] M. A. Ruderman and C. Kittel, Indirect exchange coupling of nuclear magnetic moments by conduction electrons, Phys. Rev. 96, 99 (1954).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Indirect exchange coupling of nuclear magnetic moments by conduction electrons"
          ],
          "label": "40",
          "authors": [
            {
              "full_name": "Ruderman, M.A."
            },
            {
              "full_name": "Kittel, C."
            }
          ],
          "publication_info": {
            "year": 1954,
            "artid": "99",
            "page_start": "99",
            "journal_title": "Phys.Rev.",
            "journal_volume": "96"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[41] T. Kasuya, A theory of metallic ferro- and antiferromagnetism on zener’s model, Progr. Theoret. Phys. 16, 45 (1956).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "A theory of metallic ferro- and antiferromagnetism on zener’s model"
          ],
          "label": "41",
          "authors": [
            {
              "full_name": "Kasuya, T."
            }
          ],
          "publication_info": {
            "year": 1956,
            "artid": "45",
            "page_start": "45",
            "journal_title": "Prog.Theor.Phys.",
            "journal_volume": "16"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[42] K. Yosida, Magnetic properties of Cu-Mn alloys, Phys. Rev. 106, 893 (1957).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Magnetic properties of Cu-Mn alloys"
          ],
          "label": "42",
          "authors": [
            {
              "full_name": "Yosida, K."
            }
          ],
          "publication_info": {
            "year": 1957,
            "artid": "893",
            "page_start": "893",
            "journal_title": "Phys.Rev.",
            "journal_volume": "106"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[43] J. Xu, X. Tang, Y. Zhu, J. Sun, and S. Pu, SGMNet: Learning rotation-invariant point cloud representations via sorted gram matrix, in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (IEEE, 2021).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "SGMNet: Learning rotation-invariant point cloud representations via sorted gram matrix, in"
          ],
          "label": "43",
          "authors": [
            {
              "full_name": "Xu, J."
            },
            {
              "full_name": "Tang, X."
            },
            {
              "full_name": "Zhu, Y."
            },
            {
              "full_name": "Sun, J."
            },
            {
              "full_name": "Pu, S."
            }
          ],
          "imprint": {
            "publisher": "IEEE"
          },
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[43] J. Xu, X. Tang, Y. Zhu, J. Sun, and S. Pu, SGMNet: Learning rotation-invariant point cloud representations via sorted gram matrix, in 2021 IEEE/CVF International Conference on Computer Vision (ICCV) (IEEE, 2021).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "/CVF International Conference on Computer Vision (ICCV)"
          ],
          "label": "43",
          "authors": [
            {
              "full_name": "Xu, J."
            },
            {
              "full_name": "Tang, X."
            },
            {
              "full_name": "Zhu, Y."
            },
            {
              "full_name": "Sun, J."
            },
            {
              "full_name": "Pu, S."
            }
          ],
          "imprint": {
            "publisher": "IEEE"
          },
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[44] S. Assaad, C. Downey, R. Al-Rfou, N. Nayakanti, and B. Sapp, VN-Transformer: Rotation-Equivariant attention for vector neurons, (2022), arXiv:2206.04176 [cs.CV].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "VN-Transformer: Rotation-Equivariant attention for vector neurons"
          ],
          "label": "44",
          "authors": [
            {
              "full_name": "Assaad, S."
            },
            {
              "full_name": "Downey, C."
            },
            {
              "full_name": "Al-Rfou, R."
            },
            {
              "full_name": "Nayakanti, N."
            },
            {
              "full_name": "Sapp, B."
            }
          ],
          "arxiv_eprint": "2206.04176",
          "publication_info": {
            "year": 2022
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[45] C. Deng, O. Litany, Y. Duan, A. Poulenard, A. Tagliasacchi, and L. Guibas, Vector neurons: A general framework for SO(3)-Equivariant networks, (2021), arXiv:2104.12229 [cs.CV].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Vector neurons: A general framework for SO(3)-Equivariant networks"
          ],
          "label": "45",
          "authors": [
            {
              "full_name": "Deng, C."
            },
            {
              "full_name": "Litany, O."
            },
            {
              "full_name": "Duan, Y."
            },
            {
              "full_name": "Poulenard, A."
            },
            {
              "full_name": "Tagliasacchi, A."
            },
            {
              "full_name": "Guibas, L."
            }
          ],
          "arxiv_eprint": "2104.12229",
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[46] P. Thölke and G. De Fabritiis, TorchMD-NET: Equivariant transformers for neural network based molecular potentials, (2022), arXiv:2202.02541 [cs.LG].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "P. Thölke and G. De Fabritiis, TorchMD-NET: Equivariant transformers for neural network based molecular potentials"
          ],
          "label": "46",
          "arxiv_eprint": "2202.02541",
          "publication_info": {
            "year": 2022
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[47] S. Batzner, A. Musaelian, L. Sun, M. Geiger, J. P. Mailoa, M. Kornbluth, N. Molinari, T. E. Smidt, and B. Kozinsky, E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials, Nat. Commun. 13, 2453 (2022).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials"
          ],
          "label": "47",
          "authors": [
            {
              "full_name": "Batzner, S."
            },
            {
              "full_name": "Musaelian, A."
            },
            {
              "full_name": "Sun, L."
            },
            {
              "full_name": "Geiger, M."
            },
            {
              "full_name": "Mailoa, J.P."
            },
            {
              "full_name": "Kornbluth, M."
            },
            {
              "full_name": "Molinari, N."
            },
            {
              "full_name": "Smidt, T.E."
            },
            {
              "full_name": "Kozinsky, B."
            }
          ],
          "publication_info": {
            "year": 2022,
            "artid": "2453",
            "page_start": "2453",
            "journal_title": "Nature Commun.",
            "journal_volume": "13"
          }
        }
      },
      {
        "raw_refs": [
          {
            "value": "[48] M. Innes, E. Saba, K. Fischer, D. Gandhi, M. C. Rudilosso, N. M. Joy, T. Karmali, A. Pal, and V. Shah, Fashionable modelling with flux (2018), arXiv:1811.01457 [cs.PL].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Fashionable modelling with flux"
          ],
          "label": "48",
          "authors": [
            {
              "full_name": "Innes, M."
            },
            {
              "full_name": "Saba, E."
            },
            {
              "full_name": "Fischer, K."
            },
            {
              "full_name": "Gandhi, D."
            },
            {
              "full_name": "Rudilosso, M.C."
            },
            {
              "full_name": "Joy, N.M."
            },
            {
              "full_name": "Karmali, T."
            },
            {
              "full_name": "Pal, A."
            },
            {
              "full_name": "Shah, V."
            }
          ],
          "arxiv_eprint": "1811.01457",
          "publication_info": {
            "year": 2018
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2728636"
        },
        "raw_refs": [
          {
            "value": "[49] J. Bezanson, A. Edelman, S. Karpinski, and V. B. Shah, Julia: A fresh approach to numerical computing (2015), arXiv:1411.1607 [cs.MS].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Julia: A fresh approach to numerical computing"
          ],
          "label": "49",
          "authors": [
            {
              "full_name": "Bezanson, J."
            },
            {
              "full_name": "Edelman, A."
            },
            {
              "full_name": "Karpinski, S."
            },
            {
              "full_name": "Shah, V.B."
            }
          ],
          "arxiv_eprint": "1411.1607",
          "publication_info": {
            "year": 2015
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2729484"
        },
        "raw_refs": [
          {
            "value": "[50] I. Loshchilov and F. Hutter, Decoupled weight decay regularization (2019), arXiv:1711.05101 [cs.LG].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Decoupled weight decay regularization"
          ],
          "label": "50",
          "authors": [
            {
              "full_name": "Loshchilov, I."
            },
            {
              "full_name": "Hutter, F."
            }
          ],
          "arxiv_eprint": "1711.05101",
          "publication_info": {
            "year": 2019
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1865610"
        },
        "raw_refs": [
          {
            "value": "[51] L. D. Debbio, J. M. Rossney, and M. Wilson, Efficient modeling of trivializing maps for lattice ϕ4 theory using normalizing flows: A first look at scalability, Physical Review D 104, 10.1103/physrevd.104.094507 (2021).",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "dois": [
            "10.1103/physrevd.104.094507"
          ],
          "misc": [
            "Efficient modeling of trivializing maps for lattice ϕ4 theory using normalizing flows: A first look at scalability, Physical Review D",
            "104"
          ],
          "label": "51",
          "authors": [
            {
              "full_name": "Debbio, L.D."
            },
            {
              "full_name": "Rossney, J.M."
            },
            {
              "full_name": "Wilson, M."
            }
          ],
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/1998859"
        },
        "raw_refs": [
          {
            "value": "[52] L. D. Debbio, J. M. Rossney, and M. Wilson, Machine learning trivializing maps: A first step towards understanding how flow-based samplers scale up (2021), arXiv:2112.15532 [hep-lat].",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Machine learning trivializing maps: A first step towards understanding how flow-based samplers scale up"
          ],
          "label": "52",
          "authors": [
            {
              "full_name": "Debbio, L.D."
            },
            {
              "full_name": "Rossney, J.M."
            },
            {
              "full_name": "Wilson, M."
            }
          ],
          "arxiv_eprint": "2112.15532",
          "publication_info": {
            "year": 2021
          }
        }
      },
      {
        "record": {
          "$ref": "https://inspirehep.net/api/literature/2620105"
        },
        "raw_refs": [
          {
            "value": "[53] J. Komijani and M. K. Marinkovic, Generative models for scalar field theories: how to deal with poor scaling? (2023), arXiv:2301.01504 [hep-lat]. SUPPLEMENTAL MATERIAL ATTENTION AND TRANSFORMER Here we briefly review Transformer and Attention block [21], which have large model capacity [28]. Please see [29] for detail and recent development. The attention layer is essential component of the transformer neural networks. The input consists of queries, keys, and values of dimension d. In the conventional attention layer, so-called scaled dot-product attention layer, we compute the dot products of the query with keys, divide each by √ d and apply the activation function to obtain the weights of the values. According to the Ref. [21], the conventional attention layer is defined as Attention(Q, K, V ) = softmax \u0012 QKT √ d \u0013 V. (45) Here, Q,K and V are tensors whose size depend on system. The self-attention layer defined as SelfAttention(x) = Attention(WQ x, WK x, WV x), (46)",
            "schema": "text",
            "source": "arXiv"
          }
        ],
        "reference": {
          "misc": [
            "Generative models for scalar field theories: how to deal with poor scaling?",
            "SUPPLEMENTAL MATERIAL ATTENTION AND TRANSFORMER Here we briefly review Transformer and Attention block [21], which have large model capacity [28]. Please see [29] for detail and recent development. The attention layer is essential component of the transformer neural networks. The input consists of queries, keys, and values of dimension d. In the conventional attention layer, so-called scaled dot-product attention layer, we compute the dot products of the query with keys, divide each by √ d and apply the activation function to obtain the weights of the values. According to the Ref. [21], the conventional attention layer is defined as Attention(Q, K, V ) = softmax QKT √ d V. (45) Here, Q",
            "K and V are tensors whose size depend on system. The self-attention layer defined as SelfAttention(x) = Attention(WQ x, WK x, WV x), (46)"
          ],
          "label": "53",
          "authors": [
            {
              "full_name": "Komijani, J."
            },
            {
              "full_name": "Marinkovic, M.K."
            }
          ],
          "arxiv_eprint": "2301.01504",
          "publication_info": {
            "year": 2023
          }
        }
      }
    ],
    "public_notes": [
      {
        "value": "10 pates, 6 figures, Full paper version",
        "source": "arXiv"
      }
    ],
    "arxiv_eprints": [
      {
        "value": "2306.11527",
        "categories": [
          "cond-mat.str-el",
          "cond-mat.dis-nn",
          "hep-lat"
        ]
      }
    ],
    "document_type": [
      "article"
    ],
    "preprint_date": "2023-06-20",
    "control_number": 2669845,
    "number_of_pages": 11,
    "inspire_categories": [
      {
        "term": "Condensed Matter",
        "source": "arxiv"
      },
      {
        "term": "Lattice",
        "source": "arxiv"
      }
    ]
  }
}