# Optimizing the domain wall fermion Dirac operator using the R\-Stream source\-to\-source compiler

**INSPIRE:** [1408296](https://inspirehep.net/literature/1408296)
**arXiv:** [1512\.01542](https://arxiv.org/abs/1512.01542)
**DOI:** [10\.22323/1\.251\.0022](https://doi.org/10.22323/1.251.0022)

**Authors:** Lin, Meifeng, Papenhausen, Eric, Langston, M\. Harper, Meister, Benoit, Baskaran, Muthu, Izubuchi, Taku, Jung, Chulwoo

**Submitted:** 4 December 2015

**Subjects:**
- hep\-lat
- physics\.comp\-ph
- Lattice
- Computing

**Journal reference:** PoS LATTICE2015 022 \(2016\)

## Abstract

The application of the Dirac operator on a spinor field, the Dslash operation, is the most computation\-intensive part of the lattice QCD simulations\. It is often the key kernel to optimize to achieve maximum performance on various platforms\. Here we report on a project to optimize the domain wall fermion Dirac operator in Columbia Physics System \(CPS\) using the R\-Stream source\-to\-source compiler\. Our initial target platform is the Intel PC clusters\. We discuss the optimization strategies involved before and after the automatic code generation with R\-Stream and present some preliminary benchmark results\.
