2025

SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale
SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale

Qi Li, Kun Li, Haozhi Han, Honghui Shang, Xinfu He, Yunquan Zhang, Hong An, Ting Cao, Mao Yang

Under review.

SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale

Qi Li, Kun Li, Haozhi Han, Honghui Shang, Xinfu He, Yunquan Zhang, Hong An, Ting Cao, Mao Yang

Under review.

SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation
SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation

Qi Li, Kun Li, Haozhi Han, Liang Yuan, Junshi Chen, Yunquan Zhang, Yifeng Chen, Hong An, Ting Cao, Mao Yang

The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC) 2025 Best Student Paper Award Finalist CCF-A

SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation

Qi Li, Kun Li, Haozhi Han, Liang Yuan, Junshi Chen, Yunquan Zhang, Yifeng Chen, Hong An, Ting Cao, Mao Yang

The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC) 2025 Best Student Paper Award Finalist CCF-A

FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units
FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units

Haozhi Han, Kun Li, Wei Cui, Donglin Bai, Yiwei Zhang, Liang Yuan, Yifeng Chen, Yunquan Zhang, Ting Cao, Mao Yang

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2025 CCF-A

FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units

Haozhi Han, Kun Li, Wei Cui, Donglin Bai, Yiwei Zhang, Liang Yuan, Yifeng Chen, Yunquan Zhang, Ting Cao, Mao Yang

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2025 CCF-A

Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers
Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers

Yiwei Zhang, Kun Li, Liang Yuan, Haozhi Han, Yunquan Zhang, Ting Cao, Mao Yang

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2025 CCF-A

Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers

Yiwei Zhang, Kun Li, Liang Yuan, Haozhi Han, Yunquan Zhang, Ting Cao, Mao Yang

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2025 CCF-A