2025

Pushing a Single GPU to Its Limits and Scaling to Tens of Thousands: RL-Guided, Physically Consistent KMC for Nuclear Materials Simulation
Pushing a Single GPU to Its Limits and Scaling to Tens of Thousands: RL-Guided, Physically Consistent KMC for Nuclear Materials Simulation

Haozhi Han*, Qi Li*,, Ruge Zhang*,, Haipeng Jia,, Yunquan Zhang,, Yifeng Chen,, Ting Cao,, Yunxin Liu,, Kun Li (* equal contribution)

ISC High Performance 2026 (ISC) 2026

Pushing a Single GPU to Its Limits and Scaling to Tens of Thousands: RL-Guided, Physically Consistent KMC for Nuclear Materials Simulation

Haozhi Han*, Qi Li*,, Ruge Zhang*,, Haipeng Jia,, Yunquan Zhang,, Yifeng Chen,, Ting Cao,, Yunxin Liu,, Kun Li (* equal contribution)

ISC High Performance 2026 (ISC) 2026

SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale
SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale

Qi Li, Kun Li, Haozhi Han, Honghui Shang, Xinfu He, Yunquan Zhang, Hong An, Ting Cao, Mao Yang

Under review.

SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale

Qi Li, Kun Li, Haozhi Han, Honghui Shang, Xinfu He, Yunquan Zhang, Hong An, Ting Cao, Mao Yang

Under review.

Matrix Is All You Need: Rearchitecting Quantum Chemistry to Scale on AI Accelerators
Matrix Is All You Need: Rearchitecting Quantum Chemistry to Scale on AI Accelerators

Haozhi Han, Kun Li, Fusong Ju, Qi Li, Yifeng Chen, Yunquan Zhang, Ting Cao, Mao Yang

The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC) 2025 CCF-A

Matrix Is All You Need: Rearchitecting Quantum Chemistry to Scale on AI Accelerators

Haozhi Han, Kun Li, Fusong Ju, Qi Li, Yifeng Chen, Yunquan Zhang, Ting Cao, Mao Yang

The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC) 2025 CCF-A

SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation
SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation

Qi Li, Kun Li, Haozhi Han, Liang Yuan, Junshi Chen, Yunquan Zhang, Yifeng Chen, Hong An, Ting Cao, Mao Yang

The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC) 2025 Best Student Paper Award Finalist CCF-A

SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation

Qi Li, Kun Li, Haozhi Han, Liang Yuan, Junshi Chen, Yunquan Zhang, Yifeng Chen, Hong An, Ting Cao, Mao Yang

The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC) 2025 Best Student Paper Award Finalist CCF-A

FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units
FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units

Haozhi Han, Kun Li, Wei Cui, Donglin Bai, Yiwei Zhang, Liang Yuan, Yifeng Chen, Yunquan Zhang, Ting Cao, Mao Yang

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2025 CCF-A

FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units

Haozhi Han, Kun Li, Wei Cui, Donglin Bai, Yiwei Zhang, Liang Yuan, Yifeng Chen, Yunquan Zhang, Ting Cao, Mao Yang

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2025 CCF-A

Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers
Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers

Yiwei Zhang, Kun Li, Liang Yuan, Haozhi Han, Yunquan Zhang, Ting Cao, Mao Yang

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2025 CCF-A

Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers

Yiwei Zhang, Kun Li, Liang Yuan, Haozhi Han, Yunquan Zhang, Ting Cao, Mao Yang

ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2025 CCF-A