I am currently an Associate Researcher with the College of Intelligence and Computing, Tianjin University, Tianjin, China. And I am a member of TANK Lab, led by Prof.Keqiu Li. From 2022 to 2024, I was a Postdoctoral Researcher in the Storage Research Group (advised by Prof.Jiwu Shu and Youyou Lu), Department of Computer Science and Technology, Tsinghua University, Beijing, China. I received the PhD degree from the School of Computer Science and Technology, Shandong University, Qingdao, China, in June 2022. My research interest includes storage system and AI system.

🔥 News

  • 🎉2024.07:   Our paper entitled “Achieving Wire-Latency Storage Systems by Exploiting Hardware ACKs” has been accepted by NSDI’25.
  • 🎉2024.07:   Our paper entitled “Deft: A Scalable Tree Index for Disaggregated Memory” has been accepted by Eurosys’25.
  • 🎉2024.07:   Our paper entitled “Ares-Flash: Efficient Parallel Integer Arithmetic Operations Using NAND Flash Memory” has been accepted by MICRO’24.
  • 🎉2024.05:   Our paper entitled “Full Lifecycle Data Analysis on a Large-scale and Leadership Supercomputer: What Can We Learn from It?” has been accepted by USENIX ATC’24.

đź’¬ Research Topics

Storage system:

  • Distributed Storage System with Fast/Smart CXL/Network Devices (e.g., RDMA/Smart NIC).
  • Near Data Processing System (i.e., Processing-in-memory and Computing-in-Storage) with Emerging Devices (e.g., DIMM PIM Modules and Smart SSD).

AI system:

  • Task Scheduling and Memory/Storage Management System for Large Scale AI Models.
  • High Performance Networking System for Large Scale AI Models.

đź“ť Publications

Check my full publication list on google scholar.

  • Achieving Wire-Latency Storage Systems by Exploiting Hardware ACKs.[PDF]
    Qing Wang, Jiwu Shu, Jing Wang, Yuhao Zhang.
    The 22nd USENIX Symposium on Networked Systems Design and Implementation (NSDI’25), 2025, CCF-A.

  • Deft: A Scalable Tree Index for Disaggregated Memory.[PDF]
    Jing Wang, Qing Wang, Yuhao Zhang, Jiwu Shu.
    The 20th European Conference on Computer Systems (Eurosys’25), 2025, CCF-A.

  • Ares-Flash: Efficient Parallel Integer Arithmetic Operations Using NAND Flash Memory.[PDF]
    Jian Chen, Congming Gao, Youyou Lu, Yuhao Zhang, Jiwu Shu.
    57th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO’24), 2024, CCF-A.

  • Full Lifecycle Data Analysis on a Large-scale and Leadership Supercomputer: What Can We Learn from It?[PDF]
    Bin Yang, Hao Wei, Wenhao Zhu, Yuhao Zhang, Weiguo Liu, Wei Xue.
    USENIX Annual Technical Conference (USENIX ATC’24), 2024, CCF-A.

  • A Semantic-integrated LSM-tree based Key-Value Storage Engine for Blockchain Systems.[PDF]
    Qian Wei, Zehao Chen, Yuhao Zhang, Xiaojun Cai, Zhiping Jia, Zhaoyan Shen, Yi Wang, Zili Shao, Bingzhe Li.
    IEEE Transactions on Computer-Aided Design of Integrated Circuits And System (TCAD), 2024, CCF-A.

  • Towards High-throughput Neural Network Inference with Computational BRAM on Nonvolatile FPGAs.[PDF]
    Hao Zhang, Mengying Zhao, Huichuan Zheng, Yuqing Xiong, Yuhao Zhang, Zhaoyan Shen.
    Design, Automation & Test in Europe (DATE’24), 2024, CCF-B.

  • Perseid: A Secondary Indexing Mechanism for LSM-based Storage Systems.[PDF]
    Jing Wang, Youyou Lu, Qing Wang, Yuhao Zhang, Jiwu Shu.
    ACM Transactions on Storage (TOS), 2024, CCF-A.

  • ASHL: An Adaptive Multi-stage Distributed Deep Learning Training Scheme for Heterogeneous Environments.[PDF]
    Zhaoyan Shen, Qingxiang Tang, Tianren Zhou, Yuhao Zhang, Zhiping Jia, Dongxiao Yu, Zhiyong Zhang, Bingzhe Li.
    IEEE Transactions on Computers (TC), 2024, CCF-A.

  • Static Scheduling of Weight Programming for DNN Acceleration with Resource Constrained PIM.[PDF]
    Xin Gao, Hongyue Wang, Yiyan Chen, Yuhao Zhang, Zhaoyan Shen, Lei Ju.
    Transactions on Embedded Computing Systems (TECS), 2023, CCF-B.

  • Revisiting Secondary Indexing in LSM-based Storage Systems with Persistent Memory.[PDF]
    Jing Wang, Youyou Lu, Qing Wang, Yuhao Zhang, Jiwu Shu.
    USENIX Annual Technical Conference (USENIX ATC’23), 2023, CCF-A.

  • PQ-PIM: A Pruning-Quantization Joint Optimization Framework for ReRAM-Based Processing-in-Memory DNN Accelerator.[PDF]
    Yuhao Zhang, Xinyu Wang, Xikun Jiang, YuhanYang, Zhaoyan Shen, Zhiping Jia.
    Journal of Systems Architecture (JSA), 2022, CCF-B.

  • An Efficient Highly Parallelized ReRAM-based Architecture for Motion Estimation of HEVC.[PDF]
    Yuhao Zhang, Bing Liu, Zhiping Jia, Renhai Chen, Zhaoyan Shen.
    Journal of Systems Architecture (JSA), 2021, CCF-B.

  • A Practical Highly Paralleled ReRAM-based DNN Accelerator by Reusing Weight Pattern Repetitions.[PDF]
    Yuhao Zhang, Zhiping Jia, Hongchao Du, Runzhen Xue, Zhaoyan Shen, Zili Shao.
    IEEE Transactionson Computer-Aided Design of Integrated Circuits And System (TCAD), 2021, CCF-A.

  • PattPIM: A Practical ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions.[PDF]
    Yuhao Zhang, Zhiping Jia, Yungang Pan, Hongchao Du, Zhaoyan Shen, Mengying Zhao, Zili Shao.
    Design Automation Conference (DAC’20), 2020, CCF-A.

🎖 Honors and Awards

  • Outstanding Graduate, Shandong University, 2022.

  • Outstanding Academic Achievement Award for Graduate Students, Shandong University, 2022.(1 PhD students from the the the school of computer science and technology were selected).

  • National Scholarship for Ph.D. Graduate Students, Shandong University, 2021. (2 PhD students from the the the school of computer science and technology were selected)