Publications

(2025). RpcNIC: Enabling Efficient Datacenter RPC Offloading on PCIe-attached SmartNICs. 2025 IEEE International Symposium on High Performance Computer Architecture (HPCA).
(2025). Hyperion: Optimizing ssd access is all you need to enable cost-efficient out-of-core gnn training. 2025 IEEE 41st International Conference on Data Engineering (ICDE).
(2025). CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access. 2025 IEEE 41st International Conference on Data Engineering (ICDE).
(2024). Understanding Routable PCIe Performance for Composable Infrastructures. 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 24).
(2024). DmRPC: Disaggregated Memory-aware Datacenter RPC for Data-intensive Applications. 2024 IEEE 40th International Conference on Data Engineering (ICDE).
(2024). Demystifying Datapath Accelerator Enhanced Off-path SmartNIC. 2024 IEEE 32nd International Conference on Network Protocols (ICNP).
(2023). SparseACC: A Generalized Linear Model Accelerator for Sparse Datasets. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.
(2023). SmartDS: Middle-Tier-centric SmartNIC Enabling Application-aware Message Split for Disaggregated Block Storage. Proceedings of the 50th Annual International Symposium on Computer Architecture (ISCA).
(2023). P4SGD: Programmable Switch Enhanced Model-Parallel Training on Generalized Linear Models on Distributed FPGAs. IEEE Transactions on Parallel and Distributed Systems.
(2023). Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN Training. 2023 USENIX Annual Technical Conference (USENIX ATC 23).