Me (second from left)
Google Scholar | CV
I am a PostDoc at UC Berkeley advised by Ion Stoica. I have equal interests in core systems and ML systems research, including efficient machine learning, datacenter far memory, distributed transactions and consensus, and networking stack designs. Short Bio.
I will be joining UC Davis CS as an Assistant Professor, starting July 2025.
I have multiple PhD openings at UC Davis (apply here by Dec 15, 2024). Feel free to drop me an email if you are interested.
I finished my PhD in Computer Science at Harvard University in 2024, advised by Minlan Yu and James Mickens. I received my B.S. in Computer Science at Peking University in 2018, advised by Tong Yang on probabilistic data structures and streaming algorithms. I was supported by a Google PhD Fellowship in Systems and Networking (see my application materials).
NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference
Xuanlin Jiang, Yang Zhou, Shiyi Cao, Ion Stoica, Minlan Yu
[arxiv]
BlendServe: Optimizing Offline Inference for Auto-regressive Large Models with Resource-aware Batching
Yilong Zhao*, Shuo Yang*, Kan Zhu, Lianmin Zheng, Baris Kasikci, Yang Zhou, Jiarong Xing, Ion Stoica
[arxiv]
eTran: Extensible Kernel Transport with eBPF
Zhongjie Chen, Qingkai Meng, ChonLam Lao, Yifan Liu, Fengyuan Ren, Minlan Yu, Yang Zhou
NSDI 2025. USENIX Symposium on Networked Systems Design and Implementation
SmartNIC Security Isolation in the Cloud with S-NIC
Yang Zhou, Mark Wilkening, James Mickens, Minlan Yu
EuroSys 2024. European Conference on Computer Systems
[paper]
[slides]
[code]
DINT: Fast In-Kernel Distributed Transactions with eBPF
Yang Zhou*, Xingyu Xiang*, Matthew Kiley, Sowmya Dharanipragada, Minlan Yu
NSDI 2024. USENIX Symposium on Networked Systems Design and Implementation
[paper]
[slides]
[talk]
[code]
Electrode: Accelerating Distributed Protocols with eBPF
Yang Zhou*, Zezhou Wang*, Sowmya Dharanipragada, Minlan Yu
NSDI 2023. USENIX Symposium on Networked Systems Design and Implementation
[paper]
[slides]
[talk]
[code]
Carbink: Fault-Tolerant Far Memory
Yang Zhou, Hassan Wassel, Sihang Liu, Jiaqi Gao, James Mickens, Minlan Yu, Chris Kennelly, Paul Turner, David Culler, Hank Levy, Amin Vahdat
OSDI 2022. USENIX Symposium on Operating Systems Design and Implementation
[paper]
[slides]
[talk]
Evolvable Network Telemetry at Facebook
Yang Zhou, Ying Zhang, Minlan Yu, Guangyu Wang, Dexter Cao, Eric Sung and Starsky Wong
NSDI 2022. USENIX Symposium on Networked Systems Design and Implementation
[paper]
[slides]
[talk]
Cold Filter: A Meta-Framework for Faster and More Accurate Stream Processing.
Yang Zhou, Tong Yang, Jie Jiang, Bin Cui, Minlan Yu, Xiaoming Li, Steve Uhlig
SIGMOD 2018. ACM SIGMOD International Conference on Management of Data
[paper]
[slides]
[Code]
Elastic Sketch: Adaptive and Fast Network-wide Measurements.
Tong Yang, Jie Jiang, Peng Liu, Qun Huang, Junzhi Gong, Yang Zhou, Rui Miao, Xiaoming Li, Steve Uhlig
SIGCOMM 2018. ACM SIGCOMM International Conference on Data Communications
[paper]
[slides]
[talk]
[Code]
A Comparison of Performance and Accuracy of Measurement Algorithms in Software.
Omid Alipourfard, Masoud Moshref, Yang Zhou, Tong Yang, Minlan Yu
SOSR 2018. ACM Symposium on SDN Research
[paper]
Single Hash: Use One Hash Function to Build Faster Hash Based Data Structures.
Xiangyang Gou, Chenxingyu Zhao, Tong Yang, Lei Zou, Yang Zhou, Yibo Yan, Xiaoming Li, Bin Cui
BigComp 2018. IEEE International Conference on Big Data and Smart Computing
[paper]
Pyramid Sketch: a Sketch Framework for Frequency Estimation of Data Streams.
Tong Yang, Yang Zhou, Hao Jin, Shigang Chen, Xiaoming Li
VLDB 2017. International Conference on Very Large Data Bases
[paper]
[Code]
One Memory Access Sketch: a More Accurate and Faster Sketch for Per-flow Measurement.
Yang Zhou, Peng Liu, Hao Jin, Tong Yang, Shoujiang Dang, Xiaoming Li
Globecom 2017. IEEE Global Communications Conference
[paper]
[Code]
ABC: a Practicable Sketch Framework for Non-uniform Multisets.
Junzhi Gong, Tong Yang, Yang Zhou, Dongsheng Yang, Shigang Chen, Bin Cui, Xiaoming Li
BigData 2017. IEEE International Conference on Big Data
[paper]
On the Evolutionary of Bloom Filter False Positives - An Information Theoretical Approach to Optimizing Bloom Filter Parameters
Zhuochen Fan, Gang Wen, Zhipeng Huang, Yang Zhou, Qiaobin Fu, Tong Yang, Alex X. Liu, Bin Cui
IEEE Transactions on Knowledge and Data Engineering (TKDE) 2022
[paper] [Code]
Pyramid Family: Generic Frameworks for Accurate and Fast Flow Size Measurement
Yuanpeng Li, Xiang Yu, Yilong Yang, Yang Zhou, Tong Yang, Zhuo Ma, Shigang Chen
IEEE/ACM Trasactions on Networking (TON) 2021
[paper]
[Code]
Adaptive Measurements using One Elastic Sketch.
Tong Yang, Jie Jiang, Peng Liu, Qun Huang, Junzhi Gong, Yang Zhou, Rui Miao, Xiaoming Li, Steve Uhlig
IEEE/ACM Trasactions on Networking (TON) 2019
[paper]
[Code]
Fast and Accurate Stream Processing by Filtering the Cold.
Tong Yang, Jie Jiang, Yang Zhou, Long He, Jinyang Li, Bin Cui, Steve Uhlig, Xiaoming Li
VLDB Journal 2019
[paper]
[Code]
Accelerating Network Measurement in Software.
Yang Zhou, Omid Alipourfard, Minlan Yu, Tong Yang
SIGCOMM CCR 2018 July issue, ACM SIGCOMM Computer Communication Review
[paper]
[Code]
*: co-primary authors
Last updated Dec 11, 2024
Hosted on GitHub Pages — Theme by orderedlist