Zhichao Cao
-
Mail code: 8809Campus: Tempe
-
Zhichao Cao is an assistant professor in the School of Computing and Augmented Intelligence at Arizona State University. He leads the Intelligent Data Infrastructure (IDI) research lab, where he conducts research in the areas of database systems (e.g., key-value stores, graph databases, and timeseries databases), storage systems (e.g., file systems, cloud storage, and deduplication systems), and next-generation data infrastructure (e.g., disaggregated infrastructure, computing-in-X, and wireless datacenter). His research interests also lie in the design and development of data management systems for new storage technologies, such as SMR, IMR, NVM, ZNS, and DNA. Moreover, Prof. Cao's research also encompasses big data systems, with a focus on the development of query engines for large-scale scientific computing in HPC and storage solutions for AI/ML platforms.
Prior to joining ASU, Prof. Cao worked as a research scientist at Facebook, where he contributed to storage and database research from 2018 to 2021. He earned his bachelor's degree in Automation from Tsinghua University in 2013 and his doctoral degree in Computer Science from the University of Minnesota, Twin Cities, in 2020.
[Important Notice] I'm now recruiting highly motivated graduate and undergraduate students (including 2 Ph.D. students starting from 2024 Spring or Fall). If you are interested in databases, storage systems, data infrastructure, and new data persisting techniques, please send your CV and statements to Zhichao.Cao@asu.edu.
- Ph.D. Computer Science, University of Minnesota, Twin-Cities, 2020
- M.S. Computer Science, University of Minnesota, Twin-Cities, 2019
- B.S. Automation, Tsinghua University, China 2013
Data Infrastructure: key-value store (RocksDB, LevelDB, and HBase); NoSQL database; data deduplication; backup and archive system; file system; hierarchical storage system; distributed storage system; compute-storage disaggregation
Storage for Big Data: cloud storage; object storage; storage systems for big graph; AI/ML for storage system; storage systems for AI/ML; storage system in IoT
New Storage Devices: Non-Volatile Memory (NVM); Shingled Magnetic Recording (SMR); Interlaced Magnetic Recording (SMR); Zone Namespace SSD (ZNS); DNA- and Glass based storage
[ICCD'23] Zhichao Cao, Hao Wen, Fenggang Wu, David H.C. Du. "SMRTS: A Performance and Cost-Effectiveness Optimized SSD-SMR Tiered File System with Data Deduplication" The 41st IEEE International Conference on Computer Design (to appear) (Acceptance rate: 28%).
[ICCD'23] Hao Wen, Zhichao Cao, Bingzhe Li, David Du, Ayman Abouelwafa, Doug Voigt, Shiyong Liu, Jim Diehl and Fenggang Wu "K8sES: Optimizing Kubernetes with Enhanced Storage Service-Level Objectives" (to appear) (Acceptance rate: 28%).
[TOS’22] Zhichao Cao, Huibing Dong, Yixun Wei, Shiyong Liu, and David H.C. Du. “IS-HBase: An In-Storage Computing Optimized HBase with I/O Offloading and Self-Adaptive Caching in Compute-Storage Disaggregated Infrastructure.” ACM Transaction on Storage, Volume 18, Issue 2, May 2022.
[TOS’22] Hiwot Tadese Kassa, Jason Akers, Mrinmoy Ghosh, Zhichao Cao, Vaibhav Gogte, Ronald Dres-linski. “Power-optimized Deployment of Key-value Stores Using Storage Class Memory.” ACM Transaction on Storage, Volume 18, Issue 2, May 2022.
[TOS’22] Xiongzi Ge Zhichao Cao, and David H.C. Du. “HintStor: A Framework to Study I/O Hints in Heterogeneous Storage.” ACM Transaction on Storage, Volume 18, Issue 2, May 2022.
[ATC’21] Hiwot Tadese Kassa, Jason Akers, Mrinmoy Ghosh, Zhichao Cao, Vaibhav Gogte, Ronald Dreslin- ski. “Improving Performance of Flash Based Key-Value Stores Using Storage Class Memory as a Volatile Memory Extension.” 2021 USENIX Annual Technical Conference, 2021 (Acceptance rate: 64/341=23%).
[FAST’20] Zhichao Cao, Siying Dong, Sagar Vemuri, and David H.C. Du.. “Characterizing, Modeling, and Benchmarking RocksDB Key-Value Workloads at Facebooke.” 18th USENIX Conference on File and Storage Technologies, 2020 (Acceptance rate: 23/138=17% as Full Paper).
[FAST’19] Zhichao Cao, Shiyong Liu, Fenggang Wu, Guohua Wang, Bingzhe Li, and David H.C. Du. “Sliding Look-Back Window Assisted Data Chunk Rewriting for Improving Deduplication Restore Performance.” 17th USENIX Conference on File and Storage Technologies, 2019 (Acceptance rate: 26/145=18% as Full Paper).
[TOS’19] Zhichao Cao, Hao Wen, Xiongzi Ge, and David H.C. Du. “TDDFS: A Tier-aware Data Deduplication based File System.” ACM Transaction on Storage, 2019.
[FAST’18] Zhichao Cao, Hao Wen, Fenggang Wu, and David H.C. Du. “ALACC: Accelerating Restore Performance of Data Deduplication Systems Using Adaptive Look Ahead Window Assisted Chunk Caching.” 16th USENIX Conference on File and Storage Technologies, 2018 (Acceptance rate: 23/139=17% as Full Paper).
Courses
2023 Fall
Course Number | Course Title |
---|---|
CSE 792 | Research |
CSE 580 | Practicum |
CSE 792 | Research |
CSE 330 | Operating Systems |
CSE 330 | Operating Systems |
CSE 330 | Operating Systems |
CSE 330 | Operating Systems |
CSE 330 | Operating Systems |
2023 Summer
Course Number | Course Title |
---|---|
CSE 584 | Internship |
CSE 584 | Internship |
CSE 599 | Thesis |
2023 Spring
Course Number | Course Title |
---|---|
CSE 599 | Thesis |
CSE 580 | Practicum |
CSE 511 | Data Processing at Scale |
2022 Fall
Course Number | Course Title |
---|---|
CSE 580 | Practicum |
CSE 330 | Operating Systems |
CSE 330 | Operating Systems |
CSE 330 | Operating Systems |
CSE 330 | Operating Systems |
CSE 330 | Operating Systems |
2022 Spring
Course Number | Course Title |
---|---|
CSE 511 | Data Processing at Scale |
Spring 2022: CSE 520
- Program Committee of VLDB 2024
- Program Committee of ICPP 2023
- Program Committee of ACM HotStorage 2023
- Proceedings Co-Chair of ACM SIGMOD 2023
- Program Committee of ACM SIGMOD 2023
- Virtual Chair of ACM HotStorage 2022
- Program Committee of IEEE NAS 2022
- Program Committee of ACM APSys 2022
- Reviewer of ACM Transactions on Storage (TOS), 2023
- Reviewer of IEEE Transactions on Computers (TC), 2022
- Reviewer of IEEE/ACM Transactions on Networking (TON), 2022, 2023
- Reviewer of IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2022
- Reviewer of International Journal of Future Generation Computer Systems, 2021
- Reviewer of Transactions on Cloud Computing, 2022
- Reviewer of Computer Communications, 2022
- Reviewer of IEEE Intelligent Systems, 2022
- Volunteer of International Conference on Parallel Processing (ICPP’14)
- Research Scientist Facebook Oct. 2019 - Dec. 2021
- Research Collaborator Facebook Sep. 2018 - Sep. 2019
- Research Intern Facebook Jun. 2018 - Aug. 2018
- Research Intern Veritas Jun. 2016 - Aug. 2016
- Research Intern Hewlett-Packard (HPE) Jun. 2015 - Aug. 2015
- Research Intern Hewlett-Packard (HPE) Jun. 2014 - Aug. 2014