Jia Zou
-
Mail code: 8809Campus: Tempe
-
Jia Zou is a Tenure-Track Assistant Professor in the School of Computing and Augmented Intelligence, Arizona State University - Tempe, starting in fall 2019. She is also the director of the CACTUS data-intensive systems lab founded in the summer of 2020. Before that, she was a Research Scientist in the Department of Computer Science of Rice University, Houston, TX, and before that she worked in IBM Research - China as a researcher. She received her Ph.D in Computer Science from Tsinghua University, China.
Jia Zou received a prestigious NSF CAREER award in 2022, an Amazon Research Award in 2023, and an IBM faculty award in 2021. Her research interests include database systems, AI/ML in database, applied AI/ML to database, and federated data management. She has more than 20 papers published in VLDB, SIGMOD, VLDB journal, ICDCS, ICDM and so on and has been granted 15 patents. Her work received VLDB 2019 best paper honorable mention award and SIGMOD 2020 research highlight award.
More project information is here.
Research Interests
1. Machine Learning Systems
2. Database Systems
e. Applying Deep Learning to Database Systems
4. Federated Data Management and Data Integration
[New!!!] We are now recruiting highly motivated graduate and undergraduate students, please send your CV to jia.zou@asu.edu.
- Ph.D. Department of Computer Science, Tsinghua University, China
1. Database Systems for Big data Analytics and Machine Learning
2. Applying Deep Learning to Database Systems
3. Federated Data Management and Data Integration
Please find group information here.
Publications
(Supervised students are marked with *)
2022
Lixi Zhou*, Jiaqing Chen*, Amitabh Das*, Hong Min, Lei Yu, Ming Zhao, and Jia Zou. "Serving Deep Learning Models with Deduplication from Relational Databases." VLDB 2022. [14 pages][PDF]
Lixi Zhou*, Arindam Jain*, Zijie Wang*, Amitabh Das*, Yingzhen Yang, and Jia Zou, "Benchmark of DNN Model Search at Deployment Time." SSDBM'22. [12 pages] [PDF]
2021
Jia Zou, Amitabh Das*, Pratik Barhate*, Arun Iyengar, Binhang Yuan, Dimitrije Jankov, and Chris Jermaine. "Lachesis: Automated Generation of Persistent Partitionings for UDF-Centric Analytics." arXiv:2006.16529 [cs.DB] VLDB 2021 [14 pages]
Binhang Yuan, Dimitrije Jankov, Jia Zou, Yuxin Tang, Daniel Bourgeois, and and Chris Jermaine. “Tensor Relational Algebra for Machine Learning System Design.” arxiv:2009.00524 [cs.DB] VLDB 2021 [13 pages]
Jia Zou, "Using Deep Learning Models to Replace Large Materialized Views in Relational Database", CIDR 2021 (Abstract) [1 page]
2020
Jia Zou, Arun Iyengar, and Chris Jermaine. "Architecture of a distributed storage that combines file system, memory and computation in a single layer." The VLDB Journal 29(5) (2020): 1049-1073. [25 pages][PDF]
Dimitrije Jankov, Shangyu Luo, Binhang Yuan, Zhuhua Cai, Jia Zou, Chris Jermaine, Zekai J. Gao. Declarative recursive computation on an RDBMS, or, why you should use a database for distributed machine learning, SIGMOD Record, Volume 49 No. 1. [8 pages] [PDF] (Invited)
Zijie Wang*, Lixi Zhou*, Jia Zou. "Integration of Fast-Evolving Data Sources Using A Deep Learning Approach." SFDI 2020, workshop co-located with VLDB 2020 (Accepted) (14 pages)
Jia Zou, Ming Zhao, Juwei Shi and Chen Wang. "WATSON: A Workflow-based Data Storage Optimizer for Analytics." MSST 2020 (Accepted) (14 pages)
2019 and before
Dimitrije Jankov, Shangyu Luo, Binhang Yuan, Zhuhua Cai, Jia Zou, Chris Jermaine, Zekai J. Gao. Declarative recursive computation on an RDBMS, or, why you should use a database for distributed machine learning, VLDB 2019, PVLDB Volume 12 Issue 7. [14 pages] (PDF) (Honorable Mention, VLDB 2019 Best Paper Award runner-up, 2020 SIGMOD Research Highlight Award)
Jia Zou, Arun Iyengar, Chris Jermaine, Pangea: Monolithic Distributed Storage for Data Analytics, VLDB 2019, PVLDB Volume 12 Issue 6. [14 pages] (PDF)
Jia Zou, R Matthew Barnett, Tania Lorido-Botran, Shangyu Luo, Carlos Monroy, Sourav Sikdar, Kia Teymourian, Binhang Yuan, Chris Jermaine, PlinyCompute: A Platform for High- Performance, Distributed, Data-Intensive Tool Development, SIGMOD 2018. [16 pages] (PDF)
Jia Zou, Juwei Shi, Tongping Liu, Zhao Cao, Chen Wang, Foreseer: Workload-aware Data Storage for MapReduce, ICDCS 2015. [2 pages]
Lanjun Wang, Oktie Hassanzadeh, Shuo Zhang, Juwei Shi, Limei Jiao, Jia Zou, Chen Wang, Schema Management for Document Stores, VLDB 2015, PVLDB Volume 8 Issue 9. [12 pages]
Juwei Shi, Jia Zou, Jiaheng Lu, Zhao Cao, Shiqiang Li, Chen Wang, MRTuner: A Toolkit to Enable Holistic Optimization for MapReduce Jobs, VLDB 2014, PVLDB Volume 7 Issue 13. [12 pages]
Jia Zou, Gong Su, Arun Iyengar, Yu Yuan, Yi Ge, Design and Analysis of a Distributed Multi-leg Stock Trading System, ICDCS 2011. [12 pages]
Jia Zou, Jing Xiao, Rui Hou, Yanqi Wang, Frequent Instruction Sequential Pattern Mining in Hardware Sample Data, ICDM 2010. [6 pages]
Jia Zou, Zhiyong Liang, Yiqi Dai, Scalability Evaluation and Optimization of Multi-core SIP Proxy Server, ICPP 2008. [8 pages]
Jianguo Hao, Jia Zou, Yiqi Dai, A real-time payment scheme for SIP service based on hash chain, ICEBE 2008. [8 pages]
Jia Zou, Wei Xue, Zhiyong Liang, Yixin Zhao, Bo Yang and Ling Shao, SIP Parsing Offload: Design and Performance, GLOBECOM 2007. [6 pages]
Jia Zou, Yiqi Dai, Motivating and Modeling SIP Offload, ICCCN 2007. [6 pages]
Granted Patents
1. with Juwei Shi, Chen Wang and et al. Method and Apparatus for Generating Schema of Non- Relational Database. US Patent 10002142B2, 2018
2. with Li Li, Juwei Shi and et al. Resource management in MapReduce architecture and architec- tural system. US Patent 9582334 B2, 2017
3. with Zhao Cao, Juwei Shi and et al. Scheduling and execution of tasks based on resource avail- ability. US Patent 9495206 B2, 2016
4. with Heng Cao, Juwei Shi and et al. Determining location of a user of a mobile device. US Patent 9374800, B2, 2016
5. with Xiaotao Chang, Fei Chen and et al. Method and system for allocating FPGA resources. US Patent 9389915 B2, 2016
6. with Kun Wang, Tianyi Wang and et al. Data processing method, data query method in a database, and corresponding device. US Patent 9471612 B2, 2016
7. with Bo Yang, Juwei Shi and et al. Method and apparatus for processing database data in distributed database system. US Patent 10140351B2, 2016
8. with Arun Iyengar, Su Gong and et al. Methods and systems for highly available coordinated transaction processing. US Patent 9146944B2, 2015
9. with Stephen Heisig, Yanqi Wang and et al. Computer system performance analysis. US Patent 8639697 B2, 2014
10. with Arun Iyengar, Su Gong and et al. Systems and methods for multi-leg transaction processing. US Patent 8601479 B2, 2013
Courses
2025 Spring
Course Number | Course Title |
---|---|
CSE 599 | Thesis |
CSE 792 | Research |
CSE 795 | Continuing Registration |
CSE 799 | Dissertation |
CSE 590 | Reading and Conference |
CSE 790 | Reading and Conference |
CSE 584 | Internship |
CEN 580 | Practicum |
CSE 580 | Practicum |
CEN 792 | Research |
CEN 799 | Dissertation |
CEN 795 | Continuing Registration |
CSE 598 | Special Topics |
2024 Fall
Course Number | Course Title |
---|---|
CSE 595 | Continuing Registration |
CSE 599 | Thesis |
CSE 792 | Research |
CSE 795 | Continuing Registration |
CSE 499 | Individualized Instruction |
CSE 590 | Reading and Conference |
CSE 580 | Practicum |
CEN 792 | Research |
CEN 599 | Thesis |
CEN 580 | Practicum |
CEN 590 | Reading and Conference |
CEN 795 | Continuing Registration |
CSE 790 | Reading and Conference |
CSE 412 | Database Management |
DSE 792 | Research |
CSE 599 | Thesis |
CSE 412 | Database Management |
2024 Summer
Course Number | Course Title |
---|---|
CSE 584 | Internship |
CSE 595 | Continuing Registration |
CSE 795 | Continuing Registration |
CSE 792 | Research |
CSE 584 | Internship |
CEN 584 | Internship |
CEN 795 | Continuing Registration |
CEN 792 | Research |
DSE 792 | Research |
2024 Spring
Course Number | Course Title |
---|---|
CSE 599 | Thesis |
CSE 792 | Research |
CSE 795 | Continuing Registration |
CSE 799 | Dissertation |
CSE 590 | Reading and Conference |
CSE 790 | Reading and Conference |
CSE 584 | Internship |
CEN 580 | Practicum |
CSE 580 | Practicum |
CSE 792 | Research |
CEN 792 | Research |
CEN 799 | Dissertation |
CEN 795 | Continuing Registration |
CSE 598 | Special Topics |
DSE 792 | Research |
2023 Fall
Course Number | Course Title |
---|---|
CSE 595 | Continuing Registration |
CSE 599 | Thesis |
CSE 792 | Research |
CSE 795 | Continuing Registration |
CSE 499 | Individualized Instruction |
CSE 590 | Reading and Conference |
CSE 580 | Practicum |
CEN 792 | Research |
CEN 599 | Thesis |
CEN 580 | Practicum |
CEN 590 | Reading and Conference |
CEN 795 | Continuing Registration |
CSE 790 | Reading and Conference |
CSE 412 | Database Management |
DSE 792 | Research |
CSE 599 | Thesis |
CSE 412 | Database Management |
2023 Summer
Course Number | Course Title |
---|---|
CSE 595 | Continuing Registration |
CSE 584 | Internship |
CSE 792 | Research |
CSE 584 | Internship |
CEN 584 | Internship |
CEN 792 | Research |
2023 Spring
Course Number | Course Title |
---|---|
CSE 599 | Thesis |
CSE 792 | Research |
CSE 799 | Dissertation |
CSE 590 | Reading and Conference |
CSE 790 | Reading and Conference |
CEN 580 | Practicum |
CSE 580 | Practicum |
CEN 792 | Research |
CEN 799 | Dissertation |
CSE 598 | Special Topics |
2022 Fall
Course Number | Course Title |
---|---|
CSE 595 | Continuing Registration |
CSE 599 | Thesis |
CSE 792 | Research |
CSE 499 | Individualized Instruction |
CSE 590 | Reading and Conference |
CSE 580 | Practicum |
CEN 792 | Research |
CEN 599 | Thesis |
CSE 580 | Practicum |
CEN 580 | Practicum |
CEN 590 | Reading and Conference |
CSE 790 | Reading and Conference |
CSE 799 | Dissertation |
CSE 412 | Database Management |
2022 Summer
Course Number | Course Title |
---|---|
CSE 584 | Internship |
CEN 584 | Internship |
CEN 792 | Research |
2022 Spring
Course Number | Course Title |
---|---|
CSE 599 | Thesis |
CSE 792 | Research |
CSE 790 | Reading and Conference |
CEN 580 | Practicum |
CEN 792 | Research |
CSE 598 | Special Topics |
CSE 412 | Database Management |
2021 Fall
Course Number | Course Title |
---|---|
CSE 595 | Continuing Registration |
CSE 599 | Thesis |
CSE 792 | Research |
CSE 590 | Reading and Conference |
CSE 580 | Practicum |
CEN 792 | Research |
CEN 599 | Thesis |
CEN 580 | Practicum |
CEN 590 | Reading and Conference |
CSE 412 | Database Management |
2021 Summer
Course Number | Course Title |
---|---|
CSE 595 | Continuing Registration |
CEN 792 | Research |
2021 Spring
Course Number | Course Title |
---|---|
CSE 599 | Thesis |
CSE 792 | Research |
CEN 580 | Practicum |
CEN 792 | Research |
CSE 598 | Special Topics |
2020 Fall
Course Number | Course Title |
---|---|
CSE 599 | Thesis |
CSE 590 | Reading and Conference |
CSE 580 | Practicum |
CEN 792 | Research |
CEN 599 | Thesis |
CEN 580 | Practicum |
CEN 590 | Reading and Conference |
CSE 412 | Database Management |
2020 Summer
Course Number | Course Title |
---|---|
CEN 792 | Research |
2020 Spring
Course Number | Course Title |
---|---|
CSE 598 | Special Topics |
2023 Amazon Research Award ($70,000 + $50,000 Cloud credit)
2022 DOE SHI Fellowship ($58,800)
2022 NSF CAREER Award ($547, 584)
2022 WSDM Outstanding Service Award
2021 IBM Global University Program Academic Award ($40,000)
2020 SIGMOD Research Highlight Award
2019 VLDB Best Paper Runner-up, Honorable Mention Award
-
Program Committee Member of VLDB 2024
-
Program Committee Member of the Advanced Data Science Track of KDD 2023
-
New Researcher Symposium Co-Chair and a Program Committee Member of SIGMOD 2023
-
Program Committee Member of SSDBM 2023
-
Program Committee Member. Advanced Data Science track of ECML/PKDD 2023
-
Local Arrangement Chair of ACM WSDM 2022
-
Program Committee Member of VLDB 2022
-
Program Committee Member of SSDBM 2022
-
Program Vice Chair of IEEE Big Data 2020.
-
Organization Committee Member of IEEE Service Hackathon 2020.
-
Program Committee Member of IEEE SmartDataServices 2020.
-
Program Committee Member of CIKM 2019.
-
Program Committee Member of IEEE Cluster 2018, 2019.
-
Program Committee Member of HotData I, the First International Workshop on Hot Topics in Big Data and Networking, in conjunction with ICCCN 2014.
-
Reviewer of IEEE Transactions on Parallel and Distributed Systems (TPDS), IEEE Transactions on Knowledge and Data Engineering (TKDE), Journal of SuperComputing and etc.. (https://publons.com/researcher/1466438/jia-zou/)
-
Co-lead of IBM GTO Topic (subtopic: Future of Data Management) 2011.
-
Technical Assistant to Director of IBM Research-China, 2011.
Researcher, IBM Research -China, 2008-2014