Yuanjian Liu

Hi there! I am a fourth-year Ph.D. student in Computer Science at the University of Chicago, interested in high-performance computing and autonomous laboratory research. I am a member of Globus Labs where I am co-advised by Ian Foster and Kyle Chard. I completed my Bachelors in Computer Science at the Zhejiang University and previously worked at Google and Alibaba.

RESEARCH

Autonomous Laboratory:

The science discovery can be slowed down by tedious assembly and tricky mannual operations. The autonomous laboratory project aims to replace the tasks traditionally performed by human researchers by automated systems and intelligent algorithms. I currently work with Ian Foster and Chibueze Amanchukwu to build an autonomous laboratory to manufacture coin-cell batteries. We propose the development of generative AI models to identify candiate electrolyte solvents with desired properties (high ionic conductivity, oxidative stability, and Coulombic efficiencies) and the deployment of self-driving labs for electrolyte synthesis and battery fabrication and testing.

SZ3 Lossy Compression:

Modern simulations (e.g. particle simulation, climate simulation) can produce huge amount of data every day. Lossy compression can significantly reduce the data size while preserving important information for analysis. I work with Sheng Di in the compression project. We explore lossy compression on scientific datasets, especially the datasets consisting of floating-point numbers. The data files are usually planar (e.g. CESM dataset 1800x3600) or cubic (e.g. Nyx dataset 512x512x512). Some extremely large single file can be over 900 GB (e.g. Turbulent Channel Flow 10240x7680x1536). Other datasets may contain thousands of smaller files. The goal of this project is to provide a friendly program for users to compress, transfer and store these huge datasets.

PUBLICATIONS

Optimizing Scientific Data Transfer on Globus with Error-Bounded Lossy Compression [ ]
Yuanjian Liu, Sheng Di, Kyle Chard, Ian Foster, Franck Cappello
ICDCS 2023
TLDR | URL | Code | Slides | BibTex | PDF
FastqZip: An Improved Reference-Based Genome Sequence Lossy Compression Framework [ ]
Yuanjian Liu, Huihao Luo, Zhijun Han, Yao Hu, Yehui Yang, Kyle Chard, Sheng Di, Ian Foster, Jiesheng Wu
preprint
TLDR | PDF
Optimizing Error-Bounded Lossy Compression for Scientific Data With Diverse Constraints [ ]
Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian Foster, Franck Cappello
TPDS 2022
TLDR | URL | Code | Slides | BibTex | PDF
Optimizing Multi-Range based Error-Bounded Lossy Compression for Scientific Datasets [ ]
Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian Foster, Franck Cappello
HiPC 2021
TLDR | URL | BibTex | PDF
Understanding Effectiveness of Multi-Error-Bounded Lossy Compression for Preserving Ranges of Interest in Scientific Analysis [ ]
Yuanjian Liu, Sheng Di, Kai Zhao, Sian Jin, Cheng Wang, Kyle Chard, Dingwen Tao, Ian Foster, Franck Cappello
DRBSD-7 2021
TLDR | URL | BibTex | PDF

© Yuanjian Liu. Some rights reserved.

Stay passionate about your life because it is awesome!