Date: 2024/05/14 10:30-11:30
Location: CSIE R210
Speaker: Prof. Xian-He Sun, (孫賢和), Illinois Tech
Host: Prof. Pang-Feng Liu
Abstract:
Big data, AI, and other data-driven applications generate massive amounts of data and create new data-discovery demands. These applications have fundamentally transformed the computing landscape, making it increasingly data-centric and data-driven. However, the performance improvement of disk-based storage systems has lagged behind that of computing and memory, resulting in a significant I/O performance gap. Simultaneously, data discovery necessitates new forms of data, further straining existing memory and storage systems. In this talk, we first address the challenges and solutions related to I/O systems for high-performance computing (HPC). We then delve into the difficulties and potential solutions for managing active data in a distributed cloud environment. Lastly, we discuss the challenge of handling metadata—the data that manages other data—leading to a broader conversation about new challenges in representing information in learning and data discovery. To illustrate the state of the art, for HPC I/O systems, we showcase Hermes, an intelligent, multi-tiered, dynamic, and distributed I/O buffering system. Hermes has been released as open source under the widely used HDF5 library. Additionally, we touch upon active data issues through our NSF CSSI ChronoLog project, and finally, we explore enriched metadata for scientific and knowledge insights via introducing our DoE ASCR project, Coeus. We also reflect on the challenges and opportunities presented by the AI and big data era.
Biography:
Dr. Xian-He Sun is a University Distinguished Professor, the Ron Hochsprung Endowed Chair of Computer Science, and the director of the Gnosis Research Center for accelerating data-driven discovery at the Illinois Institute of Technology (Illinois Tech). Before joining Illinois Tech, he worked at DoE Ames National Laboratory, at ICASE, NASA Langley Research Center, at Louisiana State University, Baton Rouge, and was an ASEE fellow at Navy Research Laboratories. Dr. Sun is an IEEE fellow and is known for his memory-bounded speedup model, also called Sun-Ni’s Law, for scalable computing. His research interests include high-performance data processing, memory and I/O systems, and performance evaluation and optimization. He has over 300 publications and 6 patents in these areas and is currently leading multiple large software development projects in HPC I/O systems. Dr. Sun is the Editor-in-Chief of the IEEE Transactions on Parallel and Distributed Systems, and a former department chair of the Computer Science Department at Illinois Tech. He received the Golden Core award from IEEE CS society in 2017, the ACM Karsten Schwan Best Paper Award from ACM HPDC in 2019, the Ron Hocksprung Endowed Chair from Illinois Tech in 2020, and the first prize best paper award from ACM/IEEE CCGrid in 2021. More information about Dr. Sun can be found at his web site www.cs.iit.edu/~sun/.