Skip to main content

Showing 1–28 of 28 results for author: Lee, P P C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.17963  [pdf, other

    cs.DC

    The Design and Implementation of a High-Performance Log-Structured RAID System for ZNS SSDs

    Authors: Jinhong Li, Qiuping Wang, Shujie Han, Patrick P. C. Lee

    Abstract: Zoned Namespace (ZNS) defines a new abstraction for host software to flexibly manage storage in flash-based SSDs as append-only zones. It also provides a Zone Append primitive to further boost the write performance of ZNS SSDs by exploiting intra-zone parallelism. However, making Zone Append effective for reliable and scalable storage, in the form of a RAID array of multiple ZNS SSDs, is non-trivi… ▽ More

    Submitted 6 February, 2025; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 35 pages

    ACM Class: C.4; C.5.0

  2. arXiv:2205.14555  [pdf, other

    cs.IT

    Two New Piggybacking Designs with Lower Repair Bandwidth

    Authors: Zhengyi Jiang, Hanxu Hou, Yunghsiang S. Han, Patrick P. C. Lee, Bo Bai, Zhongyi Huang

    Abstract: Piggybacking codes are a special class of MDS array codes that can achieve small repair bandwidth with small sub-packetization by first creating some instances of an $(n,k)$ MDS code, such as a Reed-Solomon (RS) code, and then designing the piggyback function. In this paper, we propose a new piggybacking coding design which designs the piggyback function over some instances of both $(n,k)$ MDS cod… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

  3. arXiv:2205.11753  [pdf, other

    cs.PF

    Efficient LSM-Tree Key-Value Data Management on Hybrid SSD/HDD Zoned Storage

    Authors: Jinhong Li, Qiuping Wang, Patrick P. C. Lee

    Abstract: Zoned storage devices, such as zoned namespace (ZNS) solid-state drives (SSDs) and host-managed shingled magnetic recording (HM-SMR) hard-disk drives (HDDs), expose interfaces for host-level applications to support fine-grained, high-performance storage management. Combining ZNS SSDs and HM-SMR HDDs into a unified hybrid storage system is a natural direction to scale zoned storage at low cost, yet… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  4. arXiv:2203.10766  [pdf, other

    cs.DC

    An In-Depth Comparative Analysis of Cloud Block Storage Workloads: Findings and Implications

    Authors: Jinhong Li, Qiuping Wang, Patrick P. C. Lee, Chao Shi

    Abstract: Cloud block storage systems support diverse types of applications in modern cloud services. Characterizing their I/O activities is critical for guiding better system designs and optimizations. In this paper, we present an in-depth comparative analysis of production cloud block storage workloads through the block-level I/O traces of billions of I/O requests collected from two production systems, Al… ▽ More

    Submitted 19 November, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: 30 pages. Accepted by ACM Transactions on Storage

  5. arXiv:2110.04785  [pdf, ps, other

    cs.IT

    A Generalization of Array Codes with Local Properties and Efficient Encoding/Decoding

    Authors: Hanxu Hou, Yunghsiang S. Han, Patrick P. C. Lee, You Wu, Guojun Han, Mario Blaum

    Abstract: A maximum distance separable (MDS) array code is composed of $m\times (k+r)$ arrays such that any $k$ out of $k+r$ columns suffice to retrieve all the information symbols. Expanded-Blaum-Roth (EBR) codes and Expanded-Independent-Parity (EIP) codes are two classes of MDS array codes that can repair any one symbol in a column by locally accessing some other symbols within the column, where the numbe… ▽ More

    Submitted 12 September, 2022; v1 submitted 10 October, 2021; originally announced October 2021.

  6. arXiv:2107.05954  [pdf, other

    cs.NI

    MVPipe: Enabling Lightweight Updates and Fast Convergence in Hierarchical Heavy Hitter Detection

    Authors: Lu Tang, Qun Huang, Patrick P. C. Lee

    Abstract: Finding hierarchical heavy hitters (HHHs) (i.e., hierarchical aggregates with exceptionally huge amounts of traffic) is critical to network management, yet it is often challenged by the requirements of fast packet processing, real-time and accurate detection, as well as resource efficiency. Existing HHH detection schemes either incur expensive packet updates for multiple aggregation levels in the… ▽ More

    Submitted 28 June, 2023; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: 14 pages. Accepted by IEEE/ACM Transactions on Networking

  7. arXiv:2104.12425  [pdf, other

    cs.DC

    Separating Data via Block Invalidation Time Inference for Write Amplification Reduction in Log-Structured Storage

    Authors: Qiuping Wang, Jinhong Li, Patrick P. C. Lee, Tao Ouyang, Chao Shi, Lilong Huang

    Abstract: Log-structured storage has been widely deployed in various domains of storage systems, yet its garbage collection incurs write amplification (WA) due to the rewrites of live data. We show that there exists an optimal data placement scheme that minimizes WA using the future knowledge of block invalidation time (BIT) of each written block, yet it is infeasible to realize in practice. We propose a no… ▽ More

    Submitted 10 February, 2022; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: 19 pages. Accepted by the 20th USENIX Conference on File and Storage Technologies (FAST '22)

  8. arXiv:1912.09722  [pdf, other

    cs.LG cs.DC stat.ML

    Robust Data Preprocessing for Machine-Learning-Based Disk Failure Prediction in Cloud Production Environments

    Authors: Shujie Han, Jun Wu, Erci Xu, Cheng He, Patrick P. C. Lee, Yi Qiang, Qixing Zheng, Tao Huang, Zixi Huang, Rui Li

    Abstract: To provide proactive fault tolerance for modern cloud data centers, extensive studies have proposed machine learning (ML) approaches to predict imminent disk failures for early remedy and evaluated their approaches directly on public datasets (e.g., Backblaze SMART logs). However, in real-world production environments, the data quality is imperfect (e.g., inaccurate labeling, missing data samples,… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: 12 pages, 9 figures

  9. arXiv:1910.10441  [pdf, other

    cs.NI

    A Fast and Compact Invertible Sketch for Network-Wide Heavy Flow Detection

    Authors: Lu Tang, Qun Huang, Patrick P. C. Lee

    Abstract: Fast detection of heavy flows (e.g., heavy hitters and heavy changers) in massive network traffic is challenging due to the stringent requirements of fast packet processing and limited resource availability. Invertible sketches are summary data structures that can recover heavy flows with small memory footprints and bounded errors, yet existing invertible sketches incur high memory access overhead… ▽ More

    Submitted 22 July, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: 15 pages. Accepted by IEEE/ACM Transactions on Networking

  10. arXiv:1908.01527  [pdf, other

    cs.DC

    Repair Pipelining for Erasure-Coded Storage: Algorithms and Evaluation

    Authors: Xiaolu Li, Zuoru Yang, Jinhong Li, Runhui Li, Patrick P. C. Lee, Qun Huang, Yuchong Hu

    Abstract: We propose repair pipelining, a technique that speeds up the repair performance in general erasure-coded storage. By carefully scheduling the repair of failed data in small-size units across storage nodes in a pipelined manner, repair pipelining reduces the single-block repair time to approximately the same as the normal read time for a single block in homogeneous environments. We further design d… ▽ More

    Submitted 20 November, 2020; v1 submitted 5 August, 2019; originally announced August 2019.

    Comments: 28 pages. Accepted by ACM Transactions on Storage

  11. arXiv:1907.08938  [pdf, other

    cs.IT

    Multi-Layer Transformed MDS Codes with Optimal Repair Access and Low Sub-Packetization

    Authors: Hanxu Hou, Patrick P. C. Lee, Yunghsiang S. Han

    Abstract: An $(n,k)$ maximum distance separable (MDS) code has optimal repair access if the minimum number of symbols accessed from $d$ surviving nodes is achieved, where $k+1\le d\le n-1$. Existing results show that the sub-packetization $α$ of an $(n,k,d)$ high code rate (i.e., $k/n>0.5$) MDS code with optimal repair access is at least $(d-k+1)^{\lceil\frac{n}{d-k+1}\rceil}$. In this paper, we propose a c… ▽ More

    Submitted 22 July, 2019; v1 submitted 21 July, 2019; originally announced July 2019.

  12. arXiv:1904.05736  [pdf, other

    cs.CR cs.DC

    Information Leakage in Encrypted Deduplication via Frequency Analysis: Attacks and Defenses

    Authors: Jingwei Li, Patrick P. C. Lee, Chufeng Tan, Chuan Qin, Xiaosong Zhang

    Abstract: Encrypted deduplication combines encryption and deduplication to simultaneously achieve both data security and storage efficiency. State-of-the-art encrypted deduplication systems mainly build on deterministic encryption to preserve deduplication effectiveness. However, such deterministic encryption reveals the underlying frequency distribution of the original plaintext chunks. This allows an adve… ▽ More

    Submitted 9 October, 2019; v1 submitted 11 April, 2019; originally announced April 2019.

    Comments: 31 pages, Accepted by ACM Transactions on Storage

  13. arXiv:1811.10000  [pdf, other

    cs.DB

    Enabling Efficient Updates in KV Storage via Hashing: Design and Performance Evaluation

    Authors: Yongkun Li, Helen H. W. Chan, Patrick P. C. Lee, Yinlong Xu

    Abstract: Persistent key-value (KV) stores mostly build on the Log-Structured Merge (LSM) tree for high write performance, yet the LSM-tree suffers from the inherently high I/O amplification. KV separation mitigates I/O amplification by storing only keys in the LSM-tree and values in separate storage. However, the current KV separation design remains inefficient under update-intensive workloads due to its h… ▽ More

    Submitted 17 June, 2019; v1 submitted 25 November, 2018; originally announced November 2018.

    Comments: 28 pages. Accepted by ACM Transactions on Storage

  14. arXiv:1811.04570  [pdf, other

    cs.DC

    On the Performance and Convergence of Distributed Stream Processing via Approximate Fault Tolerance

    Authors: Zhinan Cheng, Qun Huang, Patrick P. C. Lee

    Abstract: Fault tolerance is critical for distributed stream processing systems, yet achieving error-free fault tolerance often incurs substantial performance overhead. We present AF-Stream, a distributed stream processing system that addresses the trade-off between performance and accuracy in fault tolerance. AF-Stream builds on a notion called approximate fault tolerance, whose idea is to mitigate backup… ▽ More

    Submitted 12 August, 2019; v1 submitted 12 November, 2018; originally announced November 2018.

    Comments: 25 pages. Accepted by The VLDB Journal

  15. arXiv:1809.04380  [pdf, ps, other

    cs.IT

    Binary MDS Array Codes with Optimal Repair

    Authors: Hanxu Hou, Patrick P. C. Lee

    Abstract: Consider a binary maximum distance separable (MDS) array code composed of an $m\times (k+r)$ array of bits with $k$ information columns and $r$ parity columns, such that any $k$ out of $k+r$ columns suffice to reconstruct the $k$ information columns. Our goal is to provide {\em optimal repair access} for binary MDS array codes, meaning that the bandwidth triggered to repair any single failed infor… ▽ More

    Submitted 28 August, 2019; v1 submitted 12 September, 2018; originally announced September 2018.

  16. arXiv:1802.07891  [pdf, other

    cs.IT

    A New Design of Binary MDS Array Codes with Asymptotically Weak-Optimal Repair

    Authors: Hanxu Hou, Yunghsiang Han, Patrick P. C. Lee, Yuchong Hu, Hui Li

    Abstract: Binary maximum distance separable (MDS) array codes are a special class of erasure codes for distributed storage that not only provide fault tolerance with minimum storage redundancy but also achieve low computational complexity. They are constructed by encoding $k$ information columns into $r$ parity columns, in which each element in a column is a bit, such that any $k$ out of the $k+r$ columns s… ▽ More

    Submitted 20 June, 2019; v1 submitted 21 February, 2018; originally announced February 2018.

  17. arXiv:1802.04031  [pdf, other

    cs.IT

    Rack-Aware Regenerating Codes for Data Centers

    Authors: Hanxu Hou, Patrick P. C. Lee, Kenneth W. Shum, Yuchong Hu

    Abstract: Erasure coding is widely used for massive storage in data centers to achieve high fault tolerance and low storage redundancy. Since the cross-rack communication cost is often high, it is critical to design erasure codes that minimize the cross-rack repair bandwidth during failure repair. In this paper, we analyze the optimal trade-off between storage redundancy and cross-rack repair bandwidth spec… ▽ More

    Submitted 25 February, 2019; v1 submitted 12 February, 2018; originally announced February 2018.

  18. arXiv:1704.03696  [pdf, other

    cs.DC

    Optimal Repair Layering for Erasure-Coded Data Centers: From Theory to Practice

    Authors: Yuchong Hu, Xiaolu Li, Mi Zhang, Patrick P. C. Lee, Xiaoyang Zhang, Pan Zhou, Dan Feng

    Abstract: Repair performance in hierarchical data centers is often bottlenecked by cross-rack network transfer. Recent theoretical results show that the cross-rack repair traffic can be minimized through repair layering, whose idea is to partition a repair operation into inner-rack and cross-rack layers. However, how repair layering should be implemented and deployed in practice remains an open issue. In th… ▽ More

    Submitted 15 September, 2017; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: 24 pages. Accepted by ACM Transactions on Storage

  19. arXiv:1701.08084  [pdf, ps, other

    cs.DB cs.DC

    Erasure Coding for Small Objects in In-Memory KV Storage

    Authors: Matt M. T. Yiu, Helen H. W. Chan, Patrick P. C. Lee

    Abstract: We present MemEC, an erasure-coding-based in-memory key-value (KV) store that achieves high availability and fast recovery while keeping low data redundancy across storage servers. MemEC is specifically designed for workloads dominated by small objects. By encoding objects in entirety, MemEC is shown to incur 60% less storage redundancy for small objects than existing replication- and erasure-codi… ▽ More

    Submitted 21 May, 2017; v1 submitted 27 January, 2017; originally announced January 2017.

    Comments: Accepted by SYSTOR 2017

  20. arXiv:1607.08388  [pdf, other

    cs.DC cs.CR

    The Design and Implementation of a Rekeying-aware Encrypted Deduplication Storage System

    Authors: Chuan Qin, Jingwei Li, Patrick P. C. Lee

    Abstract: Rekeying refers to an operation of replacing an existing key with a new key for encryption. It renews security protection, so as to protect against key compromise and enable dynamic access control in cryptographic storage. However, it is non-trivial to realize efficient rekeying in encrypted deduplication storage systems, which use deterministic content-derived encryption keys to allow deduplicati… ▽ More

    Submitted 19 December, 2016; v1 submitted 28 July, 2016; originally announced July 2016.

    Journal ref: ACM Transactions on Storage (2017)

  21. arXiv:1502.05110  [pdf, other

    cs.CR cs.DC

    CDStore: Toward Reliable, Secure, and Cost-Efficient Cloud Storage via Convergent Dispersal

    Authors: Mingqiang Li, Chuan Qin, Patrick P. C. Lee

    Abstract: We present CDStore, which disperses users' backup data across multiple clouds and provides a unified multi-cloud storage solution with reliability, security, and cost-efficiency guarantees. CDStore builds on an augmented secret sharing scheme called convergent dispersal, which supports deduplication by using deterministic content-derived hashes as inputs to secret sharing. We present the design of… ▽ More

    Submitted 29 May, 2015; v1 submitted 17 February, 2015; originally announced February 2015.

  22. arXiv:1406.5282  [pdf, other

    cs.IT

    STAIR Codes: A General Family of Erasure Codes for Tolerating Device and Sector Failures

    Authors: Mingqiang Li, Patrick P. C. Lee

    Abstract: Practical storage systems often adopt erasure codes to tolerate device failures and sector failures, both of which are prevalent in the field. However, traditional erasure codes employ device-level redundancy to protect against sector failures, and hence incur significant space overhead. Recent sector-disk (SD) codes are available only for limited configurations. By making a relaxed but practical… ▽ More

    Submitted 23 June, 2014; v1 submitted 20 June, 2014; originally announced June 2014.

    Comments: An earlier version of this work was presented at USENIX FAST '14. This extended version presents new reliability analysis for STAIR codes and is submitted for journal review

    ACM Class: B.3.2; B.8.1; C.4; D.4.2; E.4

  23. arXiv:1405.5661  [pdf, ps, other

    cs.DC cs.DB

    Efficient Hybrid Inline and Out-of-line Deduplication for Backup Storage

    Authors: Yan Kit Li, Min Xu, Chun Ho Ng, Patrick P. C. Lee

    Abstract: Backup storage systems often remove redundancy across backups via inline deduplication, which works by referring duplicate chunks of the latest backup to those of existing backups. However, inline deduplication degrades restore performance of the latest backup due to fragmentation, and complicates deletion of ex- pired backups due to the sharing of data chunks. While out-of-line deduplication addr… ▽ More

    Submitted 22 May, 2014; originally announced May 2014.

    ACM Class: D.4.2; D.5.1

  24. arXiv:1304.1863  [pdf, ps, other

    cs.PF cs.DC

    Stochastic Analysis on RAID Reliability for Solid-State Drives

    Authors: Yongkun Li, Patrick P. C. Lee, John C. S. Lui

    Abstract: Solid-state drives (SSDs) have been widely deployed in desktops and data centers. However, SSDs suffer from bit errors, and the bit error rate is time dependent since it increases as an SSD wears down. Traditional storage systems mainly use parity-based RAID to provide reliability guarantees by striping redundancy across multiple devices, but the effectiveness of RAID in SSDs remains debatable as… ▽ More

    Submitted 6 April, 2013; originally announced April 2013.

    Comments: 12 pages

  25. arXiv:1303.4816  [pdf, ps, other

    cs.PF

    Stochastic Modeling of Large-Scale Solid-State Storage Systems: Analysis, Design Tradeoffs and Optimization

    Authors: Yongkun Li, Patrick P. C. Lee, John C. S. Lui

    Abstract: Solid state drives (SSDs) have seen wide deployment in mobiles, desktops, and data centers due to their high I/O performance and low energy consumption. As SSDs write data out-of-place, garbage collection (GC) is required to erase and reclaim space with invalid data. However, GC poses additional writes that hinder the I/O performance, while SSD blocks can only endure a finite number of erasures. T… ▽ More

    Submitted 20 March, 2013; v1 submitted 19 March, 2013; originally announced March 2013.

    Comments: 14 pages, Sigmetrics 2013

  26. arXiv:1302.3344  [pdf, ps, other

    cs.DC

    CORE: Augmenting Regenerating-Coding-Based Recovery for Single and Concurrent Failures in Distributed Storage Systems

    Authors: Runhui Li, Jian Lin, Patrick P. C. Lee

    Abstract: Data availability is critical in distributed storage systems, especially when node failures are prevalent in real life. A key requirement is to minimize the amount of data transferred among nodes when recovering the lost or unavailable data of failed nodes. This paper explores recovery solutions based on regenerating codes, which are shown to provide fault-tolerant storage and minimum recovery ban… ▽ More

    Submitted 5 June, 2013; v1 submitted 14 February, 2013; originally announced February 2013.

    Comments: 25 pages

  27. arXiv:1302.0621  [pdf, ps, other

    cs.DC cs.OS

    RevDedup: A Reverse Deduplication Storage System Optimized for Reads to Latest Backups

    Authors: Chun-Ho Ng, Patrick P. C. Lee

    Abstract: Scaling up the backup storage for an ever-increasing volume of virtual machine (VM) images is a critical issue in virtualization environments. While deduplication is known to effectively eliminate duplicates for VM image storage, it also introduces fragmentation that will degrade read performance. We propose RevDedup, a deduplication system that optimizes reads to latest VM image backups using an… ▽ More

    Submitted 27 June, 2013; v1 submitted 4 February, 2013; originally announced February 2013.

    Comments: A 7-page version appeared in APSys'13

  28. arXiv:1208.2787  [pdf, ps, other

    cs.IT

    Analysis and Construction of Functional Regenerating Codes with Uncoded Repair for Distributed Storage Systems

    Authors: Yuchong Hu, Patrick P. C. Lee, Kenneth W. Shum

    Abstract: Modern distributed storage systems apply redundancy coding techniques to stored data. One form of redundancy is based on regenerating codes, which can minimize the repair bandwidth, i.e., the amount of data transferred when repairing a failed storage node. Existing regenerating codes mainly require surviving storage nodes encode data during repair. In this paper, we study functional minimum storag… ▽ More

    Submitted 21 January, 2013; v1 submitted 14 August, 2012; originally announced August 2012.

    Comments: 9 pages; IEEE INFOCOM (2013)