Conference In: scopus, orcid

On the use of suffix arrays for memory-efficient Lempel-Ziv data compression

Data Compression Conference Proceedings

Ferreira, A.; Oliveira, A.; Figueiredo, M.2009IEEE

Key information

Authors:

Published in

12/01/2009

Abstract

The Lempel-Ziv 77 (LZ77) and LZ-Storer-Szymanski (LZSS) text compression algorithms use a sliding window over the sequence of symbols, with two sub-windows: the dictionary (symbols already encoded) and the look-ahead-buffer (LAB) (symbols not yet encoded). Binary search trees and suffix trees (ST) have been used to speedup the search of the LAB over the dictionary, at the expense of high memory usage [1]. A suffix array (SA) is a simpler, more compact data structure which uses (much) less memory [2,3] to hold the same information. The SA for a length m string is an array of integers ([1], ...[k], ...a[m]) that stores the lexicographic order of suffix k of the string; sub-string searching, as used in LZ77/LZSS, is done by searching the SA.

Publication details

Publisher

IEEE

Title of the publication container

Data Compression Conference Proceedings

Location of the conference

Snowbird, UT, USA

First page or article number

444

Last page

444

Volume

48

ISBN

9780769535920

Fields of Science and Technology (FOS)

computer-and-information-sciences - Computer and information sciences

Publication language (ISO code)

eng - English

Rights type:

Only metadata available