Lempel–Ziv Factorization Using Less Time & Space

Gang Chen; Simon J. Puglisi; W. F. Smyth

doi:10.1007/s11786-007-0024-4

Lempel–Ziv Factorization Using Less Time & Space

Gang Chen, Simon J. Puglisi, W. F. Smyth

Source

Mathematics in Computer Science > 2008 > 1 > 4 > 605-623

Abstract

For 30 years the Lempel–Ziv factorization LZ_x of a string x = x[1..n] has been a fundamental data structure of string processing, especially valuable for string compression and for computing all the repetitions (runs) in x. Traditionally the standard method for computing LZ_x was based on Θ(n)-time (or, depending on the measure used, O(n log n)-time) processing of the suffix tree ST_x of x. Recently Abouelhoda et al. proposed an efficient Lempel–Ziv factorization algorithm based on an “enhanced” suffix array – that is, a suffix array SA_x together with supporting data structures, principally an “interval tree”. In this paper we introduce a collection of fast space-efficient algorithms for LZ factorization, also based on suffix arrays, that in theory as well as in many practical circumstances are superior to those previously proposed; one family out of this collection achieves true Θ(n)-time alphabet-independent processing in the worst case by avoiding tree structures altogether.

Identifiers

journal ISSN :	1661-8270
journal e-ISSN :	1661-8289
DOI	10.1007/s11786-007-0024-4

Authors

Gang Chen

McMaster University, Department of Computing & Software, Hamilton, Canada

Simon J. Puglisi

RMIT University, School of Computer Science & Information Technology, Melbourne, Australia

W. F. Smyth

Curtin University of Technology, Digital Ecosystems & Business Intelligence Institute, Perth, Australia
McMaster University, Algorithms Research Group, Department of Computing & Software, Hamilton, Canada

Keywords

Lempel–Ziv factorization suffix array suffix tree LZ factorization

Additional information

Publication languages: English

Data set: Springer

Publisher

Springer International Publishing

Fields of science

No field of science has been suggested yet.

article

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Lempel–Ziv Factorization Using Less Time & Space $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Gang Chen

Simon J. Puglisi

W. F. Smyth

Keywords

Additional information

Publisher

Fields of science

Fields of science

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Lempel–Ziv Factorization Using Less Time & Space