The Wavelet Tree data structure introduced in Grossi, Gupta, and Vitter [11] is a space-efficient technique for rank and select queries that generalizes from binary symbols to an arbitrary multisymbol alphabet. Over the last two decades, it has become a pivotal tool in modern full-text indexing and data compression because of its properties and capabilities in compressing and indexing data, with many applications to information retrieval, genome analysis, data mining, and web search. In this paper, we survey the fascinating history and impact of Wavelet Trees; no doubt many more developments are yet to come. Our survey borrows some content from the authors' earlier works. This paper is divided into two parts: The first part gives a brief history of Wavelet Trees, including its varieties and practical implementations, which appears in the Festschrift dedicated to Roberto Grossi [4]; the second part (this one) deals with Wavelet Tree-based text indexing and is included in the Festschrift dedicated to Giovanni Manzini.

Ferragina, P., Giancarlo, R., Grossi, R., Rosone, G., Venturini, R., Vitter, J.S. (2025). Wavelet Tree, Part II: Text Indexing. In P. Ferragina, T. Gagie, G. Navarro (a cura di), The Expanding World of Compressed Data: A Festschrift for Giovanni Manzini's 60th Birthday. Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing [10.4230/OASIcs.Manzini.2025.4].

Wavelet Tree, Part II: Text Indexing

Giancarlo R.;
2025-08-01

Abstract

The Wavelet Tree data structure introduced in Grossi, Gupta, and Vitter [11] is a space-efficient technique for rank and select queries that generalizes from binary symbols to an arbitrary multisymbol alphabet. Over the last two decades, it has become a pivotal tool in modern full-text indexing and data compression because of its properties and capabilities in compressing and indexing data, with many applications to information retrieval, genome analysis, data mining, and web search. In this paper, we survey the fascinating history and impact of Wavelet Trees; no doubt many more developments are yet to come. Our survey borrows some content from the authors' earlier works. This paper is divided into two parts: The first part gives a brief history of Wavelet Trees, including its varieties and practical implementations, which appears in the Festschrift dedicated to Roberto Grossi [4]; the second part (this one) deals with Wavelet Tree-based text indexing and is included in the Festschrift dedicated to Giovanni Manzini.
ago-2025
Settore INFO-01/A - Informatica
978-3-95977-390-4
Ferragina, P., Giancarlo, R., Grossi, R., Rosone, G., Venturini, R., Vitter, J.S. (2025). Wavelet Tree, Part II: Text Indexing. In P. Ferragina, T. Gagie, G. Navarro (a cura di), The Expanding World of Compressed Data: A Festschrift for Giovanni Manzini's 60th Birthday. Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing [10.4230/OASIcs.Manzini.2025.4].
File in questo prodotto:
File Dimensione Formato  
OASIcs.Manzini.4.pdf

accesso aperto

Tipologia: Versione Editoriale
Dimensione 856.79 kB
Formato Adobe PDF
856.79 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/691780
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact