Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

We introduce the notion of reverse-safe data structures. These are data structures that prevent the reconstruction of the data they encode (i.e., they cannot be easily reversed). A data structure D is called z-reverse-safe when there exist at least z datasets with the same set of answers as the ones stored by D. The main challenge is to ensure that D stores as many answers to useful queries as possible, is constructed efficiently, and has size close to the size of the original dataset it encodes. Given a text of length n and an integer z, we propose an algorithm that constructs a z-reverse-safe data structure (z-RSDS) that has size O(n) and answers decision and counting pattern matching queries of length at most d optimally, where d is maximal for any such z-RSDS. The construction algorithm takes O(nI• log d) time, where I• is the matrix multiplication exponent. We show that, despite the nI• factor, our engineered implementation takes only a few minutes to finish for million-letter texts. We also show that plugging our method in data analysis applications gives insignificant or no data utility loss. Furthermore, we show how our technique can be extended to support applications under realistic adversary models. Finally, we show a z-RSDS for decision pattern matching queries, whose size can be sublinear in n. A preliminary version of this article appeared in ALENEX 2020.

Bernardini G., Chen H., Fici G., Loukides G., Pissis S.P. (2021). Reverse-Safe Text Indexing. ACM JOURNAL OF EXPERIMENTAL ALGORITHMICS, 26, 1-26 [10.1145/3461698].

Reverse-Safe Text Indexing

Bernardini G.;Chen H.;Fici G.;Loukides G.;Pissis S. P.

2021-01-01

Abstract

We introduce the notion of reverse-safe data structures. These are data structures that prevent the reconstruction of the data they encode (i.e., they cannot be easily reversed). A data structure D is called z-reverse-safe when there exist at least z datasets with the same set of answers as the ones stored by D. The main challenge is to ensure that D stores as many answers to useful queries as possible, is constructed efficiently, and has size close to the size of the original dataset it encodes. Given a text of length n and an integer z, we propose an algorithm that constructs a z-reverse-safe data structure (z-RSDS) that has size O(n) and answers decision and counting pattern matching queries of length at most d optimally, where d is maximal for any such z-RSDS. The construction algorithm takes O(nI• log d) time, where I• is the matrix multiplication exponent. We show that, despite the nI• factor, our engineered implementation takes only a few minutes to finish for million-letter texts. We also show that plugging our method in data analysis applications gives insignificant or no data utility loss. Furthermore, we show how our technique can be extended to support applications under realistic adversary models. Finally, we show a z-RSDS for decision pattern matching queries, whose size can be sublinear in n. A preliminary version of this article appeared in ALENEX 2020.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2021
			
	Titolo del periodico 
DATO PREVISTO SU LOGINMIUR
	
				ACM JOURNAL OF EXPERIMENTAL ALGORITHMICS
			
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
				https://dx.doi.org/10.1145/3461698
			
	URL dell'editore (Open access ove possibile)
	
				https://dl.acm.org/doi/10.1145/3461698
			
	Citazione
	
				Bernardini G.,  Chen H.,  Fici G.,  Loukides G.,  Pissis S.P. (2021). Reverse-Safe Text Indexing. ACM JOURNAL OF EXPERIMENTAL ALGORITHMICS, 26, 1-26 [10.1145/3461698].
			
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Reverse-Safe Text Indexing.pdf accesso aperto Tipologia: Versione Editoriale Dimensione 1.55 MB Formato Adobe PDF Visualizza/Apri	1.55 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/565538

Citazioni

ND

8

ND

social impact