POLITECNICO DI BARI - Catalogo dei prodotti della Ricerca

Image mining consists of the procedures that allow to access, search and explore very large databases of data. Institutions like spatial agencies have to manage huge archives of Earth Observation (EO) images and need solutions to make data available to users from both the algorithmic and the infrastructural point of views. On the other side, users would need to explore the variety of images not just based on metadata, like time of acquisition or sensor parameters, but also by getting knowledge of their content. In this contribution, we investigate methodologies for content-based EO image retrieval via example-based queries. In particular, we present a procedure for the indexing of large-scale unstructured archives, built on top of a cluster analytics framework, Apache Spark. The procedure is based on a hierarchical and scalable implementation of a space partitioning algorithm and allows O(log n) response query times. Scalability analyses are conducted on polarimetric data from NASA/JPL archives, by using virtualized computing resources distributed over the Internet. In particular, the effects of the cluster size and of the hardware scale-up are demonstrated. The results also reveal the applicative potential of using on-demand cloud-based resources.

Optimised data structures for large scale content-based geo-indexing / Mascolo, L., Quartulli, M., Nico, G., Guccione, P., Olaizola, I.g.. - STAMPA. - (2015), pp. 1488-1491. (IEEE International Geoscience and Remote Sensing Symposium (IGARSS) Milano, Italy July 26-31, 2015) [10.1109/IGARSS.2015.7326061].

Optimised data structures for large scale content-based geo-indexing

Mascolo, L;Quartulli, M;Nico, G;Guccione, P;Olaizola, IG

2015

Abstract

Image mining consists of the procedures that allow to access, search and explore very large databases of data. Institutions like spatial agencies have to manage huge archives of Earth Observation (EO) images and need solutions to make data available to users from both the algorithmic and the infrastructural point of views. On the other side, users would need to explore the variety of images not just based on metadata, like time of acquisition or sensor parameters, but also by getting knowledge of their content. In this contribution, we investigate methodologies for content-based EO image retrieval via example-based queries. In particular, we present a procedure for the indexing of large-scale unstructured archives, built on top of a cluster analytics framework, Apache Spark. The procedure is based on a hierarchical and scalable implementation of a space partitioning algorithm and allows O(log n) response query times. Scalability analyses are conducted on polarimetric data from NASA/JPL archives, by using virtualized computing resources distributed over the Internet. In particular, the effects of the cluster size and of the hardware scale-up are demonstrated. The results also reveal the applicative potential of using on-demand cloud-based resources.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2015
			
	Titolo del convegno
	
				IEEE International Geoscience and Remote Sensing Symposium (IGARSS)
			
	Codice ISBN
	
				978-1-4799-7929-5
			
	Codice DOI
	
				https://dx.doi.org/10.1109/IGARSS.2015.7326061
			
	Citazione
	
				Optimised data structures for large scale content-based geo-indexing / Mascolo, L., Quartulli, M., Nico, G., Guccione, P., Olaizola, I.g.. - STAMPA. - (2015), pp. 1488-1491. (IEEE International Geoscience and Remote Sensing Symposium (IGARSS) Milano, Italy July 26-31, 2015) [10.1109/IGARSS.2015.7326061].
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/159358

Citazioni

0

0

social impact