The assessment of class frequency in soil map legends is affected by uncertainty, especially at small scales, where generalization is larger. The aim of this study was to test the hypothesis that data mining or geostatistic techniques provide better estimation of class frequency than traditional deterministic pedology in a national soil map. In the map of Italian soil regions compiled at 1:5,000,000 reference scale, soil classes were the WRB Reference Soil Groups (RSGs). Different data mining techniques, namely neural networks, random forests, boosted tree, classification and regression tree, supported vector machine (SVM), were tested and the last one gave the best RSGs predictions, using selected auxiliary variables and 22,015 classified soil profiles. Given the categorical target variable, the multi-collocated indicator cokriging was the algorithm chosen for the geostatistic approach. The first five more frequent RSGs resulting from the three methods were compared. The outcomes were validated with a Bayesian approach on a subset of 10% of geographically representative profiles, kept out before the elaborations. The most frequent classes were uniformly predicted by the three methods, which instead differentiated notably for the classes with a lower occurrence. The Bayesian validation indicated that the SVM method was as reliable as the multi-collocated indicator cokriging, and both more than the deterministic pedological approach. An advantage of the SVM was the possibility to use numeric and categorical variable in the same elaboration, without any previous transformation, which notably reduced the processing time.

Lorenzetti, R., Barbetti, B., Fantappiè, M., L'Abate, G., Costantini, E. (2014). Comparing Different Approaches - Data Mining, Geostatistic, and Deterministic Pedology - to Assess the Frequency of WRB Reference Soil Groups in the Italian Soil Regions. In Proceedings of the 20th WORLD CONGRESS OF SOIL SCIENCE, Soils Embrace Life and Universe [10.13140/2.1.4977.6969].

Comparing Different Approaches - Data Mining, Geostatistic, and Deterministic Pedology - to Assess the Frequency of WRB Reference Soil Groups in the Italian Soil Regions

FANTAPPIE', Maria;
2014-01-01

Abstract

The assessment of class frequency in soil map legends is affected by uncertainty, especially at small scales, where generalization is larger. The aim of this study was to test the hypothesis that data mining or geostatistic techniques provide better estimation of class frequency than traditional deterministic pedology in a national soil map. In the map of Italian soil regions compiled at 1:5,000,000 reference scale, soil classes were the WRB Reference Soil Groups (RSGs). Different data mining techniques, namely neural networks, random forests, boosted tree, classification and regression tree, supported vector machine (SVM), were tested and the last one gave the best RSGs predictions, using selected auxiliary variables and 22,015 classified soil profiles. Given the categorical target variable, the multi-collocated indicator cokriging was the algorithm chosen for the geostatistic approach. The first five more frequent RSGs resulting from the three methods were compared. The outcomes were validated with a Bayesian approach on a subset of 10% of geographically representative profiles, kept out before the elaborations. The most frequent classes were uniformly predicted by the three methods, which instead differentiated notably for the classes with a lower occurrence. The Bayesian validation indicated that the SVM method was as reliable as the multi-collocated indicator cokriging, and both more than the deterministic pedological approach. An advantage of the SVM was the possibility to use numeric and categorical variable in the same elaboration, without any previous transformation, which notably reduced the processing time.
Settore AGR/14 - Pedologia
12-giu-2014
20th WORLD CONGRESS OF SOIL SCIENCE, Soils Embrace Life and Universe
Jeju, Korea
08-13/06/2014
20
2014
1
https://www.researchgate.net/publication/271132326_Comparing_Different_approaches_-_Data_mining_Geostatistic_and_Deterministic_pedology_-_to_assess_the_Frequency_of_WRB_reference_soil_groups_in_the_Italian_Soil_Regions
Lorenzetti, R., Barbetti, B., Fantappiè, M., L'Abate, G., Costantini, E. (2014). Comparing Different Approaches - Data Mining, Geostatistic, and Deterministic Pedology - to Assess the Frequency of WRB Reference Soil Groups in the Italian Soil Regions. In Proceedings of the 20th WORLD CONGRESS OF SOIL SCIENCE, Soils Embrace Life and Universe [10.13140/2.1.4977.6969].
Proceedings (atti dei congressi)
Lorenzetti, R; Barbetti, B; Fantappiè, M; L'Abate, G; Costantini, EAC
File in questo prodotto:
File Dimensione Formato  
20WCSS_Proceedings_LORENZETTI_ET_AL_2014.pdf

accesso aperto

Descrizione: Articolo principale
Dimensione 4.53 MB
Formato Adobe PDF
4.53 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/106041
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact