The identification of new useful patterns in data is a core process for intelligent systems. Information overflow is directly related to this problem. In this work we propose a knowledge discovery methodology to retrieve useful and novel information from raw data stored in a DBMS. We used ALSDB, a database that has been built suitably to access structured information obtained from the questionnaires produced in the Linguistic Atlas of Sicily (ALS) project. The ALS project is a decennal joint effort led by researchers at the Dipartimento di Scienze Filologiche e Linguistiche of the University of Palermo that has the purpose to track and study the geo-linguistic and lexicographic processes about the function and usage of the Sicilian dialect. The main goal of the work described in this paper is to develop an information retrieval methodology that incorporates the directions of linguistic investigation embedded into the ALS questionnaire into a querying tool abstratcing away from the intricacies of SQL or XML query constructs. We do this setting up a methodology and data retrieval tool that is scalable and gen- eral enough to allow, firstly, evaluation of linguistics’ hypotheses about regional language and dialect evolution in space and time, and, secondly, to help discover new directions of investigation. This works presents the process of knowledge discovery. Starting from conceptualization of few basic ideas, concepts have been extracted from the DBMS through an XML-based mapping and used as building blocks for further investigations. The interaction with users is very intuitive, and the results are incrementally and automatically proposed to the researchers, who may determine to use them as new knowledge to maintain for further use or discard them.

Pirrone, R., Gentile, A., Cannella, V., Russo, G. (2009). XML-based Knowledge Discovery for Linguistic Atlas of Sicily (ALS) Project. In International Conference on Complex, Intelligent and Software Intensive Systems 2009 (CISIS 2009) (pp.98-104). Los Alamitos, CA : IEEE Computer Society [10.1109/CISIS.2009.151].

XML-based Knowledge Discovery for Linguistic Atlas of Sicily (ALS) Project

PIRRONE, Roberto;GENTILE, Antonio;CANNELLA, Vincenzo;RUSSO, Giuseppe
2009-01-01

Abstract

The identification of new useful patterns in data is a core process for intelligent systems. Information overflow is directly related to this problem. In this work we propose a knowledge discovery methodology to retrieve useful and novel information from raw data stored in a DBMS. We used ALSDB, a database that has been built suitably to access structured information obtained from the questionnaires produced in the Linguistic Atlas of Sicily (ALS) project. The ALS project is a decennal joint effort led by researchers at the Dipartimento di Scienze Filologiche e Linguistiche of the University of Palermo that has the purpose to track and study the geo-linguistic and lexicographic processes about the function and usage of the Sicilian dialect. The main goal of the work described in this paper is to develop an information retrieval methodology that incorporates the directions of linguistic investigation embedded into the ALS questionnaire into a querying tool abstratcing away from the intricacies of SQL or XML query constructs. We do this setting up a methodology and data retrieval tool that is scalable and gen- eral enough to allow, firstly, evaluation of linguistics’ hypotheses about regional language and dialect evolution in space and time, and, secondly, to help discover new directions of investigation. This works presents the process of knowledge discovery. Starting from conceptualization of few basic ideas, concepts have been extracted from the DBMS through an XML-based mapping and used as building blocks for further investigations. The interaction with users is very intuitive, and the results are incrementally and automatically proposed to the researchers, who may determine to use them as new knowledge to maintain for further use or discard them.
mar-2009
International Conference on Complex, Intelligent and Software Intensive Systems 2009 (CISIS 2009)
Fukuoka, Japan
March 16-19, 2009
2009
7
Pirrone, R., Gentile, A., Cannella, V., Russo, G. (2009). XML-based Knowledge Discovery for Linguistic Atlas of Sicily (ALS) Project. In International Conference on Complex, Intelligent and Software Intensive Systems 2009 (CISIS 2009) (pp.98-104). Los Alamitos, CA : IEEE Computer Society [10.1109/CISIS.2009.151].
Proceedings (atti dei congressi)
Pirrone, R; Gentile, A; Cannella, V; Russo, G
File in questo prodotto:
File Dimensione Formato  
5066774.pdf

Solo gestori archvio

Descrizione: Articolo principale
Dimensione 445.08 kB
Formato Adobe PDF
445.08 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
CISIS 2009 Title Pirrone.pdf

Solo gestori archvio

Descrizione: Copertina atti
Dimensione 794.01 kB
Formato Adobe PDF
794.01 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
CISIS 2009 TOC Pirrone.pdf

Solo gestori archvio

Descrizione: Indice atti
Dimensione 197.88 kB
Formato Adobe PDF
197.88 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/76929
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 4
social impact