The identification of new useful patterns in data is a core process for intelligent systems. Information overflow is directly related to this problem. In this work we propose a knowledge discovery methodology to retrieve useful and novel information from raw data stored in a DBMS. We used ALSDB, a database that has been built suitably to access structured information obtained from the questionnaires produced in the Linguistic Atlas of Sicily (ALS) project. The ALS project is a decennal joint effort led by researchers at the Dipartimento di Scienze Filologiche e Linguistiche of the University of Palermo that has the purpose to track and study the geo-linguistic and lexicographic processes about the function and usage of the Sicilian dialect. The main goal of the work described in this paper is to develop an information retrieval methodology that incorporates the directions of linguistic investigation embedded into the ALS questionnaire into a querying tool abstratcing away from the intricacies of SQL or XML query constructs. We do this setting up a methodology and data retrieval tool that is scalable and gen- eral enough to allow, firstly, evaluation of linguistics’ hypotheses about regional language and dialect evolution in space and time, and, secondly, to help discover new directions of investigation. This works presents the process of knowledge discovery. Starting from conceptualization of few basic ideas, concepts have been extracted from the DBMS through an XML-based mapping and used as building blocks for further investigations. The interaction with users is very intuitive, and the results are incrementally and automatically proposed to the researchers, who may determine to use them as new knowledge to maintain for further use or discard them.
Pirrone, R., Gentile, A., Cannella, V., Russo, G. (2009). XML-based Knowledge Discovery for Linguistic Atlas of Sicily (ALS) Project. In International Conference on Complex, Intelligent and Software Intensive Systems 2009 (CISIS 2009) (pp.98-104). Los Alamitos, CA : IEEE Computer Society [10.1109/CISIS.2009.151].
XML-based Knowledge Discovery for Linguistic Atlas of Sicily (ALS) Project
PIRRONE, Roberto;GENTILE, Antonio;CANNELLA, Vincenzo;RUSSO, Giuseppe
2009-01-01
Abstract
The identification of new useful patterns in data is a core process for intelligent systems. Information overflow is directly related to this problem. In this work we propose a knowledge discovery methodology to retrieve useful and novel information from raw data stored in a DBMS. We used ALSDB, a database that has been built suitably to access structured information obtained from the questionnaires produced in the Linguistic Atlas of Sicily (ALS) project. The ALS project is a decennal joint effort led by researchers at the Dipartimento di Scienze Filologiche e Linguistiche of the University of Palermo that has the purpose to track and study the geo-linguistic and lexicographic processes about the function and usage of the Sicilian dialect. The main goal of the work described in this paper is to develop an information retrieval methodology that incorporates the directions of linguistic investigation embedded into the ALS questionnaire into a querying tool abstratcing away from the intricacies of SQL or XML query constructs. We do this setting up a methodology and data retrieval tool that is scalable and gen- eral enough to allow, firstly, evaluation of linguistics’ hypotheses about regional language and dialect evolution in space and time, and, secondly, to help discover new directions of investigation. This works presents the process of knowledge discovery. Starting from conceptualization of few basic ideas, concepts have been extracted from the DBMS through an XML-based mapping and used as building blocks for further investigations. The interaction with users is very intuitive, and the results are incrementally and automatically proposed to the researchers, who may determine to use them as new knowledge to maintain for further use or discard them.File | Dimensione | Formato | |
---|---|---|---|
5066774.pdf
Solo gestori archvio
Descrizione: Articolo principale
Dimensione
445.08 kB
Formato
Adobe PDF
|
445.08 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
CISIS 2009 Title Pirrone.pdf
Solo gestori archvio
Descrizione: Copertina atti
Dimensione
794.01 kB
Formato
Adobe PDF
|
794.01 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
CISIS 2009 TOC Pirrone.pdf
Solo gestori archvio
Descrizione: Indice atti
Dimensione
197.88 kB
Formato
Adobe PDF
|
197.88 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.