Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

We propose a framework for the representation of visual knowledge in a robotic agent, with special attention to the understanding of dynamic scenes. According to our approach, understanding involves the generation of a high level, declarative description of the perceived world. Developing such a description requires both bottom-up, data driven processes that associate symbolic knowledge representation structures with the data coming out of a vision system, and top-down processes in which high level, symbolic information is in its turn employed to drive and further refine the interpretation of a scene. On the one hand, the computer vision community approached this problem in terms of 2D/3D shape reconstruction and of estimation of motion parameters. On the other, the AI community developed rich and expressive systems for the description of processes, events, actions and, in general, of dynamic situations. Nevertheless, these two approaches evolved separately and concentrated on different kinds of problems. We propose an architecture that integrates these two traditions in a principled way. Our assumption is that a link is missing between the two classes of representations mentioned above. In order to fill this gap, we adopt the notion of conceptual space (CS - Gaerdenfors (2000)), a representation where information is characterized in terms of a metric space. A CS acts as an intermediate representation between subconceptual (i.e., not yet conceptually categorized) information, and symbolically organized knowledge. The concepts of process and action have immediate characterizations in terms of structures in the conceptual space. The architecture is illustrated with reference to an experimental setup based on a vision system operating in a scenario with moving and interacting people.

Chella, A., Frixione, M., Gaglio, S. (2000). Understanding dynamic scenes. ARTIFICIAL INTELLIGENCE, 123(1-2), 89-132 [10.1016/S0004-3702(00)00048-5].

Understanding dynamic scenes

Chella, A.;Frixione, M.;Gaglio, S.

2000-01-01

Abstract

We propose a framework for the representation of visual knowledge in a robotic agent, with special attention to the understanding of dynamic scenes. According to our approach, understanding involves the generation of a high level, declarative description of the perceived world. Developing such a description requires both bottom-up, data driven processes that associate symbolic knowledge representation structures with the data coming out of a vision system, and top-down processes in which high level, symbolic information is in its turn employed to drive and further refine the interpretation of a scene. On the one hand, the computer vision community approached this problem in terms of 2D/3D shape reconstruction and of estimation of motion parameters. On the other, the AI community developed rich and expressive systems for the description of processes, events, actions and, in general, of dynamic situations. Nevertheless, these two approaches evolved separately and concentrated on different kinds of problems. We propose an architecture that integrates these two traditions in a principled way. Our assumption is that a link is missing between the two classes of representations mentioned above. In order to fill this gap, we adopt the notion of conceptual space (CS - Gaerdenfors (2000)), a representation where information is characterized in terms of a metric space. A CS acts as an intermediate representation between subconceptual (i.e., not yet conceptually categorized) information, and symbolically organized knowledge. The concepts of process and action have immediate characterizations in terms of structures in the conceptual space. The architecture is illustrated with reference to an experimental setup based on a vision system operating in a scenario with moving and interacting people.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
			2000
		
	Settore scientifico disciplinare del contributo
	
			Settore ING-INF/05 - Sistemi Di Elaborazione Delle Informazioni
		
	Titolo del periodico 
DATO PREVISTO SU LOGINMIUR
	
			ARTIFICIAL INTELLIGENCE
		
	DOI del contributo 
DATO PREVISTO SU LOGINMIUR
	
			https://dx.doi.org/10.1016/S0004-3702(00)00048-5
		
	Citazione
	
			Chella, A., Frixione, M., Gaglio, S. (2000). Understanding dynamic scenes. ARTIFICIAL INTELLIGENCE, 123(1-2), 89-132 [10.1016/S0004-3702(00)00048-5].
		
	Appare nelle tipologie:
	
			1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
article.pdf Solo gestori archvio Dimensione 1.4 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.4 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/287376

Citazioni

ND

69

45

social impact