Post on 10-Feb-2016
description
transcript
1Francesco Guerra – DBGroup@unimo
DOTTORATO DI RICERCA ININGEGNERIA DELL’INFORMAZIONE
XVI ciclo di dottorato - II ciclo Nuova Serie
Dai Dati all’Informazione:Dai Dati all’Informazione:il sistema MOMISil sistema MOMIS
dott. ing. Francesco Guerratutore: prof. Sonia Bergamaschi
2Francesco Guerra – DBGroup@unimo
OutlineOutline
Intelligent Integration of Information Matching The MOMIS system
MOMIS in the Semantic Web MOMIS as the basis of a virtual marketplace MOMIS to manage collaborative processes (the
WINK project) MOMIS as a semantic search engine (the SEWASIE
project)
3Francesco Guerra – DBGroup@unimo
Intelligent Integration of InformationIntelligent Integration of Information
Distinguishing elements: Kinds of managed sources The Global-as-View vs. the Local-as-View
approach Data Model Building the Global View Querying the Global View Description Logics techniques Updating the Global View
4Francesco Guerra – DBGroup@unimo
Intelligent Integration of InformationIntelligent Integration of Informationthe systemsthe systems
5Francesco Guerra – DBGroup@unimo
Intelligent Integration of InformationIntelligent Integration of Informationthe systemsthe systems
6Francesco Guerra – DBGroup@unimo
Matching comparisonMatching comparison
Distinguishing elements: Different kinds of mappings representation
(granularity, cardinality)
Mappings extraction (structure-instances analysis, lexical analysis, external tools exploitation)
7Francesco Guerra – DBGroup@unimo
Matching comparisonMatching comparison
Extended from : E. Rahm and P.A. Bernstein. A survey of approaches to automatic schema matching, VLDB Extended from : E. Rahm and P.A. Bernstein. A survey of approaches to automatic schema matching, VLDB Journal, 10(4):334-350,2001Journal, 10(4):334-350,2001
8Francesco Guerra – DBGroup@unimo
Matching comparisonMatching comparison
9Francesco Guerra – DBGroup@unimo
The MOMIS SystemThe MOMIS System
MOMIS (Mediator envirOnment for Multiple Information Sources) is a framework to perform information extraction and integration from both structured and semistructured data sources. An object-oriented language, with an underlying Description Logic, called
ODL-I3, derived from the standard ODMG is introduced for information extraction. Information integration is then performed in a semi-automatic way, by exploiting the knowledge in a Common Thesaurus and ODL-I3 descriptions of source schemas with a combination of clustering techniques and Description Logics. This integration process gives rise to a virtual integrated view of the underlying sources (the Global Virtual View) for which mapping rules and integrity constraints are specified to handle heterogeneity.
The MOMIS system, based on a conventional wrapper/mediator architecture, provides methods and open tools for data management in Internet-based information systems by using a CORBA-2 interface. MOMIS was developed as a joint collaboration between the University of Modena and Reggio Emilia and University of Milano and Brescia.
10Francesco Guerra – DBGroup@unimo
The MOMIS SystemThe MOMIS System
Distributed information stored in multiple, heterogeneous sources
• Sources integration provides a Global Schema (which is a virtual view)• the Global Schema allows the user to send a query and get a unified answer
from all the involved sources (transparently)• All information in http://www.dbgroup.unimo.it• INTERDATA (1999-2000); D2I (from Data to Information) (2001-2002) –
“Programmi di ricerca scientifica di rilevante interesse nazionale”; WINK (Web-linked Integration of Network-based Knowledge) (2002-2003); SEWASIE (Semantic Webs and AgentS in Integrated Economies) (2002-2005)
11Francesco Guerra – DBGroup@unimo
The MOMIS System- ArchitectureThe MOMIS System- Architecture
12Francesco Guerra – DBGroup@unimo
SYNSET2
SYNSET#
SYNSET4
SYNSET1
MANUALANNOTATION
SEMI-AUTOMATICANNOTATION
INFERRED RELATIONSHIPS
LEXICON DERIVEDRELATIONSHIPS
SCHEMA DERIVEDRELATIONSHIPS
CommonThesaurus
COMMON THESAURUSGENERATION
USER SUPPLIEDRELATIONSHIPS
ODLI3LOCAL SCHEMA N
WRAPPING
ODLI3LOCAL SCHEMA 1
…
…
GVV GENERATION
MAPPING TABLES
GLOBAL CLASSES
13Francesco Guerra – DBGroup@unimo
Local sources annotationLocal sources annotation
The integration designer has to manually choose the appropriate WordNet (www.cogsci.princeton.edu/~wn/) meaning for each element of the conceptual schema provided by wrappers.
The annotation phase is composed of two steps:1. Word Form choice. The WordNet morphologic processor aids
the designer by suggesting a word form corresponding to the given term.
2. Meaning choice. The designer can choose to map an element on zero, one or more senses. Notice that the user can choose a sense among the existing ones in WordNet and he can add new senses in the DB.
Motivations of the annotation:1. Exploiting semantics associated with the names of the
schemas/structures of the information sources2. Having a well-known meaning for each term of the sources
14Francesco Guerra – DBGroup@unimo
Global Virtual View annotationGlobal Virtual View annotation The GVV has to be annotated to become ”exportable knowledge”.
Annotating a GVV means to provide Global Classes with a name and with meanings.
By starting from annotations of local sources and mappings between the GVV and the local ontologies, we have developed a semi-automatic methodology to generate the annotations of the GVV.
15Francesco Guerra – DBGroup@unimo
GVV annotationGVV annotation
UNI.Article NT CS.Publication
CS.Essay NT CS.Publication
CS.Essay=<essay, {essay#1}>CS.Publication=<publication,{publication#2}>UNI.Article=<article,{article#1}>
GlobalClass1 = <publication, {essay#1,publication#2, article#1}>GlobalClass1 = <publication, {essay#1,publication#2, article#1}>The annotated Global classThe annotated Global class
Annotated Local classesAnnotated Local classes
The CT relationshipsThe CT relationships
namename
meaningsmeanings
essay#1 = an analytic or interpretive literary compositionessay#1 = an analytic or interpretive literary composition publication#2 = a copy of a printed work offered for distributionpublication#2 = a copy of a printed work offered for distribution article#1 = nonfictional prose forming an independent part of a publicationarticle#1 = nonfictional prose forming an independent part of a publication
Wordnet meaningsWordnet meanings
GlobalClass1 CS.Essay CS.Publication UNI.Article
A Global classA Global class
broadest meaningbroadest meaning
BLCBLCGCGC={LC={LCGC| GC| y y GC, (LC NT y ) v (y BT LC)}GC, (LC NT y ) v (y BT LC)}
16Francesco Guerra – DBGroup@unimo
Updating the GVVUpdating the GVV
A created GVV can change:
1) By adding a new source on the system2) By updating an existing data source schema3) By deleting a previously integrated source
Adding a new source: two possible scenarios
• Integration from scratch: the integration process is applied again; in this case only the Common Thesaurus of the previously GVV can be exploited.
• Integration with the GVV: the process exploits the “automatically annotated” GVV and the Common Thesaurus.
17Francesco Guerra – DBGroup@unimo
Adding a new sourceAdding a new source
CommonThesaurus
Clustergeneration
XML
AnnotatedAnnotatedGVVGVV
OODB
RDBSources’sSchema
ODLI3
GloblalClass3
GloblalClass1
GloblalClass2
Mapping Global schema/Local schema
New GVVNew GVV
Common ThesaurusCommon Thesaurus– intra/inter schema relationshipsintra/inter schema relationships (only new sources)(only new sources)– lexicon relationshipslexicon relationships (GVV e new sources annotated)(GVV e new sources annotated)– relationships inserted by userrelationships inserted by user– inferred relationshipsinferred relationships
NewNew
NewNew
18Francesco Guerra – DBGroup@unimo
Adding a new sourceAdding a new source
Three scenarions: A new global class is composed of only one old global class and
one or more new local classes
A global class of the new integrated schema is composed of only new local classes
A global class of the new integrated schema is composed of more than one global class of the old GVV and at least one local class of the new source
19Francesco Guerra – DBGroup@unimo
GVV- integrated ontologyGVV- integrated ontology A GVV may be thought of as a domain ontology for the integrated
sources; the usual approach in the Semantic Web is based on “a priori” existence of an ontology connected by means of semantic markups to the sources
Ontology
Ontology Builder
Semantic WebMOMIS
20Francesco Guerra – DBGroup@unimo
GVV- integrated ontologyGVV- integrated ontology
The MOMIS ontology is composed of the following components:
Global Virtual View Mapping Rules Integrity constraint rules Intensional and extensional inter and intra-schema
relationships (Common Thesaurus)
We express the ontology by using the ODLI3 language or an OWL file.
21Francesco Guerra – DBGroup@unimo
Using the MOMIS system Using the MOMIS system
The MOMIS system was exploited: To create a virtual marketplace To support collaborative processes within the
European Wink project To build an advanced semantic search engine
within the European SEWASIE project (under development)
22Francesco Guerra – DBGroup@unimo
SEWASIESEWASIE
SEWASIE (SEmantic Webs and AgentS in Integrated Economies) is SEWASIE (SEmantic Webs and AgentS in Integrated Economies) is a research project funded by EU on action line Semantic Web (May a research project funded by EU on action line Semantic Web (May 2002/April 2005)2002/April 2005)
The consortium detailsThe consortium details Università degli Studi di Modena e Reggio EmiliaUniversità degli Studi di Modena e Reggio Emilia (ITALY) (ITALY) CNA SERVIZI Modena s.c.a.r.l.CNA SERVIZI Modena s.c.a.r.l. (ITALY) (ITALY) Università degli Studi di Roma “La Sapienza”Università degli Studi di Roma “La Sapienza” (ITALY) (ITALY) Rheinisch Westfaelische Technische Hochschule AachenRheinisch Westfaelische Technische Hochschule Aachen ( (GERMANYGERMANY)) Libera UniversitLibera Università di Bolzano (ITALY)à di Bolzano (ITALY) Thinking Networks AGThinking Networks AG ( (GERMANYGERMANY)) IBM Italia SPAIBM Italia SPA (ITALY) (ITALY) Fraunhofer-Gesellschaft Institut Angewandte Informationstechnik Fraunhofer-Gesellschaft Institut Angewandte Informationstechnik
((GERMANYGERMANY))
23Francesco Guerra – DBGroup@unimo
SEWASIE ObjectivesSEWASIE Objectives
The SEWASIE project pursues the following aims:
To develop an agent-based secure, scalable and distributed system architecture for semantic search (based on ontologies) and for structured web-based communication.To develop a general framework for query management and information reconciliation based on a semantically enriched data and trusted agent structure. To develop an information brokering component which includes methods for collecting, contextualizing and visualizing data. To provide the end-user with an efficient interface for formulating queries using a graphical representation and for intelligent navigation through the semantically information space.
The SEWASIE project aims to develop an advanced search engine enabling intelligent access to heterogeneous data sources on the web, via semantic enrichment, to provide the basis for structured web-based communication.
24Francesco Guerra – DBGroup@unimo
The SEWASIE architecture The SEWASIE architecture
The SEWASIE system realizes a virtual network, the SEWASIE Virtual Network (SVN), whose nodes are SEWASIE Information Nodes (SINodes), multi-database mediator-based systems, each including a Virtual Data Store, an Ontology Builder, and a Query Manager
Brokering Agents maintain the knowledge related to the SEWASIE Virtual Network and the user profiles.
In query solving phase, starting from a specified SINode, a Query Agent accesses other SINodes and thus collects partial answers.
To select SINodes useful to solve a query, a Query Agent interacts with a/several Brokering Agents.
25Francesco Guerra – DBGroup@unimo
The SEWASIE architectureThe SEWASIE architecture
Other users
Query Results
CommunicationTool
QueryAgent
QueryAgent
QueryAgent
Comm.Agent
useruser
user
BrokeringAgent (BA)Ontology
maps
MonitoringAgent (MA)
User Interface
Comm. InterfaceQuery InterfaceMetadata Interface
Visualisation
UserProfile
OLAPToolOLAP
Reports
Monitoring Interface
MonitorProfiles
SINode
…
Ontology
StructuredDatabases
RDBs
WrapperSemanticEnrichment
QueryMetadataRepository
Semi-Databases
WrapperSemanticEnrichment
<XML><DATA>...</DATA>
Wrapper
UnstructuredText documents
<HTML>...
Virtual Data Store
Ontology
StructuredDatabases
RDBs
WrapperSemanticEnrichment
QueryManagerMetadata
Repository
Databases
Wrapper
SemanticEnrichment
<XML><DATA>...</DATA>
Wrapper
<HTML>...
Ontology
builder
Virtual Data Store
The u
ser
The u
ser
inter
face l
ayer
inter
face l
ayer
SEWASIE Interconnectio
ninfrastructure
Ontologymaps
BrokeringAgent (BA)
BA
BA
BABA The
Theinform
atioinform
ationn
layerlayer
TheThe intermediaries
intermediarieslayerlayer
Structured
Wrapper HTML
XML Wrap
HTML→XML
26Francesco Guerra – DBGroup@unimo
Future WorkFuture Work
–Ontology evolution within an SINodeOntology evolution within an SINodeUpdate of existing sourcesDeletion of previously integrated sources
–Extending WordNetExtending WordNetIf a source description element has no correspondent
concept in WordNet, the designer may add a new meaning and proper relationships connecting them to existing meanings.
–Multilingual functionalitiesMultilingual functionalitiesSEWASIE multilingual technologies will allow users to share
information and resources available all over the world, but also to preserve their original local qualities.
Enrichment of multi-lingual lexicon ontology with the aid of statistical analysis techniques for multilingual text corpora (for example with techniques for the generation of multilingual dictionaries).
27Francesco Guerra – DBGroup@unimo
Partecipazione a progetti di ricerca di Partecipazione a progetti di ricerca di carattere nazionale ed europeocarattere nazionale ed europeo
• progetto D2I (From Data to Information) finanziato dal MIUR: “Programma di ricerca scientifica di rilevante interesse nazionale (2000-2001)”;
• progetto “Agenti software e commercio elettronico: profili giuridici, tecnologici e psico-sociali”, finanziato dal MIUR “Programma di ricerca scientifica di rilevante interesse nazionale” (2001-2002)
• progetto “Tecnologie per arricchire e fornire accesso a contenuti” finanziato con il Fondo Speciale Innovazione 2000 (2001-2002)
• progetto SEWASIE (SEmantic Web and AgentS in Integrated Economies) finanziato dalla Comunità Europea (2002-2005)
• progetto WINK (Web-linked Integration of Network-based Knowledge) finanziato dalla Comunità Europea (cluster EUTIST-AMI). (2002-2003)
28Francesco Guerra – DBGroup@unimo
PubblicazioniPubblicazioniRiviste Internazionali (RI) e Capitoli in libri Internazionali (CLI)Riviste Internazionali (RI) e Capitoli in libri Internazionali (CLI)
[[RIRI1] S. Bergamaschi, G. Cabri, F. Guerra, L. Leonardi, M. Vincini, F. Zambonelli, 1] S. Bergamaschi, G. Cabri, F. Guerra, L. Leonardi, M. Vincini, F. Zambonelli, Exploiting Agents Exploiting Agents to Support Information Integration, to Support Information Integration, Special Issue of the International Journal on Cooperative Special Issue of the International Journal on Cooperative Information Systems vol. 11(3-4): 293-314, 2002, ISSN 0218-8430Information Systems vol. 11(3-4): 293-314, 2002, ISSN 0218-8430
[[RIRI2] I. Benetti, D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini,2] I. Benetti, D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini, An Information Integration An Information Integration Framework for E-Commerce,Framework for E-Commerce, IEEE Intelligent Systems Magazine, Jan/Feb 2002, pp. 18-25, IEEE Intelligent Systems Magazine, Jan/Feb 2002, pp. 18-25,
[[RIRI3] D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini,3] D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini, Synthesizing an Integrated Ontology, Synthesizing an Integrated Ontology, IEEE Internet Computer, September-October 2003, 42-51, ISSN 1089-7801IEEE Internet Computer, September-October 2003, 42-51, ISSN 1089-7801
[[RIRI4] I. Benetti, S. Bergamaschi, F. Guerra, M. Vincini, 4] I. Benetti, S. Bergamaschi, F. Guerra, M. Vincini, Soap-enabled web services for knowledge Soap-enabled web services for knowledge managementmanagement to appear in Int. J. Web Engineering and Technology, InderScience Publishers. to appear in Int. J. Web Engineering and Technology, InderScience Publishers.
[[RI5RI5] D. Beneventano, F. Guerra, S. Magnani, M. Vincini ] D. Beneventano, F. Guerra, S. Magnani, M. Vincini A Web Service based framework for the A Web Service based framework for the semantic mapping between product classification schemassemantic mapping between product classification schemas, to appear in Journal of Electronic , to appear in Journal of Electronic Commerce Research, ISSN 15266133.Commerce Research, ISSN 15266133.
[[CLICLI1] D. Beneventano, S. Bergamaschi, J. Gelati, F. Guerra, M. Vincini: 1] D. Beneventano, S. Bergamaschi, J. Gelati, F. Guerra, M. Vincini: MIKS: an agent framework MIKS: an agent framework supporting information access and integrationsupporting information access and integration, Intelligent Information Agents - The AgentLink , Intelligent Information Agents - The AgentLink Perspective, (editor S. Bergamaschi, M. Klusch, P. Edwards, P. Petta) - March 2003, Lecture Notes in Perspective, (editor S. Bergamaschi, M. Klusch, P. Edwards, P. Petta) - March 2003, Lecture Notes in Computer Science N. 2586 - Springer Verlag, pp. 22-49 ISSN 0302-9743 ISBN 3-540-00759-8Computer Science N. 2586 - Springer Verlag, pp. 22-49 ISSN 0302-9743 ISBN 3-540-00759-8
Riviste Nazionali (RN)Riviste Nazionali (RN)[[RNRN1] G. Gelati, F. Guerra, M. Vincini,1] G. Gelati, F. Guerra, M. Vincini, Agents Supporting Information Integration: the MIKS Agents Supporting Information Integration: the MIKS FrameworkFramework,, AI*IA Notizie, Periodico dell’Associazione Italiana per l’Intelligenza Artificiale, AnnoXIV, AI*IA Notizie, Periodico dell’Associazione Italiana per l’Intelligenza Artificiale, AnnoXIV, N.4, Dicembre 2001N.4, Dicembre 2001
29Francesco Guerra – DBGroup@unimo
PubblicazioniPubblicazioniCongressi Internazionali (CI)Congressi Internazionali (CI)
[[CICI1] D. Beneventano, S. Bergamaschi, I. Benetti, A. Corni, F. Guerra, G. Malvezzi, 1] D. Beneventano, S. Bergamaschi, I. Benetti, A. Corni, F. Guerra, G. Malvezzi, SI-Designer: a SI-Designer: a tool for intelligent integration of informationtool for intelligent integration of information, 34th Annual Hawaii International Conference on , 34th Annual Hawaii International Conference on System Sciences (HICSS-34), January 3-6, 2001, Maui, Hawaii - Track 9. System Sciences (HICSS-34), January 3-6, 2001, Maui, Hawaii - Track 9. IEEE Computer SocietyIEEE Computer Society
[[CICI2] D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini2] D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini ,The Momis approach to Information ,The Momis approach to Information IntegrationIntegration, IEEE and AAAI International Conference on Enterprise Information Systems (ICEIS01), , IEEE and AAAI International Conference on Enterprise Information Systems (ICEIS01), Setùbal, Portugal, 7-10 July 2001, pp.194-198, ISBN 972-98050-2-4Setùbal, Portugal, 7-10 July 2001, pp.194-198, ISBN 972-98050-2-4
[[CICI3] I. Benetti, D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini,3] I. Benetti, D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini, SI-Designer: an Integration SI-Designer: an Integration Framework for E-CommerceFramework for E-Commerce, IJCAI*01 Workshop on E-Business & the Intelligent Web Seattle, , IJCAI*01 Workshop on E-Business & the Intelligent Web Seattle, USA * August 5 2001 USA * August 5 2001
[[CICI4] S. Bergamaschi, G. Cabri, F. Guerra, L. Leonardi, M. Vincini, F. Zambonelli, 4] S. Bergamaschi, G. Cabri, F. Guerra, L. Leonardi, M. Vincini, F. Zambonelli, Supporting Supporting information integration with autonomous agentsinformation integration with autonomous agents, Fifth International Workshop CIA-2001 on , Fifth International Workshop CIA-2001 on COOPERATIVE INFORMATION AGENTS September 6 - 8, 2001 Modena, Italy pp, 88-99.COOPERATIVE INFORMATION AGENTS September 6 - 8, 2001 Modena, Italy pp, 88-99.
[[CICI5] D. Calvanese, S. Castano, F. Guerra, D. Lembo, M. Melchiori, G. Terracina, D. Ursino, M. 5] D. Calvanese, S. Castano, F. Guerra, D. Lembo, M. Melchiori, G. Terracina, D. Ursino, M. Vincini, Vincini, Towards a comprehensive methodological framework for integrationTowards a comprehensive methodological framework for integration, 8th International , 8th International Workshop on Knowledge Representation meets Databases (KRDB-2001), Roma, Italy, 2001Workshop on Knowledge Representation meets Databases (KRDB-2001), Roma, Italy, 2001
[[CICI6] S. Bergamaschi, F. Guerra, M. Vincini, 6] S. Bergamaschi, F. Guerra, M. Vincini, A Data Integration Framework for E-commerce A Data Integration Framework for E-commerce product classificationproduct classification, 1st International Semantic Web Conference (ISWC2002), Sardegna, Italy, 9-, 1st International Semantic Web Conference (ISWC2002), Sardegna, Italy, 9-12 June 2002, LNCS 2342 Springer 2002, ISBN 3-540-43760-6, pp. 379-393, ISBN 3-540-43760-612 June 2002, LNCS 2342 Springer 2002, ISBN 3-540-43760-6, pp. 379-393, ISBN 3-540-43760-6
30Francesco Guerra – DBGroup@unimo
PubblicazioniPubblicazioni[[CICI7] S. Bergamaschi, F. Guerra,7] S. Bergamaschi, F. Guerra, Peer to Peer Paradigm for a Semantic Search EnginePeer to Peer Paradigm for a Semantic Search Engine, , in proceedings in proceedings of the International Workshop on Agents and Peer-to-Peer Computing, Bologna, 15 July 2002, LNCS 2530, of the International Workshop on Agents and Peer-to-Peer Computing, Bologna, 15 July 2002, LNCS 2530, Springer ISBN 3-540-40538-0Springer ISBN 3-540-40538-0
[[CICI8] S. Bergamaschi, F. Guerra, M. Vincini, 8] S. Bergamaschi, F. Guerra, M. Vincini, Product Classification Integration for E-CommerceProduct Classification Integration for E-Commerce, Second , Second International Workshop on Electronic Business Hubs - WEBH 2002 in conjunction with DEXA 2002, International Workshop on Electronic Business Hubs - WEBH 2002 in conjunction with DEXA 2002, September 2-6 2002, Aix En Provence, France, published by IEEE Computer Society, Los Alanitos (CA), September 2-6 2002, Aix En Provence, France, published by IEEE Computer Society, Los Alanitos (CA), ISBN 0-7695-1668-8, pp. 861-867ISBN 0-7695-1668-8, pp. 861-867
[[CICI9] 9] D. Beneventano, S. Bergamaschi, S. Castano, V. De Antonellis, A. Ferrara, F. Guerra, F. Mandreoli, D. Beneventano, S. Bergamaschi, S. Castano, V. De Antonellis, A. Ferrara, F. Guerra, F. Mandreoli, G. Ornetti, M. Vincini,G. Ornetti, M. Vincini, Semantic Integration and Query Optimization of Heterogeneous Data Sources, Semantic Integration and Query Optimization of Heterogeneous Data Sources, 1st Int.l Workshop on Efficient Web-based Information Systems (EWIS), 2002, Montpellier, France, pp.154-1st Int.l Workshop on Efficient Web-based Information Systems (EWIS), 2002, Montpellier, France, pp.154-165.165.
[[CICI10] S. Bergamaschi, F. Guerra, M. Vincini, 10] S. Bergamaschi, F. Guerra, M. Vincini, A peer-to-peer information system for the semantic web,A peer-to-peer information system for the semantic web, in proceedings of the International Workshop on Agents and Peer-to-Peer Computing, in AAMAS 2003 in proceedings of the International Workshop on Agents and Peer-to-Peer Computing, in AAMAS 2003 Melbourne, Australia, July 14, 2003Melbourne, Australia, July 14, 2003
[[CICI11]11] D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini:D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini: Building an Ontology with MOMISBuilding an Ontology with MOMIS, , in in proceedings of the proceedings of the Semantic Integration Workshop within the Second International Semantic Web within the Second International Semantic Web Conference, October 20, 2003 Sundial Resort, Sanibel Island, Florida, USA.Conference, October 20, 2003 Sundial Resort, Sanibel Island, Florida, USA.
[[CICI12] D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini, 12] D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini, Building an integrated Ontology within Building an integrated Ontology within SEWASIE system, SEWASIE system, in proceedings of the First International Workshop on Semantic Web and Databases, in proceedings of the First International Workshop on Semantic Web and Databases, Co-located with Co-located with VLDB 2003 Berlin, Germany, (2003) Berlin, Germany, (2003)
[[CICI13] S. Bergamaschi, G.Gelati, F. Guerra, M. Vincini, 13] S. Bergamaschi, G.Gelati, F. Guerra, M. Vincini, WINK: a Web-based Enterprise System for WINK: a Web-based Enterprise System for Collaborative Project Management in Virtual Enterprises, Collaborative Project Management in Virtual Enterprises, 4th International Conference on Web 4th International Conference on Web Information Systems Engineering, Roma Italy, 10-12 December 2003 Information Systems Engineering, Roma Italy, 10-12 December 2003
31Francesco Guerra – DBGroup@unimo
PubblicazioniPubblicazioniCongressi Nazionali (CN)Congressi Nazionali (CN)[[CNCN1] D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini, 1] D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini, Exploiting extensional knowledge for Exploiting extensional knowledge for query reformulation and object fusion in a data integration systemquery reformulation and object fusion in a data integration system , Proceedings of SEBD2001, , Proceedings of SEBD2001, Venezia, 27-29 June, 2001, pp. 257-271Venezia, 27-29 June, 2001, pp. 257-271
[[CNCN2] G. Gelati, F. Guerra, M. Vincini,2] G. Gelati, F. Guerra, M. Vincini, Agents Supporting Information Integration: the MIKS Agents Supporting Information Integration: the MIKS FrameworkFramework,, Proc. AIIA and TABOO Workshop: From Object to Agents, Pitagora Editrice, Bologna, Proc. AIIA and TABOO Workshop: From Object to Agents, Pitagora Editrice, Bologna, ISBN 88-371-1272-6, September 2001ISBN 88-371-1272-6, September 2001
[[CNCN3] D. Beneventano, S. Bergamaschi, D. Bianco, F. Guerra, M. Vincini, 3] D. Beneventano, S. Bergamaschi, D. Bianco, F. Guerra, M. Vincini, SI-Web: a Web based SI-Web: a Web based interface for the MOMIS projectinterface for the MOMIS project, Proceedings of SEBD2002, 19-22 June, 2002, pp. 407-411, Proceedings of SEBD2002, 19-22 June, 2002, pp. 407-411
[[CNCN4] D. Beneventano, S. Bergamaschi, D. Gazzotti, G.Gelati, F. Guerra, M. Vincini,4] D. Beneventano, S. Bergamaschi, D. Gazzotti, G.Gelati, F. Guerra, M. Vincini, The WINK Project The WINK Project for Virtual Enterprise Networking and Integrationfor Virtual Enterprise Networking and Integration,, Proceedings of SEBD2002, 2002, pp. 283-290Proceedings of SEBD2002, 2002, pp. 283-290
[[CNCN5] D. Beneventano, S. Bergamaschi, M.Felice, D. Gazzotti, G.Gelati, F. Guerra, M. Vincini,.5] D. Beneventano, S. Bergamaschi, M.Felice, D. Gazzotti, G.Gelati, F. Guerra, M. Vincini,. An An Agent framework for Supporting the MIKS IntegrationAgent framework for Supporting the MIKS Integration,, Proc. AIIA and TABOO Workshop: From Proc. AIIA and TABOO Workshop: From Object to Agents, 18-19 Novembre 2002, Milano Università BicoccaObject to Agents, 18-19 Novembre 2002, Milano Università Bicocca
[[CNCN6] D. Beneventano, S. Bergamaschi, A. Fergnani, F. Guerra, M. Vincini, D. Montanari, 6] D. Beneventano, S. Bergamaschi, A. Fergnani, F. Guerra, M. Vincini, D. Montanari, A Peer-to-A Peer-to-Peer Agent-Based Semantic Search EnginePeer Agent-Based Semantic Search Engine, Proceedings of SEBD2003, Cetraro (CS),2003, pp.283-, Proceedings of SEBD2003, Cetraro (CS),2003, pp.283-290290
[[CNCN7] S. Bergamaschi, G. Gelati, F. Guerra, M. Vincini, 7] S. Bergamaschi, G. Gelati, F. Guerra, M. Vincini, A Experiencing AUML for the WINK Milti- A Experiencing AUML for the WINK Milti-Agent SystemAgent System,, Proc. AIIA and TABOO Workshop: From Object to Agents, 10-11 Settembre 2003, Proc. AIIA and TABOO Workshop: From Object to Agents, 10-11 Settembre 2003, Villasimius (CA)Villasimius (CA)
32Francesco Guerra – DBGroup@unimo
Global Instance ComputationGlobal Instance Computation
For the definition of a Global Class we have to define the following elements: Mapping Table: define the mapping between the global
class attributes and the local classes attributes Join condition: we assume that there is a Join Condition
between each pair of overlapping relations to identify tuples corresponding to the same object and fuse them
Full disjunction: the GC contains a unique tuple containing a unique tuple resulting from the merge of all different tuples representing the same real world object.
33Francesco Guerra – DBGroup@unimo
Global Instance ComputationGlobal Instance Computation
S(l1)= (firstn, lastn, year, e_mail)S(l2)= (name, e_mail, dept_code, s_code)
Name E_mail Section Year Dept L1 firstn and lastn E_mail null year null L2 name E_mail S_code null dept code
Two functions: Global function: renaming the attributes of the
local classes into attributes of the global class Local Function: converting a tuple of elements of
a local classby suitable functions such as string concatenations ….
34Francesco Guerra – DBGroup@unimo
Global Instance ComputationGlobal Instance Computation
Semantic Homogeneity property condition
Join AttributeJoin Attribute Join AttributeJoin Attribute
Full DisjunctionFull Disjunction
35Francesco Guerra – DBGroup@unimo
Global Instance ComputationGlobal Instance Computation
Semantic Homogeneity property condition not verified: Resolution functions:
Random Priority User defined function
39Francesco Guerra – DBGroup@unimo
ExampleExample
Department(dept_code,dept_name,budget) Research_Staff(name,e_mail,dept-code,s_code)
FK dept_code REF Department, s_code REF SectionSchool_Member(name,school,year,e_mail)Section(s_code,section_name,length,room_code)
FK room_code REF Department, s_code REF RoomRoom(room_code,seats_number,notes)
<!ELEMENT ListOfStudent (Student*)><!ELEMENT Student (name,s_code,school_name,e_mail,tax_fee)><!ELEMENT name (#PCDATA)>
University source (relational)
Tax_Position source (XML)
40Francesco Guerra – DBGroup@unimo
CS_Person(first_name,last_name) Professor:CS_Person(belongs_to:Division,rank)Student:CS_Person(year,takes:set<Course>,rank,e_mail)Division(description,address:Location)Location(city,street,number,country)Course(course_name,tought_by:Professor)
Computer_Science source (object)
ExampleExample
41Francesco Guerra – DBGroup@unimo
Source Acquisition ModuleSource Acquisition Module
42Francesco Guerra – DBGroup@unimo
Common Thesaurus (Domain Ontology)Common Thesaurus (Domain Ontology)
Set of terminological relationships between classes and attributes names (terms)
expresses both intra-schema and inter-schema knowledge
Relationships added to Common Thesaurus:(1) schema derived(2) lexicon derived(3) designer supplied(4) inferred exploiting ODB-Tools capabilities
43Francesco Guerra – DBGroup@unimo
Schema-derived relationshipsSchema-derived relationships
Terminological and extensional intra-schema relationships
RT relationships derived from foreign keys in a relational schema
UNI.Section RT UNI.Department
BT/NT relationships derived from inheritance relationships in a object-oriented
schema or integrity constraints in relational schema
CS.Student NT CS.CS_Person CS.Professor NT CS.CS_Person
44Francesco Guerra – DBGroup@unimo
Schema Derived RelationshipsSchema Derived Relationships
45Francesco Guerra – DBGroup@unimo
Lexicon-derived relationshipsLexicon-derived relationships
Extracted from WordNet lexical database (Princeton Un.)129625 lemma organized in 99759 synonym set (synset)
SynonymyPolysemy
Tax_position_xml.Student.name SYN University.School_member.name
CS.Professor NT CS.CS_Person
46Francesco Guerra – DBGroup@unimo
Lexicon Derived RelationshipsLexicon Derived Relationships
47Francesco Guerra – DBGroup@unimo
Lexicon Derived RelationshipsLexicon Derived Relationships
48Francesco Guerra – DBGroup@unimo
Lexicon Derived RelationshipsLexicon Derived Relationships
49Francesco Guerra – DBGroup@unimo
Lexicon Derived RelationshipsLexicon Derived Relationships
50Francesco Guerra – DBGroup@unimo
Lexicon Derived RelationshipsLexicon Derived Relationships
51Francesco Guerra – DBGroup@unimo
Inferred relationshipsInferred relationships
Exploiting Description Logics techniques (ODB-Tools system) a new set of terminological relationships are inferred
University.Research_Staff RT CS.Course
52Francesco Guerra – DBGroup@unimo
Common ThesaurusCommon Thesaurus
53Francesco Guerra – DBGroup@unimo
Mediator global schemaMediator global schema
Global schema generation (interaction with ARTEMIS module):
Affinity calculationCluster generationGlobal attributes and mapping table
generationA global class gci is generated for each cluster Cli
SI-Designer builds the attributes set to be associated to the cluster:
–Union of the attributes of all classes belonging to the cluster–Fusion of “similar attributes”
54Francesco Guerra – DBGroup@unimo
Affinity tree and ClusterAffinity tree and Cluster
55Francesco Guerra – DBGroup@unimo
Affinity tree and ClusterAffinity tree and Cluster
56Francesco Guerra – DBGroup@unimo
Affinity tree and ClusterAffinity tree and Cluster
57Francesco Guerra – DBGroup@unimo
Mapping table exampleMapping table example
– each global class includes mapping rules between global and local attributes (and/or relationships, default/null values)
– a mapping is generated for each global class gci
58Francesco Guerra – DBGroup@unimo
Mapping tableMapping table
59Francesco Guerra – DBGroup@unimo
Mapping tableMapping table