Nuova ricerca

PAOLO TIBERIO

PROFESSORE EMERITO presso: Dipartimento di Ingegneria "Enzo Ferrari"


Home | Curriculum(pdf) |


Pubblicazioni

2010 - Toward a Flexible Data Management Middleware for Wireless Sensor Networks [Relazione in Atti di Convegno]
Haider, Razia; Mandreoli, Federica; Martoglia, Riccardo; Sassatelli, Simona; Tiberio, Paolo
abstract

In this paper we present the research activity we are carrying out in the "Mobile Semantic Self-Organizing Wireless Sensor Networks" Project at the Department of Information Engineering of the University of Modena and Reggio Emilia. In this context, the main aim of our research is to study solutions for the flexible querying of distributed data collected by heterogeneous devices providing measurement readings. To this end, we propose a middleware for wireless sensor networks which is able to autonomously configure the communication and the operations required to each device in order to reduce energy and temporal costs.


2010 - Toward an Effective and Efficient Query Processing in the NeP4B Project [Relazione in Atti di Convegno]
C., Gennaro; Mandreoli, Federica; Martoglia, Riccardo; M., Mordacchini; S., Orlando; W., Penzo; Sassatelli, Simona; Tiberio, Paolo
abstract

In this paper we present our main current research activity in the Italian co-funded FIRB Project NeP4B (Networked Peers for Business). In particular, we provide an overview of our P2P query routing approach which combines semantics and multimedia aspects in order to make query processing effective and efficient.


2008 - Ontology-Based Personalization of E-Government Services [Capitolo/Saggio]
F., Grandi; Mandreoli, Federica; Martoglia, Riccardo; Ronchetti, Enrico; M. R., Scalas; Tiberio, Paolo
abstract

While the World Wide Web user is suffering form the disease caused by information overload, for which personalization is one of the treatments which work, the citizen who gets ready to use the e-Government services which are made available on the Web is not immune from contagion. This seems a good reason to try to prescribe a personalization treatment also to the e-Government user. Hence, we introduce the design and implementation of Web information systems supporting personalized access to multi-version resources in an e-Government scenario. Personalization is supported by means of Semantic Web techniques and relies on an ontology-based profiling of users (citizens). Resources we consider are collections of norm documents (laws, decrees, regulations, etc.) in XML format but can also be generic Web pages and portals or e-Government transactional services. We introduce a reference infrastructure, describe the organization and present performance figures of a prototype system we have developed.


2006 - A Native Extensible XML Query Processor Towards Efficient and Effective MPEG-7 Querying [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo; M., Righini
abstract

In recent years the production of massive amounts of visual information has led to the arrival of very large multimedia Digital Libraries (DLs). The key to support efficient search and management operations in such repositories is to exploit metadata information for digital media, such as MPEG-7 based ones, which seem to be the most widely accepted. The underlying XML syntax, together with the high versatility of the provided constructs, make it easy to specify significant and complex queries, however executing them efficiently on huge quantities of data is not a trivial task. In this paper we provide an overview of the XSiter system, a native and extensible XML query processor providing very high performance in general XML querying settings and whose flexible architecture can be easily enhanced to better support the peculiarities of retrieving multimedia objects through MPEG-7 annotation metadata. Further, we consider possible "use-cases" and tasks related to multimedia and video DLs querying and management which our system can successfully accomplish.


2006 - An eGovernment system for temporal- and semantic-aware access to norms [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo; F., Grandi; M. R., Scalas; E., Ronchetti
abstract

In this paper, we present the results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a semantic-aware system supporting efficient and personalized access to a multi-version repository of norma- tive texts. The research activity is entitled “Semantic web techniques for the management of digital identity and the ac- cess to norms”. In the context of a complete and modular in- frastructure, we defined a multi-version XML data model and developed a temporal and semantical XML query processor supporting both temporal versioning –essential in normative systems– and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applica- ble to their personal case. The whole infrastructure, which we plan to complete in the near future, will integrate the query- ing component with several auxiliary services, including au- tomatic citizen identification and classification and assisted update of the repository data


2006 - EXTRA: a system for example-based translation assistance [Articolo su rivista]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo
abstract

Nowadays we are witnessing the need to translate ever increasing quantities of texts, with an ever increasing quality. The expertise and skill of professional translators is not alone entirely sufficient in order to achieve highly effective and efficient translation performance. The best way to translate very large quantities of documents, while ensuring optimal translation time and costs, is to exploit Example-Based Machine Translation (EBMT), which is devised in the aim of achieving better quality and quantity in less time, while preserving and treasuring the richness and accuracy that only human translation can achieve. In this paper we present EXTRA (EXample-based TRanslation Assistant), the EBMT system we have developed over the last few years to support the translation of texts written in Western languages. EXTRA is able to propose effective translation suggestions by relying on syntactic analysis of the text and on a rigorous, language-independent measure; the search is performed efficiently in large amounts of bilingual texts thanks to its advanced retrieval techniques. Furthermore, EXTRA does not use external knowledge requiring the intervention of users and is completely customizable and portable as it has been implemented on top of a standard DataBase Management System (DBMS). In the paper we also provide a thorough evaluation of both the effectiveness and the e±ciency of our system. In particular, in order to quantify the benefits offered by EXTRA assisted translation over manual translation, we introduce a simulator implementing specifically devised statistical, process-oriented, discrete-event models.


2006 - Semantic Web Techniques for Personalization of eGovernment Services [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo; F., Grandi; M. R., Scalas; E., Ronchetti
abstract

In this paper, we present the results of an ongoing research involving the design and implementation of systems supporting personalized access to multi-version resources in an eGovernment scenario. Personalization is supported by means of Semantic Web techniques and is based on an ontology-based profiling of users (citizens). Resources we consider are collections of norm documents in XML format but can also be generic Web pages and portals or eGovernment services. We introduce a reference infrastructure, describe the organization and present performance figures of a prototype system we have developed.


2006 - Using Semantic Mappings for Query Routing in a PDMS Environment [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Sassatelli, Simona; Tiberio, Paolo; W., Penzo
abstract

In this paper we present the current achievement of our research activity in the WISDOM project, whose aim is the definition of intelligent techniques enabling e®ective and e±cient information search in a distributed and decentralized PDMS scenario. We focus on the query routing problem and we define a new routing mechanism, which we call routing by mapping, in which the query is sent to the peers whose subnetworks best approximate the concepts required. In order to select the best subnetworks, the peer receiving the query exploits information about the semantic approximation of the query concepts, when moving towards each neighbour. This information is computed starting from the semantic mappings established with the peer's neighbours and it is maintained into specifically devised data structures called Semantic Routing Indices (SRIs), whose update we propose specific algorithms and protocols for. The effectiveness of the achieved results has been experimentally proved through a series of exploratory tests.


2005 - Accesso Personalizzato a Documenti Multiversione per Applicazioni nel Settore dell’E-Government [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo; E., Ronchetti; F., Grandi; M. R., Scalas
abstract

In questo lavoro viene presentata l’attività di ricerca concernente la realizzazione di sistemi prototipali per la gestione efficiente di documenti XML multiversione in uno scenario di e-Government. Lo scopo applicativo di tali sistemi è di permettere al cittadino l’accesso a versioni personalizzate di risorse quali testi normativi e informazioni rese disponibili sul WEB dalle Pubbliche Amministrazioni. Per rappresentare l’evoluzione delle norme nel tempo e il conseguente “versionamento” si sono usate quattro dimensioni temporali e un’ulteriore dimensione semantica per rappresentare l’applicabilità delle norme a differenti classi di cittadini, in accordo alla loro identità digitale. La classificazione dei cittadini è basata sulla gestione di un’ontologia e l’adozione di tecniche di Semantic WEB. L’attuale implementazione, evoluzione di un approccio di tipo “stratum” (sviluppato on top di una piattaforma RDBMS), è basata su un approccio “nativo” consistente in un query processor XML sviluppato ad-hoc. Una sperimentazione preliminare ha evidenziato nel nuovo sistema buoni livelli di prestazioni e scalabilità.


2005 - Enhanced access to eGovernment services: temporal and semantics-aware retrieval of norms [Relazione in Atti di Convegno]
F., Grandi; Mandreoli, Federica; Martoglia, Riccardo; E., Ronchetti; M. R., Scalas; Tiberio, Paolo
abstract

In this paper, we summarize the results of an ongoing research involving the design and implementation of a multi-version repository of norm texts supporting efficient and personalized access in an eGovernment scenario. The research activity is entitled "Semantic web techniques for the management of digital identity and the access to norms". In the context of a complete and modular infrastructure, we defined a multiversion XML data model and developed an XML query processor supporting both temporal and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. The whole infrastructure, which we plan to complete in the near future, will integrate the query answering component with several auxiliary services, including automatic citizen identification and classification and computer-aided update of the repository data.


2005 - Personalized access to multi-version XML documents in an eGovernment scenario [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo; F., Grandi; M. R., Scalas; E., Ronchetti
abstract

In this paper, we present some results of an ongoing research involving the design and implementation, in an eGovernment scenario, of a multiversion repository of norm texts supporting efficient and personalized access. In particular we defined a multi-version XML data model supporting both temporal versioning –essential in normative systems– and semantic versioning. Semantic versioning is based on the applicability of different norm parts to different classes of citizens and allows users to retrieve personalized norm versions only containing provisions which are applicable to their personal case. We describe the organization and present preliminary performance figures of a prototype system we developed.


2005 - Temporal modelling and management of normative documents in XML format [Articolo su rivista]
F., Grandi; Mandreoli, Federica; Tiberio, Paolo
abstract

In this paper, we present the results of a research project concerning the temporal management of normative texts in XML format. In particular, four temporal dimensions (publication, validity, efficacy and transaction times) are used to correctly represent the evolution of norms in time and their resulting versioning. Hence, we introduce a multiversion data model based on XML schema and define basic mechanisms for the maintenance and retrieval of multiversion norm texts. Finally, we describe a prototype management system which has been implemented and evaluated.


2005 - Text Clustering as a Mining Task [Capitolo/Saggio]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo
abstract

In this chapter we introduce readers to the various aspects of cluster analysis performed on textual data in a mining framework. We first provide a brief overview on the techniques and the background notions on general clustering. Then, we focus on the importance and on the goals of clustering in a text mining scenario, analyzing and describing the issues which are specific to this particular field. Effective information extraction from highly dimensional textual data, clustering algorithms specifically designed to efficiently work on very large unstructured and, possibly, hyperlinked data sets, and comprehension of the clustering output are among the covered topics.


2004 - A Document Comparison Scheme for Secure Duplicate Detection [Articolo su rivista]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo
abstract

The ever-growing amounts of textual information coming from different sources have fostered the development of digital libraries, making digital contents readily accessible but also easy for malicious users to plagiarize, thus giving rise to security problems. In this paper, we introduce a duplicate detection scheme that is able to determine, with a particularly high accuracy, how much a document is similar to another. Our pairwise document comparison scheme detects the resemblance between the content of documents by considering document chunks, representing contexts of words selected from the text. The resulting duplicate detection technique presents a good level of security in the protection of intellectual property, while improving the availability of the data stored in the digital library and the correctness of the search results. Finally, the paper addresses efficiency and scalability issues by introducing new data reduction techniques.


2004 - Approximate Query Answering for a Heterogeneous XML Document Base [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo
abstract

In this paper, we deal with the problem of effective search and query answering in heterogeneous web document bases containing documents in XML format of which the schemas are available. We propose a new solution for the structural approximation of the submitted queries which, in a preliminary schema matching process, is able to automatically identify the similarities between the involved schemas and to use them in the query processing phase, when a query written on a source schema is automatically rewritten in order to be compatible with the other useful XML documents. The proposed approach has been implemented in a web service and can deliver middleware rewriting services in any open-architecture XML repository system offering advanced search capabilities.


2004 - Management of the Citizen's Digital Identity and Access to Multi-version Norm Texts on the Semantic Web [Relazione in Atti di Convegno]
Mandreoli, Federica; Tiberio, Paolo; F., Grandi; M. R., Scalas
abstract

This paper describes an ongoing research project involving the implementation of e-Government services on the Semantic Web. In particular, the project is aimed at managing the “digital identity” of citizens on the Internet, enabling them to benefit from “personalized” versions of the online services offeredby the Public Administration, which can improve and optimize their involvement in the e-Governance process. The kind of service we will consider is the selective access to norm texts available on Web repositories. The project requires the definitionand maintenance of a citizen’s ontology, the semantic markup and versioning of the stored norm texts which takes into account the actual applicability to different classes of citizens, the definition and enactment of Web services for the reconstruction of the citizen’s digital identity and its classification with respectto the ontology, the design and implementation of a legal document management system for the selective access to personalized norm versions.


2003 - A temporal data model and management system for normative texts in XML format [Relazione in Atti di Convegno]
Mandreoli, Federica; Grandi, F; Bergonzini, M; Tiberio, Paolo
abstract

In this paper, we present the results of an on-going researchactivity concerning the temporal management of normativetexts in XML format. In particular, four temporal dimen-sions (publication, validity, e±cacy and transaction times)are used to correctly represent the evolution of norms intime and their resulting versioning. Hence, we introduce amultiversion data model based on XML schema and de¯nebasic mechanisms for the management of norm texts. Fi-nally, we describe a prototype management system whichhas been implemented and evaluated.


2003 - A temporal data model and system architecture for the management of normative texts [Relazione in Atti di Convegno]
Mandreoli, Federica; Tiberio, Paolo; F., Grandi; M., Bergonzini
abstract

In this paper, we present the preliminary results of an ongoingresearch activity concerning the temporal management of normative texts in XML format. In particular, four temporal dimensions (publication, validity, efficacy and transaction times) are used to correctly represent the evolution of norms in time and their resulting versioning. Hence, we introduce a multiversion data model based on XML schema and define three basic operators for the management of norm texts. Finally, we describe the architecture of a management system prototype which is being implemented.


2003 - Exploiting multi-lingual text potentialities in EBMT systems [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo
abstract

Translating documents from a source to a target language is a repetitive activity. The attempt to automate such a difficult task has been a long-term scientific dream. Among the several types of approaches in Machine Translation (MT), one of the most promising paradigms is Example-Based Machine Translation (EBMT). An EBMT system translates by analogy, using past translations to translate other, similar source-language material into the target language. In this paper we introduce EXTRA (EXample-based TRanslation Assistant), a complete EBMT system that exploits some innovative ideas in information retrieval and multilingual text management to effectively and efficiently extract useful suggestions from past translations and present them to the translator. This work has been developed as a joint work with the LOGOS group, a worldwide leader in multilingual document translation.


2003 - Un Metodo per il Riconoscimento di Duplicati in Collezioni di Documenti [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo
abstract

I recenti avanzamenti nella potenza di calcolo e nelle telecomunicazioni hanno creato le giuste condizioni per la diffusione globale di enormi moli di informazioni elettroniche e di nuovi strumenti per l’analisi del loro contenuto, sollevando problemi di information overload e, in particolare, di duplicate detection. I duplicati, cioe' documenti molto simili che contengono approssimativamente le stesse informazioni, degradano l’efficacia e l’efficienza delle ricerche e, spesso, costituiscono anche violazioni di copyright. In questo articolo introduciamo DANCER (Document ANalysis and Comparison ExpeRt), un sistema completo di duplicate detection che sfrutta idee innovative nell’ambito dell’information retrieval per l’identificazione dei documenti duplicati, utilizzando algoritmi e misure di similarita' inedite in questo campo e sufficientemente fini da ottenere una buona efficacia nella maggior parte delle applicazioni. Inoltre, il sistema propone diverse nuove tecniche di data reduction che permettono di ridurre sia il tempo di esecuzione che lo spazio richiesto per la memorizzazione dei dati, senza compromettere la buona qualita' dei risultati.


2002 - A syntactic approach for searching similarities within sentences [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo
abstract

Textual data is the main electronic form of knowledge representation. Sentences, meant as logic units of meaningful word sequences, can be considered its backbone. In this paper, we propose a solution based on a purely syntactic approach for searching similarities within sentences, named approximate sub2sequence matching. This process being very time consuming, efficiency in retrieving the most similar parts available in large repositories of textual data is ensured by making use of new filtering techniques. As far as the design of the system is concerned, we chose a solution that allows us to deploy approximate sub2sequence matching without changing the underlying database.


2002 - Searching Similar (Sub)Sentences for Example-Based Machine Translation [Relazione in Atti di Convegno]
Mandreoli, Federica; Martoglia, Riccardo; Tiberio, Paolo
abstract

Translation is a repetitive activity. The attempt to automate such a difficult task has been a long-term scientific dream; in the past years research in this field has acquired a growing interest, making some forms of Machine Translation (MT) a reality. Among the several types of approaches in MT, one of the most promising paradigms is MAHT and, in particular, example-Based Machine Translation (EBMT). An EBMT system translates by analogy, using past translations to translate other, similar sourcelanguage sentences into the target language. The basic premise is that, if a previously translated sentence occurs again, the same translation is likely to be correct. In this paper, we propose a solution based on a purely syntactic approach for searching similar sentences and parts of them in an EBMT system; the underlying similarity measure is based on the similarity between sequence of terms such that the sentences most close to a given one are those who maintain most of the original form and contents. The system efficiently retrieves and ranks the most similar sentences available and, when no useful suggestion exists, it proceeds with the retrieval of similar parts. We opted for a design that would require minimal changes to existing databases and whose similarity measure and search algorithms are completely independent from the involved languages. This work has been developed as a joint work with LOGOS S.p.A., a worldwide leader in multilingual document translation.


1999 - Distributed Database Support for Data-Intensive Workflow Application [Relazione in Atti di Convegno]
Bergamaschi, Sonia; S., Castano; C., Sartori; Tiberio, Paolo; Vincini, Maurizio
abstract

Venice, Italy