Personalized Environmental Service Configuration and Delivery Orchestration: The PESCaDO Demonstrator

Wanner, Leo; Rospocher, Marco; Vrochidis, Stefanos; Bosch, Harald; Bouayad-Agha, Nadjet; Bügel, Ulrich; Casamayor, Gerard; Ertl, Thomas; Hilbring, Desiree; Karppinen, Ari; Kompatsiaris, Ioannis; Koskentalo, Tarja; Mille, Simon; Moßgraber, Jürgen; Moumtzidou, Anastasia; Myllynen, Maria; Pianta, Emanuele; Saggion, Horacio; Serafini, Luciano; Tarvainen, Virpi; Tonelli, Sara

doi:10.1007/978-3-662-46641-4_41

Leo Wanner^20,21,
Marco Rospocher²⁷,
Stefanos Vrochidis²⁵,
Harald Bosch²²,
Nadjet Bouayad-Agha²¹,
Ulrich Bügel²³,
Gerard Casamayor²¹,
Thomas Ertl²²,
Desiree Hilbring²³,
Ari Karppinen²⁴,
Ioannis Kompatsiaris²⁵,
Tarja Koskentalo²⁶,
Simon Mille²¹,
Jürgen Moßgraber²³,
Anastasia Moumtzidou²⁵,
Maria Myllynen²⁶,
Emanuele Pianta²⁷,
Horacio Saggion²¹,
Luciano Serafini²⁷,
Virpi Tarvainen²⁴ &
…
Sara Tonelli²⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7540))

Included in the following conference series:

Extended Semantic Web Conference

1428 Accesses

Abstract

Citizens are increasingly aware of the influence of environmental and meteorological conditions on the quality of their life. This results in an increasing demand for personalized environmental information, i.e., information that is tailored to citizens’ specific context and background. In this demonstration, we present an environmental information system that addresses this demand in its full complexity in the context of the PESCaDO EU project. Specifically, we will show a system that supports submission of user generated queries related to environmental conditions. From the technical point of view, the system is tuned to discover reliable data in the web and to process these data in order to convert them into knowledge, which is stored in a dedicated repository. At run time, this information is transferred into an ontology-based knowledge base, from which then information relevant to the specific user is deduced and communicated in the language of their preference.

You have full access to this open access chapter, Download conference paper PDF

Context-Aware Service Orchestration in Smart Environments

CIAO-WPS - Utilizing Semantic Web (Web 3.0) Techniques to Assist in the Automatic Orchestration of Geospatial Processes and Datasets

Multi-Agent Based Flexible Deployment of Context Management in Ambient Intelligence Applications

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Research Background

Citizens are increasingly aware of the influence of environmental and meteorological conditions on the quality of their life. One of the consequences of this awareness is the demand for high quality environmental information that is tailored to one’s specific context and background (e.g. health conditions, travel preferences, etc.), i.e., which is personalized. Personalized environmental information may need to cover a variety of aspects (such as meteorology, air quality, pollen, and traffic) and take into account a number of specific personal attributes (health, age, allergies, etc.) of the user, as well as the intended use of the information. For instance, a pollen allergic person, planning to do some outdoor activities, may be interested in being notified whether the pollen situation in the area may trigger some symptoms, or if the temperature is too hot for doing physical exercise, while a city administrator has to be informed whether the current air quality situation requires some actions to be urgently taken.

So far, only a few approaches have been proposed with a view of how this information can be facilitated in technical terms. All of these approaches focus on one environmental aspect and only very few of them address the problem of information personalization [2, 7, 9]. We aim to address the above task in its full complexity.

In this work, carried on in the context of the PESCaDO EU project, we take advantage of the fact that nowadays, the World Wide Web already hosts a great range of services (i.e. websites, which provide environmental information) that offer data on each of the above aspects, such that, in principle, the required basic data are available. The challenge is threefold: first, to discover and orchestrate these services; second, to process the obtained data in accordance with the needs of the user; and, third, to communicate the gained information in the users preferred mode.

The demonstration will aim, in particular, at showing how semantic web technologies are exploited to address this challenges in PESCaDO.

2 The PESCaDO Platform: Main Modules and Key Semantic Technologies Used

The challenges mentioned in Sect. 1 require the involvement of an elevated number of rather heterogeneous applications addressing various complex tasks: discovery of the environmental service nodes in the web, distillation of the data from webpages, orchestration of the environmental service nodes, fusion of environmental data, assessment of the data with respect to the needs of the addressee, selection of user-relevant content and its delivery to the addressee, and, finally, interaction with the user. Thus, in PESCaDO we developed a service-based infrastructure to integrate all these applications.

For a general overview of the running PESCaDO service platform^{Footnote 1}, and the type of information produced, see: http://www.youtube.com/watch?v=c1Ym7ys3HCg. In this section, we focus on presenting three tasks we addressed by applying semantic web technologies.

The back-bone of the PESCaDO service platform, exploited in each of these three tasks, is an ontology-based knowledge base, the PESCaDO Knowledge Base (PKB), where all the information relevant for a user request are dynamically instantiated. The ontology, partially built exploiting automatic key-phrases extraction techniques [8], formalizes a variety of aspects related to the application context: environmental data, environmental nodes^{Footnote 2}, user requests, user profiles, warnings and recommendations triggered by environmental conditions, logico-semantic relations (e.g. cause, implication) between facts, and so on. The current version of the ontology consists of 241 classes, 672 individuals, 151 object properties, and 43 datatype properties.

2.1 Discovery of Environmental Nodes

The first step towards the extraction and indexing of environmental information is the discovery of environmental nodes, which can be considered as a problem of domain specific search. To this end, we implement a node discovery framework, which builds upon state of the art domain specific search techniques, advanced content distillation, ontologies and supervised machine learning. The framework consists of three main parts: (a) Web search (b) Post processing and (c) Indexing and storage. Web search is realized with the aid of a general-purpose search engine, which accesses large web indices. In this implementation we employ Yahoo! Search BOSS API. In order to generate domain specific queries, we apply two complementary techniques. First we use the ontology of the PKB and we extract concepts and instances referring to types of environmental data (e.g. temperature, birch pollen, PM\(_{10}\)) and we combine them with geographical city names automatically retrieved by geographical resources. In addition, the queries are expanded by keyword spices [6], which are domain specific keywords extracted with the aid of machine learning techniques from environmental websites.

During the post-processing step we perform supervised classification with Support Vector Machines to separate relevant from irrelevant nodes and we crawl each website to further expand our search in an iterative manner. The determination of the relevance of the nodes and their categorization is done using a classifier that operates on a weight-based vector of key phrases and concepts from the content and the structure of the webpages. Subsequently, we parse the body and the metadata of the relevant webpages in order to extract the structure and the clues that reveal the information presented.

Finally, the information obtained with respect to each relevant node is indexed in a Sensor Observation Service (SOS) [5] compliant repository, which can be accessed and retrieved by the system when a user request is submitted.

The whole discovery procedure is automatic, however an administrative user could intervene through an interactive user interface, in order to select geographic regions of interest to perform the discovery, optimize the selection of keyword spices, and parametrize the training of the classifiers.

2.2 Processing Raw Environmental Data to Obtain Content

The user interface of the PESCaDO system guides the user in formulating a request, which is instantiated in all its details (e.g. type of request, user profile, time period, geographic location) in the PKB. By exploiting Description Logics (DL) reasoning on the PKB, the system determines from the request description which are the types of environmental data which constitute the raw content necessary to fulfil the user needs. A specific component of the system is then responsible of selecting from the SOS repository the actual values (observed, forecasted, historical) for the selected types of environmental data, and to appropriately instantiate them in the PKB.

At this stage, the raw data retrieved from the environmental nodes are processed to derive additional personalized content from them, such as data aggregations, qualitative scaling of numerical data, and user tailored recommendations and warnings triggered by the environmental data relevant for the specific user query. Logico-semantic relations are also instantiated at this stage, for instance to represent whether a certain pollen concentration value causes the triggering of a recommendation to the user, due to its sensitiveness to that pollen.

The computation of this inferred content is performed by the decision support service of the PESCaDO Platform by combining some complementary reasoning strategies, including DL reasoning and rule-based reasoning. A two layer reasoning infrastructure is currently in place. The first layer exploits the HermiT reasoner for the OWL DL reasoning services. The second layer is stacked on top of the previous layer. It uses the Jena RETE rule engine, which performs the rule-based reasoning computation.

2.3 Generating User Information from Content

As is common in Natural Language Generation, our information generator is divided into two major modules: the text planning module and the linguistic generation module (with the latter taking as input the text plan produced by the former).

Text Planning. The text planning module is divided into a content selection module and discourse structuring module. As is common in report generation, our content selection is schema- (or template-) based. Therefore, the ontology of the PKB introduced above defines a class Schema with an \(n\)-ary schema component object property whose range can be any individuals of the PKB itself.

Similar to [1], we assume the output of the discourse structuring module to be a well-formed text plan which consists of (i) elementary discourse units (EDUs) that group together individuals of the PKB, (ii) discourse relations between EDUs and/or individuals of the PKB, and (iii) precedence relations between EDUs. This structure translates into two top classes of the ontology of the PKB: EDU with an \(n\)-ary EDU component relation and a linear precedence property, and Discourse Relation with nucleus and satellite relation. A set of sparql query rules are defined to instantiate the various concepts and relations.

Content Selection (CS) operates on the output of the decision support service. It selects the content to be included in the report and groups it by topic, instantiating a number of schemas for each topic. The inclusion of a given individual in a schema can be subject to some restrictions defined in the queries; for example, if the minimum and maximum air quality index (AQI) values are identical, or if the maximum AQI value triggers a user recommendation or warning, then only the maximum AQI value is selected (the minimum AQI rating is omitted).

Discourse structuring is carried out by a pipeline of three rule-based submodules: (i) Elementary Discourse Unit (EDU) Determination, which groups topically related PKB individuals into propositional units starting from the schemas determined during CS; (ii) Mapping logico-semantic relations to discourse relations; and (iii) EDU Ordering, which introduces a precedence relation between EDUs using a number of heuristics derived from interviews with domain communication experts.

Linguistic generation. Our linguistic generation module is based on a multilevel linguistic model of the Meaning-Text Theory (MTM) [4], such that the generation consists of a series of mappings between structures of adjacent strata (from the conceptual stratum to the linguistic surface stratum): Conceptual Structure (ConStr) \(\Rightarrow \) Semantic Structure (SemStr) \(\Rightarrow \) Deep-Syntactic Structure (DSyntStr) \(\Rightarrow \) Surface-Syntactic Structure (SSyntStr) \(\Rightarrow \) Deep-Morphological Structure (DMorphStr) \(\Rightarrow \) Surface-Morphological Structure (SMorphStr) \(\Rightarrow \) Text. Starting from the conceptual stratum, for each pair of adjacent strata \(\mathcal{S}_i\) and \(\mathcal{S}_{i+1}\), a transition grammar \(\mathcal{G}^i_{i+1}\) is defined; see [3].

The ConStr is derived from each text plan produced by the text planning component. In a sense, ConStr can thus be considered a projection of selected fragments of the ontologies onto a linguistically motivated structure. ConStrs are language-independent and thus ideal as starting point of multilingual generation.

3 System Demonstration

The system demonstration will show how the PKB is instantiated and exploited by the different services composing the PESCaDO Platform in the context of two different application scenarios, one about health safety decision support for end users and one about administrative decision support. In particular, the demo attendees will have the chance to see how the raw environmental data are dynamically processed with ontology-based techniques to obtain reports. Furthermore, we will demonstrate how to use and set-up the tool for environmental node discovery.

Notes

1.
A more comprehensive description of the system workflow can be found in [10].
2.
An environmental node is a provider of environmental data values, like for instance a web-site, a web-service, or a measuring station.

References

Bouayad-Agha, N., Casamayor, G., Wanner, L., Díez, F., López Hernández, S.: FootbOWL: using a generic ontology of football competition for planning match summaries. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part I. LNCS, vol. 6643, pp. 230–244. Springer, Heidelberg (2011)
Chapter Google Scholar
Karatzas, K.D.: State-of-the-art in the dissemination of aq information to the general public. In: Proceedings of EnviroInfo, pp. 41–47 (2007)
Google Scholar
Lareau, F., Wanner, L.: Towards a generic multilingual dependency grammar for text generation. In: King, T., Bender, E.M. (eds.) Proceedings of the GEAF07 Workshop, pp. 203–223. CSLI, Stanford (2007)
Google Scholar
Mel’čuk, I.A.: Dependency Syntax: Theory and Practice. SUNY Press, Albany (1988)
Google Scholar
North. Sensor observation service (sos) (2004)
Google Scholar
Oyama, S., Kokubo, T., Ishida, T.: Domain-specific web search with keyword spices. IEEE Trans. Knowl. Data Eng. 16(1), 17–27 (2004)
Article Google Scholar
Peinel, G., Rose, T., San José, R.: Customized information services for environmental awareness in urban areas. In: Proceedings of the 7th World Congress on Intelligent Transport Systems, Turin, Italy (2000)
Google Scholar
Tonelli, S., Rospocher, M., Pianta, E., Serafini, L.: Boosting collaborative ontology building with key-concept extraction. In: Proceedings of 5th IEEE International Conference on Semantic Computing, (September 18–21, 2011 - Palo Alto, USA) (2011)
Google Scholar
Wanner, L., Bohnet, B., Bouayad-Agha, N., Lareau, F., Nicklaß, D.: MARQUIS: generation of user-tailored multilingual air quality bulletins. Appl. Artif. Intell. 24(10), 914–952 (2010)
Article Google Scholar
Wanner, L., Vrochidis, S., Tonelli, S., Moßgraber, J., Bosch, H., Karppinen, A., Myllynen, M., Rospocher, M., Bouayad-Agha, N., Bügel, U., Casamayor, G., Ertl, T., Kompatsiaris, I., Koskentalo, T., Mille, S., Moumtzidou, A., Pianta, E., Saggion, H., Serafini, L., Tarvainen, V.: Building an environmental information system for personalized content delivery. In: Hřebíček, J., Schimak, G., Denzer, R. (eds.) Environmental Software Systems. IFIP AICT, vol. 359, pp. 169–176. Springer, Heidelberg (2011)
Google Scholar

Download references

Acknowledgments

The work described in this paper has been partially funded by the European Commission under the contract number FP7-248594, PESCaDO (Personalized Environmental Service Configuration and Delivery Orchestration) project.

Author information

Authors and Affiliations

Catalan Institute for Research and Advanced Studies, Barcelona, Spain
Leo Wanner
Department of Information and Communication Technologies, Pompeu Fabra University, Barcelona, Spain
Leo Wanner, Nadjet Bouayad-Agha, Gerard Casamayor, Simon Mille & Horacio Saggion
Visualization Institute, University of Stuttgart, Stuttgart, Germany
Harald Bosch & Thomas Ertl
Fraunhofer Institute for Optronics, System Technologies and Image Exploitation, Karlsruhe, Germany
Ulrich Bügel, Desiree Hilbring & Jürgen Moßgraber
Finnish Meteorological Institute, Helsinki, Finland
Ari Karppinen & Virpi Tarvainen
Centre for Research and Technology Hellas, Informatics and Telematics Institute, Thessaloniki, Greece
Stefanos Vrochidis, Ioannis Kompatsiaris & Anastasia Moumtzidou
Helsinki Region Environmental Services Authority, Helsinki, Finland
Tarja Koskentalo & Maria Myllynen
Fondazione Bruno Kessler, Trento, Italy
Marco Rospocher, Emanuele Pianta, Luciano Serafini & Sara Tonelli

Authors

Leo Wanner
View author publications
You can also search for this author in PubMed Google Scholar
Marco Rospocher
View author publications
You can also search for this author in PubMed Google Scholar
Stefanos Vrochidis
View author publications
You can also search for this author in PubMed Google Scholar
Harald Bosch
View author publications
You can also search for this author in PubMed Google Scholar
Nadjet Bouayad-Agha
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Bügel
View author publications
You can also search for this author in PubMed Google Scholar
Gerard Casamayor
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Ertl
View author publications
You can also search for this author in PubMed Google Scholar
Desiree Hilbring
View author publications
You can also search for this author in PubMed Google Scholar
Ari Karppinen
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis Kompatsiaris
View author publications
You can also search for this author in PubMed Google Scholar
Tarja Koskentalo
View author publications
You can also search for this author in PubMed Google Scholar
Simon Mille
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Moßgraber
View author publications
You can also search for this author in PubMed Google Scholar
Anastasia Moumtzidou
View author publications
You can also search for this author in PubMed Google Scholar
Maria Myllynen
View author publications
You can also search for this author in PubMed Google Scholar
Emanuele Pianta
View author publications
You can also search for this author in PubMed Google Scholar
Horacio Saggion
View author publications
You can also search for this author in PubMed Google Scholar
Luciano Serafini
View author publications
You can also search for this author in PubMed Google Scholar
Virpi Tarvainen
View author publications
You can also search for this author in PubMed Google Scholar
Sara Tonelli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco Rospocher .

Editor information

Editors and Affiliations

University of Southampton, Southampton, United Kingdom
Elena Simperl
British Museum, London, United Kingdom
Barry Norton
Ljubljana, Slovenia
Dunja Mladenic
DEIB - Politecnico di Milano, Milano, Italy
Emanuele Della Valle
Foundation for Research and Technology Hellas (FORTH), Heraklion, Greece
Irini Fundulaki
MDG Web Limited, Dublin, Ireland
Alexandre Passant
Multimedia Communications Department, EURECOM, Campus SophiaTech, Biot, France
Raphaël Troncy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wanner, L. et al. (2015). Personalized Environmental Service Configuration and Delivery Orchestration: The PESCaDO Demonstrator. In: Simperl, E., et al. The Semantic Web: ESWC 2012 Satellite Events. ESWC 2012. Lecture Notes in Computer Science(), vol 7540. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46641-4_41

Download citation

DOI: https://doi.org/10.1007/978-3-662-46641-4_41
Published: 21 April 2015
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-46640-7
Online ISBN: 978-3-662-46641-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Personalized Environmental Service Configuration and Delivery Orchestration: The PESCaDO Demonstrator

Abstract

Similar content being viewed by others

Context-Aware Service Orchestration in Smart Environments

CIAO-WPS - Utilizing Semantic Web (Web 3.0) Techniques to Assist in the Automatic Orchestration of Geospatial Processes and Datasets

Multi-Agent Based Flexible Deployment of Context Management in Ambient Intelligence Applications

Keywords

1 Research Background

2 The PESCaDO Platform: Main Modules and Key Semantic Technologies Used

2.1 Discovery of Environmental Nodes

2.2 Processing Raw Environmental Data to Obtain Content

2.3 Generating User Information from Content

3 System Demonstration

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Personalized Environmental Service Configuration and Delivery Orchestration: The PESCaDO Demonstrator

Abstract

Similar content being viewed by others

Context-Aware Service Orchestration in Smart Environments

CIAO-WPS - Utilizing Semantic Web (Web 3.0) Techniques to Assist in the Automatic Orchestration of Geospatial Processes and Datasets

Multi-Agent Based Flexible Deployment of Context Management in Ambient Intelligence Applications

Keywords

1 Research Background

2 The PESCaDO Platform: Main Modules and Key Semantic Technologies Used

2.1 Discovery of Environmental Nodes

2.2 Processing Raw Environmental Data to Obtain Content

2.3 Generating User Information from Content

3 System Demonstration

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation