TMR: A Semantic Recommender System Using Topic Maps on the Items’ Descriptions

Garrido, Angel Luis; Ilarri, Sergio

doi:10.1007/978-3-319-11955-7_21

Angel Luis Garrido⁷ &
Sergio Ilarri⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8798))

Included in the following conference series:

European Semantic Web Conference

1835 Accesses
2 Citations

Abstract

Recommendation systems have become increasingly popular these days. Their utility has been proved to filter and to suggest items archived at web sites to the users. Even though recommendation systems have been developed for the past two decades, existing recommenders are still inadequate to achieve their objectives and must be enhanced to generate appealing personalized recommendations effectively. In this paper we present TMR, a context-independent tool based on topic maps that works with item’s descriptions and reviews to provide suitable recommendations to users. TMR takes advantage of lexical and semantic resources to infer users’ preferences and thus the recommender is not restricted by the syntactic constraints imposed on some existing recommenders. We have verified the correctness of TMR using a popular benchmark dataset.

You have full access to this open access chapter, Download conference paper PDF

Semantics-Aware Content-Based Recommender Systems

SJORS: A Semantic Recommender System for Journalists

Article Open access 21 December 2023

Pattern-based hybrid book recommendation system using semantic relationships

Article Open access 06 March 2023

Keywords

1 Introduction

Machine learning, information retrieval, data mining, natural language processing, and probabilistic models have been adopted for developing systems that recommend items like books, songs, and movies, for example. Our proposed system, TMR (Topic Map Recommender), is a semantic, ontological, and linguistic enhanced recommendation system, which takes advantage of natural language processing (NLP) and semantic tools to provide personalized item suggestions tailored to the preferences of individual users. Unlike its counterparts, TMR examines the “meaning” of textual item metadata, such as content descriptions and reviews on items to be recommended, considered during the recommendation process, as opposed to simply syntactically analyse the words in the texts.

There are already some semantic and ontological approaches such as [1, 2]. TMR differs from them in the way the system generates abstractions of themes and subject areas from items and user profiles. For this purpose, the system uses topic maps, a kind of diagram that shows relationships between concepts within a context. As a representation of a conceptualization corresponds to the definition of an ontology [3], we can use techniques and methodologies from ontological engineering to model these representations and work with them. Furthermore, unlike its ontology-based counterparts, TMR does not depend on the availability of a domain ontology, since it is not domain-dependent: ontologies in the form of topic maps are automatically built by the system.

2 Our Proposed Methodology

The main idea is to represent both the user’s likes and dislikes and the items. We will use topic maps to represent all this information, and we will compare the corresponding topic maps in order to evaluate the degree of similarity between the likes/dislikes of the user and the items. The more similar the representation of an element is with respect to the representation of the profile of what a user likes/disklikes, the more likely we are to recommend it/not to recommend it.

We use text descriptions of the items (the items that can be recommended to the user, as well as the items that the user valued positively –likes– and negatively –dislikes–) and other user’s reviews. All this information is contained in natural language texts, so we need an information extraction tool to exploit it. To obtain the relevant data in order to build the topic maps from text, TMR adopts TM-Gen [4], which is a tool that extracts information from any number of texts and represents them in a topic map format.

TM-Gen scans the texts to find the most important keywords and the main named entities [5]. It divides the text into sentences and assigns them a relevance score, in order to find those that are most important in the text. Afterwards, TM-Gen analyzes syntactically the sentences to find the best candidates to be a topic, and then it establishes associations between them, creating the relations. We have adapted this method to analyse the items’ descriptions in TMR.

TMR examines the descriptions using Freeling^{Footnote 1}, an NLP tool. The system then proceeds to extract concepts and the corresponding relationships among them using the aforementioned techniques from TM-Gen. The different topic maps obtained are merged into a single one using SIM (Subject Identity Measure) [6], an existing approach that describes the relationships among two subjects or topics. As part of the topic map generation process, TMR performs a semantic analysis of the topic map and simplifies it if the system finds redundancies, incompatible associations, or ambiguities, using for this purpose lexical databases (i.e., WordNet), Linked Data resources like DBPedia, and a disambiguation engine [7] (similar to that used in [8]).

TMR analyzes also each item review to find relevant information, which is used to enrich the topic map of the item. As the language used in the reviews is usually much less formal than the one employed in item descriptions, it is more difficult to use parsers to extract information. For this reason, TMR lemmatizes the texts in the reviews and extracts the most frequent keywords and named entities using the well-known TF-IDF algorithm [9]. These extracted keywords and named entities are incorporated into the topic map as new elements either as topics or as relationships, by using Freeling’s morphological analyzer.

The next step in the TMR’s recommendation process is to construct a profile of the user which captures his/her preferences, by examining the ratings that he/she has previously assigned to other items. In doing so, TMR generates two different topic maps: one for the likes (TMlikes), and another one for the dislikes (TMdislikes). The texts used to build those topic maps are the ones describing the corresponding data items.

The last step applied by TMR in making suggestions involves predicting the degree to which a user will like (or not) a new item. TMR evaluates the degree of similarity between the topic map of an item and each of the topic maps that capture the likes and dislikes of the user. To calculate the similarity between topic maps, TMR employs an algorithm we developed that evaluates the resemblance between the topics of any two topic maps. This algorithm is based on two measures introduced in [10]: lexical similarity and relation overlap; while the first measure calculates the lexical overlap between strings, the second one quantifies the degree to which the relations of two concepts in an ontology match. Using Eq. 1, TMR yields a score for an item on a [1, 10] range.

$$\begin{aligned} Rate(Item)=Norm[(Sim(TMlikes) - Sim(TMdislikes))] \end{aligned}$$

(1)

where Sim captures the degree of similarity between the corresponding topic map of likes and dislikes and the one corresponding to the item, and $Norm$ is a function that maps the differences in similarity scores from a [$-1$, 1] range to a [1, 10] range.

3 Experiments

To evaluate the performance of TMR, we have used the BookCrossing dataset as a test case. BookCrossing is a popular benchmark dataset commonly-used to assess the performance of book recommendation systems. We apply the popular five-fold cross validation protocol. For each one of the five repetitions, $85\,\%$ of the books rated by a user $U$ in a set of users $BX$ were used to model $U$’s likes/dislikes (i.e., $U_{train}$) and the remaining $15\,\%$ ($U_{test}$) were used for actual testing.

In our empirical study, we quantified the performance of a recommender system $R$ using the Root Mean Squared Error (RMSE), as shown in Eq. 2, which is a de facto metric for evaluating predictive recommendation systems.

$$\begin{aligned} RMSE(R) = \frac{\sum _{U \in BX}\sqrt{\frac{\sum _{b \in U_{test}} |R_{U,b}-r_{U,b}|}{|U_{test}|}}}{|BX|} \end{aligned}$$

(2)

where $R_{U,b}$ denotes the rating $predicted$ by $R$ for a book $b$ ($\in U_{test}$) given the corresponding user $U$, and $r_{U,b}$ is the $actual$ rating given to $b$ by $U$.

We executed each experiment five times, and the overall RMSE score is the average of the RMSE scores computed for each repetition. In our experiments, the RMSE score generated using TMR is 1.25. Its performance, in terms of RMSE, is much higher than some baseline recommenders like SVD++ [11] (4.67) and Bias-SVD [11] (3.94). If we compare TMR with other state-of-the-art recommenders like fLDA [12] (1.31), RLMF [12] (1.32), and uLDA [12] (1.35), we find that our results are very promising, given the significant difference obtained with respect to its counterparts.

4 Conlusions and Future Work

In this paper we have presented TMR, a domain-independent recommender that combines semantic and ontological techniques with NLP tools and lexical resources to made recommendations suitable to the preferences/interests of each individual user. In principle, TMR can work in any context where a textual description and textual reviews of the data items are available. We conducted an empirical study with the BookCrossing dataset and obtained positive results.

Our intention now is to verify the generality of our solution. For this purpose, we will evaluate the performance of TMR using other datasets to prove that our system is indeed context-independent. Comparing the proposal with other domain-specific recommenders in different contexts is also a relevant task of future work, as we can expect a trade-off between the generality of the proposal and its performance, that needs to be quantified.

Notes

1.
http://nlp.lsi.upc.edu/freeling/

References

Cantador, I., Bellogin, A., Castells, P.: A multilayer ontology-based hybrid recommendation model. AI Commun. 21(2–3), 203–210 (2008)
MathSciNet MATH Google Scholar
IJntema, W., Goossen, F., Frasincar, F., Hogenboom, F.: Ontology-based News Recommendation. In: ACM EDBT/ICDT Workshops, Article 16, 6p (2010)
Google Scholar
Gruber, T.: A translation approach to portable ontology specifications. Knowl. Acquis. 5(2), 199–220 (1993). (Academic Press Ltd.)
Article Google Scholar
Garrido, A.L., Buey, M.G., Escudero, S., Ilarri, S., Mena, E., Silveira, S.: TM-Gen: a topic map generator from text documents. In: IEEE ICTAI, pp. 735–740 (2013)
Google Scholar
Sekine, S., Ranchhod, E.: Named Entities: Recognition, Classification and Use. John Benjamins, Amsterdam (2009)
Book Google Scholar
Maicher, L., Witschel, H.: Merging of distributed topic maps based on the subject identity measure (SIM) approach. Berliner XML-Tage 4, 301–307 (2004)
Google Scholar
Navigli, R.: Word sense disambiguation: a survey. ACM Comput. Surv. 41(2), 1–69 (2009)
Article Google Scholar
Granados Buey, M., Luis Garrido, Á., Escudero, S., Trillo, R., Ilarri, S., Mena, E.: SQX-Lib: developing a semantic query expansion system in a media group. In: de Rijke, M., Kenter, T., de Vries, A.P., Zhai, C.X., de Jong, F., Radinsky, K., Hofmann, K. (eds.) ECIR 2014. LNCS, vol. 8416, pp. 780–783. Springer, Heidelberg (2014)
Chapter Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)
Article Google Scholar
Maedche, A., Staab, S.: Measuring similarity between ontologies. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, p. 251. Springer, Heidelberg (2002)
Chapter Google Scholar
Koren, Y.: Factorization meets the neighborhood: a multifaceted collaborative filtering model. In: ACM SIGKDD, pp. 426–434 (2008)
Google Scholar
Agarwal, D., Chen, B.: fLDA: matrix factorization through latent dirichlet allocation. In: ACM WSDM, pp. 91–100 (2010)
Google Scholar

Download references

Acknowledgment

Supported by CICYT project TIN2010-21387-C02-02 and DGA-FSE. Thank you to Maria Soledad Pera, Maria G. Buey, Sandra Escudero and Alvaro Peiro.

Author information

Authors and Affiliations

IIS Department, University of Zaragoza, Zaragoza, Spain
Angel Luis Garrido & Sergio Ilarri

Authors

Angel Luis Garrido
View author publications
You can also search for this author in PubMed Google Scholar
Sergio Ilarri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Angel Luis Garrido or Sergio Ilarri .

Editor information

Editors and Affiliations

ISTC-CNR, Rome, Italy
Valentina Presutti
Linköping University, Linköping, Sweden
Eva Blomqvist
EURECOM, Biot, France
Raphael Troncy
Hasso-Plattner-Institut, Potsdam, Brandenburg, Germany
Harald Sack
Ionian University, Corfu, Greece
Ioannis Papadakis
Elsevier B.V., Amsterdem, The Netherlands
Anna Tordai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Garrido, A.L., Ilarri, S. (2014). TMR: A Semantic Recommender System Using Topic Maps on the Items’ Descriptions. In: Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I., Tordai, A. (eds) The Semantic Web: ESWC 2014 Satellite Events. ESWC 2014. Lecture Notes in Computer Science(), vol 8798. Springer, Cham. https://doi.org/10.1007/978-3-319-11955-7_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-11955-7_21
Published: 16 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11954-0
Online ISBN: 978-3-319-11955-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

TMR: A Semantic Recommender System Using Topic Maps on the Items’ Descriptions

Abstract

Similar content being viewed by others

Semantics-Aware Content-Based Recommender Systems

SJORS: A Semantic Recommender System for Journalists

Pattern-based hybrid book recommendation system using semantic relationships

Keywords

1 Introduction

2 Our Proposed Methodology

3 Experiments

4 Conlusions and Future Work

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

TMR: A Semantic Recommender System Using Topic Maps on the Items’ Descriptions

Abstract

Similar content being viewed by others

Semantics-Aware Content-Based Recommender Systems

SJORS: A Semantic Recommender System for Journalists

Pattern-based hybrid book recommendation system using semantic relationships

Keywords

1 Introduction

2 Our Proposed Methodology

3 Experiments

4 Conlusions and Future Work

Notes

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation