1 Introduction

With the rapid development of Internet and Web technologies, the volume of online information is increasing across various domains, such as e-commerce, e-government, and e-learning applications. Recommender Systems (RSs) have emerged as effective tools for assisting users in quickly and efficiently finding the information they need within the vast amount of content available online. In recent years, Recommender systems have gained significant importance as a crucial research area within information retrieval and decision support systems. Recommender systems employ information filtering techniques, primarily focused on predicting whether a specific user will be interested in a particular item or identifying a set of items that align with a user’s preferences [1,2,3,4].

Numerous filtering recommendation techniques have been proposed in the literature to enhance Recommendation Systems. Major recommendation techniques include Content-Based Filtering (CB), Collaborative Filtering (CF), Knowledge-Based Filtering, and Hybrid Recommendation, which is a combination of two or more recommendation techniques [1, 2]. Although CB and CF approaches are more popular in practical applications, both suffer from some limitations. For instance, the CB filtering approach tends to result in overspecialization with which the diversity in the recommendation results eventually vanishes, while the CF approach suffers from data sparsity and scalability problems. Moreover, both approaches encounter a significant challenge known as the ‘cold-start problem’. This problem arises when making recommendations to new users and/or items for which the available information is limited. As a result, the recommendations offered in such cases tend to be of poor quality and lack usefulness. On the other hand, hybrid recommendation approaches can enhance recommendation quality and address the primary limitations of traditional approaches. Most hybrid recommendation approaches involve combining CF with other recommendation approaches [2, 3, 5, 6].

The quality of recommendations heavily relies on the similarity measures used to compare items or users. With respect to early recommendation techniques, similarity measures primarily relied on examining the content descriptions and user profiles, often ignoring to consider meaningful aspects that had the potential to greatly improve the accuracy of recommendations [6,7,8,9,10].

In recent recommendation research works, the incorporation of semantic enhancement based on semantic similarity has received increasing attention, with research findings showing promising results [4, 11,12,13,14,15]. Semantic-enhanced recommendation techniques have the potential to provide more accurate recommendations by integrating semantic information from user and/or item profiles into the recommendation process. These techniques mainly utilize Semantic Web technologies, including ontologies and semantic reasoning, to improve the similarity assessments used in traditional CB and CF approaches.

To address the gap between early and recent recommendation research, four primary hybridization techniques can be identified: (i) augmenting domain knowledge represented in ontologies with item-based CF to enhance similarity estimations [14, 16], (ii) enhancing the conventional user-based CF approach by incorporating semantic information from ontologies [17,18,19,20], (iii) combining user-based and/or item-based CF with semantic-enhanced Content-Based filtering [9, 13, 21], and (iv) integrating semantic user-based and/or item-based CF with semantic-enhanced CB filtering [12, 22].

Through an extensive literature review, it was observed that existing semantic similarity measures employed to enhance CF recommendations primarily rely on direct hierarchical (i.e., taxonomical) relationships among items and their semantic descriptions, as defined by ontological data type and object properties. While progress has been made in developing efficient strategies for semantic similarity estimations, limited attention has been given to reasoning about indirect relationships between ontological instances (i.e. items). Such indirect relationships have the potential to uncover hidden connections between seemingly disparate entities, enabling a richer semantic representation. This, in turn, facilitates semantic analysis of heterogeneous content, revealing valuable insights into the similarities and differences between ontological instances.

In this paper, the primary focus is the integration of semantic knowledge, represented within an ontology, into the item-based CF recommendation approach to enhance the item similarity estimations and recommendations quality. Consequently, a new ontology-based semantic similarity measure and semantic enhanced hybrid recommendation approach are proposed. The proposed approaches aim to bridge the gap between Semantic Web technologies and their practical implementation in recommendation systems. It contributes to the existing literature and offers direction for developing more effective and practical semantic-based recommendation systems.

The rest of this paper is organized as follows. Section 2 briefs the background and related work. Section 3 presents the new ontology-based semantic similarity measures along with an illustrative example. Section 4 presents the proposed semantic-based CF approach using the new SS measure. Section 5 outlines the experimental design and procedure. Section 6 demonstrates the experimental scenario results and discusses the main findings. Section 7 concludes and highlights particular future work directions.

2 Background and Literature Review

The notion of an ontology, as defined by Gruber [23], refers to an “explicit specification of a conceptualization.” Subsequently, different definitions of ontology are presented, Borst [24] defined an ontology as a “formal specification of a shared conceptualization”. Ontology is also defined as a “knowledge domain conceptualization into a computer processable format, which axioms” [25]. The use of ontologies allows for semantic access to information in complex environments, providing an effective means of representing and describing information at a high level of abstraction [23].

Ontologies serve as valuable tools for formally modeling the structures of domain systems based on observed relationships. One of the main goals of using ontologies is to model the information at the semantic level [26]. Consequently, ontology has a growing interest in different applications such as semantic information retrieval, information integration, semantic web service discovery and matching, text mining, e-learning as well as web-based personalized applications and recommendation systems [10, 15, 27,28,29].

Recommendation systems highlight the significance of ontologies as they provide a structured and clear representation of knowledge in a precisely defined format. By utilizing this valuable semantic knowledge, decision-making processes can be significantly improved. Furthermore, this approach enables us to provide more informed and precise recommendations. In essence, it empowers decision-makers with richer information, facilitating better informed choices and more accurate recommendations. [7, 11, 12, 27, 29]. Integrating ontological semantic knowledge into the recommendation process takes various forms, with a common emphasis on evaluating the semantic similarity between items or users in order to enhance recommendation accuracy [9, 12,13,14, 16, 18, 20, 28, 30, 31]. In the subsequent section, a comprehensive review of research studies exploring semantic similarity strategies, in alignment with the utilization of a domain ontology, is presented.

2.1 Semantic Similarity Using Ontology

Based on an extensive literature review, semantic similarity approaches for evaluating ontological concepts can be classified into three main categories: distance-based, information-theoretic (IC), and feature-based approaches [32,33,34,35].

Distance-based approaches assess concept similarity within a specific taxonomy by considering the distance or edge length between them. One well-known measure is the shortest path approach, where shorter paths indicate greater similarity [36]. Wu and Palmer [37] proposed a measure that incorporates the relative depth of concepts within a reference taxonomy. Subsequently, several distance-based similarity measures have been developed to enhance similarity calculations [32, 35, 38, 39].

Although distance-based metrics offer simplicity and computational efficiency due to their reliance on taxonomy construction, they are subject to limitations reported in the literature [29, 38,39,40,41,42]. Distance-based metrics heavily depend on the specific taxonomy and are influenced by subjective decisions during taxonomy engineering. Moreover, distance-based approaches exhibit limited reliability in distance calculation due to their assumption of uniform distances between edges in a taxonomy structure [43, 44].

Resnik [45] proposed the Information Content (IC) approach as an alternative to distance-based methods, aiming to address the limitations and unreliability of such metrics. The IC approach relies less on detailed taxonomy structure and instead measures conceptual similarity based on shared information between two concepts. The more shared information, the greater the similarity. Resnik [45] suggests that the concept of the Least Common Ancestor, encompassing both concepts in the ontology’s taxonomy, can effectively represent this shared information. Building on Resnik’s work, Jiang and Conrath [46] refined the measure by combining the IC approach with a distance-based methodology. Their approach surpasses the semantic similarity measures proposed by Resnik [45] and Rada et al. [36].

Lin [47] expanded upon Resnik’s Information Content (IC) measure by introducing the concept of commonality information. Lin’s enhancement suggests that the similarity between two concepts is determined by the extent to which they share information. In this approach, Lin incorporated not only the IC of the Least Common Ancestor of the compared concepts but also included the IC values of the concepts themselves in the measurement. Conventional IC methods combine hierarchical structure and usage statistics to compute IC values, which can be computationally expensive. Seco et al. [41] proposed an intrinsic measure that relies solely on the hierarchical structure (taxonomy) without involving statistics. This approach experimentally outperforms traditional IC similarity methods.

In feature-based approaches, the representation of concepts to be compared is done through a set of features. These features encompass various information elements such as the concept’s neighborhood (ancestors, descendants, and siblings), synonyms, attributes, functions, and more. The utilization of such measures effectively addresses the limitations observed in previous measures [33, 43, 44, 48, 49].

In recent research endeavors, particularly in the field of recommender system applications, an increasing number of studies have embraced the integration of multiple semantic similarity approaches. The objective behind this integration is to enhance both the accuracy and efficiency of the systems. This type of combination is commonly referred to as hybrid semantic similarity. The resultant composite similarity measure typically combines two or more combinations of distance-based methodologies, IC-based approaches, and feature-based techniques, leveraging domain-specific data in diverse ways [27, 29, 30, 32, 42, 50]. A study conducted by Lastra-Díaz et al. [32] and Hussain et al. [44] provides a detailed categorization of semantic similarity measures.

2.2 Semantic Collaborative Filtering Recommendation

In the domain of recommendation systems, collaborative filtering is the most successful technique [1,2,3,4, 51]. It is based on the assumption that finding people with similar interests and recommending items that they like in an effective way to provide interesting content. Collaborative filtering works with user-item historical data, where each user rates items based on their preferences. Collaborative filtering methods require past user ratings to predict and recommend items. The prediction process involves calculating similarities between users and items using distance measures. Collaborative filtering can be categorized into user-based and item-based approaches. Both approaches use user-item rating data to compute similarity for making recommendations. However, item-based algorithms focus on exploring relationships between items rather than relationships between users, resulting in distinct prediction computations compared to the user-based approach. Despite being widely used in practice, collaborative filtering (CF) approaches have several limitations such as data sparsity and the cold-start problem for new items and users. These limitations negatively impact the quality and accuracy of recommendation predictions.

Researchers have frequently chosen to employ hybrid recommendation filtering to achieve improved recommendation quality while also mitigating the limitations associated with traditional Collaborative Filtering (CF) recommendation approaches. One stream of hybrid recommendation filtering focuses on combining CF filtering algorithms with other RSs techniques, specifically content-based filtering. Another stream of hybrid recommendation research has emerged, called semantic-based hybrid filtering, in which it focuses on incorporating the underlying semantics of users or items in the recommendation process [4, 8, 15]. Such hybrid filtering utilizes semantic Web technologies and features, specifically ontologies, as means that can allow RSs to extract useful knowledge about users or items, which ultimately leads to producing effective predictions. The focus of this study is on utilizing semantic knowledge extracted from oncology along with item-based CF, thus a review of related research studies has been conducted within this scope.

Ontology has demonstrated its effectiveness in addressing certain limitations of collaborative filtering algorithms. Its implementation has shown promising results in improving recommendation system accuracy and alleviating the challenges posed by new item cold-start problems and sparsity.

To improve the accuracy and reliability of collaborative filtering recommendations within e-commerce, Martín-Vicente et al. [17] presented a new strategy based on an ontology, which encapsulates the semantic representation of commercial goods. The exploitation of semantics enables reasoning about the data stored in the users’ profiles and inferring new knowledge. Such new knowledge enhances recommendations in two ways: by refining neighborhood selection based on candidate trustworthiness and by adjusting the prediction process to value contributions from more experienced users.

Martín-Vicente, et al. [52] introduced an improved hybrid approach for recommending products in the e-commerce domain. Their hybrid approach mainly avoids selecting fake neighborhoods in collaborative filtering recommender systems with diverse products. The authors adopted a new strategy, which implements a semantic procedure for calculating the similarity between users, based on a measure of the semantic similarity between the products they have rated, aiming to enhance flexibility in similarity detection and reduce the sparsity of CF. By using the hierarchical structure of an ontology, the strategy intelligently categorizes products and determines their similarity based on how close they are to the hierarchy and how many common ancestors they have. This structure also helps to choose the best level of abstraction needed to form neighborhoods.

Tarus et al. [13] proposed a hybrid knowledge-based recommendation approach, which integrates ontology and sequential learning patterns for recommending learning resources to learners. Ontology encapsulates knowledge about learners and resources, while a sequential pattern mining algorithm is applied to discover the learner’s historical sequential learning patterns. Collaborative filtering is employed to compute similarities of ratings and make predictions for a target learner. The findings of this approach show its effectiveness in addressing the cold-start problem using ontology-based domain knowledge. Additionally, it overcomes sparsity by leveraging learners’ sequential patterns, ensuring accurate predictions even with a few ratings.

Kermany and Alizadeh [22] proposed a novel hybrid recommendation approach that combines ontological semantic CF filtering with user demographics in multi-criteria recommendations for movie recommendation purposes. The Jaccard metric and cosine similarity are used to compute user-based and item-based similarities for accurate predictions. The semantic similarity is computed by considering taxonomy-based relations between movie items. The data sparsity problem is tackled by blending these similarities with optimized weights obtained through a gradient descent algorithm. Nilashi et al. [14] proposed a hybrid recommendation approach by integrating CF and dimensionality reduction. An ontology is used in order to describe the semantic relations between concepts. Only the taxonomy-based relations are considered to compute semantic similarity between items. Their approach was tested in the movie recommendation domain, showing enhanced recommendation accuracy.

A new hybrid recommendation approach proposed by Bagherifard et al. [12] uses ontology to improve new item suggestions. It combines collaborative and content-based filtering, refining ontology structure and semantic similarity measurement for better accuracy. The refined semantic similarity measure is enhanced by removing uniformity in hierarchical relationships between concepts in item ontology. The similarity is computed by Enhanced clustering driven by ontology reduces search needs for similar clusters and users.

According to Chew et al. [21], they discovered that incorporating ontology significantly boosted the accuracy of the matrix factorization model. This enhancement was achieved by merging item-based and user-based collaborative filtering techniques to enrich the user-item matrix with additional data. This enriched dataset combined semantic similarity with user ratings, addressing the “cold start” problem in recommender systems, where there’s limited data for new users/items.

The noticeable features so far of related research studies are focused on incorporating semantic knowledge along with CF in different aspects. Some research studies combine knowledge of content (i.e. items) with the traditional item-based CF approach as in [16, 17, 19]. Other research studies incorporate semantic knowledge of items with the user-based CF approach [19, 20, 22, 30, 52, 53]. Moreover, combining user-based or item-based CF approaches with semantic-enhanced CB filtering was adopted as effective research in recommender systems [9, 12]. Lastly, other streams of studies focus on combining semantic knowledge along with CF and data mining models [13, 14, 21].

While the incorporation of semantic knowledge has certainly enhanced the recommendation techniques previously mentioned, its utility remains relatively limited and the research in this area is still open. This is mainly because the ontological connections between instances are not always effectively addressed, and only a few research endeavors have tackled this issue.

While existing studies have incorporated semantic knowledge with CF in various aspects [16, 17, 19], and different streams of research explore combinations of user-based or item-based CF with semantic-enhanced content-based filtering [9, 12], our work introduces a novel fusion-based semantic similarity measure. This measure, integrated with item-based CF, aims to evaluate item similarities within a specific domain ontology, addressing gaps in the current literature regarding the effective handling of ontological relationships between instances. The proposed approach enhances the depth of semantic assessment, focusing on explicit hierarchical relationships among items, shared attributes, and implicit item relationships.

Our research contributes to the state-of-the-art by introducing an innovative fusion-based semantic similarity measure and integrating it with the standard item-based Collaborative Filtering (CF) similarity. By leveraging refined semantic similarity measures offered by ontology, our hybrid recommendation approach aims to provide more accurate and high-quality recommendations.

In the following sections, we will provide a thorough explanation of both the semantic similarity measure and the hybrid recommendation approach.

3 The Proposed Semantic Similarity Measure

This section presents the proposed semantic similarity measure, motivated by Resnick’s 1995 assumption of semantic similarity, which suggests: “the more information two concepts share in common, the more similar they are”. To achieve this assumption, the proposed semantic similarity measures focus on exploring all possible shared information between items that are defined in the considered domain ontology, to effectively estimate their similarity. The proposed measure, entitled “fusion-based semantic similarity measure”, takes into account three kinds of semantic knowledge: (i) taxonomy-based knowledge, (ii) content-based knowledge, and (iii) inference-based (i.e., implicit) knowledge. The taxonomy-based knowledge is utilized to compute the similarity of items based on their hierarchical positions in the considered taxonomy of specific domain ontology. The content-based knowledge is utilized to compute the similarity of items based on their common semantic description as defined in the considered domain ontology using OWL datatype properties. The implicit-based knowledge is utilized to compute the similarity of items through reasoning extensively their semantic description (i.e. object properties) to infer hidden relationships that might be useful for computing similarity.

Beyond these kinds of semantic knowledge, the proposed semantic similarity measure consists of three types of similarity measures including (i) hierarchical-based similarity (ii) attribute-based similarity, and (iii) inferred-based similarity. The proposed semantic similarity measure is termed a fusion-based semantic similarity measure.

To illustrate the proposed fusion-based semantic similarity measure and its comprised metrics, this section first introduces a formal description of the ontology model and definition and then describes the proposed semantic similarity measure.

3.1 Ontology Model Definition

An ontology is defined as “a set of representational primitives that are relevant for modeling a domain of knowledge or discourse. These primitives typically consist of a set of concepts or entities within a domain, relationships among these concepts, and attributes that distinguish each concept” [23]. A formal definition of ontology structure, as introduced by Maedche and Zacharias [31]:

Definition 1

(Ontology) An ontology structure is a six-tuple \(O:=\)< C, P, A, \({H}^{c}\), prop, att > , where C represents the concept set defined in O; P is a set of relationships defined in O, each \((p\in P)\) has a domain and range which are at least one concept of the set C; A is a set of attributes defined in O; \({H}^{c}\) is a directed transitive relation \({H}^{c}\subset C\times C\) which is also called concept taxonomy, \({H}^{c}\left({c}_{2},{c}_{1}\right)\) means c2is-ac1, or c2 is a sub-concept of c1; prop is a function, i.e. \(prop: P\to C \times C\), that relates concepts non-taxonomically, e.g. the function \(prop\left( {p}_{1}\right)=({c}_{1},{c}_{2})\) means that the concept \({c}_{1}\) is related to the concept \({c}_{2}\) through \({p}_{1}\); and att is a function, i.e. \(att: A \to C\), that relates concepts with literal values such as string, integer, Boolean, etc.

In OWL domain ontology, concepts are linked either directly through asserted relationships or indirectly through implicit relationships. Asserted relationships are explicitly defined by ontology developers within a specific domain. Implicit relationships, on the other hand, are generated through reasoning based on asserted relationships, aiming to formulate valuable knowledge about the domain. Asserted relationships encompass two primary types: taxonomical (hierarchical) and non-taxonomical relations. Taxonomical relations denoted as HC in Definition 1, establish hierarchical links among all concepts within a particular domain. These relations are often represented in a hierarchical tree structure, where one class is identified as a subclass of another. Conversely, non-taxonomical relations are explicitly established using OWL properties for each concept, describing their various characteristics, attributes, interconnections, restrictions, and other logical assertions available within the OWL ontology.

A detailed description of semantic similarity measures is presented in the subsections below.

3.2 Hierarchical-Based Similarity

The hierarchical-based Similarity aims to compute the similarity between instances in a domain of ontology with respect to the concepts that they belong to in the hierarchical structure \({H}^{c}\). The hierarchical structure \({H}^{c}\) is generated using semantic reasoning. In this study, Protégé editor is employed as a platform to develop OWL ontology for the considered domain (i.e. recommending movies). Under this platform, Pellet descriptive logic reasoner is used to generate class hierarchy by automatically classifying whether or not one class is a subclass of another class, it uses the description of the classes to determine if a super-class/subclass relationship exists between them [54].

Hierarchical-based similarity computes the similarity between ontology instances based on their positions within the taxonomy structure (\({H}^{c}\)) representing concepts. This study incorporates the similarity metric proposed by [55], which was originally extended based on the work done by Seco, et al. [41], to assess instances’ similarity. Thus, for two instances \({I}_{x}\) and \({I}_{y}\), their hierarchical-based similarity, denoted as \({Sim}_{H}\left({I}_{x},{I}_{y}\right),\) is calculated as follows:

$${Sim}_{H}\left({I}_{x},{I}_{y}\right)= IC\left({LCA}_{{I}_{x},{I}_{y}}\right),$$
(1)

where \(IC({LCA}_{{I}_{x},{I}_{y}})\) denotes intrinsic IC of given two instances \({I}_{x} {\text{and}} {I}_{y}\), which is obtained with regard to their Least Common Ancestor (LCA) of concepts that subsumes them in the considered \({H}^{c}\). Considering the fact that the instances in OWL ontology may have more than one parent concept, we define \({LCA}_{{I}_{x},{I}_{y}}\) as the most informative LCA for \({I}_{x} {\text{and}} {I}_{y}\), which is the pair of parent concepts that has the highest IC. For example, if the parent set of two given instances \({I}_{x} {\text{and}} {I}_{y}\) is \(\left\{{c}_{1},{c}_{2,}\right\} \mathrm{and }\left\{{c}_{3}\right\}\), respectively, the \({LCA}_{{I}_{x},{I}_{y}}\) can be expressed as follows:

$$max\left(IC\left({LCA}_{{c}_{1},{c}_{3}}\right),IC\left({LCA}_{{c}_{2},{c}_{3}}\right)\right),$$
(2)

The IC of a concept can be calculated as follows:

$$IC\left(c\right)=1-\frac{{\text{log}}(hypo\left(c\right)+1)}{{\text{log}}({max}_{cons})}, 0\le IC(c)\le 1$$
(3)

where c is a concept in \({H}^{c}\), hypo is a function that returns the number of hyponymsFootnote 1 of a given concept c and \({max}_{cons}\) is the number of concepts that exist in the taxonomy under consideration \({H}^{c}\).

3.3 Attribute-Based Semantic Similarity Metric

Attribute-based semantic similarity measures the similarity among ontology instances based on their shared attributes, defined using OWL datatype properties. These properties connect individuals to data values like strings, integers, Booleans, times, and dates. For example, in the Movie dataset ontology, we have properties like “movie’s name,” “movie’s audience,” and “movie’s actors.”

In OWL ontologies, instances can have literal datatype values, which come in various types like interval-scaled, binary, nominal, ordinal, ratio, or mixed. Each data type requires a different approach when calculating similarity. In this study, detailed similarity metrics for each data type are explored, and the Jaccard coefficient, as outlined in [56], is employed to compute the attribute-based semantic similarity.

For two instances \({I}_{x}\) and \({I}_{y}\), N is the set of their common datatype properties, their attribute-based similarity, denoted as \({Sim}_{attr}\left({I}_{x},{I}_{y}\right)\), is defined formally as follows:

$${Sim}_{attr}\left({I}_{x},{I}_{y}\right)=\frac{{\sum }_{{p}_{i\in N}}{ AttrSim}_{{p}_{i}}\left({I}_{x},{I}_{y}\right)}{N},$$
(4)

where \({p}_{i}\in N\) denotes each datatype property \({p}_{i}\) belongs to the set of common datatype properties \(N\), \({AttrSim}_{{p}_{i}}({I}_{x},{I}_{y})\) function represents the attribute similarity between instances \({I}_{x}\) and \({I}_{y}\) concerning the datatype property \({p}_{i}\). It is important to note that formula (4) yields a result of zero when both instances do not have any shared datatype properties.

The similarity function in Formula (4), \({AttrSim}_{{p}_{i}}\left({I}_{x},{I}_{y}\right)\), varies based on the data type of the property \({p}_{i}\). In the considered case study, which is the Movie dataset, we focus on nominal (categorical) datatype properties, which generalize binary data by allowing multiple values. Thus, we employ the Jaccard coefficient [56], a commonly used measure for asymmetric information on binary and non-binary variables, to calculate attribute-based similarity between instances sharing common datatype properties.

3.4 Inferred Attribute-Based Semantic Similarity Metric

Inferred attribute-based semantic similarity aims to measure similarity between instances within a specific domain ontology by examining their shared OWL object properties. Object properties, as described in Definition 1, establish connections between instances of different concepts. These connections create chains of linked instances within a given OWL domain ontology. Starting with a particular instance in a chain, we can infer other instances by following the links along the entire chain. Consequently, exploring object properties among instances along the entire chain helps reveal implicit semantic relationships, which can effectively contribute to computing semantic similarity between instances. For example, in the considered Movie domain ontology, it is important to note that tracing any movie, as illustrated in, using any of its object properties could lead to the discovery of new movies that may share common information (i.e., data and object properties).

As an example, the similarity between “The Good Dinosaur” with “Rio2” movies is computed by comparing their datatype and object properties. Regarding the datatype property, the two movies are compared by comparing their values of each datatype property using Formula 4. Accordingly, three datatype properties will be compared, these properties include “hasAudience”, “hasExperience” and hasPlotAspect”. While object properties of two movies are compared based on their instances that are linked through the same object property. As shown in Fig. 1, three object properties are common between “The Good Dinosaur” with “Rio2” movies, including “isDirectedBy”, “hasStar” and “hasGenre” object properties. As can be seen in Fig. 1, both movies have in common “hasStar” object property, “Steve Zahan”, who is a star in “The Good Dinosaur” movie also a star in “Escape from Planet Earth” movie. While “George Lopez” is a star in “Rio” movie and he is also a star in “Escape from Planet Earth” movie. This indicates that new information, in this case, the “Escape from Planet Earth” movie, extracted by tracing the object property named “hasStar” could contribute to the total similarity between “The Good Dinosaur” and “Rio” movies. Further comparison between “The Good Dinosaur” movie and “Rio” movie will be carried out for the other object properties including the “isDirectedBy” and “hasGenre” to calculate the overall object properties similarities.

Fig. 1
figure 1

Sample of the utilized the MovieLens ontology

The inferred-based semantic similarity, denoted as \({Sim}_{infer}\left({I}_{x},{I}_{y}\right)\), for given two instances \({I}_{x}\) and \({I}_{y}\), is defined formally as follows:

$${Sim}_{infer}\left({I}_{x},{I}_{y}\right)=\frac{\sum_{i=1}^{M}{Sim}_{{objpr}_{i}}\left({I}_{x},{I}_{y}\right)}{M}$$
(5)

where M denotes the number of common object properties that are shared by \({I}_{x}\) and \({I}_{y}\), \({Sim}_{{objpr}_{i}}\left({I}_{x},{I}_{y}\right)\) is the similarity between \({I}_{x}\) and \({I}_{y}\) with regards to the ith object property (i = 1,2, …, M) and is calculated as follows:

$${Sim}_{{objpr}_{i}}\left({I}_{x},{I}_{y}\right)=$$
$$\frac{1}{{L_{i} }}\mathop \sum \limits_{{I_{{x_{l} }} \in R\left( {obj\Pr_{i} , I_{x} } \right), l = 1}}^{{l = L_{i} }} \left( { \frac{1}{{K_{i} }}\mathop \sum \limits_{{I_{{y_{k} }} \in R\left( {obj\Pr_{j} ,I_{y} } \right), k = 1}}^{{k = K_{i} }} \left( {Sim_{H} \left( {I_{{x_{l} }}^{i} ,I_{{y_{k} }}^{j} } \right) + Sim_{att} \left( {I_{{x_{l} }}^{i} ,I_{{y_{k} }}^{j} } \right) + Sim_{\inf er} \left( {I_{{x_{l} }}^{i} ,I_{{y_{k} }}^{j} } \right)} \right) /3} \right),$$
(6)

where \(R\left({objPr}_{i},{ I}_{x}\right)\) and \(R\left({objPr}_{i},{ I}_{y}\right)\) denote the set of instances (i.e. object property range) for \({I}_{x}\) and\({I}_{y}\), respectively. \({L}_{i}\) denotes the number of instances in the range of \({I}_{x}\) ( \(R\left({objPr}_{i},{ I}_{x}\right) )\) and \({K}_{i}\) is the number of instances in the range of \({I}_{y}\) (\(R\left({objPr}_{i},{ I}_{y}\right)\)).

The calculation procedure of computing Inferred-attribute-based SS between a given two instances in specific domain ontology consists of the following steps:

Step 1: retrieve the instances that connected with \({I}_{x}\) and \({I}_{y}\) via object property range \({objPr}_{i}\), Suppose \(\left\{{I}_{x}^{1},{I}_{x}^{2},...,{I}_{x}^{l}\right\}\) and \(\left\{{I}_{y}^{1},{I}_{y}^{2},...,{I}_{y}^{k}\right\}\) sets are retrieved as \(R\left({objPr}_{i},{ I}_{x}\right)\) and \(R\), respectively. Then, a set of elements, denoted as “pair-instances-set”, is formed. Each element in this set represents a pair of corresponding instances from both mentioned sets, i.e. \(\left\{{(I}_{x}^{1},{I}_{y}^{1}),\left({I}_{x}^{1},{I}_{y}^{2}\right),...,\left({{I}_{x}^{1},I}_{y}^{k}\right),\dots , {{(I}_{x}^{l},I}_{y}^{1}),{{(I}_{x}^{l},I}_{y}^{2}),\dots ,({{I}_{x}^{l},I}_{y}^{k})\right\}\).

Step 2: evaluate each pair of instances that formed the “pair-instances-set”. For example, starting with the first pair of instances \({(I}_{x}^{1},{I}_{y}^{1})\), the evaluation is performed by computing their hierarchical-based similarity and attribute-based similarity, as well as further evaluation is performed recursively on their corresponding common object properties. The evaluation of the respective pair, i.e. \(\left({I}_{x}^{1},{I}_{y}^{1}\right)\), continues till identical instances or instances with no common object properties are reached (i.e. to prevent infinite computation a maximum recursion depth is defined).

Step 3: The resulting computed similarity measures of the pair of instances \(\left({I}_{x}^{1},{I}_{y}^{1}\right)\) are aggregated, as shown in Formula 7, along with the inferred similarity result of each traversed element in the “pair-instances-set”.

Step 4: Repeat steps 1, 2 and 3 for all M common object properties between the instances \({I}_{x}\) and \({I}_{y}\), as defined in Formula 6.

Step 5: calculate the final result of the Inferred-based SS measure between \({I}_{x}\) and \({I}_{y}\), using Formula 6, as the average of inferred similarity measures obtained as a result of traversing all common object properties.

The inferred attribute-based similarity measure depends on a semantic reasoning strategy in which it recursively traverses each common object property between two compared instances. Such reasoning leads to infer all possible instances that might share similar properties; the more inferred instances the more similar the compared instances would be.

3.5 Fusion-Based Semantic Similarity Measure

In this study, a Fusion-based semantic similarity measure is developed to effectively evaluate similarity between instances. It aims to capture rich semantic knowledge of instances by exploiting all possible shared semantics between instances in a given domain ontology, in line with Resnik’s principle that suggests "the more shared information between concepts, the greater their similarity”. Accordingly, the Fusion-based semantic similarity measure integrates all the utilized semantic similarity metrics for calculating the semantic similarity between instances. Thus, it targets significant sources of semantic knowledge to define the similarity between instances, including their structural information, direct content knowledge that is defined using datatype properties, and implicit knowledge that is inferred using object properties.

The Fusion-based semantic similarity measure between a given two instances,\({I}_{x}\) and \({I}_{y}\) is computed as the proportional rate of three similarity measures including hierarchical-based similarity, attribute-based similarity, and inferred attribute-based similarity to a variable ℱ named weight factor. The weight factor ℱ indicates how much the similarity of each involved measure contributes to the similarity measure. The Fusion-based semantic similarity between \({I}_{x}\) and is defined as follows::

$$FusionSim\left( {I_{x} ,I_{y} } \right) = \frac{{\gamma Sim_{H} \left( {I_{x} ,I_{y} } \right) + \beta Sim_{attr} \left( {I_{x} ,I_{y} } \right) + \lambda Sim_{\inf er} \left( {I_{x} ,I_{y} } \right)}}{{\mathcal{F}}},$$
(7)

where γ, β, and λ are semantic parameters determined by the contribution of each involved similarity measure in total similarity using Formula 7. Semantic parameters γ, β, and λ are set to a constant value one. The weight factor ℱ is assigned to summation values of semantic parameters γ, β, and λ. The \({Sim}_{Hier}({I}_{x},{I}_{y})\) measure denotes the hierarchical-based similarity and is calculated using Formula 1, \({Sim}_{attr}\left({I}_{x},{I}_{y}\right)\) measure denotes the attribute-based similarity and is calculated using Formula 4 and \({Sim}_{infer}({I}_{x},{I}_{y})\) measure denotes the inferred attribute-based similarity and is calculated using Formula 6. The fusion similarity of two instances ranges from 0 to 1, with higher similarity yielding values closer to 1.

4 The Proposed Semantic-Based CF Recommendation Approach

In this paper, we correspondingly proposed a new hybrid recommendation method called the “Fusion-based Semantic CF” approach. This approach combines traditional item-based collaborative filtering with fusion-based semantic similarity measures to enhance recommendation quality. This integration offers several advantages; it improves item similarity by incorporating ontology-based semantic knowledge alongside item ratings. It also addresses common issues in collaborative filtering, such as sparsity and cold-start items. Ultimately, this approach holds the promise of significantly boosting prediction accuracy in recommendations.

The framework of the proposed semantic-based improved collaborative filtering approach is presented in Fig. 2. It consists of four main phases. The first phase constructs the ontology repository of the considered domain. The other three phases form the computational procedure that is required to generate top-N recommendations. The procedure phases of the proposed Fusion-based semantic collaborative filtering are demonstrated in detail as follows:

Fig. 2
figure 2

The introduced framework of the proposed semantic-based improved collaborative filtering approach

4.1 Phase 1: Ontology Construction and Items Ratings Extraction

To empower a recommendation system, ontology construction and item ratings extraction must be undertaken. Using the Protégé editor, ontology repository, a structured knowledge base encompassing related concepts, relationships, and attributes, are constructed. We acknowledge the inherent complexity involved in constructing ontologies, and its feasibility within this context relies on well-defined domain ontology. This includes the utilization of appropriate tools, understanding dataset characteristics, and aligning with the primary goal of enhancing the recommendation system. The involvement of domain experts becomes principal in ensuring the accurate representation of intricate relationships within the ontology. The knowledge repository, shaped by domain expertise, plays a pivotal role in enabling our system to comprehend the intricate connections within its designated domain. This process lays a robust foundation essential for generating context-aware recommendations.

Furthermore, the extraction of item ratings from the dataset under consideration is a crucial step in our proposed framework. This process involves creating users’ ratings on items, which are then organized into a user-item rating matrix R[m × n] with m users and n items. These ratings play a pivotal role in collaborative filtering calculations, serving as a foundational component of our recommendation framework.

4.2 Phase 2: Calculate the Similarity of Items Including Semantic Similarity

This includes:

First, calculate item similarity using the Pearson correlation coefficient

In this step, we compute pairwise item similarities for all items in the dataset. In item-based CF, the similarity between items is computed according to users’ ratings on items R \([m\times n]\). This study uses the Pearson Correlation PC coefficient to compute items’ similarities, as it demonstrates its superiority in performance compared to other similarity measures. Suppose \(U=\left\{{u}_{1},{u}_{2},\dots ,{u}_{m}\right\}\) be the set of all users,\(I=\left\{{I}_{1},{I}_{2},\dots ,{I}_{n}\right\}\) be the set of all items, and \({U}_{ij}\) be the set of users who rated items \({I}_{i}\) and \({I}_{j}\) together. The item-based CF similarity between a pair of items \({I}_{i}\) and \({I}_{j}\) using Pearson Correlation, denoted as \({Sim}_{PC}\left({I}_{i},{I}_{j}\right):I\times I\to [-\mathrm{1,1}]\), is given in the following formula:

$$Sim_{PC} \left( {I_{i} , I_{j} } \right) = \frac{{\mathop \sum \nolimits_{u = 1}^{{|U_{ij} |}} (r_{{u, I_{i} }} - \overline{r}_{{I_{i} }} )(r_{{u, I_{j} }} - \overline{r}_{{I_{j} }} )}}{{\sqrt {\mathop \sum \nolimits_{u = 1}^{{|U_{ij} |}} (r_{{u, I_{i} }} - \overline{r}_{{I_{i} }} )^{2} } \sqrt {\mathop \sum \nolimits_{u = 1}^{{|U_{ij} |}} (r_{{u, I_{j} }} - \overline{r}_{{I_{j} }} )^{2} } }},$$
(8)

where \({r}_{u, {I}_{i}}{\text{and}} {r}_{u, {I}_{j}}\) represent the rating of user u on items i and j separately, and \({\overline{r} }_{{I}_{i}} {\text{and}} {\overline{r} }_{{I}_{j}}\) is the average ratings of all users \(u \in {U}_{ij}\) on the \({i}^{th} \mathrm{and }{j}^{th}\) item, respectively. Thus, this step computes the similarity of each pair of items using users’ opinions (ratings) and stores items’ similarities resulting in a similarity matrix named \(SimPC[m\times m]\).

Second, calculate semantic similarity with the proposed fusion-based SS measure

Unlike the first step which is used to find the similarity of each pair of items with regard to users’ opinions (i.e. ratings), this step aims to compute similarity using the proposed Fusion-based semantic similarity measure. The semantic similarity degrees between each pair of items are calculated using Formula 7. The obtained similarities of pairwise items are stored in the item-item semantic similarity matrix denoted as \(SimSem[m\times m]\).

4.3 Phase 3: Integrate Semantic Similarity with Pearson Correlation

In this step, the obtained similarities from the previous two steps of each pair of items are integrated linearly to create a combined similarity. An integrated similarity matrix, called \(SemPCSim\left[{\text{m}}\times {\text{m}}\right],\) is created to store the combined similarity results of pairwise items. The integrated similarity of a pair of items \({I}_{i}\) and, denoted as \(TotalSim\left({I}_{i},{I}_{j}\right):I\times I\to [-\mathrm{1,1}]\), is defined in the following formula:

$$IntegrSim\left( {I_{i} ,I_{j} } \right) = \alpha \times FusionSim\left( {I_{i} ,I_{j} } \right) + \left( {1 - \alpha } \right) \times Sim_{PC} (I_{i} ,I_{j} ),$$
(9)

where \(\alpha\) is a semantic combination parameter specifying the weight of similarity measure in the integrated total similarity. If \(\alpha =0\), then the total similarity between \({I}_{i}\) and \({I}_{j}\) is attributed to the PC similarity, whereas, if \(\alpha =1\), then the total similarity between \({I}_{i}\) and \({I}_{j}\) is attributed to semantic similarity. Determining the optimal \(\alpha\) value is dataset-dependent, and a sensitivity analysis is needed to select the best \(\alpha\) value.

4.4 Phase 4: Generate Recommendations

This phase aims to suggest the most relevant items for users’ interests and preferences. In recommendation systems, several prediction methods are used to compute predictions of users’ ratings for unseen items (i.e. rating values between 1 and 5). However, the weighted sum method is employed in this study for computing the prediction of items’ ratings, since it is commonly used in recommender systems research.

Considering the weighted sum method, the prediction of a user’s rating for each un-rated item is calculated first by selecting the most similar items set to a target item \({I}_{i}\). Next, the intersection set between the set of rated items by the active user \({u}_{a}\) and the most similar items set to the target item \({I}_{i}\) is retrieved, this set is denoted as \({K}_{i}\). Finally, the prediction of a rating value is computed by summing up the ratings given by the active user to all those items in the intersection set, where each rating is weighted by the corresponding similarity between the target item \({I}_{i}\) (which is un-rated) and the others rated items in the intersection set. The prediction value of an active user \({u}_{a}\) on a target item denoted as \({P}_{{u}_{a},{I}_{i}}\), is defined using the weighted sum method as shown in the following Formula:

$${P}_{{u}_{a},{I}_{i}}=\frac{\sum_{q=1}^{{K}_{i}}{r}_{{u}_{a},{I}_{q}}\times TotalSim({I}_{i},{I}_{q})}{\sum_{q=1}^{{K}_{i}}TotalSim({I}_{i},{I}_{q})} ,$$
(10)

where the item \({I}_{q}\in {K}_{i}\), \({K}_{i}\) denotes the set of the most similar items to the target item \({I}_{i}\) and is rated also by the active user \({u}_{a}\), \({r}_{u,{I}_{q}}\) denotes the rating of an item \({I}_{q}\) by the user \({u}_{a}\), \(TotalSim({I}_{i},{I}_{q})\) denotes the combined similarity value of the target item \({I}_{i}\) and \({I}_{q}\) which is calculated by Formula 9. The predicted rating values of unseen items for the user \({u}_{a}\) are stored as a vector in the prediction matrix \({P}_{{u}_{a}}[1\times n]\).

Based on the matrix \({P}_{{u}_{a}}[1\times n]\), all unseen items will be sorted in descending order according to their predicted rating values and then the top-N items will be generated as the most relevant recommendations for the given user.

5 Experimental Design and Procedures

To evaluate the proposed Fusion-based Semantic CF approach, we present the experimental design, which includes details about the utilized dataset, evaluation metric employed, and parameter settings. Moreover, the experimental procedures are illustrated.

5.1 Utilized Dataset

This study employs the MovieLens benchmark dataset, accessible at https://grouplens.org/datasets/movielens/. Specifically, the academic and research dataset is utilized, comprising a randomly selected sample. This sample consists of a user-item rating matrix with 600 rows representing users and 1000 columns representing movie items, totaling approximately 35,000 ratings. The density of the user-item rating matrix of the utilized dataset is around 5%. It indicates the proportion of non-zero entries that are actually rated by users to the total entries in the utilized Movielens dataset. Users included in this dataset have provided a minimum of 20 ratings within this matrix, each user’s entry represents a rating score on a scale from 1 to 5, with 5 indicating high interest and 1 indicating low interest. Entries may also be null, indicating that the corresponding users have not rated those items. This resulting matrix is subsequently employed for computing item similarity using Pearson Correlation Formula 8 and making predictions for unrated items using Formula 10. Benchmark IMdb dataset (https://datasets.imdbws.com/) is used to construct the semantic description of movie items (i.e. instances) and their properties values. In this regard, a preprocessing of the Movielens enriched with the IMdb dataset was performed, where the required data of movie features and user ratings are firstly extracted from the available dataset files and then imported into protégé. The OWL ontology, named Movie Recommender (MovieRec) ontology, is constructed to represent semantic knowledge for the considered domain. The MovieRec ontology provides a detailed semantic description of movie-related concepts and their corresponding instances. It includes a formalized description of a set of concepts (i.e. classes), e.g. movies, genres, actors, actresses, etc., with their hierarchal relationships and properties. Such a formalized description is then utilized to meet the purpose of this study. In this regard, the semantic knowledge embedded in the constructed MovieRec ontology is exploited for computing movie item similarities using the proposed semantic similarity measures.

5.2 Evaluation Metric

This study employs the Mean Absolute Error metric (MAE) to evaluate the proposed recommendation approach, as it is one of the most commonly used metrics in recommendation research [19, 57, 58]. The MAE basically measures the accuracy of generated recommendations. It measures the deviation of prediction values of items from their actual rating values over all predicted items in the test dataset:

$$MAE = \frac{{\mathop \sum \nolimits_{i,j}^{n} \left| {a_{i,j} - p_{i,j} } \right|}}{n},$$
(11)

where n is the total number of predicted ratings over all users, \({p}_{i,j}\) is the predicted rating for user i on item j, and \({a}_{i,j}\) is the actual rating for user i on item j. The lower the MAE, the more accurate the prediction made by the recommendation approach.

5.3 Parameters Settings

In this study, we must emphasize two critical parameters that significantly impact prediction accuracy: neighborhood size (K) and the semantic combination parameter (α). These parameters play pivotal roles in shaping the performance and outcomes of the evaluation and thus warrant careful consideration and adjustment during the experimentation process. The neighborhood size K parameter represents the number of the most similar movie items for a given movie item as presented in phase 4, Sect. 4. It is utilized to compute the prediction values of unseen movie items. On the other hand, the semantic combination parameter \(\alpha\) used to specify the weight of semantic similarity in the combined similarity measure as described in Phase 3, Sect. 4.

On the other hand, the data density, defined by the training/test ratio, significantly impacts prediction accuracy, as discussed by [59]. For our subsequent experiments aimed at determining optimal values for parameters ‘K’ and ‘α’, we select a training/test ratio of 0.9. To ensure statistical robustness, we conducted tenfold cross-validations, randomly partitioning the user ratings dataset into 90% for training and 10% for testing in each fold. All reported results in this study represent averages over these 10 folds. It is worth noting that the training dataset was utilized to construct movie item-item similarity models, while the testing dataset was used to evaluate predicted accuracy for unseen items.

5.4 Experimental Procedure

In conducting the experimental evaluation, we implemented the utilized semantic similarity measures and their integration, both individually and alongside the item-based CF approach, using the Java NetBeans platform. Additionally, we included other competing approaches for evaluation purposes. To facilitate communication between the OWL ontology of the MovieLens dataset (generated as an OWL file by the Protégé editor) and the Java NetBeans platform, we employed OWLModel and Jena OntModel.

The effectiveness of the proposed semantic similarity measures in enhancing the performance of item-based CF is investigated. The evaluation involves three key comparison scenarios. First, we individually evaluate the influence of each semantic similarity measure on the recommendation accuracy in conjunction with the item-based CF approach, and the obtained results are analyzed. Second, we examine the impact of the proposed Fusion-based SS measure on recommendation accuracy when used alongside item-based CF. Finally, we investigate the highest recommendation accuracy attained by the aforementioned experimental scenarios relative to other benchmark methods, focusing on predictive accuracy, sparsity, and new item problems.

6 Experimental Results and Discussion

6.1 First Experimental Scenario

The objective of this experimental Scenario is to investigate how the proposed semantic similarity measures influence the accuracy of recommendations. To achieve this goal, each of the proposed semantic similarity measures is incorporated into the standard item-based CF approach, resulting in an integrated formula for computing item similarity. This integrated similarity measure is then applied to compute predictions using Formula 10. Table 1 presents the semantic similarity (SS) measures under consideration, which encompass the Pearson correlation coefficient, resultant integrated similarity formulas, and the associated semantic combination parameters.

Table 1 The integrated SS measures with item-based CF

The sensitivity of semantic combination parameters, presented in Table 1, is investigated in order to determine the impact of each integrated similarity measure with the item-based CF approach on prediction accuracy. The MAE metric (Formula 11) is used to compute the accuracy of prediction. Accordingly, a number of experiments were performed when the values of \(\gamma , \beta \mathrm{and }\lambda\) varied from 0 to 1, with an increment of 0.1. For each distinct value of these parameters, further experiments were run over different values of K. The neighborhood parameter, i.e. K, takes values between 10 and 80 with an increment of 10. Based on the obtained results, the minimum value of MAE has been conducted for each involved approach as well as the corresponding values of each semantic parameter and K parameter, as shown in Table 2. For instance, the integrated hierarchical-based SS with item-based CF approach achieved the lowest MAE when semantic parameter \(\gamma\) equals 0.4 and K parameter equals 60. As can be seen in Table 2, the best-achieved result of MAE (i.e. 0.77823) is obtained by the integrated Inferred-based SS with Item-based CF approach; this result is achieved when the K parameter equals 60 and semantic parameter \(\lambda\) equals 0.4.

Table 2 Parameters values that achieved the best prediction accuracy results

6.2 Second Experimental Scenario

This experimental Scenario aims to investigate the impact of integrating the proposed Fusion-based semantic similarity measure with item-based CF on recommendation accuracy. It also aims to validate its effectiveness against other semantic similarity measures listed in Table 2.

Accordingly, a set of experiments has been conducted to examine the sensitivity of semantic combination parameter α on the prediction accuracy. Items similarities are computed using the proposed integrated similarity as in Formula 9. The prediction accuracy has been computed using the prediction metric (Formula 10). The experiments were performed when the value of \(\alpha\) varied from 0 to 1, in an increment of 0.1, and parameter K takes values from 10 to 80 in an increment of 10. Each experiment was run at a distinct value of \(\alpha\) and different values of K. For each different combination of K parameter and \(\alpha\) parameter, the MAE is computed. Figure 3 shows the MAE of the proposed integrated Fusion-based semantic similarity with the Item-based CF approach. As can be seen in Fig. 3, each plotted value of the MAE represents the minimum achieved value at specific K (i.e. nearest neighbors of items) for each distinct value of parameter α. To sum up, the minimum value of MAE is (0.766433) and is achieved when parameter \(\alpha\) is 0.6 and the size of the K parameter is 60. Note that the obtained results for using different values of K are not reported here due to space availability.

Fig. 3
figure 3

Prediction accuracy of the integrated Fusion-based SS with Item-based CF approach

As shown in Fig. 3, note that when \(\alpha =0\), the prediction accuracy of the proposed recommendation approach is weak as the MAE value is quite high. In this case, the semantic similarity is not taken into account to generate recommendations and only the Item-based CF approach is considered (see Formula 9). In contrast, when \(\alpha =1\), the proposed recommendation approach has made worse predictions because the semantic similarity is only considered to generate recommendations. This indicates that integrating the proposed semantic similarity with the item-based CF leads to improved recommendation outcomes. Nevertheless, the parameter α exhibits a positive influence on prediction accuracy, especially within the range of 0.2 to 0.8, where recommendation accuracy shows improvement. The optimal accuracy is achieved when α is set to 0.6. The experimental findings emphasize the enhancing effect of the parameter on prediction accuracy. Additionally, the effect of both similarity measures yields advantageous outcomes, emphasizing the essential integration of users’ ratings data with the proposed Fusion-based semantic similarity measure.

It is important to mention that, the semantically enhanced CF approach proposed by Mobasher (2003) is sensitive as well to parameter alpha as it linearly combines the item-based CF similarity and the semantic similarity. The combined similarities are used to generate predictions of unseen items. The parameter alpha is tested by following the same procedure as explained above used to test our approach. Based on the running experiments, the achieved optimal value of parameter alpha was 0.4 and K was 60.

6.3 Third Experimental Scenario

In terms of comparing the performance of the proposed Fusion-based semantic CF recommendation approach with other benchmark approaches, we have implemented two approaches proposed in the field of RSs. These approaches include the standard item-based CF proposed by Sarwar et al. [59] and the advanced semantically enhanced CF approach proposed by Mobasher et al. [16]. Standard item-based CF relies solely on users’ ratings for recommendations, while semantically enhanced CF aims to boost accuracy by addressing sparsity and new item problems. It combines item descriptions from web-based ontologies with users’ ratings to make predictions. Consequently, several experiments were conducted to investigate the effectiveness of the proposed Fusion-based Semantic CF recommendation approach compared to other benchmark approaches. The results of these experiments are outlined below.

6.3.1 Enhancement in Predictive Accuracy

The impact of the proposed Fusion-based semantic CF recommendation approach is evaluated against other benchmark approaches. A number of experiments are performed at different values of parameter K which varies from 10 to 80 with an increment of 10 and the parameter α is set to its optimal value for our proposed approach and semantically enhanced-CF approach proposed by Mobasher et al. [16] as illustrated in the second previous experiment. Figure 4 shows the MAE improvement of the proposed approach against other competing approaches according to the K parameter. It shows that the proposed approach achieves substantially better results in terms of prediction accuracy compared to the other competing approaches. As Fig. 4 shows, when the size of the K parameter is higher than 60, the MAE value tends to gradually increase for all compared approaches. The underlying rationale can be attributed to the diminished capacity for accurate predictions by all approaches as the neighborhood size for each item expanded, intensifying the differences between items. The minimum MAE achieved by the proposed approach is 0.788661 and obtained when K is 60, thus this value is selected as the optimal value of the K parameter for the given dataset. Further, the minimum achieved MAE values for the standard item-based CF and semantically enhanced CF competing approaches are 0.866752 and 0.803506, respectively. However, the proposed approach result indicates effective improvement in prediction accuracy for all K parameter values when compared with other competing approaches.

Fig. 4
figure 4

Prediction accuracy of the proposed approach in comparison to benchmark approaches

6.3.2 Improvement Extent for Sparsity Problem

In this experiment, the effectiveness of the proposed approach in handling the sparsity problem is investigated and examined against other considered benchmark approaches. Sparsity is one of the main problems that negatively influence the prediction accuracy of CF recommendation approaches. It occurs once the entire ratings of a dataset are few compared to the total number of the items, e.g. sparsity level of the considered MovieLens dataset in this study is 94.23%, which denotes the proportion of the non-zero entries that were actually rated by users to the total entries in the given dataset.

A number of experiments were carried out using the proposed approach and competing benchmark approaches considering different sparsity levels. For all conducted experiments using the involved recommendations approaches, parameters K and α were set to their optimal values. For each sparsity level, the MAE result of each approach was recorded. Then, the improvement in the prediction accuracy of the proposed recommendation approach against other competing benchmark approaches is calculated using the following formula:

$${Accuracy}_{imp}=\frac{\left|{MAE}_{appr1}-{MAE}_{appr2}\right|}{{MAE}_{appr1}},$$

where \({MAE}_{{appr}_{1}}\) and \({MAE}_{appr2}\) is the MAE value of approach 1 and approach 2, respectively. Note that, approach 1 is considered the proposed recommendation approach in this study.

Figure 5 shows the accuracy improvement of the proposed recommendation approach over all other competing approaches for all sparsity levels. The results reveal that the MAE of the proposed approach achieves a significant improvement over the traditional item-based CF and a better improvement against the semantically enhanced CF approach at all sparsity levels.

Fig. 5
figure 5

Prediction accuracy comparison on sparsity problem

Moreover, as depicted in Fig. 5, while the overall improvement in recommendation accuracy decreases for all approaches as the sparsity is increased, the proposed approach achieves larger improvements than the other approaches. As expected, this improvement starts to converge to 0 for very sparse datasets as shown in Fig. 5. The observed phenomenon is attributed to the inherent limitations posed by extremely sparse data, where neither approach is capable of producing accurate recommendations. Nonetheless, up to a training/test ratio of 40%, the proposed approach demonstrates a notable enhancement, the proposed approach provides improvement that exceeds 4.1459% in accuracy over the standard item-based CF approach and 1.0% over the other approach. These results indicate the effectiveness of our proposed fusion-based semantic CF approach to generate more accurate predictions at all sparsity levels.

6.3.3 Extent of Improvement for New Item Problem

The standard item-based CF recommendation approach suffers from generating accurate recommendations for new items because high-quality recommendations can only be obtained with sufficient data ratings. The effectiveness of the proposed recommendation approach in dealing with the new items problem is evaluated only against the semantically enhanced CF approach proposed by Mobasher et al. [16]; where the item-based CF approach is excluded from the experiments as it cannot make recommendations for new items.

Accordingly, the experiments are only conducted on the items that have been actually rated once in the considered MovieLens dataset. These items are considered to be new items, which have no ratings. Using this dataset, a number of experiments were conducted in which the K parameter is varied from 10 to 80 and parameter α being the corresponding optimal value for both the proposed recommendation approach and the other competing semantically enhanced CF benchmark approach. Figure 6 shows the obtained MAE results for both compared approaches. It can be observed that the obtained results of prediction accuracy for new items by the proposed approach have performed better in comparison to the other competing approach (i.e.) at all values of parameter K. The minimum MAE value achieved by the proposed recommendation approach is 0.86683 at neighborhood size equals 60. Whereas the minimum MAE for the competing approach equals 0.885263 and is reached when the neighborhood size K parameter equals 60. Therefore, our proposed approach demonstrated its effectiveness in making more accurate predictions for new item problems.

Fig. 6
figure 6

Prediction accuracy comparison for the new item problem

7 Conclusion

This paper introduces a hybrid semantic recommendation approach designed to address the limitations of existing recommendation methods, with a particular focus on issues such as sparsity and cold-start items that significantly affect recommendation accuracy. To achieve this, our proposed approach enhances the recommendation quality by integrating the semantic similarity of items with the item-based CF approach. Additionally, this paper introduces a semantic-based similarity measure that takes into account various components, including hierarchical relations, instance features, as well as implicit relationships. Limited attention has been given to exploring implicit ontological relationships, despite their potential for richer semantic representations. Thus, the proposed measure effectively captures shared semantic knowledge between instances as conceptualized in any given domain ontology. Implicit relationships can greatly enhance the semantic analysis of heterogeneous content, offering valuable insights into the similarities and differences between ontological instances in recommendation systems.

We demonstrated the effectiveness of our proposed Fusion-based semantic CF approach and the semantic similarity measure through comprehensive experiments. The results consistently outperformed other benchmark techniques, achieving a significant reduction in Mean Absolute Error. Our research highlights the pivotal role of integrating semantic knowledge in recommendation systems. By addressing the ‘cold-start problem’ and improving recommendation quality, our findings offer promising pathways for optimizing recommendation system architectures.

Future research in the field of recommendation systems is promising, particularly in the exploration of hybrid semantic similarity measures applied to dynamic ontologies. Dynamic ontologies are constantly evolving, and current studies largely focus on static ontologies, which overlook the potential changes and updates that real-world ontologies undergo. Addressing this research gap by developing adaptive approaches could lead to more robust and flexible similarity measures, enhancing the effectiveness and relevance of recommendation systems. Moreover, we intend to broaden our experimental evaluation by incorporating diverse datasets from various domains. This expansion aims to enhance the external validity of our findings. Subsequently, we plan to assess the proposed approach against other advanced similarity approaches for recommender systems, providing valuable insights into its effectiveness and adaptability across real-world scenarios.