A Neural Inference of User Social Interest for Item Recommendation

Chen, Junyang; Chen, Ziyi; Wang, Mengzhu; Fan, Ge; Zhong, Guo; Liu, Ou; Du, Wenfeng; Xu, Zhenghua; Gong, Zhiguo

doi:10.1007/s41019-023-00225-8

A Neural Inference of User Social Interest for Item Recommendation

Research Paper
Open access
Published: 29 August 2023

Volume 8, pages 223–233, (2023)
Cite this article

Download PDF

You have full access to this open access article

Data Science and Engineering Aims and scope Submit manuscript

A Neural Inference of User Social Interest for Item Recommendation

Download PDF

Junyang Chen ORCID: orcid.org/0000-0002-1139-8654^1,2,
Ziyi Chen³,
Mengzhu Wang¹,
Ge Fan³,
Guo Zhong⁴,
Ou Liu⁵,
Wenfeng Du¹,
Zhenghua Xu⁶ &
…
Zhiguo Gong⁷

1363 Accesses
3 Citations
Explore all metrics

Abstract

User-generated content is daily produced in social media, as such user interest summarization is critical to distill salient information from massive information for recommendation tasks. While the interested messages (e.g., tags or posts) from a single user are usually sparse becoming a bottleneck for existing methods, we propose a neural inference method (NIGraphNet) by mining user social interest for item recommendation. It can unearth user latent topics combined with user relation learning. Specifically, we exploit a neural variational inference approach to learn the distributions between user interests and hidden topics. (We denote it as interest-topic distributions in the following.) Then, we adopt a unified graph-based training loss that jointly learns the hidden topics and user relations for item recommendation. Experiments on two datasets collected from well-known social media platforms demonstrate the superior performance of our model in the tasks of user interest summarization and item recommendation. Further discussions also show that exploiting the latent topic representations and user relations is conducive to the user’s automatic language understanding.

A Topic-Aware Graph-Based Neural Network for User Interest Summarization and Item Recommendation in Social Media

Modeling User Interest and Community Interest in Microbloggings: An Integrated Approach

Mining User Interests from Information Sharing Behaviors in Social Media

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Social networking platforms have become immensely popular among individuals for sharing opinions and exchanging information [1, 2]. However, the sheer volume of user-generated posts produced daily on these platforms far surpasses human beings’ reading and comprehension abilities. Therefore, the ability to extract essential information from a large volume of posts has become a critical capability for current applications.

Two main characteristics can be summarized for current posts: short texts and word sparsity. As shown in Table 1, the source posts of one point-of-interest (POI) from social media (e.g., Yelp) usually only contain a few short sentences in which their word co-occurrences are usually sparse. To extract the key points of these sentences, the keyphrases generation [3] can be adopted to summarize the whole posts of a POI. These generated keyphrases can be further used in the downstream tasks, such as similar POI search [4, 5], user sentiment analysis [6, 7], and POI recommendation [8, 9].

However, most previous work focuses on extracting existing phrases from target posts. For example, [7, 10] employ topic models to generate topical words as the keyphrases of a group of posts. These methods, ascribed to the limitation of most topic models, are incapable of generating non-existed keyphrases for each targeted post. To deal with this problem, [3] recently introduces a sequence generation framework that can generate keyphrases beyond the target post. It presents a neural seq2seq model based on integrating more tweets related to the target post to generate keyphrases in a word-by-word training manner. But still, the above methods encounter a common challenge: only a limited number of relevant posts existed for one POI are encoded when processing posts from social media. To illustrate this challenge, we display Table 1 where a batch of posts is from Yelp. Such posts are the comments related to the POI “The Vortex Bar And Grill - Midtown”. We can observe that each post only contains a few words, as such this will inevitably encounter the sparsity problem. One way is to combine more relevant posts [3] to enrich the contents. However, even though all posts are focused on the same topic, it is difficult to summarize the keyphrases “American (Traditional), Burgers, Restaurants, Bars, Nightlife” due to the limited number and colloquial nature of social media language by looking at posts from 1 to 6 in Table 1.

Table 1 POI: “The Vortex Bar And Grill” on Yelp.

Full size table

To address the above challenges, we first propose a graph-based neural interest summarization model (UGraphNet) [11] that includes three complementary innovations. The first one is user collaboration that leverages neighboring information by constructing the bipartite graph of user-post-user to enrich sparse contents. The second one is corpus-level user latent topic modeling with the constructed graph and the users’ interested posts. The last one is joint modeling the latent topic embedding of all users and the interest prediction of the target users. These approaches can effectively improve the accuracy and alleviate data sparsity in the tasks of user interest summarization and item recommendation. UGraphNet has achieved improvement compared with the baselines.

Moreover, we further study UGraphNet to improve its performance by considering the optimization in the second part. The previous method refers to a matrix factorization [12] to obtain the hidden topical representations of user interests, which is a lack of nonlinear changes in the learning process and will have a negative impact on the final outcomes. More concretely, user-interested posts may only be related to several topics, and each topic may only involve several important words in reality. But the matrix factorization with linear transform is unable to follow such conditions, which prevents obtaining better user interest representations. To this end, we explore a neural variational inference method (NIGraphNet) to endow with the ability of nonlinear transform in the topical representation learning. Finally, we adopt a unified graph-based training loss that jointly learns the hidden topics and user relations for item recommendation.

In general, the contributions of this work are as follows:

We propose a novel method to leverage user relations and latent topics by social media interest summarization for item recommendation. Our model enables an end-to-end training process through a unified graph-based training loss.
We propose three main components: a contrastive learning loss, a topic modeling loss, and a graph-based learning loss to achieve the above purposes through their joint learning. We further explore a neural variational inference method to endow with the ability of nonlinear transform in topical representation learning.
We experiment on two newly constructed social media datasets. Our model can significantly outperform all the comparison methods. Ablation analysis also demonstrates the effectiveness of exploiting the latent topic representations and user relations in user automatic language understanding.

2 Related Work

This work is mainly in the line of three domains: user interest summarization, item recommendation, and topic modeling.

2.1 User Interest Summarization

Most of the previous works employ supervised or unsupervised methods to extract words selected from target documents to form the summarization. For supervised learning, [10] use deep recurrent neural networks with sequence tagging to conduct keyphrase extraction. Further, [13] incorporate expert knowledge into the extraction. For unsupervised learning, various algorithms are also proposed, such as graph ranking [14] and document clustering [15]. However, these works only select keyphrases from source documents to make a summary, which would encounter a sparsity problem in short posts on social media. After that, [16] proposes to predict keyphrases in a sequence generation manner that allows the creation of absent keyphrases. Besides, some previous works [17, 18] that are based on topic modeling can also effectively alleviate the data sparsity with the corpus-level latent topics. Different from them, we propose to leverage user relations and latent topics on social media for user interest summarization that has been ignored in previous research and will be extensively studied here. In this way, our model can generate keyphrases beyond the limited number of relevant posts for the target user.

2.2 Item Recommendation

This refers to the social recommendation that adopts social relations to improve the content recommendation performance. Earlier work has typically used the directly linked neighbors to constrain and learn the representation of target users via matrix factorization approaches [19]. Recently, with the rise of the graph neural networks (GNN) such as GCN [20], GraphSAGE [21], and GAT [22], lots of effort have been devoted into the social recommendation. [23] use the directly linked relations of target users/items with GNN to learn their representations. Then, [24, 25] further exploit behavior patterns into user–item graphs to learn more powerful representations for both users and items. And [26] adopts a hypergraph neural network to explore high-order information in recommendation scenarios. Different from the above models, the proposed user interest summarization method is learned together with language generation, which has not been explored before in the existing work.

2.3 Topic Modeling

To enhance the topic modeling aspect of UGraphNet, we propose incorporating relevant work on hidden topic analysis. One prominent research approach for exploring the relationships between documents and their hidden topics is the matrix factorization method [12]. This method takes the document–word matrix as input and produces the document–topic matrix and the topic–word matrix as output. It has been widely employed as a key component in previous works on hidden topic analysis [11, 27]. However, there are two primary limitations associated with the matrix factorization method. Firstly, the previous use of this method lacked the ability to perform nonlinear transformations in topical representation learning. Consequently, previous models were unable to engage in nonlinear parameter learning. Secondly, we adopt a variational inference method [28] for topical learning. By utilizing the reparameterization trick in the inference process, this method addresses the issue of overfitting and enables models to capture more generalized user interests during the generative learning process.

3 Proposed Model

In this section, we describe the proposed framework that how to leverage user collaboration and latent topics for the user interest summarization. Figure 1 shows the overall architecture consisting of three modules—a contrastive learning loss, a topic modeling loss, and a graph-based generative learning loss. Formally, given a collection D of social media posts, we process each post into bags-of-words word vector $[t_1, t_2,...,t_{|V|}]$, which is a V-dim vector over the vocabulary and V denotes its size. Besides, each post consists of latent topics and we denote the topic size as |K|. Below we first introduce our three modules and then describe how they are jointly trained.

3.1 Contrastive Learning Loss

As shown in the left part of Fig. 1, we exploit user collaboration by constructing the adjacent graph of users. Specifically, when two users are interested in the same posts, we make a connection between these users. Besides, it is difficult for every user has a unique embedding in large-scale scenarios, which will inevitably make the number of parameters tremendous. As such, inspired by the work [29] that they represent the users by the terms of queries, we represent the users with a smaller number of tag embeddings. In other words, each user can be represented with a limited number of tags. Here we also use one-hot encoding to represent the tag (or call word) lexicon ($t_1,...,t_{|V|}$). Then, we map the tags to d-dimensional vectors with a mapping function f to represent users as follows:

$$\begin{aligned} \begin{aligned} {\textbf{h}}_{v_t} = f((t_1,...,t_{|V|}), {\textbf{M}}), \end{aligned} \end{aligned}$$

(1)

where ${\textbf{h}}_{v_t} \in {\mathbb {R}}^d$ denotes the embedding of a user $v_t$, and ${\textbf{M}} \in {\mathbb {R}}^{|V| \times d}$ is the transformation matrix. After that, we adopt an attention method to fuse the information of a target user and its neighbors. First, we perform the message propagation step for dealing with the messages passing from neighboring nodes, which is given by:

$$\begin{aligned} \begin{aligned} \varvec{m}_{v_i \leftarrow v_j} = \text {MLP}(n_{v_j v_i} \oplus \varvec{h}_{v_j}) \cdot \varvec{h}_{v_j}, \end{aligned} \end{aligned}$$

(2)

where $\varvec{m}_{v_i \leftarrow v_j} \in {\mathbb {R}}^d$ denotes the information passing from node $v_j$ to $v_i$, $n_{v_j v_i}$ is one-hot encoded of the neighbor type (e.g., one-hop (0, 1) or multi-hop neighbors (1, 0)), $\text {MLP}(\cdot ) \in {\mathbb {R}}^{d \times d}$ denotes a multi-layer perception that takes as inputs both the neighbor type $n_{v_j v_i}$ and the representations of the user $\varvec{h}_{v_j}$, and $\oplus $ represents the concatenation.

Then, we aggregate the information of the target node and the messages passing from its neighbors in an attentive way. The weight coefficient $\alpha _{v_i,v_j}$ between two nodes can be formulated by:

$$\begin{aligned} \begin{aligned} \alpha _{v_i,v_j} = \frac{\text {exp}\bigg (\sigma ({\textbf {a}}^T \cdot [\varvec{W}\varvec{h}_{v_i} || \varvec{W} \varvec{m}_{v_i \leftarrow v_j}])\bigg )}{\sum _{v_k \in {\mathcal {N}}_{v_i}} \text {exp}\bigg (\sigma ({\textbf {a}}^T \cdot [\varvec{W}\varvec{h}_{v_i} || \varvec{W} \varvec{m}_{v_i \leftarrow v_k}])\bigg )}, \end{aligned} \end{aligned}$$

(3)

where $\varvec{W} \in {\mathbb {R}}^{d \times d}$ is a shared weight matrix for mapping nodes into the same embedding space, ${\textbf {a}} \in {\mathbb {R}}^{2d}$ denotes a weight vector for learning the relations of the target node and its neighbors, and ${\mathcal {N}}_{v_i}$ is the set of neighbors of node $v_i$, and $\sigma $ denotes the sigmoid function [30].

After that, with the learned weight coefficients $\alpha _{v_i,v_j}$ and the neighboring message information $\varvec{m}_{v_i \leftarrow v_j}$, the final representations of node $v_i$ can be formulated by:

$$\begin{aligned} \begin{aligned} {\textbf{h}}_{v_t}^L = \text {ReLU} \bigg (\sum _{v_j \in {\mathcal {N}}_{v_i}} \alpha _{v_i,v_j} {\textbf{W}} {\textbf{m}}_{v_i \leftarrow v_j} \bigg ), \end{aligned} \end{aligned}$$

(4)

where ReLU is an activation function [31] and L denotes the last layer of the network.

Finally, inspired by the recent advances in the contrastive learning work [32, 33], we introduce a contrastive learning loss ${\mathcal {L}}_c$ formulated by:

$$\begin{aligned} \begin{aligned} {\mathcal {L}}_c = \sum _{(v_t, v_p, v_n) \in {\mathcal {T}}} [\sigma (v_t, v_p; \varvec{h}) - \sigma (v_t, v_n; \varvec{h}) + \nabla ]_{+}, \end{aligned} \end{aligned}$$

(5)

where $\varvec{h}$ denotes the hidden embeddings of users, $v_t$ is the target user, $v_p$ denotes its neighbor users, $v_n$ is the negative users drawn from the whole set by using the alias table method [34] that only takes O(1) time, $\nabla $ is a margin hyper-parameter separating the positive pair and the corresponding negative one (we set it as 0.5 in the experiments), ${\mathcal {T}}$ denotes a training batch, and $[\cdot ]_{+}$ denotes the positive part of the calculation. The above contrastive learning loss (Eq. 5) explicitly encodes similarity ranking among node pairs into the embedding vectors.

3.2 Topic Modeling Loss

In this part, the previous method UGrahpNet [11] refers to a matrix factorization [12] method to obtain the topic modeling loss ${\mathcal {L}}_t$. More concretely, given the document–word matrix ${\textbf{D}}$, we decompose it into the product of the document–topic embedding matrix $\mathbf {\Theta }$ and the topic–word embedding matrix ${\textbf{T}}$ with regularization as follows:

$$\begin{aligned} \begin{aligned} {\mathcal {L}}_t = \sum _{i \in {\mathcal {T}}} ({\textbf{D}}_i - \mathbf {\Theta }_i {\textbf{T}})^2 + \lambda (|| \mathbf {\Theta }_i ||^2_2 + || {\textbf{T}} ||^2_2), \end{aligned} \end{aligned}$$

(6)

where ${\textbf{D}} \in {\mathbb {R}}^{|D| \times V}$, D denotes the set of documents, V is the vocabulary size, $\mathbf {\Theta } \in {\mathbb {R}}^{|D| \times k}$, ${\textbf{T}} \in {\mathbb {R}}^{k \times V}$, k is the dimension of the topic embedding, $||\cdot ||^2_2$ is the $l_2$ norm regularization of the parameters, and $\lambda $ is a harmonic factor for regularization. In Eq. (6), we explore the latent topics of the posts that are interesting to the target user. Besides, the obtained document–topic embedding $\mathbf {\Theta }$ will be used in generative learning.

Nevertheless, Eq. (6) lacks nonlinear transform in the topical feature learning, which may prevent obtaining better user interests. To this end, in our proposed NIGraphNet, we further explore a variational inference method [28] in topical learning, where the detail is the following.

As shown in the right part of Fig. 1, given a collection of posts, i.e., the document–word matrix ${\textbf{D}}$, we adopt an encoder to obtain the document–topic embedding $\mathbf {\Theta }$ by estimating the parameters: mean $\mu $ and variance $\delta $, where the formula is given by:

$$\begin{aligned} \begin{aligned} \mu = \text {ReLU}({\textbf{D}} {\textbf{M}}_{\mu }), \ \ \ \delta = \text {ReLU}({\textbf{D}} {\textbf{M}}_{\delta }), \end{aligned} \end{aligned}$$

(7)

where ${\textbf{M}}_{\mu } \in {\mathcal {R}}^{V \times k}$ and ${\textbf{M}}_{\delta } \in {\mathcal {R}}^{V \times k}$ are trainable weights, and ReLU is an activation function [17]. In our approach, we utilize a Gaussian distribution with mean $\mu $ and variance $\delta $ to estimate the document–topic embedding $\mathbf {\Theta }$. This enables us to incorporate model uncertainty in parameter inference, thereby mitigating the overfitting issue commonly associated with the original matrix factorization method. By considering the distribution of the document–topic embedding, we introduce a level of flexibility that helps to alleviate the limitations of the previous approach. Then, we follow the generative process:

1.
Generate hidden topical features $\mathbf {\Theta } \sim {\mathcal {N}}(\mu ,\delta ^2)$;
2.
For each document $i \in D$:
1. (a)
  Draw a document–topic distribution: $\Phi _i = \text {Softmax}(\text {Sigmoid}(W_{\Theta } \mathbf {\Theta }_i))$;
2. (b)
  For each word w in the vocabulary, draw a word–topic distribution: $\phi _w = \text {Softmax}(\text {Sigmoid}(W_{\Phi } \Phi _i))$.

where $\mathbf {\Theta } \in {\mathbb {R}}^{|D| \times k}$, $M_{\Theta }$ and $M_{\Phi }$ are trainable linear transformation, $\Phi \in {\mathcal {R}}^{|D| \times k}$ denotes the document–topic distributions, k denotes the dimension of topic embeddings, and $\phi \in {\mathcal {R}}^{V \times k}$ denotes the word–topic distributions. In an end-to-end training, $\mathbf {\Theta } \sim {\mathcal {N}}(\mu ,\delta ^2)$ can be re-parameterized [28] as:

$$\begin{aligned} \begin{aligned} \mathbf {\Theta } = \mu + \epsilon \cdot \delta , \end{aligned} \end{aligned}$$

(8)

where $\epsilon \sim {\mathcal {N}}(0,1)$. After that, a decoder is adopted for $\mathbf {\Theta }$ to reconstruct the input $\hat{{\textbf{D}}}$. The detail is given as follows:

$$\begin{aligned} \begin{aligned} \hat{{\textbf{D}}} = \text {Softmax}(\mathbf {\Theta } {\textbf{W}}_{d}), \end{aligned} \end{aligned}$$

(9)

where ${\textbf{W}}_{d} \in {\mathcal {R}}^{k \times V}$. In general, we need to minimize the original input ${\textbf{D}}$ and the reconstructed output $\hat{{\textbf{D}}}$. The final topic modeling loss is given by:

$$\begin{aligned} \begin{aligned} \hat{{\mathcal {L}}}_{t} = || {\textbf{D}} - \hat{{\textbf{D}}} ||^2 + \eta (||\mu ||^2 + ||\delta ||^2), \end{aligned} \end{aligned}$$

(10)

where $\eta $ is set to 0.001 in the experiments. In summary, we improve the exploration of Eq. (6) with Eq. (10). The obtained document–topic embedding $\mathbf {\Theta }$ of Eq. (8) will be used in the following section.

By incorporating the aforementioned neural inference method, we enhance our ability to perform nonlinear transformation in topical representation learning. Moreover, the utilization of the reparameterization trick during the inference process helps to mitigate the problem of overfitting and allows our models to capture more generalized user interests during the generative learning process.

3.3 Generative Learning Loss

With the target user embedding ${\textbf{h}}_{v_t}^L$ from Eq. (4) that represents the user collaboration information, and the document–topic embedding $\mathbf {\Theta }_{v_t}$ from Eq. (8) that represents the interests of the target user, we can construct the generative learning loss ${\mathcal {L}}_g$ as follows:

$$\begin{aligned} \begin{aligned} {\mathcal {L}}_g = - \sum _{v_t \in {\mathcal {T}}} \text {log} ( \sigma ([{\textbf{h}}_{v_t}^L; \mathbf {\Theta }_{v_t}]{\textbf{W}}_v)), \end{aligned} \end{aligned}$$

(11)

where ${\textbf{h}}_{v_t}^L \in {\mathbb {R}}^{1 \times d}$, $\mathbf {\Theta }_{v_t} \in {\mathbb {R}}^{1 \times k}$, ${\textbf{W}}_v \in {\mathbb {R}}^{(d + k) \times 1}$ are trainable weights and [; ] denotes the concatenation operation. In Eq. (11), we aim to fuse the information of the two domains (i.e., the user relations and the interested latent topics) which exploits the assumption that relevant users may share similar interests.

3.4 Learning and Inference

In the training stage, we adopt stochastic gradient descent [35] to minimize the loss function of the total loss, which is given by:

$$\begin{aligned} \begin{aligned} {\mathcal {L}}_{total} = {\mathcal {L}}_{c} + \hat{{\mathcal {L}}}_{t} + {\mathcal {L}}_{g}. \end{aligned} \end{aligned}$$

(12)

With the above learning objective as shown in Eq. (12), we can: (1) exploit the user collaboration information with the contrastive learning loss (Eq. 5), (2) explore the latent topics of the semantic information to summarize user interests (Eq. 10), and (3) fuse the above information (Eq. 11) to simultaneously learn them in an end-to-end way.

3.4.1 User interest inference

Based on the concatenated embedding of user collaborative information ${\textbf{h}}_{v_t}^L$ and user historical interest information $\mathbf {\Theta }_{v_t}$, we can conduct dot product with the topic–word embedding ${\textbf{T}}$ to generate a ranking list of output words, where the top K ones serve as the user interest summarization in the evaluation.

3.5 Post Recommendation Inference

Similarly, based on the ${\textbf{h}}_{v_t}$ and $\mathbf {\Theta }_{v_t}^L$ of the target user, we generate a ranking list with the document–topic embedding $\mathbf {\Theta }$ of the output posts, where the top N ones serve as the post recommendation.

Table 2 The statistics of datasets

Full size table

4 Experiments

In the experiments, we first evaluate the performance on user interest summarization tasks. Then, we conduct an ablation study for estimating the effect of the proposed components, including contrastive learning, topic modeling, and generative learning. At last, we evaluate whether jointly learning user interests can be conducive to the item recommendation task.

4.1 Datasets

We adopt two real-world datasets to estimate the performance: Delicious^{Footnote 1} and Yelp^{Footnote 2} which are widely used in social recommendation [36, 37]. The statistics of the datasets are shown in Table 2. Each dataset contains of users, items, the interactions including browse or access between users and items, user summarization of items, and item description. The “Avg. items interacted by per user” represents the average number of items that have been browsed or visited by users before. The “Avg. length of user summarization per item” denotes the average length of words that users summarize items. The “Avg. length of description per item” denotes the average length of words that are used to comprehensively describe the characters of items.

4.2 Comparison Methods

We include several traditional and state-of-the-art approaches that can be applied to user interest summarization, including probabilistic graph models and sequential learning models. Here are descriptions of the selected methods:

GSDMM [38] is a traditional and widely used probabilistic graph model which is designed for short text modeling. The word and document representations are learned by combining Dirichlet and multinomial distributions.

DP-BMM [2] is another often used probabilistic graph model which explicitly exploits the word pairs constructed from each document to enhance the word co-occurrence pattern in short texts. It can deal with the topic drift problem of short text streams naturally.

SEQ-TAG [10] is a state-of-the-art deep recurrent neural network model that can combine keywords and context information to automatically extract keyphrases from short texts.

SEQ2SEQ-CORR [39] exploits a sequence-to-sequence (seq2seq) architecture for keyphrase generation which captures correlation among multiple keyphrases in an end-to-end fashion.

TAKG [16] introduces a seq2seq-based neural keyphrase generation framework that takes advantage of the recent advance of neural topic models [28] to enable end-to-end training of latent topic modeling and keyphrase generation.

Different from the above methods, we exploit the potential usefulness of user collaboration and the latent topics exhibited in the user interest and the item contents, which have been ignored in previous research and will be extensively studied here. We also present an ablation study to show the effectiveness of our proposed components. Our proposed models include:

UGraphNet [11] propose a graph-based neural interest summarization model that includes contrastive learning, topic modeling, and generative learning.

NIGraphNet is our proposed improved version that considers the optimization in the second part of UGraphNet, which endows with the ability of nonlinear transform in the topical representation learning.

4.3 User Interest Summarization Results

In this section, we examine our performance in user interest summarization for social media. The performance of the user summarization is accessed by calculating how many "hits" in an n-sized list of ranked words. To this end, we use popular information retrieval metrics Hit Ratio (HR) and Mean Average Precision (MAP) for evaluation. For the datasets Delicious and Yelp, most items are summarized by users with 3 to 6 on average (Table 2), thus HR@1, HR@5, HR@10 are reported. Besides, MAP is measured over the top 10 prediction for all datasets.

Table 3 Main comparison results displayed with scores in %.

Full size table

The main comparison results are shown in Table 3, where the highest scores are highlighted in boldface and the underlined ones denote the second best. The last row is the improvements of our method compared with the best baseline. In general, we can observe that:

(1) Our model UGraphNet and NIGraphNet consistently outperform other comparisons on all datasets under various metrics. This shows the usefulness of leveraging user neighboring information for their interest summarization. Moreover, NIGraphNet increases 3.30%, 2.48%, 3.05%, and 4.91% over UGraphNet in terms of HR@1, HR@5, HR@10 and MAP on Delicious, respectively. And NIGraphNet increases 25.95%, 79.68%, 97.08%, and 34.85% on Yelp. One interesting observation is that NIGraphNet can gain larger improvements on Yelp compared with Delicious. We explain that Yelp contains more text information than Delicious as shown in Table 2. These improvements demonstrate our improved version can better explore the document–topic distributions.

(2) Besides, the second-best method, UGraphNet, achieves up to 24.86%, 14.70%, 14.67%, and 24.75% improvements over the third-best method TAKG in terms of HR@1, HR@5, HR@10 and MAP on Delicious. UGraphNet gains 32.46%, 5.63%, 5.73%, and 25.62% improvements on average against the second ones on Yelp. In general, the above improvements demonstrate the effectiveness of our methods by jointly modeling user relations and user interests.

(3) Among the results of the baselines, the traditional methods including GSDMM and DP-BMM give poor performance. This indicates that user interest summarization is a challenging task. It is hard to rely on probabilistic graphical models to yield acceptable performance. On the contrary, seq2seq-based models consisting of SEQ-TAG, SEQ2SEQ-CORR, and TAKG yield better results than the traditional ones. Particularly, TAKG outperforms the other baselines, which suggests the help of exploiting latent topics in short texts. Interestingly, our model achieves larger improvements with a step further by exploring the user relations and their latent topics.

4.4 Ablation Analysis

To analyze the effectiveness of the proposed components on user interest summarization (introduced in Sect. 3) in our method, we conduct an ablation analysis as follows. In general, we have three ablated variants of our model:

Table 4 Ablation analysis

Full size table

I. w/o CLoss (without contrastive learning loss): The CLoss (Eq. 5) is used to exploit user relations that help to distinguish the target user from its neighboring users and negative users. We remove the CLoss and keep the TLoss and GLoss for comparison.
II. w/o TLoss (without topic modeling loss): The TLoss (Eq. 10) aims to exploit the latent topics in short texts which can especially effectively alleviate the data sparsity in the user interest summarization.
III. w/o GLoss (without generative learning loss): The GLoss (Eq. 11) utilizes the assumption that relevant users share similar interests. We adopt it to generate keyphrases that are relevant to users’ latent topics.

The results of the ablation tests are shown in Table 4. Our method NIGraphNet outperforms the other variants. Specifically, NIGraphNet achieves 13.30%, 24.11%, 21.37%, and 34.66% improvements over the second-best variant in terms of HR@1, HR@5, HR@10, and MAP on Delicious, and obtains 15.32%, 35.97%, 41.69%, and 11.78% gains on Yelp, respectively. These results validate that the user attention update gate is more appropriate to explore user interests. These results demonstrate the effectiveness of jointly learning different components. We observe that the performance order is presented as w/o GLoss > w/o CLoss > w/o TLoss on Delicious. These results demonstrate that the topic modeling loss contributes the most to the learning. By contrast, the performances of components on Yelp are as: w/o CLoss > w/o TLoss > w/o GLoss, which shows that the generative loss contributes the most while the contrastive learning loss contributes the least. In general, all parts contribute to the final performance, which evidently demonstrates their effectiveness.

4.5 Item Prediction

In this part, we evaluate if unearthing potential user relations and jointly learning the latent topic representations can facilitate item prediction. Concretely, we adopt a standard evaluation metric area under the curve (AUC) [40] to predict a link between users and items. This metric represents the probability that users and items in a random unobserved link are more similar than those in a random non-existed link. The AUC metric has been widely used in recommendation tasks [9, 25]. When the prediction results perfectly match the ground truth, the AUC value will be one, otherwise, it will be zero. The baselines include AMOUNT [41], IMP-GCN [42], and IRLM [43], where are the stat-of-the-art methods used for user–item prediction. We report the comparison results as shown in Fig. 2. Observations derived from this figure are as follows:

Table 5 Parameter analysis for $\eta $ in Eq. (10), where the highest scores are marked in boldface

Full size table

Our methods achieve the best performance over the other baselines in terms of AUC. Specifically, our NIGraphNet obtains 0.9574 and 0.8896 on Delicious and Yelp, respectively. Besides, UGraphNet obtains 0.9423 and 0.8775 on Delicious and Yelp, respectively. Among the baselines, IRLM gets the second-best performance, 0.9276, on Delicious, and IMP-GCN obtains better performance, 0.8633, on Yelp. All the comparison methods either ignore the semantic features or regard them as the static values associated with nodes. By contrast, our models enable an end-to-end training process that jointly learning the latent topics and the user–item relations.

4.6 Parameter Analysis

This section aims to perform experiments with different values of $\eta $ in Eq. (10) and analyze their impact on the model’s performance. The results of these experiments are presented in Table 5. From the table, we can observe that NIGraphNet achieves the best performance on both datasets when $\eta =0.001$. Additionally, NIGraphNet obtains the second-best performance when $\eta =0.0001$ on the Delicious dataset and $\eta =0.01$ on the Yelp dataset. It is important to note that we introduce additional constraints parameterized by $\eta $ in Eq. (10). The inclusion of a penalty term serves the purpose of preventing the parameters from reaching excessively large values. By imposing these restrictions, we aim to regulate the parameter values and facilitate a more balanced and stable optimization process.

4.7 Summary for Experimental Study

In general, our proposed extension method, NIGraphNet, achieves improvements over the original UGraphNet in various evaluation metrics. Specifically, on the Delicious dataset, NIGraphNet outperforms UGraphNet by 3.30%, 2.48%, 3.05%, and 4.91% in terms of HR@1, HR@5, HR@10, and MAP, respectively. On the Yelp dataset, NIGraphNet achieves improvements of 25.95%, 79.68%, 97.08%, and 34.85% over UGraphNet in the same metrics. Additionally, NIGraphNet demonstrates superior performance in terms of AUC, surpassing UGraphNet by 1.60% and 1.38% on the Delicious and Yelp datasets, respectively. The results of the ablation study further highlight the effectiveness of jointly learning different components within NIGraphNet.

5 Conclusion

In general, we propose a topic-aware graph-based neural interest summarization method, called UGraphNet, that can enhance user semantic mining for user interest summarization and item recommendation in social media. Moreover, we further propose an improved version, NIGraphNet, that can explore hidden topics with a variational inference approach. The main innovations of our work include a contrastive learning loss, a topic modeling loss, and a graph-based learning loss that can leverage user relations and latent topics on social media through joint training. Experiments on two newly constructed social media datasets demonstrate that our model can significantly outperform all the comparison methods. Ablation analysis is also conducted to show the superiority of our proposed components.

Availability of Data and Materials

Not applicable.

Notes

References

Chen J, Gong Z, Liu W (2019) A nonparametric model for online topic discovery with word embeddings. Inf Sci 504:32–47
Article MathSciNet Google Scholar
Chen J, Gong Z, Liu W (2020) A Dirichlet process biterm-based mixture model for short text stream clustering. Appl Intell 50:1609–1619
Article Google Scholar
Wang Y, Li J, King I, Lyu MR, Shi S (2019) Microblog hashtag generation via encoding conversation contexts. arXiv preprint arXiv:1905.07584
Efron M (2010) Hashtag retrieval in a microblogging environment, pp 787–788
Bansal P, Jain S, Varma V (2015) Towards semantic retrieval of hashtags in microblogs, pp 7–8
Davidov D, Tsur O, Rappoport A (2010) Enhanced sentiment learning using twitter hashtags and smileys, pp 241–249
Wang X, Wei F, Liu X, Zhou M, Zhang M (2011) Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach, pp 1031–1040
Wang W et al (2019) Trust-enhanced collaborative filtering for personalized point of interests recommendation. IEEE Trans Ind Inf 16:6124–6132
Article Google Scholar
Wang W, Chen J, Wang J, Chen J, Gong Z (2019) Geography-aware inductive matrix completion for personalized point-of-interest recommendation in smart cities. IEEE Internet Things J 7:4361–4370
Article Google Scholar
Zhang Q, Wang Y, Gong Y, Huang X-J (2016) Keyphrase extraction using deep recurrent neural networks on twitter, pp 836–845
Chen J et al. (2023) A topic-aware graph-based neural network for user interest summarization and item recommendation in social media, pp 537–546
Ahmed NK et al. (2018) Learning role-based graph embeddings. arXiv preprint arXiv:1802.02896
Gollapalli SD, Li X-L, Yang P (2017) Incorporating expert knowledge into keyphrase extraction, Vol. 31
Mihalcea R, Tarau P (2004) Textrank: bringing order into text, pp 404–411
Liu Z, Huang W, Zheng Y, Sun M (2010) Automatic keyphrase extraction via topic decomposition, pp 366–376
Wang Y et al. (2019) Topic-aware neural keyphrase generation for social media language. arXiv preprint arXiv:1906.03889
Zeng J et al. (2018) Topic memory networks for short text classification. arXiv preprint arXiv:1809.03664
Li J, Song Y, Wei Z, Wong K-F (2018) A joint model of conversational discourse and latent topics on microblogs. Comput Linguist 44:719–754
Article Google Scholar
Tang J, Hu X, Liu H (2013) Social recommendation: a review. Soc Netw Anal Min 3:1113–1133
Article Google Scholar
Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907
Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs, pp 1024–1034
Veličković P et al. (2017) Graph attention networks. arXiv preprint arXiv:1710.10903
Fan W et al. (2019) Graph neural networks for social recommendation, pp 417–426
Wu L et al. (2020) Diffnet++: a neural influence and interest diffusion network for social recommendation. IEEE Trans Knowl Data Eng
Chen J et al. (2022) Meta-path based neighbors for behavioral target generalization in sequential recommendation. IEEE Trans Netw Sci Eng
Yu J et al. (2021) Self-supervised multi-channel hypergraph convolutional network for social recommendation, pp 413–424
Chen J, Gong Z, Wang W, Liu W, Dong X (2021) Crl: collaborative representation learning by coordinating topic modeling and network embeddings. IEEE Trans Neural Netw Learn Syst 33:3765–3777
Article MathSciNet Google Scholar
Miao Y, Grefenstette E, Blunsom P (2017) Discovering discrete latent topics with neural variational inference, pp 2410–2419 (PMLR)
Fan S et al. (2019) Metapath-guided heterogeneous graph neural network for intent recommendation, pp 2478–2486
Han J, Moraga C (1995) The influence of the sigmoid function parameters on the speed of backpropagation learning, Springer, pp 195–201
Glorot X, Bordes A, Bengio Y (2011) Deep sparse rectifier neural networks, pp 315–323
Jaiswal A, Babu AR, Zadeh MZ, Banerjee D, Makedon F (2020) A survey on contrastive self-supervised learning. Technologies 9:2
Article Google Scholar
Khan A, AlBarri S, Manzoor MA (2022) Contrastive self-supervised learning: a survey on different architectures, pp 1–6 (IEEE)
Li AQ, Ahmed A, Ravi S, Smola AJ (2014) Reducing the sampling complexity of topic models, pp 891–900
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Wang H, Wu Q, Wang H (2017) Factorization bandits for interactive recommendation
Guo Z, Wang H (2020) A deep graph neural network-based mechanism for social recommendations. IEEE Trans Ind Inf 17:2776–2783
Article Google Scholar
Yin J, Wang J (2014) A dirichlet multinomial mixture model-based approach for short text clustering, pp 233–242 (ACM)
Chen J, Zhang X, Wu Y, Yan Z, Li Z (2018) Keyphrase generation with correlation constraints. arXiv preprint arXiv:1808.07185
Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36
Article Google Scholar
Wang W et al (2021) A multi-graph convolutional network framework for tourist flow prediction. ACM Trans Internet Technol (TOIT) 21:1–13
Article Google Scholar
Liu F, Cheng Z, Zhu L, Gao Z, Nie L (2021) Interest-aware message-passing GCN for recommendation, pp 1296–1305
Chen J et al. (2022) Irlm: inductive representation learning model for personalized poi recommendation. IEEE Trans Comput Soc Syst

Download references

Acknowledgements

The preliminary version of this article has been published in DASFAA 2023 [A Topic-Aware Graph-Based Neural Network for User Interest Summarization and Item Recommendation in Social Media]. This work was supported by National Natural Science Foundation of China under Grant No. 62102265, by the Open Research Fund from Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ) under Grant No. GML-KF-22-29, by the Natural Science Foundation of Guangdong Province of China under Grant No. 2022A1515011474 and 2023A1515012534, by the National Statistical Science Research Project of China Grant No. 2022LY096, by the Science and Technology Development Fund, Macau SAR, China, under Grant No. (0068/2020/AGJ, SKL-IOTSC(UM)-2021-2023), and by the Natural Science Foundation of Hebei Province, China, under the Grant No. F2021202064. (Corresponding authors: Wenfeng Du, Zhenghua Xu, Email: junyangchen@szu.edu.cn, duwf@szu.edu.cn, zhenghua.xu@hebut.edu.cn).

Funding

Not applicable

Author information

Authors and Affiliations

College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
Junyang Chen, Mengzhu Wang & Wenfeng Du
Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen, China
Junyang Chen
Tencent Inc., Shenzhen, China
Ziyi Chen & Ge Fan
School of Information Science and Technology, Guangdong University of Foreign Studies, Guangzhou, China
Guo Zhong
Wenzhou Institute, University of Chinese Academy of Sciences, Zhejiang, China
Ou Liu
State Key Laboratory of Reliability and Intelligence of Electrical Equipment, Hebei University of Technology, Tianjin, China
Zhenghua Xu
State Key Laboratory of Internet of Things for Smart City, Department of Computer Information Science, University of Macau, Zhuhai, China
Zhiguo Gong

Authors

Junyang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ziyi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Mengzhu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ge Fan
View author publications
You can also search for this author in PubMed Google Scholar
Guo Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Ou Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wenfeng Du
View author publications
You can also search for this author in PubMed Google Scholar
Zhenghua Xu
View author publications
You can also search for this author in PubMed Google Scholar
Zhiguo Gong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JC, MW, ZC, and GF are responsible for writing the first draft of the paper, conducting code experiments, organizing experimental results, analyzing experimental results, and drawing diagrams. WD, and GZ are responsible for writing and proofreading paper formulas, deriving algorithms, and debugging experimental code. OL, ZX, and ZG are responsible for conceptualizing paper ideas, summarizing related work, and refining innovative points

Corresponding author

Correspondence to Junyang Chen.

Ethics declarations

Conflict of interest

Not applicable.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, J., Chen, Z., Wang, M. et al. A Neural Inference of User Social Interest for Item Recommendation. Data Sci. Eng. 8, 223–233 (2023). https://doi.org/10.1007/s41019-023-00225-8

Download citation

Received: 13 May 2023
Revised: 18 July 2023
Accepted: 08 August 2023
Published: 29 August 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s41019-023-00225-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Neural Inference of User Social Interest for Item Recommendation

Abstract

Similar content being viewed by others

A Topic-Aware Graph-Based Neural Network for User Interest Summarization and Item Recommendation in Social Media

Modeling User Interest and Community Interest in Microbloggings: An Integrated Approach

Mining User Interests from Information Sharing Behaviors in Social Media

Explore related subjects

1 Introduction

2 Related Work

2.1 User Interest Summarization

2.2 Item Recommendation

2.3 Topic Modeling

3 Proposed Model

3.1 Contrastive Learning Loss

3.2 Topic Modeling Loss

3.3 Generative Learning Loss

3.4 Learning and Inference

3.4.1 User interest inference

3.5 Post Recommendation Inference

4 Experiments

4.1 Datasets

4.2 Comparison Methods

4.3 User Interest Summarization Results

4.4 Ablation Analysis

4.5 Item Prediction

4.6 Parameter Analysis

4.7 Summary for Experimental Study

5 Conclusion

Availability of Data and Materials

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation