Weight normalization optimization movie recommendation algorithm based on three-way neural interaction networks

Heterogeneous information networks are increasingly used in recommendation algorithms. However, they lack an explicit representation of meta-paths. In using bidirectional neural interaction models for recommendation models, interaction between users and items is often ignored, with an integral impact on the accuracy of the recommendations. To better apply the interaction information, this study proposes a weight-normalized movie recommendation model (SCLW_MCRec) based on a three-way neural interaction network. The model constructs a three-way neural interaction network ⟨\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\langle $$\end{document}user, meta-path, item⟩\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rangle $$\end{document} from meta-path contextual information, introducing meta-paths on top of the user-item representation to represent the user-item interaction information. Introduction of a two-layer, one-dimensional convolutional neural network helps capture higher-order interaction features between the user and the item, making the model more powerful in terms of interaction. Adding a dropout layer to the interaction model and using a two-layer convolutional neural network can prevent overfitting and discard irrelevant information features to improve the recommendation. In addition, an extreme cross-entropy loss (argmaxminloss) that incorporates the properties of the argmin and argmax functions is designed to reduce the model loss. A weight-normalization optimization approach is used to better optimize the model and accelerate convergence of the stochastic gradient descent optimization. Compared to current state-of-the-art recommendation models, the SCLW_MCRec model improves the Prec evaluation index by 2.94–35.8%, Recall by 1.15–53.51%, and NDCG by 6.7–49.37% on the MovieLens dataset. The framework provides a significant improvement in recommendation accuracy and also solves the cold-start problem with application of interaction information.


Introduction
With rapid development of artificial intelligence, recommendation systems have gained popularity with development of machine learning and deep learning.Personalized recommendation systems produce a close connection between users and information resources [1], aiming to provide users with information relevant to their interests; the impact of their recommendation scale is significant [2].Classical recom- mendation methods such as matrix decomposition [3] model preferences through the interaction history between users and movies, for example, or through similarity functions that perform recommendation learning by judging the similarity of objects [4], Trattner et al. capturing the exact similarity of neighbors between users or movies based on their historical co-evaluation, and subsequently recommending suitable movies [5].Intelligent recommendation systems can suggest suitable movies based on different user preferences, Lavanya et al. enhancing movie recommendations by mixing different hobby profiles, and are widely used for accurate matching of users and movies [6].Auxiliary data are reused in recommendation systems; many approaches have been derived to further improve the recommendation performance of the models by exploiting contextual information [7,8].
With rapid development of film, music, and online shopping, recommender systems can take root and evolve; bidirectional neural networks, heterogeneous information networks (HINs), and knowledge graphs are developing the next generation of recommender systems [9].Chuan et al. improving recommendation accuracy by introducing knowledge graphs to capture heterogeneous information.HINs are often used in recommender systems to obtain information about interactions between different edge types and nodes, which is inextricably linked to their ability to flexibly characterize all types of heterogeneous data [10].Meta-paths are sequences of relations connecting pairs of objects in HINs [29,30] and have been widely used to obtain semantic structural information concerning the existence of different edge types and nodes related to recommendations [11,18].Mukul et al. and Fulian et al. improve recommendation accuracy by introducing heterogeneous information networks to capture more heterogeneous information related to user items.In this paper, we present an example of a movie recommendation featuring a HIN.HIN-based recommendation methods can be classified into two types.The first type uses pathbased semantic relevance, rather than a HIN, as a direct feature for recommendation relevance [12,13]; the second type performs transformations on path-based similarity to learn effective transformation features.Both approaches are designed to improve the representation of bidirectional user-item interactions by extracting meta-path-based features [14].Hu et al. proposes a network model that introduces meta-paths into the recommendation algorithm to improve the accuracy of recommendations.
Nowadays, in order to better apply information about user items, researchers have proposed models such as ItemKNN, Bayesian Personalized Ranking (BPR) [20], Matrix Factorization (MF), HeteRS [19], FMG [21], SVDFeature (hete) [23], MCRec [25], MCRec (avg), MCRec (mp), MCRec (rand), and MAGNN_Rec [32] for use in recommendation algorithms.However, different models apply different information and for this reason we have constructed a table of the contributions made by each model in the application of information as shown in Table 1 [36].The information applied to the model includes user information, item information, additional information, supplementary information, user-item interaction information and higher order interaction information.Supplementary information includes click information, heterogeneous information and attribute information.Additional information means information that completes the missing information.Y indicates that the model applies such information, while N indicates that the model does not apply such information.
In movie recommendation algorithms [15,31], especially the traditional collaborative filtering algorithm [16,26], only the user's history of clicks [27] is used to make recommendations [17].Using the collaborative filtering algorithm to make movie recommendations, only information about the user and the movies they have previously clicked on is used; no user-movie interaction information is applied.In addition, information about newly registered users and movies cannot be used for recommendation, resulting in the inevitable problem of cold-starting users and movies in the recommendation algorithm [28].To solve these problems, application of user-item interaction information, improved application of associative interaction information, and enhanced user and item representations through user-item interaction information are necessary.A main area of research in the field of recommendation algorithms is better application of useritem interaction information to improve recommendation accuracy.If user-item interaction information used in recommendation algorithms can be increased, recommendation accuracy can be greatly improved.If user-item interaction information is ignored, causal and confusing information can affect the recommendation result and accuracy.
To improve the application of causal association information, reinforcement learning is the first thing that comes to mind, with the advantage that, with sufficient data feedback, its exploratory power can reach upper bounds unattainable by traditional machine learning.However, it suffers from several serious shortcomings: 1 Poor sampling utilisation.2 Exploration and utilisation difficulties.3 Learning difficulties due to delayed rewards [34].In recommendation algorithms require fast adaptation, user needs are changing and can lead to changes in the environment, often leading to reinforcement learning before the existing part of the needs has been learned to change, which in turn leads to non-convergence in the reinforcement learning [33] recommendation algorithm.Therefore, when applying reinforcement learning to recommendation algorithms, it is necessary to build models based on changes in users' rating behaviour and their interest in watching movies, and to build suitable reinforcement recommendation algorithms according to the requirements of different projects.There is a greater need for recommendation algorithm models that can learn consistently for different projects, so reinforcement learning [35] is not applicable to our recommendation algorithms.
In order to make better use of the interaction information that exists on the user and the item, a more stable learning recommendation algorithm is constructed compared to the reinforcement learning recommendation algorithm.In this study, a three-way neural interaction model based on meta-path context user, meta-path, item is combined with a two-layer, one-dimensional convolutional neural network (CNN), introducing the characteristics of a collaborative attention mechanism into the model and allowing top-N recommendations to be made using meta-path-based contexts.The MCRec model can effectively represent users, items, and meta-path-based context learning; introduction of a twolayer, one-dimensional CNN makes the model more powerful in terms of interaction.Adding a dropout layer to the interaction model and using a two-layer CNN can prevent overfitting The SCLW_MCRec model captures user and item information and their interaction through a three-part representation of the user, item, and user-item interaction information.The three-way neural network, the two-layer CNN, and the UUMUM, UMGM, UUUM, and UMMM meta-path methods obtain user-item interaction information, helping the model obtain path instance information related to the user-item.The extreme cross-entropy loss (argmaxminloss) combines the properties of argmin and argmax functions, with reduced loss during the recommendation algorithm model training process.Weight normalization was used to better optimize the SCLW_MCRec model and accelerate the convergence of stochastic gradient descent optimization.Through extensive experiments using the Movielens dataset on each model, it was found that the SCLW_MCRec model has better recommendation performance than the other models, with an improvement of 2.94-35.8% in the Prec evaluation index, 8.41-53.51% in Recall, and 24.52-49.37% in NDCG.
Therefore, the innovation points of this paper are shown below.
1.The model constructs a three-way neural interaction network user, meta-path, item from meta-path contextual information.2. The three-way neural network, the two-layer CNN, and the UMUM, UMGM, UUUM, and UMMM meta-path methods obtain user-item interaction information, helping the model obtain path instance information related to the user-item.3. Design for introducing argmin and argmax characteristics of extreme cross-entropy loss.4. Optimization method using weight normalization for the model.

General structural model
Compared to the previous user-item learning, the SCLW_MCRec model introduces embedded learning of user-item interaction contexts, which can be applied to relationships between users and items, and thus has an impact on the recommendation results.In contrast to the user and item embeddings in the two-way neural interaction model, the SCLW_MCRec model adds an embedding focus, meta-pathbased contextual embedding.Although only one embedding structure is added, it addresses application of user-item interaction information that two-way neural networks tend to overlook.For recommender systems, the interaction relationship has a direct impact on the recommendation result, just as one may judge the behavior of a person based on what they do, and the judgement is based on the relationship between them.In this study, we use a three-way neural interaction model based on meta-paths, consisting of users, items, and meta-paths.The model incorporates meta-paths including UMUM, UMGM, UUUM, and UMMM, perfectly compensating for not considering the relationship between users and items, and reducing the impact of cold starts on the recommendation results.The SCLW_MCRec model obtains information by first obtaining the overall user-item information.The information obtained is then refined into four meta-paths UMUM, UMGM, UUUM and UMMM to obtain more accurate information about the user's interests.This approach stems from the idea of acquiring information about the whole to the local.The architecture of the SCLW_MCRec model is shown in Fig. 1.The user, meta-path, item three-way interaction neural network is constructed using the embedding representations of user, item, and meta-path to obtain user information, item information, and user-item interaction information, respectively.The interaction information of UMMM, UMGM, UUUM, and UMMM meta-paths represents the final relevant path instances.A two-layer CNN is used to learn the embeddings of the final path instances, filtered with strong relevance to the user items.The CNN is followed by a max-pooling layer to capture higherorder interaction features with greater relevance to the path instances.The path instances obtained after the max-pooling operation are processed by the dropout layer to filter out unrelated and confusing path instances.As users do not have the same degree of association with movie items obtained from different meta-paths, attention weights are assigned to the meta-paths to represent the learning user-item interactions, with higher weights assigned to those that are more relevant.After representation of user information, representations of item information and path instance information processed through the meta-pathway are used to model the non-linear function of complex interactions through the MLP component, resulting in the final recommendation.
The SCLW_MCRec model is obtained through contextual embedding based on meta-paths using a two-layer, one-dimensional CNN consisting of a convolutional layer (generating new features through convolutional operations), a maximum pooling layer (Maxpooling), and a dropout layer (Dropout).The convolutional layers consist of 128 and 256 kernels; 64 and 128 convolutional kernels are used for validation.In the model, a 1D convolutional layer is used to obtain the local features of the movie dataset.Maxpooling is used to downsample the information, reducing the number of features without losing the main features.Dropout is used to prevent overfitting, and a fully connected layer weighs the local features of the previously collected movie dataset.The SCLW_MCRec model uses a two-layer, onedimensional CNN structure, as shown in Fig. 2.This structure captures the feature information of the path instances represented by the meta-path through the convolutional layer.The maxpooling layer captures the higher-order interaction features of the path instances, and the dropout layer improves the relevance of the interaction information for recommendations by discarding information with little or no relevance in the higher-order interaction features of the path instances.
From a mathematical point of view, the role of the SCLW_MCRec model is similar to matrix factorization, which is equivalent to decomposing and extracting the information contained on a matrix.The SCLW_MCRec model is to obtain relevant interaction information by first obtaining the characteristic information of the user's relevant movie data, and then refining it to the meta-path information about UMUM, UMGM, UUUM, UMMM, similar to finding the entry point first and dividing it in detail on the basis of the entry point.It is this approach from refining the overall logical reasoning to local logical reasoning that allows the SCLW_MCRec model to capture more information about

User and item embedding
Unlike the HIN-based recommendation model, the metapaths are used as the context for interaction between users and items.The model characterizes the three-way interaction user, meta-path, item rather than the two-way interaction user, item .To learn better through meta-paths to generate interactions for recommendation, the model introduces a more important embedding, meta-path-based context, in addition to the components used to learn user and item embeddings.The meta-path-based context is first modeled as a low-dimensional embedding using a hierarchical neural network.Using the initially learned embeddings for user, item, and meta-path-based contexts, the joint attention mechanism is improved by alternative augmentations for all three representations.Using meta-path-based context, a two-layer, one-dimensional CNN, and a dropout layer, the SCLW_MCRec model has steadily improved the accuracy of movie recommendations.The symbols involved in the model are shown in Table 2.Each of these symbols plays a key role in the SCLW_MCRec model, and it is what they represent that together build the core of the model.In the SCLW_MCRec model, u and i represent user and item information respec-tively, |U| and |I| represent the total number of users and projects respectively, and the rest of the symbols represent information about individual modules or parameters in the article, and are described in detail in later sections.
After the embedding, a lookup layer is set up for converting a user and item representation to a low-dimensional dense vector.For a given user-item pair < u, i >, m u ∈ R |U|×1 and n i ∈ R |I|×1 are their individual representations.The parameter matrix M ∈ R |U|×d of the lookup layer is used to store the potential factors of the user, and N ∈ R |I|×d stores the items.|U| represents the total number of users, and |I| is the total number of items; the dimension size embedded by users and items is represented by d.The search method of the search layer is represented in the following equations:

Meta-path-based interaction contexts
The SCLW_MCRec model approach includes four steps.The first step is to embed a single instance of the path [18]; two layers of CNN are used for processing.The CNN structure consists of a convolutional layer, a maxpooling layer [25], and a dropout layer, as shown in Fig. 2. The embedding is expressed as where m denotes a meta-path containing four types of metapaths, UMMM, UMGM, UUUM, and UMMM; X m denotes a feature of a path instance, and is a parameter in the CNN model.The second step is to embed multiple paths [18], as meta-paths generate more than one path instance, after the convolutional layer is filtered to obtain K path instances that are more relevant to the user, denoted as {h m } K m=1 .The model uses maxpooling to derive the meta-path embedding to obtain the important dimensional features.Meta-path m runs as The third step is a dropout to remove any confusing information contained in the important features, for more accurate recommendations, expressed as The last step is embedding of aggregated meta-paths, derived using an averaging pool operation to facilitate embedding of Meta-path attention score contextual modeling.This is calculated as where f u→i denotes the meta-path and contextual representation; G denotes the path instances from meta-path m, and G u→i denotes the set of meta-paths used by the model for user interaction with the movie item.

Attention mechanism embedding module
In the model, it is possible to obtain information about the user and the item, and also about their interaction.The SCLW_MCRec model uses a different attention-mechanism embedding approach to obtain information about users and items through three components: user, item, and user-item interaction information (meta-path context representation).
x u is the embedding representation of users; y i is the embedding representation of movie items, and c m is the embedding representation of meta-path contexts.The attention mechanism for user and item embeddings uses a single-layer network to compute the attention vectors for user u and item i.The attention vectors fi u and fi i are then used to improve the user and item embedding for context c u→i based on the calibrated meta-path.
where W u and W u→i are the weight matrices of the user focus layer, b u is the deviation vector, while the weight matrix and deviation vector of the item focus layer are represented by W i and b i respectively.Similarly, g() represents the sigmoid function.The final representation of users and items is then calculated by using the product of the elements with the attention vector ⊗.
where x u , y i , m and c m then denote the user's embedding, the item's embedding, the meta-path and the contextual embedding of the meta-path, respectively.For the attention mechanism used for contextual representation of meta-paths to process user-item interaction information, as different meta-paths have different semantics in user-item interactions, we use a two-tier architecture to implement a meta-path-based contextual attention mechanism with interaction-specific attention weights based on meta-paths.
α u,i,m = exp α (2)   u,i,m m ∈G u→i exp α (2)   u,i,m (13) The new embedding based on the meta-path context can be expressed as Finally, the three embedding vectors (user embedding, item embedding, and meta-path-based contextual embedding) are combined into a unified representation of the current interaction, expressed as The resulting unified representation of interactions containing user embeddings, item embeddings, and meta-pathbased contextual embeddings is fed into the MLP to implement a non-linear function to model complex interactions.The MLP component contains two hidden layers.The sigmoid function is used as the activation function, and the output layer has a ReLU function.The learning algorithm for the SCLW_MCRec model can be found in Algorithm 1.For training efficiency of the model, we use l meta-paths and use Algorithm 1 to implement the acquisition of user item interaction information.Given a node information, filter out all outgoing node information and further construct the information table for O(1) time node sampling.In this way, the acquisition of interaction information on path instances for meta-path generation interactions can be done in time O(l • L • N), where L is the path length and N is the maximum number of path instances considered for the meta-path.

Loss function
To improve the movie recommendation algorithm and reduce the loss of the model as much as possible, a loss function is

123
The weights are determined by including the argmin and argmax functions to introduce the extreme value theorem to obtain more comprehensive user-item interaction information and reduce loss; m and n denote the results obtained by introducing the argmin and argmax functions for the actual and predicted values, respectively.
Compared with other loss functions, the loss function in this model can obtain more complete data features to reduce loss and make more accurate recommendations.

Optimizer
For model optimization, the convergence of stochastic gradient descent optimization is accelerated using weight normalization to reparameterize the weight vectors.Application of weight normalization to movie recommendation models shows great advantages.
The weight-normalization optimization approach considers a standard artificial neural network, for which computation with neurons consists of two parts: the weighted sum of the input features and the elemental non-linearity: The scalar deviation term is represented by b; w represents the k-dimensional weight vector; x represents the k-dimensional vector of input features.The scalar parameter g and parameter vector v are used to re-parameterize each weight vector w, followed by stochastic gradient descent through re-parameterization using weight normalization.
where v is a k-dimensional vector; v denotes the Euclidean norm of v, and g is a scalar.The neural network is trained in the new parameterization using the standard stochastic gradient descent method.The gradient g of the loss function L with respect to the new parameters v is obtained in this section by performing differentiation.
where ∇ w L is the gradient with respect to the weight w normally used.Weight-normalized backpropagation can be used with only minor modifications to the usual backpropagation equations using standard neural network software.Another method of writing gradients has been added: where the projection matrix projected onto the complement of the w vector is represented by M w .This shows that weight normalization in the new optimization method accomplishes two things: it scales the weight gradient by g/ v , and helps the current weight-vector projection gradient bring the covariance matrix of the gradient closer to uniformity and benefit optimality.
Through several experiments on film and television recommendation models, neural networks with weight normalization worked well over a wider range of learning rates than with conventional parameterization.This study uses weight-normalized optimization to decouple the lengths of the weight vectors from their orientation by reparametrizing them in the neural network, which accelerates the convergence of stochastic gradient descent, and improves the model optimization process considerably, without introducing any dependencies between them.This suggests that the weightnormalization optimization method can also be applied to deep reinforcement learning or generative models, and that its introduction into film and television recommendation systems is highly effective.The weight-normalized optimization method is much simpler than other optimization methods, with faster batch normalization and a lower computational overhead, allowing more optimization steps to be performed in the same amount of time.

Experimental analysis
In this experiment, the SCLW_MCRec model is tested on the Movielens movie dataset and evaluated against other models by modifying the loss function, activation function, optimizer, convolutional kernel, and network information structure using the Prec, Recall, and NDCG indices.

Datasets and selected meta-paths
The MovieLens dataset contains user and movie attribute information, ratings of individual movies by different users, and different types of interactions between users and movies, users and users, movies and movies, and movies and genres, as shown in Table 3.The MovieLens dataset is one of the most commonly used datasets for recommendation systems, the test dataset for machine learning algorithms.Many well-known papers have used this dataset; it was also used in a historical recommendation system competition.Table 3 presents the data in the Movielens dataset.The first column corresponds to the user, the item, the number of interactions between the two, and the meta-path information available.The other columns present statistics for the other relationships: User-Movie, User-User, Movie-Movie, Movie-Genre.User-Movie corresponds to the UMUM meta-path; User-User corresponds to the UUUM meta-path; Movie-Movie corresponds to the UMMM meta-path, and Movie-Genre corresponds to the UMGM meta-path.
UMUM denotes the paths of movies followed by users who follow the same movies, whose data is constructed as user-movie-user-movie; UMGM denotes the paths of movies of the same genre as movies followed by the user, whose data is constructed as user-movie-genre-movie; UUUM denotes the paths of movies that the user follows, whose data is constructed as user-user-user-movie; and UMMM denotes the paths of movies associated with the movies that the user watches, whose data is constructed as user-moviemovie-movie.The dataset used in this paper is based on the public dataset Movieslens, which was refined by constructing user-movie-user-movie, user-movie-genre-movie, user-user-user-movie,user-movie-movie-movie.The Movie-Lens dataset contains information about the user, the movie, their interaction, and the path instances of the four meta-path contexts.

Evaluation indices
To validate the effectiveness of the SCLW_MCRec model, three evaluation indices, Prec, Recall, and NDCG, are used to evaluate the model recommendation performance.
The recall rate is expressed as The accuracy rate is expressed as where R(u) represents the top-n recommendation list made to the user based on the user's behavior with the training set, and T(u) represents the set of items actually selected by the user after the system has recommended items to the user.
The NDCG (normalized discounted cumulative gain) is expressed as Where r i and p i denote the correlation and order of i in T(u) in R(u).

Advanced baseline
For a more objective understanding of the effectiveness of the SCLW_MCRec model in obtaining user-item interaction information through the three-way interaction neural network and the two-layer CNN with four meta-paths (UMMM, UMGM, UUUM, UMMM), the optimization effects of the extreme value loss function and weight-normalization optimization methods on film and television recommendation models are presented.This section describes some advanced recommendation algorithm models and compares the effectiveness of the SCLW_MCRec for verification.
ItemKNN: The ItemKNN model is a classical collaborative filtering model that recommends similar items based on those chosen by users in the past.
Bayesian Personalized Ranking (BPR) [20]: The BPR model is based on Bayesian theory to maximize the posterior probability with prior knowledge and minimize the pairwise ranking loss of implicit feedback.
Matrix Factorization (MF): Cross-entropy loss is recommended for top-N recommendation; it is a standard matrix decomposition method.
HeteRS [19]: HeteRS is a recommendation method based on heterogeneous networks that uses multivariate Markov chains to model user preferences.
FMG [21]: This is a heterogeneous network-based rating prediction model.
MCRec [25]: This is a new type of deep neural network with a common attention mechanism for top-N recommendation using context based on rich meta-paths.
MAGNN_Rec [32]: This model uses graph neural networks to aggregate different levels of interaction information, such that the user-item representations obtained are more 123 closely related to the meta-path context.The interaction information is efficiently applied to improve the accuracy of the recommendation performance.

Experimental results
For this experiment, all user implicit feedback records from the movie dataset were randomly divided into a training set (80%) and a test set (20%).The model evaluation results were compared in terms of the precision rank (Prec), recall rank (Recall), and k-standardized cumulative gain (NDCG) to determine the strengths and weaknesses of the different models.The results for each parameter of the model were compared with those of previous models using the same dataset, preprocessing method, optimizer, and loss function.
The validity of the SCLW_MCRec model was verified.The SCLW_MCRec model can effectively recommend movies.To demonstrate the effectiveness of the SCLW_ MCRec model, it was compared with the ItemKNN, BPR [20], MF, HeteRS [19], FMG [21], SVDFeature [23], and SVDFeaturemp models, a variant of the MCRec model, and the MCNGNN-Rec model.The ItemKNN model can only be used by the movies they have watched in the past, ignoring the information about the interaction between the user and the item..The BPR recommendation model adds implicit feedback to the movie information based on the history of movies watched, but is still ineffective in applying user-item interaction information.HeteRS [19], FMG [21], SVDFeature [23], and SVDFeaturemp models can obtain heterogeneous information and item attribute information for recommendation, but application of interaction information is still inadequate.mCRec model variants obtain user-item interaction information using meta-paths.However, the filtering of the obtained path instances is not sufficient.
Unlike the ItemKNN and BPR models, the SCLW_MCRec model can obtain user and item information, and also useritem interaction information.The three neural interaction networks with a two-layer, one-dimensional CNN can obtain higher-order user-item interaction information.Unlike MF, HeteRS [19], FMG [21], SVDFeature [23], and SVDFeaturemp models, the SCLW_MCRec model obtains user and item attributes and selects the most suitable of the four meta-path methods (UUMM, UMGM, UUUM, UMMM) using both interaction and attribute information.Compared to the MCRec and variant models, the SCLW_MCRec model is better at selecting path instances and applying higherorder interaction information across the UMUM, UMGM, UUUM, and UMMM meta-paths.Application of extreme cross-entropy loss functions and weight-normalization optimization methods can optimize the model and reduce model loss.Compared with the MAGNN_Rec model, the SCLW_MCRec model is less capable of aggregating information, but more capable of selecting suitable meta-path instances and obtaining higher-order interaction information for recommendation.
The SCLW_MCRec model has both advantages and disadvantages compared to currently available state-of-theart recommendation models.To clearly demonstrate the strengths and weaknesses of the models, this section tests and evaluates different models using the Movielens dataset with the same parameters, and determines the Prec, Recall, and NDCG evaluation indices, as shown in Table 4.The SCLW_MCRec model produced a significant improvement in all evaluation indices compared to the individual recommendation algorithms.Compared with ItemKNN, BPR, MF, HeteRS, SVDFeature, and SVDFeaturemp, the SCLW_MCRec model had significantly improved Prec, Recall, and NDCG indices as it can obtain user interaction information.Compared with the FMG and MCRec variants, the SCLW_MCRec model can obtain more relevant interaction information, although the FMG and MCRec variant models also apply user-item interaction information.The Prec, Recall, and NDCG indices all improved.Compared with the MAGNN_Rec model, the SCLW_MCRec model had improved Recall and NDCG indices; the NDCG increased by 6.7%.The experimental results demonstrate the effectiveness of the SCLW_MCRec model recommendation performance.
The SCLW_MCRec model is extremely effective for movie recommendations compared to the other recommendation models.In Fig. 3, it is observed that the evaluation indices of the SCLW_MCRec model are much higher than those of the other models, especially NDCG.The SCLW_MCRec model is better for processing analysis and recommendation with the Movielens dataset.
In this study, several experiments were conducted using the Movielens dataset.To demonstrate the validity of the pro-     better interaction information and higher-order user-item interaction features for accurate recommendations to users.The SCL_MCRec model with a three-way neural network, a two-layer, one-dimensional CNN, and an improved extreme cross-entropy loss function improved acquisition of higher-order interaction features and enabled the model to better reduce losses during the movie recommendation training process, improving the effectiveness of the model.The SCL_MCRec model with a three-way neural network, a two-layer, one-dimensional CNN, an improved extreme cross-entropy loss function, and a weight-normalization optimization approach was more stable and better able to provide accurate recommendations to users.
To better describe the improvement in recommendation performance on the Movielens dataset provided by each module in the SCLW_MCRec model, the ablation experiment results are presented in the form of a line graph, as shown in

Conclusion
A network model combining a three-way neural interaction network and a two-layer CNN was designed, and a crossentropy loss function and weight-normalization optimization method were devised.The movie recommendation accuracy was greatly improved using four meta-path methods (UMMM, UMGM, UUUM, UMMM) to obtain user-item interaction information; the interaction information obtained through meta-path filtering is more in line with user interests.On the MovieLens dataset, the Prec, Recall, and NDCG evaluation indices all improved significantly.The SCLW_MCRec model compensates for the shortcomings of other recommendation models in learning effective representations of user, item, and meta-path contexts, with powerful interaction features that can more effectively process useritem interactions that are easily overlooked.However, for different datasets, the model must be manually designed to annotate the meta-paths; it does not provide a way to automatically design and select meta-paths based on data interaction information and directly apply them in other scenarios.In future research, the main goal is automatic selection of single or multiple meta-paths that are best suited to the application, regardless of the dataset, filtering the interaction informa- tion that provides accurate recommendations to the user by means of a self-attentive or attempted-attention mechanism.In addition, we aim to design optimal loss functions and optimization methods for different meta-path approaches to improve the convergence speed and reduce the loss of the model.

zhisheng925@163.com 1
School of Computer Science andTechnology, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China 2 College of Computer and Information science College of Software, Southwest University, Chongqing 400715, China

Fig. 2
Fig. 2 Structure of two-layer CNN Final expression: ru,i designed with the advantages of the cross-entropy loss function and the characteristics of argmin and argmax functions.The loss function refers to the abc function, returning the absolute value of the function, and can be used for training and recommendation with the Movielens dataset to reduce loss.The loss function also introduces the extreme value theorem in referring to the argmin and argmax functions, which includes the maximum value and does not ignore the minimum value, allowing the SCLW_MCRec model to obtain more comprehensive information about user-item interactions.The model loss function is expressed in the following equations, where y i denotes the actual labeled value, and ŷi denotes the predicted labeled value.Loss = (1 + weight) • log ŷi .
were considered.(1) the MCRec model of only the three-way neural network, (2) the SC_MCRec model of the three-way neural network with the two-layer, one-dimensional CNN, (3) the SCL_MCRec model of the three-way neural network with the two-layer, one-dimensional CNN and improved loss function, (4) the SCLW_MCRec model of the three-way neural network with the two-layer, one-dimensional CNN, improved loss function, and weight-normalization optimization.The other parameter settings were unchanged.Compared to the MCRec model with only a three-way neural network, the SC_MCRec model with a three-way neural network and a two-layer, one-dimensional CNN obtained more user-item interaction information based on the meta-path context, and allowed the model to obtain

Fig. 3
Fig. 3 Bar chart comparing evaluation indices of advanced movie recommendation models

Fig. 4
Fig. 4 Advanced baseline model evaluation index comparison line chart

Fig. 5 .
Each module in the model increases recommendation performance on the Movielens dataset, demonstrating the effectiveness of the SCLW_MCRec model in recommender systems.

Fig. 5
Fig. 5 Comparison of ablation experiment results

Table 1
Recommendation algorithm model information contribution table

Table 4
Comparison of evaluation indices for advanced models posed model, an ablation analysis was performed; the results are presented in

Table 5 .
In the ablation experiments, the proposed model was ablated by deleting or replacing individual modules, while keeping the corresponding parameters constant.Four cases

Table 5
Ablation study on Movielens dataset