Abstract
Recommendation systems are an effective solution to deal with information overload, particularly in the e-commerce sector, in which sequential recommendation is extensively utilized. Sequential recommendations aim to acquire users’ interests and provide accurate recommendations by analyzing users’ historical interaction sequences. To improve recommendation performance, it is vital to take into account the long- and short-term interests of users. Despite significant advancements in this domain, some issues need to be addressed. Conventional sequential recommendation models typically express each item with a uniform embedding, ignoring evolutionary patterns among item attributes, such as category, brand, and price. Moreover, these models often model users’ long- and short-term interests independently, failing to adequately address the issues of interest drift and short-term interest evolution. This study proposes a new model, the Feature-aware Long-Short Interest Evolution Network (FLSIE), to address the above-mentioned issues. Specifically, the model uses explicit feature embedding to represent item attribute information and employs a two-dimensional (2D) attention mechanism to distinguish the significance of individual features in a specific item and the relevance of each item in the interaction sequence. Furthermore, to avoid the issue of interest drift, the model employs a long-term interest guidance mechanism to enhance the representation of short-term interest and adopts a gated recurrent unit with attentional update gate to model the dynamic evolution of users’ short-term interest. Experimental results indicate that our presented model outperforms existing methods on three real-world datasets.
Keywords
Introduction
With the rapid growth of information, recommendation systems play an important role in e-commerce [1], news [2], social media [3], and other online platforms. A Sequential Recommendation (SR) is defined as recommending the subsequent item based on the user’s sequential historical behavior, aiming to improve user experience and bring colossal business value to the platform [4]. In the real world, users have both persistent long-term interests and dynamic short-term interests. Therefore, it is critical to accurately model the evolution of users’ long- and short-term interests from their complex behavior.
In recent studies, researchers have presented a number of SR models that aim at modeling user’s long-term and short-term interests and capturing the evolution patterns of users’ historical behavior. These models include traditional methods, neural network-based methods, and transformer-based methods. First, traditional methods mainly rely on Markov chains to learn users’ short-term interest [5, 6], but they ignore users’ long-term interest and cannot accurately capture dynamic changes in complex scenarios. Second, neural network-based methods include Recurrent Neural Network (RNN)-based methods [7, 8] and Convolutional Neural Network (CNN)-based approaches [9] that construct complex relationships by learning a large amount of user interaction behavior and capturing the evolution of short-term interests [10]. However, these methods only consider the user’s recent actions, resulting in the model having solely short-term memory and recommending only items related to the user’s recent behavior. Third, transformer-based models [11, 12] adopt a self-attention mechanism and have achieved satisfactory results. However, the self-attention mechanism has not considered the position information of sequences. Most algorithms fail to distinguish between users’ long-term and short-term interests and instead model the overall interaction sequence of the user. However, this distinction is crucial for practical sequential recommendations [13]. Despite the ability of these sequential methods to achieve satisfactory results, two challenges still need to be addressed.
This first challenge is how to capture users’ long- and short-term interests from the feature level. Most existing solutions use the item ID as the unique attribute for each item. However, each item is associated with multiple features, such as category, brand, and price. Essentially, user’s behavior is driven by the evolution of users’ preferences for item features. Therefore, using a uniform embedding to represent each item does not distinguish the importance of different item features. As illustrated in Fig. 1, the user initially chooses to browse the iMac and wants to browse the same brand. Thus, when browsing similar items (e.g., two mobile phones), users are more likely to click on those items with attributes they actually care about (e.g., brand and price). After purchasing a phone, the user’s needs may shift to phone supplements (e.g., headphones and chargers), so the user can shop according to the product type. Finally, while browsing headphone merchandise, the user will purchase AirPods. In this example, the user ends up interacting with earphones because he or she is interested in the Apple brand. AirPods are commonly regarded as an essential iPhone accessory, significantly elevating the demand for AirPods among users. Thus, each user has complex dependencies on different attributes, and the change in item attributes will dramatically affect the user’s decision.
An example of user sequential behavior and an illustration of the evolution of user preferences on item features.
The second challenge addressed in this research is how to model the evolution of long- and short-term interests. Most approaches typically model the user’s short-term and long-term interests independently and then concatenate them to reflect the their final interests. However, the linear concatenation between long-term and short-term interests lacks a modeling of user habits, which leads to interest drift [14, 15]. GRU4Rec [7] utilizes a gated recurrent unit (GRU) to extract information from user interaction sequences, while Caser [9] embeds a series of recent items into an “image” in the time and latent spaces and then uses convolutional filters to learn the sequence patterns, similar to using a convolution filter to learn local features of the image. Although these methods can successfully extract the main interests of users from the interaction sequence, they cannot model the evolution process of users’ interests and solve the problem of interest drift. For example, short-term interests take precedence when users browse similar items consecutively, whereas when they switch to substantially different items, users’ behavior is primarily influenced by long-term interests. As illustrated in Fig. 1, the user may consistently gravitate toward electronic products (a reflection of their long-term interest) while simultaneously becoming interested in clothes at a given moment (a short-term interest). Therefore, it is crucial to adaptively combine short- and long-term interests to predict future interactions.
In this study, we propose a novel model, namely FLSIE, to address the abovementioned issues. FLSIE can effectively leverage item features to capture users’ feature-level preferences. Moreover, FLSIE designs a long- and short-term interests evolution mechanism that not only captures the long-term and short-term interests of users from their behavioral sequences but also strengthens the influence of users’ habits on the evolution of their preferences. Finally, accurate prediction/recommendation is achieved by using user dynamic preference. The proposed method FLSIE includes two main parts.
For addressing the first challenge, we draw inspiration from the success of attribute-aware neural models [16, 17] and utilize explicit feature embedding to represent item information. This approach provides a solid foundation for accurately representing user interests. Furthermore, we adopt a 2D attention mechanism that includes vertical attention to obtain user feature-level preferences and horizontal attention to distinguish the significance of items in the interaction sequence. Therefore, the model can successfully capture the user’s short- and long-term interests in the attributes of each item within the sequences.
To solve the second challenge, we adopt the approach of user dynamic interest modeling [18, 19]. We analyze users’ long- and short-term interests to simulate the user interest evolution. Specifically, our model utilizes GRU to encode short-term preferences and explicitly extract users’ current interests. Then, we use a user’s habits to guide short-term preferences and reduce the interference of interest drift. Finally, we employ a gated recurrent unit with attentional update gate (AUGRU) [20, 10] to capture the evolution of dynamic preferences.
In summary, this study offers the following main contributions:
We introduce explicit feature embedding and utilize an attribute-aware attention mechanism to learn the feature representation of items and then capture users’ feature-level preferences from their interaction sequences. We propose a novel FLSIE that captures user preferences from their historical behavior sequence. Moreover, AUGRU is utilized to capture the evolution of user preferences. We also introduce a joint learning mechanism that combines long-term interests and current interests to learn the evolution of user interests. Extensive experiments are conducted on three public recommendation datasets (Amazon, Gowalla, and Yelp). The experimental results demonstrate that the FLSIE model achieves significant performance improvements in HR and NDCG compared with existing baselines.
This section provides an overview of the existing research on sequential recommendation. Considering that user preferences are constantly changing, SR models mainly rely on Markov chains, neural networks, and attention mechanisms to capture users’ short-term interest, long-term interest, and long- and short-term interests.
Sequential recommendation
In early studies, methods based on Markov chains were utilized to capture the user’s potential interest in the user interaction sequences. For example, Factorizing Personalized Markov Chains (FPMC) [5] is a notable next basket of recommendation algorithms that combines Matrix Factorization (MF) and Markov Chains (MC) to simulate a users’ general preferences and mine their sequential behavior, respectively. He et al. [21] employed first-order MCs, while He et al. [22] and He and McAuley et al. [6] utilized higher-order MCs to consider more dependencies for determining user preferences. However, those MC-based approaches only considered the relationship between users and items and only utilized the correlation between adjacent items to generate recommendations. Therefore, the methods neither could take into account the dependency relationships in the user behavior sequences nor capture the long-term preferences of users.
On the one hand, Recurrent Neural Networks (RNN) have been extensively utilized in recommendation systems because of their powerful feature extraction and generalization capabilities. For example, Hidasi et al. [7] used GRU to encode the user’s recent session to obtain the user’s current interest. Yuan et al. [8] classified the context information and redefined the GRU unit’s update gate and reset gate to model the dynamic user interests. Quadrana et al. [23] presented the HRNN, and Ma et al. [24] developed the HGN, both employing hierarchical RNN to capture users’ interests. Nevertheless, the RNN-based sequential recommendation model relied on the recurrent structure of the current action, resulting in high training costs, although it can capture the dependencies between items.
On the other hand, Convolutional Neural Networks (CNN) and Graph Neural Networks (GNN) are commonly used in sequential recommendation. Tang et al. [9] proposed a convolutional sequential embedding recommendation model called Caser that employs CNN to extract sequence features. Although Caser can extract higher-level features by using fully connected layers, it only considers the conditional probability distribution of the last item in the sequence. Consequently, capturing complex relationships and long-term dependencies in a sequence can be difficult. Moreover, Wu et al. [25] proposed SR-GNN that utilizes GNN to extract item-embedding vectors and combine long-term and short-term preferences via the attention layer. These sequential recommenders can effectively identify the user’s primary interests by utilizing sequential interactions. However, the aforementioned models fail to capture how user interests evolve over time.
Attention mechanism for sequential recommendation
Recently, transformer-based methods [26] have successfully captured long-term dependencies in sequences. Specifically, the self-attention mechanism module has achieved state-of-the-art results in sequence recommendation. For instance, Kang et al. [11] developed SASRec that utilizes a self-attention mechanism to consider the relationship between items for adaptively modeling user behavior. Sun et al. [12] developed BERT4Rec that utilizes bidirectional self-attention modules to capture pairwise relationships between user interactions and items. Li et al. [27] extended SASRec by increasing the items’ absolute position embeddings and time interval embeddings. Li et al. [28] and Liu et al. [29] employed an encoder-decoder framework to model user-item interactions and capture short-term preferences. Furthermore, Lin et al. [30] used a dual-interest factorization-heads attention sequential recommendation model that captures shifting patterns of behavior through positive and negative feedback. In their work, skipping and not skipping can be viewed as negative and positive feedback. Compared with RNN and CNN, the attention mechanism has higher parallelism and can better model users’ diverse preferences. However, the abovementioned methods mainly focus on capturing users’ interests from recent behavior, failing to capture users’ general interests and complex conversion patterns.
Modeling on item attributes and evolution
In the realm of sequential recommendation, modeling user interests involves two critical aspects: Attribute-aware and interaction behavior. On the one hand, additional item attributes are used to enhance the potential representation of vectors, which have been proven to play a critical role in alleviating sparse interaction data. For instance, Zhang et al. [31] adopted feature-based self-attention to learn how features transfer between items. Moreover, some studies [32, 33, 34, 35] adopted different decoupled representation learning methods based on the correlation between items and their attributes to achieve higher expressive power. Furthermore, Zhu et al. [36] utilized a disentangled self-attention mechanism to simulate the interactions of complex features. Although sequential recommendation methods mentioned above have been successful, they ignore the evolution of user interests among item attributes.
On the other hand, the evolution of users’ interests can be captured using rich interaction information between users and items. For instance, Lv et al. [17] and Ying et al. [37] divided user’s interaction sequences into long- and short-term subsequences and then explicitly extracted embeddings for long- and short-term interests. Ma et al. [38] modeled long- and short-term interests of users, utilizing bilinear functions to accurately capture the co-occurrence patterns between relevant items. Some works [39, 40] not only focused on the changes in user sequences but also paid attention to the evolution of user preferences over time. Ma et al. [1] divided user sequences based on attribute values and then modeled the item evolution over attribute dynamics with these users. These works have successfully extracted the main interests of users through sequential interactions, but it ignores the evolution of users’ short-term interest.
Different from the aforementioned works, our method introduces a unified framework for capturing the long- and short-term evolution of user behavior. This framework partitions the user’s behavior sequences into long- and short-term subsequences and employs attention mechanisms and RNN to extract the user’s long-term interest and short-term interest in the item attributes. Moreover, AUGRU is utilized to flexibly combine long- and short-term interests. The experimental results show that this approach can achieve a significant improvement in recommendation accuracy and provide a better recommendation experience.
Problem statement
Before delving into the discussion of our proposed model, let us first clearly define the problem and basic concepts. Formally, assume
In summary, our goal is to encode the user’s short-term and long-term behavior sequences
List of key notations
List of key notations
Overall structure of FLSIE.
In this section, we discuss the various components of the proposed FLSIE model. Figure 2 shows the overall structure of the FLSIE model, which comprises the side information embedding module, the long-term interest modeling module, the short-term interest evolution module, and the prediction module. Initially, we split the user’s historical interaction records into two sequences: fixed-length long-term interaction sequences
Input embedding with side information
In this subsection, we initially introduce how to explicitly embed side information from the system, such as item attributes. This process involves transforming each attribute into its corresponding embedding representation, enhancing the model’s ability to convey the significance and impact of these attributes. Subsequently, by employing an attention mechanism, we can more precisely capture users’ varying levels of attention across different attributes, thereby constructing a more accurate representation of user feature-level preferences.
Explicit feature embedding
In e-commerce recommendation scenarios such as Taobao, users pay attention to a certain product and its attributes, including category, price, brand, and so on. Different user attributes, such as gender, age, and city, will generate different interests in the items. Side information, such as user and item attributes, can provide additional information for improving prediction accuracy. Nonetheless, the majority of existing sequential models continue to utilize a unified embedding to represent each item. These models fail to distinguish the importance of different item features. Therefore, we utilize an item-embedding approach based on various feature scales of the item. Formally, let K be the number of features in the system. Then, we denote the explicit embedding of feature
where
Similarly, we generate the embedding of feature
where
Then, we concatenate features of different scales of the user to represent the user’s embedding which is defined as:
On the basis of explicit feature embedding, we utilize an attention mechanism to acquire user feature-level preferences. In traditional attention sequential recommendation models, users’ attention to items is usually horizontally distributed along the sequence, allowing for distinguishing these users’ attention to different items. However, the previous models often overlook the importance of different item features. By contrast our model associates each item with multiple features. To describe users’ interests in more detail from the perspective of features, we design vertical attention for features and horizontal attention for items. Formally, for the behavior sequences
Vertical Attention.
The purpose of implementing vertical attention is to enhance item embedding by assigning weights to different item features. Specifically, after fusing the side information, the embedding of item
where
Horizontal Attention.
The purpose of horizontal attention is to discern the relative significance of various items within a specific sequence. Assume that the enhanced historical item embeddings are
The user’s long-term interest indicates the user’s historical habits. In the online shopping scenario, users’ interaction is characterized by randomness, but in most cases, they will interact with the items they are interested in. Therefore, unrelated actions can affect the expression of a user’s long-term interest. To capture users’ long-term interest, we use horizontal attention for items in historical interactions to reduce the impact of irrelevant behavior.
This model leverages the item set
where
A user’s short-term interest is determined by the items they have recently interacted with. Previous studies [9, 41] indicate that users tend to engage with items that are closely associated with the content they have accessed recently. Therefore, in the sequential recommendation, the users’ short-term interest must be simulated effectively, as reflected by recently visited items.
shows the structures of GRU and AUGRU. Here, 
A) Interest Extractor Layer.
To extract the user’s short-term interest and global dependencies, we represent the user’s k most recent interactions by using the embedding sequence
where
However,
B) Interest Guidance Layer.
Given the diversity of interests, users’ interests may drift. The influence of interest drifting on user behavior may manifest as users becoming interested in certain electronic products in a specific period. At the same time, users may need clothes in another period. Therefore, our model introduces an interest guidance attention mechanism to enhance users’ short-term interest coding. The function of the attention mechanism is described as follows:
where
C) Interest Evolution Layer.
In earlier study [37], users’ final preferences were formed by concatenating their long-term and short-term interests, which limited the performance of the model. In the interest evolution layer, we use a GRU with attention mechanism (AUGRU) to model the evolution of user interest. As AUGRU adaptively fuses users’ short-term and long-term interests, the interference of interest drift can be effectively avoided. The neural framework of AUGRU is shown in Fig. 3. The calculation formula of AUGRU is similar to that of GRU in the interest extraction layer:
where
Prediction
The basic idea of the FLSIE model is to acquire the long-term interest
where
[h] : Learning Algorithm of FLSIE
convergence shufe the set of observations (u,
observation (u,
compute
compute loss L according to Eq. (16)
update model parameters
return
Our model optimization aims to provide a ranked list of items based on the short- and long-term behavior sequences. Therefore, we prioritize the relative ranking of predicted ratings instead of their absolute values. During the training process, let
where
Experimental setting
In this section, we provide a detailed description of the experimental setup, including the datasets used, evaluation metrics, baseline models, and implementation details.
Datasets
We evaluate the performance of our proposed FLSIE model in terms of its ability to recommend the next item on three real-world datasets, namely, Beauty, Yelp, and Gowalla. The details are introduced as follows:
In evaluating the model’s performance, the leave-one-out strategy [45] is employed following the pre-processing steps from prior works [32, 37]. The presence of a review or rating in these datasets indicates implicit feedback, demonstrating user interaction with an item. Timestamps are utilized to establish the order of actions in the sequence. Furthermore, for the above mentioned dataset, all items with less than 20 user interactions are removed. Then, we consider all user interactions in the recent period as a session to represent the user’s short-term behavioral sequences. The rest of the interactions represents the user’s long-term behavioral sequences. Here, users with less than 3 item sessions or less than 3 sessions should be filtered out. Therefore, for the historical sequence
Statistics of three datasets
The recommendation model generate a ranked list of items for each user. To evaluate the baseline model and our model, we utilize two commonly used evaluation metrics, namely, Hit Ratio at rank K (HR@K) and Normalized Discounted Cumulative Gain at rank K (NDCG@K) [4, 21], with K chosen from {10, 20}.
where
To evaluate the proposed model, we compare FLSIE with eight sequential recommendation methods. These methods include traditional factorization-based methods, deep-learning based methods, and long- and short-term sequential recommendation methods.
To achieve a fair comparison, we implement all baseline models and our proposed model using the recommendation framework RecBole [46] while ensuring that identical settings are used for all models. We utilize the Adam optimizer [43] to optimize all models while implementing early stopping when no improvement is observed on the validation dataset after 20 epochs. The batch size is 256, while the learning rate and feature embedding size are varied within the ranges of [1e-1, 1e-2, 1e-3, 1e-4] and [32, 64, 128, 256], respectively. The L2 regularization factor
Performance comparison with baselines
Comparative performance of all models on three datasets. The bold and underlined values in each line represent the optimal and second-best results, respectively (the larger the value, the better the performance). The last column shows the percentage increase in performance in contrast to the best baseline
Comparative performance of all models on three datasets. The bold and underlined values in each line represent the optimal and second-best results, respectively (the larger the value, the better the performance). The last column shows the percentage increase in performance in contrast to the best baseline
In this section, we compare the recommended performance of the proposed model FLSIE with the abovementioned baselines on three real public datasets, as shown in Table 3. The observation results are as follows:
BPR, FPMC, and ADAR all adopt matrix factorization methods. However, their results are lower than those of other baseline methods because BPR, FPMC, and ADAR do not effectively use sequence information. Nevertheless, FPMC performs better than BPR because the former models the user’s recent interactions with a first-order Markov chain. ADAR considers the attributes of items, so its results are higher than those of the other two baselines. This finding indicates that attribute information can enhance the representation of items. GRU4Rec and NARM perform better than traditional baselines, suggesting that RNN-based models can effectively handle sequence information within sessions. NARM considers the user’s current intention and outperforms GRU4Rec on all three datasets. NARM achieves excellent results, only slightly lower than the strongest baseline. STAMP and SASRec both perform well. On the Amazon and Gowalla datasets, SASRec becomes the strongest baseline, indicating that self-attention mechanisms have relatively good predictive power. STAMP outperforms GRU4Rec on the Gowalla and Yelp datasets because it reduces the impact of interest drift by treating the last click as the current interest. SHAN outperforms others on the sparse Yelp dataset by distinguishing between the long-term and short-term interests of users. Additionally, the number of long-term items exceeds that of short-term items, hence the difficulty for SASRec with a self-attention mechanism to assign appropriate weights to vital items within the short-term sequence.
Finally, our proposed model outperforms all baselines on the Amazon, Gowalla, and Yelp datasets, achieving optimal performance. Specifically, on the Amazon dataset, FLSIE gains 21.06% HR and 18.71% NDCG improvements on average compared with the strongest baseline SASRec. On the Yelp dataset, our model achieves more than 10% improvement in the HR and more than 9% improvement in the NDCG. Furthermore, the model has attained some improvement on the Gowalla dataset. FLSIE first leverages attribute-enhanced item representation and can learn user interests from item attributes in a fine-grained manner. The model incorporates attention mechanisms and RNNs to model the long- and short-term interests of users. In addition, our model employs habit-guiding mechanisms to reduce interest drift and deliver more personalized recommendations based on long- and short-term interest evolution.
Our model consists of several components: an explicit feature embedding module, long-term and short-term interest modeling modules, a habit-guided attention module, and a long- and short-term interest evolution module. To investigate the effectiveness of each component, we conduct an ablation study by removing some components to obtain different variant models. FLSIE-Att is the model in which the item attributes are removed, FLSIE-S is the model with only the user’s long-term interest is modeled, FLSIE-L is the model that only considers the user’s recent preferences, FLSIE-Hb is the model in which the habit-guided attention module is removed, and FLSIE-Agg is the model that directly concatenates the long-term and short-term interest models without considering the adaptive fusion between them. We conduct an ablation study by comparing our model to these five variants.
Performance comparison of FLSIE and its variants on three different datasets. The horizontal axis represents the various models, while the primary and secondary ordinates show the models’ scores on HR@20 and NDCG@20 evaluation metrics, respectively. The bar chart presents the HR@20 metric, while the line chart depicts the NDCG@20 metric.
Figure 4 shows the ablation results. Our proposed full model (i.e., FLSIE) consistently outperforms all of its variants. We have observed the following: First, FLSIE utilizes explicit feature embedding to learn user interests at a much finer granularity from attributes. Given the different sparsity levels of the datasets, FLSIE-Att performs better on the dense Amazon dataset than on the sparse Gowalla and Yelp datasets, effectively alleviating the cold-start problem. Second, on the Amazon dataset, FLSIE-S performs better than FLSIE-L. By contrast, on the Gowalla and Yelp datasets, FLSIE-L performs better than FLSIE-S, indicating that differentiating between users’ long-term interest and short-term interest can affect their next actions. Meanwhile, on datasets in which FLSIE-S performs well, FLSIE-Hb has a better performance improvement, suggesting that the habit-guiding mechanism can reduce user interest drift. Finally, the results of the FLSIE-Agg model are slightly lower than those of the full model. Thus, by modeling the user’s long-term and short-term interest evolution, we can adaptively integrate the two interests to provide more personalized recommendations to users.
We study the impact of hyper parameters on model performance, including the size of attribute embedding dimension
Influence of an Attribute’s Latent Dimensionality
.
We conduct comparative experiments with five different sizes of attribute-embedding dimensions
First, a much small
Influence of the latent dimensionality 
Performance of FLSIE with varying s and l values at NDCG@20 on the Amazon and Yelp datasets.
Impact of Short- and Long-Term Max Sequence Lengths.
We investigate the impact of the short-term and long-term maximum sequence lengths (represented by parameters s and l, respectively) on our model. To validate our findings, we conduct experiments with varying sequence lengths on different datasets and evaluate the model’s effectiveness by using the NDCG@20 metric. As shown in Fig. 6, increasing the sequence length within a certain range can improve the model’s performance. However, setting s and l to very extremely values may result in losing the interest information of users, whereas setting them to large values may cause sparsity issues and increase the computational costs. Two main reasons contribute to this phenomenon. First, as the sequence length increases, much more historical interaction sequences are captured and modeled. Second, overly long sequences may contain too much redundant information and introduce noise, which can limit the model’s performance. Our results indicate that it is advantageous to consider a user’s previous and recent interactions when modeling his/her ultimate interest. To optimize training efficiency, we set s
In this study, we propose a novel sequential recommendation framework called the Feature-aware Long-Short Interest Evolution Network (FLSIE). FLSIE aims to capture users’ preferences for item features and effectively model the evolution of their long-term and short-term interests. Specifically, the model employs explicit feature embeddings to represent item attribute information and utilizes a two-dimensional (2D) attention mechanism to differentiate the importance of individual features within specific items and their relevance in the interaction sequences. Furthermore, to address the issue of interest drift, we introduce a long-term interest guidance mechanism to enhance the representation of short-term interests. Additionally, we adopt AUGRU to dynamically model the evolution of users’ short-term interests. The experimental results demonstrate that the FLSIE model outperforms other baseline methods in terms of HR and NDCG evaluation metrics on three public datasets.
However, we also recognize that the FLSIE model has some limitations, including its relatively high computational complexity and the treatment of various user-item interactions (e.g., click, bookmark, add to the shopping cart, and purchase) as equal. In reality, different behavior types contribute differently to users’ interests, which deserves further consideration and optimization. In the future, to further enhance the model’s performance, we will plan to employ GNN to study the temporal evolution relationship between different user-item interaction types. By modeling the dynamic relationships among various interaction types, we aim to more accurately capture changes in user interests and improve the accuracy of the recommendation model.
Footnotes
Acknowledgments
This work was supported by the Chunhui Plan Cooperative Project of Ministry of Education (Grant No: Z2011018), the Sichuan Science and Technology Program (Grant No: 2023YFQ0044) and the Postgraduate Course Teaching Team Project of Xihua University (Grant No: YTD202303).
