Abstract
Service recommendation systems play a crucial role in delivering personalized user experiences across various domains. However, capturing the heterophily patterns and the multi-dimensional nature of user-service item interactions poses significant challenges. To address these challenges, we propose a service recommendation model named
Introduction
In the era of explosive growth in digital services, service recommendation systems serve as the core hub connecting users with massive service items, aiming to precisely capture users’ dynamic preferences and model complex interaction relationships to improve service recommendation accuracy. By filtering vast amounts of content, this approach has become critical for enhancing user experiences in diverse applications, such as social network platforms, 1 streaming services, 2 , and online shopping websites. 3
The rise of graph neural networks (GNNs) has brought revolutionary advancements to service recommendation systems, effectively modeling implicit collaborative signals by capturing the topological structure of user-service item interaction graphs.4–6 Early GNN-based collaborative filtering (CF) models, such as LightGCN 7 and UltraGCN, 8 significantly improved service recommendation accuracy by aggregating neighbor node features through graph convolution operations.
However, these models inherently rely on pairwise interaction modeling, which limits their ability to capture only direct associations between users and service items. As a result, they face significant challenges in modeling complex, high-order relationships. Specifically, they struggle to effectively represent semantic correlations that arise from multiple users interacting with the same service item simultaneously, or from users engaging with multiple heterophilic (i.e., dissimilar) service items. For instance, discovering latent cross-category interests—such as users who simultaneously prefer both “technology information” and “outdoor equipment”—or identifying cross-group interaction patterns—such as users from different age groups collectively engaging with a “health management application”—requires modeling beyond pairwise interactions. Unfortunately, the traditional binary edge-based modeling approach, which only captures point-wise user-item interactions, is insufficient for capturing such nuanced, high-order correlations. This limitation results in service recommendations being confined to a single dimension and lacking deep insights into users’ diverse needs and group commonalities.
To overcome the representational bottleneck of binary interactions in service recommendation, hypergraphs have been introduced into relevant frameworks9–11 — their core advantage lies in naturally supporting group-level interaction modeling via hyperedges that connect multiple nodes (e.g., a hyperedge linking one user to all service items they interact with, or one service item to all users who engage with it). Typical hypergraph-based methods, such as DHCF 12 and SHT, 13 have demonstrated strengths in high-order relationship modeling: DHCF adopts a two-pass hypergraph neural network to separately learn user and service item embeddings, while SHT integrates global collaborative signals through hypergraph transformation and attention strategies. However, both models lack explicit mechanisms for heterophily modeling, leading to critical limitations in capturing real-world diverse interactions:DHCF treats all nodes within a hyperedge as homogeneous, failing to distinguish cross-category signals. For instance, it cannot preserve a user’s simultaneous preference for “technology information services” and “outdoor equipment services,” resulting in the loss of cross-type interaction information. SHT perpetuates the “homophily assumption” of traditional GNNs, assuming nodes of the same type (e.g., same-category service items, users with similar demographic features) have similar embeddings. It thus cannot adjust information propagation for heterophilic nodes — such as when users from different age groups collectively select a “health management app” — and overlooks the unique value of dissimilar nodes in hyperedges.In summary, existing hypergraph recommendation models (e.g., DHCF, SHT) excel at high-order interaction modeling but uniformly neglect the prevalent heterophily interactions in practical scenarios — i.e., the phenomenon where heterophilic nodes (cross-category service items, users with dissimilar features) are connected by the same hyperedge. This oversight directly limits their ability to model users’ diverse preferences and complex service correlations.
Heterophily refers to the phenomenon where linked nodes have dissimilar features and different class labels,14,15 As illustrated in Figure 1. In contrast to hypergraph models that ignore heterophily, recent heterophily-aware methods (e.g., Hetero

Example of heterophilic pattern in user and service item hypergraphs.
In real-world service recommendation scenarios, users frequently interact with heterophilic service items (e.g., browsing movies, games, and educational services simultaneously), requiring models to capture both the tight correlations among homogeneous service items, the differential features of heterophilic service items, and the diverse latent preferences behind these interactions. However, three types of existing models fall short here: First, existing hypergraph convolutional networks (HGCNs)
18
rely on deeply stacked layers to aggregate multi-hop neighbor information, which triggers severe over-smoothing issues in strongly heterophilous scenarios—feature disparities of heterophilic nodes are progressively diluted, diminishing inter-layer representation discriminability; Second, hypergraph models like DHCF and SHT have path dependency on “homophily signals”, so they cannot dynamically adjust information propagation weights for heterophilic nodes (e.g., users of different age groups co-interacting with a service item), exacerbating the modeling deficiency of heterophilous patterns
19
; Third, multi-interest models like ComiRec (Cen et al., 2020)—which generate multiple interest embeddings to capture cross-category preferences—are based on sequential data or bipartite graphs, not hypergraphs, so they cannot integrate high-order interaction semantics (e.g., latent interests reflected by a user’s co-interaction with “movies + games + education” services) into interest modeling. These limitations make it challenging to capture all relevant signals (e.g., shared preferences, cross-genre interests) and irrelevant signals (e.g., irrelevant demographic overlaps), which is a key motivation for developing
To address the above limitations, this paper proposes the
The main contributions of this work are summarized as follows: We propose a service recommendation model named We propose a heterophily-aware attention mechanism in We design a dynamic multi-interest learning module that generates multi-interest embeddings by disentangled users’ latent interest spaces and dynamically activates relevant interests based on target service item features, enabling fine-grained modeling of users’ cross-category preferences.
The remainder of this paper is organized as follows: Section 2 introduces the related work. Section 3 gives problem statement and basic definitions. Section 4 presents empirical observations of the challenges of heterophily on service datasets. Section 5 provides a detailed description of
Traditional collaborative filtering and graph neural network-based service recommendation
The core objective of service recommendation systems is to capture dynamic preferences by modeling user-service item interactions. Early approaches such as matrix factorization (MF)20–22 and neighborhood-based collaborative filtering (CF)23,24 learn user and service item embeddings from explicit or implicit feedback but struggle to capture high-order structural information. With the development of graph neural networks (GNNs), graph-based collaborative filtering models (e.g., LightGCN, 7 NGCF 25 ) have significantly improved service recommendation accuracy by aggregating neighbor node features through graph convolutions and encoding interaction structures into node representations. These models rely on the edge structure of user-service item bipartite graphs and focus on pairwise interaction modeling, but they are limited in handling complex multi-way relationships (e.g., multiple users interacting with the same service item or users engaging with multi-category service items), making it difficult to mine implicit high-order collaborative signals.
Hypergraph neural networks and high-Order interaction modeling
To overcome the representational bottleneck of binary interactions, hypergraphs have been introduced into the service recommendation field due to their ability to model group-level interactions. Hypergraphs connect multiple nodes through hyperedges, naturally capturing diverse associations between users and service items (e.g., service item sets interacted with by a user or user groups associated with a service item). Typical methods like SHT, 13 which integrates global collaborative signals via hypergraph transformation and attention mechanisms, and DHCF, 12 which uses a two-pass hypergraph neural network to separate user and service item embeddings, have demonstrated advantages in high-order relationship modeling. However, existing hypergraph models generally adhere to the ‘homophily assumption’—assuming that homogeneous nodes have similar embeddings-while ignoring the prevalent heterophily interactions in real-world scenarios, where heterophilic nodes (e.g., cross-category service items, users of different age groups) are connected by hyperedges. This neglect of heterophily signals leads to semantic gaps in modeling complex interaction patterns.
Heterophily learning
Heterophily refers to the tendency of nodes with different attributes or types to form connections, has gained increasing attention in graph learning.14,15 Traditional GNNs rely on homophily signals, leading to significant performance degradation on heterophilous graphs. To address this, heterophily-aware methods have been proposed, such as Hetero
Multi-Interest modeling, dynamic preference disentangling and others
User preferences are often multi-faceted, spanning multiple categories or domains. Dynamic multi-interest models (e.g., DMI-GNN, 26 ComiRec 27 ) generate multiple interest embeddings via attention mechanisms or disentangling networks and activate relevant interests based on target service items, effectively capturing cross-category preferences. However, most existing multi-interest methods are based on sequential data or bipartite graph structures and do not integrate with hypergraph-based high-order interaction modeling, leading to neglect of high-order semantics in user-service item interactions (e.g., latent interest dimensions reflected by multiple service items) during interest disentangling. 15 Additionally, these methods do not explicitly address the impact of heterophily on interest activation—traditional interest matching mechanisms may fail when target service items are heterogeneous from users’ historical interaction service items.
To address historical interaction scarcity in cold-start recommendation, 28 researchers have explored four complementary research threads: Content Features29–31: exploits inherent user/item descriptive attributes (e.g., profiles, descriptions) with data-incomplete or efficient learning to model cold-start entities; Graph Relations32–34: enrich cold-start entities’ context via interaction graph enhancement, heterogeneous/attribute/knowledge graph extension, or graph aggregator optimization; Domain Information35–37: borrow information from related, data-rich domains via transfer, alignment, or domain-invariant representation learning; World Knowledge from LLMs38–40: prompt or fine-tune large language models to inject external commonsense and semantic priors for previously unseen users or items. Our work falls within the graph-relations thread: we leverage higher-order structures to alleviate cold-start scarcity, and we explicitly integrate heterophily modeling—a perspective still largely unexplored in this line of research.
Multimodal recommendation, a prominent direction in recommender systems, effectively enhances performance via diverse modalities. It captures latent cross-modal correlations and recovers complementary information unavailable to unimodal or implicit interaction-based methods. Notable advancements have been made recently with innovative frameworks.41,42 Yet a key challenge remains—multimodal data is not always accessible in practice, and this constitutes a research direction we will consider in future work.
Problem formulation
In this section, we present the preliminaries and problem statement. Specifically, we first define the user-service item interactions and user-service item hypergraphs, followed by the definitions of heterophily patterns and homophily metrics. Then we give the definition of the proposed problem.
Service interaction graph
In service recommendation scenarios, the interaction graph for service recommendation serves as a core data structure for modeling explicit associations between users and service items. Typically, it is a bipartite graph
User-Service item hypergraphs
The hypergraphs in this paper are divided into user hypergraphs and service item hypergraphs:
User Hypergraph Service Item Hypergraph
Hyperedge construction adheres to the core theoretical principles of maintaining the consistency of high-order interaction semantics and preserving heterophily signals, and is divided into two types: user hyperedges and service hyperedges. User hyperedges are built based on the collaborative filtering assumption that “users with similar interaction patterns share consistent preference” and the principle of “heterophily tolerance”. They are formed by grouping users who meet either of the following criteria: a relatively high similarity in their service category preference distributions, or co-interaction with several types of homophily services. For example, consider a user Alice, who has interacted with services such as book borrowing and offline restaurant reservations, and a user Bob, who has interacted with services like library resource access and chain café bookings. Although there are differences in their service usage scenarios—specifically, Alice may be more inclined to offline consumption while Bob prefers online consumption, reflecting a certain degree of heterophily—their overall interaction preference distributions show high similarity, and they have also co-interacted with several homophily services under the “daily life service” category. Thus, Alice and Bob are incorporated into the same user hyperedge—this not only captures the commonalities in their preferences but also preserves the heterophily-related differences between them. Service hyperedges follow the logic of “explicit preservation of heterophily”, grouping all services interacted with by a single user into a single hyperedge. Taking the Alice mentioned above as an example, all services she has interacted with are integrated into a single service hyperedge. within which homophilous and heterophilous services coexist. This coexistence provides essential signals for subsequent heterophily modeling.
Heterophily in hypergraphs
In the context of user-service item hypergraphs, heterophily manifests as users or service items with disparate features being interconnected via the same hyperedge. Unlike binary interaction graphs, hypergraphs often exhibit heterophily patterns. This is because they are capable of modeling diverse high-order relationships, grouping users or service items with distinct characteristics within the same hyperedge based on shared interactions or contexts. Identifying heterophily patterns is of paramount importance for enhancing service recommendation accuracy. By discerning these patterns, models can distinguish between varied and contrasting relationships within the hypergraph structure. This enables a more nuanced understanding of user-service item interactions, facilitating the provision of more targeted and precise service recommendations.
Hypergraph homophily ratio
Homophily metrics quantify the correlation between graph structural adjacency and node attributes/labels, which is a core basis for evaluating the applicability of graph models to specific scenarios. Traditional homophily metrics are primarily designed for pairwise graphs (with binary edges connecting only two nodes) and have been widely adopted in homogeneous graph learning as Formula 1:
This metric only characterizes pairwise edge-level homophily. Node-level homophily (equation (??)) averages the label agreement over each node’s immediate neighbours, but still collapses relationships into pairwise comparisons:
In service-recommendation hypergraphs, a single hyperedge can simultaneously connect users who prefer “technology news,” “outdoor equipment,” and “health management.” Because the traditional metrics never inspect the label composition of an entire multi-node group, they systematically underestimate or overestimate the true degree of homophily and overlook the high-order semantic cues needed to model cross-category preferences. We therefore require a measure that evaluates label consistency within each hyperedge as a whole rather than merely between node pairs.
Thus, to quantitatively analyze heterophily patterns in hypergraphs, we define the Hypergraph Homophily Ratio metric
Where
Let a service interaction graph
Empirical observations
To deeply investigate the intrinsic correlation between hypergraph homophily and the performance of graph neural networks (GNNs), we conducted an empirical study to analyze the effects of different homophily levels on the performance of service recommendation models. Specifically, we want to empirically examine whether the recommendation performance of classical GNN models degrades as the the degree of heterophily in the underlying user-service interaction patterns increases. To address this, we select Steam and MovieLens (the details of these two datasets will be introduced in Section 6.) as the datasets and employ a 2-layer GCN 43 with ReLU activation and a hidden size of 64 as the baseline model. Additionally, two widely used metrics for service recommendation, Recall@K and NDCG@K (details of these metrics will also be introduced in Section 6) are selected to evaluate the recommendation performance. The research results are presented in Table 1, from which the following observations can be made:
The recommendation result of GCN. H(G) denotes the hypergraph homophily ratio in sec 3.
The recommendation result of GCN. H(G) denotes the hypergraph homophily ratio in sec 3.
These two findings demonstrate that the hypergraph homophily ratio significantly impacts the service recommendation performance of graph convolutional networks (GCN): hypergraph structures with stronger homophily more efficiently support traditional GCN in capturing node similarity, while performance degradation of GCN in strongly heterophilous scenarios exposes their reliance on homophily signals and insufficient modeling of heterophilous interactions. These results inspire us that the limitations of traditional GNNs in heterophilous hypergraphs fundamentally stem from the lack of mechanisms to process heterophilous signals in high-order interactions, urgently requiring the design of targeted mechanisms to explicitly capture heterophily patterns.
In this section, we provide a detailed analysis of our proposed

The main modules of the proposed
Given the initial feature embeddings of user node
Building upon traditional graph attention mechanisms (e.g., GAT
44
), to calibrate raw attention weights and effectively capture interaction signals between heterophilic node pairs in hypergraphs, we utilize the heterophily discrepancy to adjust information propagation. Taking the service item hypergraph as an example, after computing the initial attention coefficient
To explicitly model the degree of heterophily between nodes, we introduce the discrepancy
This heterophily discrepancy
To normalize attention coefficients and eliminate magnitude differences, we apply softmax to obtain the final weight coefficient
The output feature of each node is then obtained by computing the linear combination of features weighted by these coefficients:
Following these steps, we derive the user feature
In the context of hypergraph recommendation, traditional models often overlook the coupling of homophily and heterophily signals, leading to feature confusion and over-smoothing in high-order interaction modeling. The high-order structure of hypergraphs (hyperedges connecting multiple nodes) and the heterophilous feature distribution pose two core challenges for contrastive learning: On one hand, nodes within hyperedges exhibit both homophilous and heterophilous associations, yet traditional models fail to explicitly separate these two types of signals. For example, ‘action games’ and ‘strategy games’ interacted by a user form heterophilous connections in hyperedges but are aggregated as homophilous signals by conventional methods, resulting in blurred feature representations and loss of cross-category interest information. On the other hand, in deep hypergraph convolution, the feature differences of heterophilous nodes are gradually diluted with increasing layers. Especially in strongly heterophilous scenarios where the hypergraph homophily ratio
To address the above challenges, we propose a Disentangled Hypergraph Contrastive Module (DHCM), which explicitly separates homophily and heterophily representations and designs dual contrastive tasks to enhance feature discriminability. The core idea of this module is to decompose high-order interactions within hyperedges into homophilous semantic channels and heterophilous semantic channels. Through contrastive learning, it respectively reinforces the aggregation of homogeneous nodes and the differentiation of heterophilous nodes, thereby breaking through the dependency of traditional models on the “homophily assumption”. Following this line of thought, each graph node is represented as:
To enforce the statistical independence between
Next, we design a contrastive auxiliary task to learn
Concretely, we randomly select some edges from all possible edges within
Homophily contrastive subtask: Encourage node pairs that belong to the same hyperedge to have higher similarity in the Heterophily contrastive subtask: Enforces that pairs of nodes not belonging to the same hyperedge are more similar in the
Accordingly, the contrastive loss is defined as equation (12). This mask-contrastive strategy takes the user set
For pairs
Similarly, by applying the same approach to the user hypergraph, we obtain
Users often exhibit diverse preferences through interactions with multiple heterophilic service items in hypergraphs (e.g., users favoring both action and strategy games in Figure 1), where traditional single user representations struggle to capture such diverse interests. To address this, we design a multi-interest learning module that decouples users’ latent interest spaces to generate dynamic interest representations relevant to target service items, enabling precise modeling of multivariate preferences in heterophilous scenarios.
For user node
To solve the interest adaptation problem in service recommendations, we introduce a target-oriented attention mechanism to dynamically activate the most relevant interest embeddings based on the features of target service item
A user representation
For user
Integrating the disentangling loss in equation (11) , the contrastive loss in equation (14),with the cross-entropy loss equation (19), the overall objective function of
The space complexity of the model is dominated by three components: hypergraph structure storage, node feature storage, and dynamic multi-interest vector storage. The model exhibits an overall lightweight nature with no redundant space overhead, and the space requirements of each module are as follows: In the hypergraph construction phase, it is necessary to store hyperedge structures (
The time complexity of the model shows linear growth in both training and inference phases, without the full-graph pairwise computation redundancy of traditional GNNs. The time cost consists of four core modules: Hypergraph construction (
In conclusion, while our model incorporates hypergraph construction and dynamic multi-interest learning modules, its overall space complexity remains lightweight (compatible with consumer-grade hardware) and time complexity shows linear growth (without full-graph pairwise redundancy of traditional GNNs), leading to controllable resource consumption. It offers a more effective methodology for service recommendation by accurately capturing both homophily/heterophily signals and high-order interaction semantics in hypergraphs.
Experiments
Datasets
The statistics of the two datasets after preprocessing are detailed in Table 2.
Description of datasets.
Description of datasets.
We selected two widely used metrics in service recommendation tasks, Recall@K and Normalized Discounted Cumulative Gain@K (NDCG@K),46,47 to evaluate the performance of our model and baselines. Here,
Baselines
We compared our model with several state-of-the-art methods: GCN
43
: A fundamental graph neural network model that aggregates node neighbor information through graph convolution operations to learn node representations in graph structures. LightGCN
7
: This is a streamlined version of GCN, omitting the feature transformations and non-linear activations present in traditional GCN. GCE-GNN
48
: A session-based recommendation (without explicit user identities, relying on interaction sequences within sessions) GNN model. It captures users’ short-term interaction behaviors through graph neural networks enhanced by global context. DHCF
12
: A hypergraph-aware divide-and-conquer recommendation approach using a two-channel hypergraph neural network to achieve separate and simultaneous learning of user and service item embeddings. SHT
13
: A hypergraph-aware transformer based approach employing hypergraph attention strategies. The model further proposes a data augmentation method by supplementing CF signals with different perspectives. HMGSR
49
: This presents a hierarchical attention mechanism to learn higher-order features on motif adjacency matrices. Hetero DMI-GNN
26
: A multi-inteCaptures users’ dynamic multi-interest preferences in sessions, modeling the diversity and evolution of interests via graph neural networks.
Parameter settings
We conducted experiments on an Ubuntu server equipped with an Intel Xeon Silver 4210 CPU, an NVIDIA A100 SXM4 GPU (40GB), and 251GB of RAM. The model was implemented in PyTorch 2.0 and compiled in a Python 3.10 environment.
To achieve optimal performance, we fine-tuned all hyperparameters and report the key settings. The experiments were trained using the AdamW optimizer with mini-batch gradient descent. Through empirical optimization, we determine the hyper-parameters of our model as follows:
The initial learning rate ( Batch Size is set to Contrastive Loss Weight ( Number of Interests ( Disentangling Loss ( The number of hidden dimensions in one head of multi-level attention mechanism The number of multi-head attention mechanism Epochs (with Early Stopping) are set to 100-200 (Patience=10).
For all baselines, we utilized the source code provided by the authors and performed hyperparameter tuning. We then compared these optimized results with those of our proposed model.
Performance comparison
Tables 3 and 4 ,5 summarize the experimental results of all models on the three datasets. The optimal performance for each evaluation metric is highlighted in bold. Experimental results demonstrate that our model exhibits significant advantages in both strong heterophily (Steam) and moderate heterophily (MovieLens) scenarios, with its core strengths stemming from the precise modeling of high-order heterophilic interactions and users’ diverse preferences.
Performance comparison of
GNN and baseline models on the steam dataset.
Performance comparison of
Performance comparison of
Performance comparison of
When comparing different types of baseline models, traditional GNNs and session-based models reveal significant limitations. As a fundamental graph convolutional model, GCN achieves an NDCG@20 of only 13.56% on the Steam dataset, less than half of our model’s performance. This stems from its reliance on binary edge modeling, which fails to capture multi-node interactions within hyperedges (e.g., users interacting with multiple heterophilic service items simultaneously), and the lack of heterophily awareness leads to the loss of cross-category interest information. Session-based models GCE-GNN and DMI-GNN, focusing on short-term interaction sequences or lacking explicit user identities, perform poorly in long-term preference modeling. For example, DMI-GNN achieves a Recall@20 of only 26.65% on Steam, far lower than our model’s 56.25%.
Although hypergraph-based baseline models SHT and DHCF can model high-order interactions, their performance is limited by the lack of explicit heterophily handling. SHT integrates global signals through hypergraph transformation and attention mechanisms, achieving an NDCG@10 of 21.88% on Steam. However, due to the failure to calibrate weights for heterophilic nodes, the dominance of homophily signals undervalues the information contribution of cross-feature users (e.g., different age groups). DHCF uses a two-pass hypergraph neural network to separate user/service item embeddings, achieving a Recall@20 of 57.24% on MovieLens. However, the absence of a heterophily-aware mechanism prevents it from effectively distinguishing the semantic features of heterophilic service items within hyperedges (e.g., movie genre differences), resulting in a recommendation accuracy 12.9% lower than our model.
The heterophilic graph model Hetero
To evaluate the contributions of key components to the overall performance of the model, we conducted ablation experiments on the Steam and MovieLens datasets. As shown in Table 6, we evaluated different model variants: (1) ‘w/o 1’: Uses only the standard GAT module without the heterophily-aware mechanism. (2) ‘w/o 2’: Excludes the contrastive learning module. (3) ‘w/o 3’: Omits the multi-interest learning component, directly computing the dot product between user and service item representations from the previous step.
Performance comparison of ablation study.
Performance comparison of ablation study.
From the results, we observe significant performance degradation across core metrics on both datasets when removing the heterophily-aware mechanism (variant ‘w/o 1’). This indicates that the mechanism effectively captures complex interaction patterns in heterophilic graphs, enhancing feature aggregation efficiency between different node types and significantly improving the model’s ability to capture user preferences. Without this mechanism, the graph attention module defaults to conventional homophily-based aggregation, failing to leverage complementary information in heterophilic structures and thus reducing recommendation accuracy.Removing the contrastive module (variant ‘w/o 2’) leads to a further decline in performance. The auxiliary module provides regularization constraints through additional supervision signals (e.g., temporal features of user behavior or service item context attributes), mitigating overfitting. Its absence weakens the model’s generalization ability, particularly in sparse interaction scenarios, where it cannot infer latent user interests from multi-dimensional information, thereby affecting recommendation diversity and accuracy. Eliminating the multi-interest learning module (variant ‘w/o 3’) results in the most substantial performance drop, underscoring its critical role in capturing dynamic and diverse user preferences. By disentangling user representations into multiple interest vectors, the module enables finer-grained matching with multi-dimensional service item features, avoiding the oversimplification of user interests inherent in direct dot-product operations. Without this component, the model fails to adapt to users’ contextual needs, especially in long-term interaction scenarios, leading to a significant decline in recommendation quality.
The ablation results systematically validate the indispensability of each core component: the heterophily-aware mechanism enhances structural modeling through optimized information aggregation in heterophilic graphs, the contrastive module strengthens generalization via multi-source supervision, and multi-interest learning enables precise preference matching through disentangled user representations. These components form a synergistic framework that collectively enables high-precision recommendation. The data further demonstrate that in complex recommendation scenarios, models relying solely on basic interaction modeling or simplistic feature matching struggle to address the diversity and dynamics of user interests, necessitating structured designs and multi-dimensional feature fusion for performance breakthroughs.
Cold-start is a prevalent challenge in practical service recommendation, primarily manifesting as user cold-start (new users with
As shown in Table 7,
Cold-start evaluation.
Cold-start evaluation.
We also investigated the impact of the number of interests on model performance. As illustrated in Figure 3, on the Steam dataset, as the number of interests increases, the Recall@20 and NDCG@20 metrics exhibit a trend of first increasing and then decreasing. When the number of interests is 3, both metrics reach their peaks, at which point the model most efficiently captures and utilizes users’ diverse interests. A similar trend is observed on the MovieLens dataset, where Recall@20 and NDCG@20 achieve their highest values when the number of interests is 5. These results indicate that for different datasets, an optimal number of interests exists, which can effectively disentangle users’ diverse preferences while avoiding information redundancy or deficiency caused by excessive or insufficient interest dimensions. This validates the critical role of reasonably setting the number of interests in the dynamic multi-interest learning module for achieving precise matching of user needs in recommendations.

The impact of interests number
Figure 4 explores the influence of the parameter

The impact of the hyperparameter
To intuitively verify the ability of the DHCM module to decouple homophily-heterophily signals of nodes, we conduct a visualization analysis of node embeddings at different training stages. By observing the distribution of nodes and their connection relationships in a 2D space, we quantify the decoupling effect of the module.
Multiple types of nodes are selected for the experiment. Node embeddings are extracted at four key training stages: the 1st epoch, 40th epoch, 80th epoch, and 120th epoch. Their distribution characteristics are visualized via 2D dimensionality reduction (

Impact of node disentanglement.
The core function of the DHCM module is to explicitly separate the “homophily signals” and “heterophily signals” in node representations: From a process perspective, it promotes node embeddings to evolve gradually from “mixed representations” to “dual-cluster separated representations”, with the decoupling degree improving monotonically with the number of training epochs; From an effect perspective, complete decoupling lays the foundation for subsequent “homophily aggregation” and “heterophily interaction”, enabling the model to more accurately capture the two types of core relationships in the user-service network and ultimately improve recommendation performance. Through the above visualization analysis, the decoupling ability of the DHCM module is intuitively verified, providing visual-level support for its effectiveness in user-service recommendation tasks.
Against the backdrop of challenges in modeling users’ diverse preferences and strongly heterophilous interactions in digital service recommendation, this paper proposes the
Footnotes
Acknowledgements
This work is supported by the Joint Fund Key Program of the National Natural Science Foundation of China(U23B2029) and Projects of International Cooperation and Exchanges NSFC (Grant No. 62061136006).
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
