A comprehensive review on zero-shot-learning techniques

Abstract

Advancements in computational capabilities have enabled the implementation of advanced deep learning models across various domains of knowledge, yet the increasing complexity and scarcity of data in specialized areas pose significant challenges. Zero-shot learning (ZSL), a subset of transfer learning, has emerged as an innovative solution to these challenges, focusing on classifying unseen categories present in the test set but absent during training. Unlike traditional methods, ZSL utilizes semantic descriptions, like attribute lists or natural language phrases, to map intermediate features from the training data to unseen categories effectively, enhancing the model’s applicability across diverse and complex domains. This review provides a concise synthesis of the advancements, methodologies, and applications in the field of zero-shot learning, highlighting the milestones achieved and possible future directions. We aim to offer insights into the contemporary developments in ZSL, serving as a comprehensive reference for researchers exploring the potentials and challenges of implementing ZSL-based methodologies in real-world scenarios.

Keywords

Big data transfer learning deep learning zero-shot learning

1. Introduction

Over the past decade, advances in computational capabilities and the availability of large datasets have paved the way for the application of complex deep learning models across diverse domains of knowledge such as finance, education, and life sciences among others. While the increase in computational power has been significant, the challenges associated with managing increasing data size and complexity have also grown. It should also be noted that in some specialized research areas, the scarcity of data further complicates the development of effective deep learning models.

To overcome the issues mentioned above, transfer learning can be utilized as an effective strategy. This approach involves leveraging pre-trained neural networks that have been designed for a different task, rather than building and training a deep neural network from scratch. Most layers from the existing model can be retained, requiring only the upper layers to be fine-tuned to suit the specific needs of a new application. This not only speeds up the training process but also reduces the amount of data needed for effective training. In this way, transfer learning serves as both a time-efficient and data-efficient solution for the development of deep learning models [1].

Zero-shot learning, a specialized type of transfer learning, that addresses the challenge of identifying categories in the test set that were not present during training. In this context, categories involved in training are referred to as ‘seen’ while those appearing only during testing are called ‘unseen’. Distinct from traditional supervised learning, zero-shot learning does not have the luxury of accessing samples from the “unseen” categories during training. To compensate for this, semantic descriptions are employed for each category, typically provided in the form of attribute lists or natural language phrases. The foundational idea is to extract intermediate features from the training data that can be applied to map test samples to the ‘unseen’ categories. Such intermediate features may encompass elements like color, texture, or specific aspects of objects. Because these features are likely to exist in both seen and unseen categories, they enable the formulation of discriminative descriptions for more complex concepts, thus making transfer learning to unseen classes more approachable [2].

Figure 1.

A structured depiction of the various zero-shot learning frameworks that are subject to analysis in this review. The diagram categorizes the frameworks into modality and attribute-based, learning strategy-based, and advanced techniques, with further subdivisions such as hybrid, generative modeling, and instance-based approaches, among others.

Table 1

Top 20 most cited zero-shot learning papers over the last years

Reference	Year	Citation count
[7]	2019	668
[8]	2023	354
[9]	2022	271
[10]	2022	95
[11]	2021	88
[12]	2023	86
[13]	2021	58
[14]	2020	57
[15]	2023	54
[16]	2022	52
[17]	2020	50
[18]	2019	48
[19]	2022	47
[20]	2021	44
[21]	2019	44
[22]	2022	44
[23]	2022	39
[24]	2023	39
[25]	2022	37
[26]	2021	36

The progression in the realm of zero-shot-based machine and deep learning methodologies has been remarkable, exemplified by an array of review articles and the unveiling of many distinct tools designed to advance numerous domains of knowledge [3, 4, 5, 6, 7] as well as the papers listed in Table 1. Nevertheless, the field remains intricate, attributed to the continuous expansion in the number of knowledge domains adopting machine learning-based tools as well as to the ever-increasing complexity of datasets. Therefore, in light of these developments and challenges, our work aims to provide a comprehensive review of the contemporary developments and innovations in the field of zero-shot learning, spanning broadly over the previous decade. Through this synthesis, we aim to delineate the contemporary research landscape, highlighting both the achieved milestones and forthcoming challenges.

1.1 Public datasets for evaluating zero-shot learning methodologies

Due to their versitility and adaptability, zero-shot learning techniques are applied across various domains, most prominently in computer vision, to address the limitations inherent to more conventional supervised learning methodologies. Although some of the datasets used for zero-shot learning weren’t originally created for it, they’ve been very helpful for research in this field because they cover a wide range of classes and examples. Below, we explore a variety of publicly available datasets that are frequently used for experimentation and validation of zero-shot learning-based techniques.

The CUB-200–2011 (Caltech-UCSD Birds 200) dataset [27], contains images representing 200 bird species, manually annotated with class labels and bounding boxes. This dataset, originated from the web, has been a notable benchmark for ZSL object recognition and has found applications in various domains including visual feature learning, multi-class recognition, object retrieval, attribute learning, and unsupervised domain adaptation.

Furthermore, ImageNet [28] stands out as one of the most prevalent datasets for image classification tasks, offering a diverse range of image data across 1000 classes. It has become an indispensable resource for a variety of ZSL applications such as image annotation, zero-shot object detection, and image retrieval due to its extensive and varied data offering.

In addition, the Animals with Attributes (AwA) dataset [29], is a benchmark compilation focusing on animal attributes, enriched with abstract attributes like stripes or horns. Comprising 33 animal categories, each with 50 images, it serves as a versatile tool for a range of ZSL tasks including attribute-based image classification.

Similarly, the aPascal/aYahoo datasets [30], extensions of the Pascal and Yahoo datasets respectively, were constructed with ZSL in mind. The former is laden with images across 20 object classes annotated with 200 attributes, while the latter focuses on 10 animal classes with annotations of 128 attributes, enriching the spectrum of ZSL research.

Visual Genome [31] is another remarkable dataset, constructed from Visual Genome images, offering class labels and scene graphs, which has been pivotal for ZSL applications like visual relationship detection and semantic segmentation.

Sun Attributes (SUN) [32], constructed from the SUN dataset, with annotations for 717 categories, provides an alternative to ImageNet for object classification and is a reputable benchmark for ZSL tasks like image annotation and zero-shot learning.

Lastly, the NUS-WIDE dataset [33], containing images from Flickr annotated with 81 concepts, has proven to be suitable for ZSL tasks, including multi-label classification and zero-shot retrieval, widening the horizon for research in zero-shot learning methodologies.

2. Learning strategy-based methods

2.1 Metric learning methods

In the dynamic landscape of zero-shot learning, Metric Learning methods have ascended to prominence, serving as a cornerstone in the pursuit of recognizing novel classes that elude conventional training. These methods specialize in the acquisition and adept utilization of distance metrics and similarity measures, fundamentally enabling the comparison of data points spanning both the familiar and the uncharted. Within this context, Xu et al. [24] contribute significantly by delving into the intricate realm of diagnosing compound bearing faults, harnessing the might of metric learning to discern complex fault patterns. Meanwhile, Huang et al. [34] introduce the Hippocampus-heuristic Character Recognition Network (HCRN), an embodiment of metric learning principles that accentuates the importance of learning features in pairs of input samples, a pivotal facet in metric learning’s arsenal.

Expanding the horizons of this field, Kutbi [35] introduces ZDDA, an algorithm that transcends traditional boundaries, excelling not only in metric learning but also radiating applicability across various domains. McCartney et al. [36] push the envelope further, demonstrating the adaptability of metric learning by seamlessly integrating it into the realm of EEG-based image retrieval, thus underscoring its versatility across divergent domains. Additionally, Xu [37] offers a bespoke encoding method, meticulously designed for decoding individual faults nestled within the complexity of compound fault signals, reiterating the pivotal role metric learning plays in signal processing.

Further advancing the frontier of signal processing, Dong et al. [38] present the ‘signal recognition and reconstruction convolutional neural networks (SR2CNN)’ framework, where metric learning combines harmoniously with loss functions to yield a powerful tool for signal recognition. Fu et al. [39] champion the cause of metric learning, contributing a robust maximum margin framework that breathes life into semantic manifold-based recognition. Meanwhile, Deznabi et al. [17] step into the realm of biology, unveiling DeepKinZero, an innovative approach designed explicitly for predicting kinases responsible for phosphorylating specific protein sites.

Shifting our focus to the visual domain, Ji et al. [40] meticulously craft a deep metric learning framework tailored for image zero-shot learning. Furthermore, Huang et al. [41] introduce CPDN, a deep metric learning model purpose-built for the challenges of Generalized Zero-Shot Learning (GZSL), showcasing the adaptability of metric learning in accommodating diverse zero-shot learning scenarios. In tandem, Ji et al. [42] and Fu et al. [43] enrich the landscape with novel manifold distance metrics, illuminating visual recognition tasks with their innovative contributions. Finally, Guo et al. [44] round off this illustrious list with a one-step recognition framework, primed and ready to tackle the uncharted territories of novel classes, underscoring the indispensable role of metric learning in seamlessly handling data from previously unencountered categories.

Collectively, these algorithms not only underscore the versatility and potential of Metric Learning but also encapsulate the essence of progress within the realm of zero-shot learning research.

2.2 Classifier-based methods

Classifier-based methods shine as powerful tools for recognizing previously unseen classes and advancing the boundaries of traditional recognition approaches. These innovative techniques leverage a variety of strategies, including class-level semantics, domain-specific adaptations, and advanced classification techniques.

Lv et al. [45] introduce TSVR, which enhances classifier capabilities through semantic-visual fusion pairs and domain-specific batch normalization, illustrating the potential of fine-tuned classifiers. Yu et al. [46] present KDCGN, a framework that directly generates classifiers conditioned on class-level semantics, streamlining the training process for unseen classes.

Venturing into uncharted domains, Freitas et al. [47] boldly apply zero-shot learning to the classification of marine materials, demonstrating its adaptability across diverse domains. Cheraghian et al. [48] pioneer a 2D zero-shot learning approach meticulously tailored for 3D point cloud classification. In doing so, they effectively address challenges such as domain adaptation, hubness, and data bias, further expanding the horizons of zero-shot learning’s applicability.

Ji et al. [49] contribute a novel method for zero-shot image classification, addressing class-imbalance issues and highlighting the adaptability of classifier-based strategies. Kim et al. [50] propose a deep attribute based on CNN features, enriching the field with discriminative and classifying properties.

Additionally, Liu et al. [51] present a co-training framework for zero-shot learning, fostering iterative knowledge transfer and strengthening classifier capabilities. Li et al. [52] explore the realm of superclasses in both feature and semantic spaces, facilitating knowledge transfer and enhancing recognition of samples from unseen classes. Hou et al. [53] introduce a discriminative comparison classifier tailored for generalized zero-shot learning tasks. Duan et al. [54] focus on Brain-Computer Interfaces (BCI) based on motor imagery for EEG signal recognition, leveraging zero-shot techniques to augment recognition capabilities. Li et al. [55] contribute an innovative technique for zero-shot generalized classification, further pushing the boundaries of classifier-based methods.

Zhang et al. [56] introduce a two-branch network designed to regress to class-level semantic embeddings, enhancing the interpretability and capabilities of classifiers within the zero-shot learning paradigm. Del et al. [57] embark on a journey that explores artwork instance recognition through two distinct approaches, encompassing conventional recognition and the intricate realm of zero-shot learning for unseen artwork instances, highlighting the adaptability of classifiers. Gui et al. [58] propose a pioneering GZSL-based learning approach for PolSAR data classification, predicting labels for both known and previously unseen classes, showcasing the potential of classifier-based strategies in handling complex real-world data. Cheng et al. [59] introduce an ingenious Random Forest-based zero-shot image classification approach, leveraging relative attributes (RAs) to enhance image recognition capabilities, demonstrating the versatility of classifiers in diverse image analysis tasks. In parallel, Qin et al. [60] harness a class-wise discrete descent algorithm and a multi-output neural network to predict multiple attributes from low-level features, transcending traditional image classification methods and highlighting classifiers’ capacity to extract rich attribute information. Complementing these innovations, Liu et al. [61] present a framework that simultaneously learns attribute-attribute connections and attribute classifiers, offering a holistic perspective on attribute-based classification and the interplay of features.

These diverse contributions collectively underscore the significance of classifier-based methods in addressing intricate recognition tasks across a spectrum of domains, from art recognition to remote sensing, image classification, and beyond.

2.3 Instance-based methods

Within the realm of zero-shot learning, instance-based methods have garnered attention for their unique approaches and innovative strategies. Yang et al. [62] introduce an Iterative Class Prototype Calibration technique, emphasizing the iterative refinement of class prototypes as a key strategy. Pham et al. [63] take a distinctive approach with PencilNet1, a method that detects racing gates by uniting predictions into a single pose tuple, demonstrating an instance-based perspective in its operation. Zarei et al. [64] tackle domain shift and the hubness problem in zero-shot learning through the use of a learned kernel distance function and a theoretical-based prototype learning strategy, enhancing the adaptability of zero-shot learning models. Li et al. [65] propose the Multiple Semantic Subspaces Network, which leverages the concept of semantic subspaces for improved zero-shot learning performance. Meanwhile, Xie et al. [66] introduce an innovative improvement to zero-shot learning by utilizing unseen images to train a model more effectively, with the aid of a novel training dataset called Virtual Mainstay samples.

In a semi-supervised fashion, Xu et al. [67] present the Low-Rank Semantic Grouping (LSG) model, which seeks to enhance the performance of zero-shot learning. Xie [68] contributes a Feature Enhancement Framework tailored for zero-shot learning tasks, enriching feature representations. Song et al. [69] explore the use of physics-based electromagnetic simulated images for learning the features of unseen targets within a zero-shot learning context, demonstrating the versatility of instance-based methods. Liu et al. [70] propose a Convolutional Prototype Learning framework that accounts for distribution conformity, enhancing the discriminative power of prototypes. Lv et al. [71] address bias reduction towards seen classes in zero-shot semantic segmentation, presenting a novel approach to promote fairness in recognition. Rahman et al. [72] introduce a Deep Multiple Instance Learning framework, shedding light on the potential of multiple instance learning techniques. Finally, Guo et al. [44] present a framework that incorporates transferred samples from source classes with pseudo labels and modifies the standard support vector machine formulation, offering a unique perspective on zero-shot learning.

Table 2
Learning strategy-based methods

Name	Method	Year	Ref
Xu et al.	Metric learning	2023	[24]
HCRN	Metric learning	2022	[34]
ZDDA	Metric learning	2021	[35]
McCartney et al.	Metric learning	2022	[36]
Xu et al.	Metric learning	2022	[37]
SR2CNN	Metric learning	2021	[38]
Fu et al.	Metric learning	2019	[39]
DeepKinZero	Metric learning	2020	[17]
Dual-Triplet Network	Metric learning	2020	[40]
GZSL	Metric learning	2020	[41]
Ji et al.	Metric learning	2018	[42]
Fu et al.	Metric learning	2017	[43]
Guo et al.	Metric learning	2017	[44]
TSVR	Classifier-based	2023	[45]
KDCGN	Classifier-based	2021	[46]
Freitas et al.	Classifier-based	2022	[47]
Cheraghian et al.	Classifier-based	2022	[48]
Ji et al.	Classifier-based	2021	[49]
Kim et al.	Classifier-based	2022	[50]
Liu et al.	Classifier-based	2021	[51]
Li et al.	Classifier-based	2020	[52]
Hou et al.	Classifier-based	2020	[53]
Duan et al.	Classifier-based	2020	[54]
Li et al.	Classifier-based	2020	[55]
Zhang et al.	Classifier-based	2020	[56]
Del Chiaro et al.	Classifier-based	2019	[57]
Gui et al.	Classifier-based	2018	[58]
Cheng et al.	Classifier-based	2022	[59]
Qin et al.	Classifier-based	2016	[60]
Liu et al.	Classifier-based	2014	[61]
Yang et al.	Instance-based	2022	[62]
PencilNet1	Instance-based	2022	[63]
Zarei et al.	Instance-based	2021	[64]
Li et al.	Instance-based	2021	[65]
VMAN	Instance-based	2021	[66]
Xu et al.	Instance-based	2021	[67]
Xie at al.	Instance-based	2020	[68]
Song et al.	Instance-based	2020	[69]
Liu et al.	Instance-based	2020	[70]
Lv et al.	Instance-based	2020	[71]
Deep0Tag	Instance-based	2020	[72]
Guo et al.	Instance-based	2017	[44]

The diverse learning strategy-based methods mentioned above and listed in Table 2 below, collectively contribute to the evolving landscape of zero-shot learning, showcasing innovative approaches and strategies tailored for various recognition tasks and challenges.

3. Generative and hybrid methods

3.1 Generative modelling

Generative modeling stands as a formidable pillar within the realm of zero-shot learning, harnessing the power of probabilistic modeling to overcome the challenges of recognizing unseen classes. This category hosts a diverse ensemble of innovative approaches, each meticulously crafted to narrow the divide between known and unknown categories. Liu et al. [73]introduce the discriminative cross-aligned variational autoencoder (DCA-VAE), a model dedicated to learning the intricate joint distribution of classes and attributes, paving the way for deeper understanding and more accurate predictions.

Meanwhile, Cheng et al. [74]unveil a hybrid routing transformer tailored explicitly for zero-shot learning tasks, drawing from the transformative capabilities of the transformer architecture to enhance recognition performance. Addressing bias concerns in feature generation for unseen classes, Yang et al. [75]present the ABA-GAN, a generative adversarial network that takes a proactive stance on fairness. In parallel, Ye et al. [76]introduce LCR-GAN, a GAN-based method that aligns the distributions of visual features and semantic attributes, thereby enriching zero-shot learning’s potential. Gao et al. [77]propose a bidirectional generative network fortified with cycle consistency, effectively bridging the chasm between visual and semantic domains. Li et al. [78]bring forth the AMAZ attribute-modulated generative meta-model, offering novel avenues for leveraging generative capabilities in zero-shot learning endeavors. In the pursuit of learning enhancement, Wei et al. [79]leverage generative replay techniques to augment the learning process. Tang et al. [80]contribute to this landscape by introducing a dedicated GAN structure tailored for zero-shot learning, enhancing the synthesis of visual features for unseen classes. In a parallel development, Liu et al. [81]delve into the domain of WGAN-based sample synthesis, harnessing the power of Generative Adversarial Networks (GANs) to create samples that bridge the gap between known and unknown categories. Mahapatra et al. [82]take a self-supervised learning approach and employ GradCAM saliency maps to synthesize features for unseen classes, showcasing the versatility of generative models.

Exploring the bidirectional connection between visual and semantic spaces, Li et al. [83]introduce Boomerang-GAN, a model that outperforms previous approaches in recognition and segmentation tasks. Guo et al. [84]employ meta-learning to generate fake visual features, effectively addressing domain bias issues with their CMPN model, while Xie et al. [85]present MGA-GAN, a Generative Adversarial Network tailored for generalized zero-shot learning. Gull et al. [25]advance the field with iVAE, a model based on the Variational Autoencoder (VAE) that excels in zero-shot learning tasks. Ma et al. [86]introduce GAN-MVAE, a fusion of a generative adversarial network and a multi-modal variational autoencoder, paving the way for generalized zero-shot learning. Shinzaki et al. [87]explore robust adversarial reinforcement learning techniques to tackle zero-shot adaptation in beam-tracking, demonstrating the adaptability of generative models across diverse applications within zero-shot learning.

The frontier of zero-shot learning is marked by the relentless innovation of generative models, which play a pivotal role in expanding its boundaries. Liu et al. [88]employ a cascade Generative Adversarial Networks (GANs) strategy to forge a path towards feature generation, enriching the model’s capacity for zero-shot tasks. Chen et al. [89]introduce a novel flow-based generative framework tailored for Generative Zero-Shot Learning (GZSL), setting the stage for enhanced feature synthesis. Shermin et al. [90]navigate the GZSL landscape with the bidirectional mapping coupled generative adversarial network (BMCoGAN), leveraging bidirectional mappings to advance feature synthesis capabilities. Deng et al. [91]usher in the Quality-Verifying Adversarial Network (QVAN), augmented with an l12 constraint, elevating feature synthesis quality. Ye et al. [92]tackle the persistent domain shift challenge in GZSL tasks through their Discriminative Learning GAN, effectively aligning distributions to enhance feature generation. Li et al. [93]pioneer the Augmented Semantic Feature Based Generative Network (ASFGN), dedicated to the synthesis of visual features for unseen classes. Luo et al. [94]present a groundbreaking Dual VAEGAN framework, unifying Variational Autoencoders (VAEs) and GANs, producing clear visual features for zero-shot learning.

Xie et al. [95]unveil a Generative Network-Based approach that leverages semantic features as input to synthesize visual features as output, bridging the gap between domains. Guo et al. [96]introduce a zero-shot augmentation learning model (ZSAL) that collaborates with medical professionals to generate virtual images for the computer-aided diagnosis of rare diseases. Feng et al. [97]pioneer a Dual-knowledge-source-based generative model, while Liu et al. [98]introduce the Cross-class generative network. Li et al. [99]contribute to the landscape with a GAN-based ZSL approach, and Song et al. [100]unveil the Domain-aware Stacked AutoEncoder (DaSAE), a model built on two interactive stacked auto-encoders for domain-aware projections.

Continuing the narrative of generative models in zero-shot learning, Ponti et al. [101]present a Bayesian generative model tailored for neural parameters within unseen task-language combinations. This innovation opens doors to more intricate and nuanced learning scenarios. Geng et al. [102]introduce a knowledge graph-based framework for Zero-Shot Learning (ZSL), featuring an attentive Zero-Shot Learner (AGCN) and an explanation generator. This model taps into the rich resource of knowledge graphs to enhance the learning process. Wang et al. [103]address the challenging zero-shot domain adaptation problem by developing a Conditional Coupled Generative Adversarial Network (CoCoGAN), leveraging generative capabilities to adapt to new domains seamlessly. Kim et al. [104]present the Zero-Shot Generative Adversarial Network (ZSGAN), a model designed to tackle the challenges posed by data imbalance, particularly pertinent in zero-shot scenarios. Ma et al. [105]propose the similarity-preserving GAN (SPGAN) to generate visual features for unseen classes while preserving the similarity relationships within the data. Liu et al. [106]advance the field with a dual-stream GAN, designed to excel in zero-shot visual classification tasks. Chi et al. [107]introduce the Dual Adversarial Distribution Network (DADN), specially crafted for zero-shot cross-media retrieval, showcasing the versatility of generative models in diverse applications. Gao et al. [108]contribute a zero-shot learning method based on contractive stacked autoencoders, providing a unique approach to feature generation. Shao et al. [109]propose a multi-channel Gaussian Mixture VAE model that excels in generalized zero-shot learning tasks, leveraging the power of Gaussian Mixture models. Gao [110]introduces Zero-VAE-GAN, while Ding et al. [111]develop a two-stage generative adversarial network tailored specifically for zero-shot learning. These models collectively push the boundaries of what is possible in zero-shot learning through generative prowess.

3.2 Hybrid methods

A cluster of hybrid-based methods has emerged, each fusing different techniques and paradigms to address the complex challenges presented by unseen classes and data scarcity. Ji et al. (2021) introduce the UPL method, which leverages the power of two constraints – an autoencoder and a triplet loss – within the episodic training paradigm, showcasing its adaptability in both traditional ZSL and generalized GZSL settings [112]. In parallel, Zhang et al. (2022) propose a pioneering SMDM-based approach, bridging the gap between familiar and unfamiliar concepts by inferring unseen relations from seen relations using semantic metrics generated by BERT [113]. Yao et al. (2023) introduce GhostShuffleNet (GSNet), a specialized framework tailored for the recognition of Unmanned Aerial Vehicle (UAV) images. GSNet stands out by amalgamating the Zero-Shot Neural Architecture Search (NAS) algorithm with other pertinent features, thereby showcasing the significance of domain-specific optimizations in Zero-Shot Learning (ZSL) systems operating in niche domains such as UAV imagery [114].

Li et al. (2023) present BGSNet, a two-branch, end-to-end network that offers a unique perspective on ZSL. BGSNet excels by harmonizing generalization and specialization capabilities, operating at both the instance and dataset levels. This approach underscores the importance of balance and synergy between these two critical facets of ZSL to enhance recognition accuracy across diverse datasets and instances [115]. Hu et al. (2022) present a hybrid approach that harnesses both a feature-attribute embedding model and a generative feature model to bridge the gap between visual and semantic domains [116]. In parallel, Ao et al. (2022) introduce a cross-modal prototype learning method, Ao et al. (2022), which integrates knowledge from both textual and visual modalities to enhance zero-shot learning performance [117]. Dong et al. (2022) propose a G-ZSL method that utilizes two statistical techniques to establish boundaries between domains, facilitating knowledge transfer between seen and unseen classes [22]. Li et al. (2022) contribute the ERPCNet, an effective, efficient, and explainable model that demonstrates its ability to transfer knowledge from observed to unseen classes in both ZSL and GZSL settings [118]. Liu et al. (2022) introduce a semantics-guided spatial attention mechanism and learn discriminative prototypes for each class [119]. Yun et al. (2022) present SALN, which employs an $\lambda$ 1,2-norm constraint to generate semantic representations and project them into the visual space [120]. Xu et al. (2022) propose the Holistically Associated Model, which comprehensively considers both visual and semantic information for zero-shot learning [121].

Bian et al. (2022) leverage cross-modality information and relation prototypes, deploying them effectively for classifying previously unseen medical images [122]. Meanwhile, Li et al. (2022) enhance classification results through a fusion of a semantic embedding network and an auxiliary classifier [123]. Song et al. (2022) introduce the Semantic-Visual Combination Propagation Network (CPN), which seamlessly combines semantic and visual representations while incorporating an auto-encoder to bridge the gap between these domains [124]. Zhang et al. (2022) contribute Cluster-Prototype Matching (CPM), harnessing sample distribution information and the Kuhn-Munkres algorithm to match clusters with class prototypes, thereby improving zero-shot classification [125]. Lu et al. (2022) propose a GZSL meta-learning approach that leverages class-level semantic knowledge and employs an entropy gate approach to tackle complex recognition tasks [126].

Shermin et al. (2022) introduce an integrated network that employs two sub-networks for the EL and FS categories of methods. It utilizes mutual learning and mutual information, exemplifying the integration of diverse techniques in hybrid zero-shot learning [127]. Liu et al. (2022) present AREES, a comprehensive approach that combines an attention mechanism, a decomposition structure, and a multimodal VAE, demonstrating its hybrid nature [128]. Li et al. (2022) contribute TUPL, designed specifically for the GZSDA challenge, showcasing its adaptability to complex zero-shot scenarios [129]. Chen et al. (2022) introduce GNDAN, which incorporates RAN and RGAT to generate both global and local embeddings, effectively addressing challenges in zero-shot learning [130]. Kwon et al. (2022) employ a two-stream autoencoder-based gating model, a hybrid approach focusing on feature generation and efficiency [131]. In parallel, Xu et al. (2022) combine generative mixup networks with semantic graph alignment and a triplet gradient matching loss, exemplifying the fusion of generative and discriminative methods for improved performance [132]. Jia et al. (2022) explore active learning for a visual explainable approach, adding another dimension to the hybrid landscape [133]. Lastly, Ye et al. (2022) employ triplet loss for ZSL image classification, effectively harnessing the power of generative adversarial networks in their approach [134].

Yao et al. (2022) propose an attribute-induced bias-eliminating (AIBE) module and an attention graph attribute embedding process, showcasing their commitment to eliminating biases and improving attribute-based recognition [135]. Li et al. (2021) present Locality-Preservation Deep Cross-Modal Embedding Networks (LPDCMENs), an end-to-end method tailored for zero-shot remote sensing scene classification [136]. Liu et al. (2021) introduce an adversarial strategy involving a projector and classifier, revolutionizing unseen object recognition [137]. Zhang et al. (2021) contribute an encoder-decoder framework with an attention mechanism, adding another layer of sophistication to the zero-shot learning landscape [138]. Nihal et al. (2021) leverage the Linear Discriminant Analysis (LDA) classifier and DenseNet101 for Bangla sign language recognition, showcasing the versatility of zero-shot learning across domains [139].

Qian et al. (2021) propose a Cross-Domain Lifelong Reinforcement Algorithm with Zero-Shot Policy Generation (CDLRL-ZPG), highlighting the potential of reinforcement learning in zero-shot settings [140]. Xie et al. (2021) present a GAN-CST-based approach incorporating Class Knowledge Overlay (CKO), semi-supervised learning, and a triplet loss, demonstrating the power of combining multiple techniques [20]. Xu et al. (2021) introduce complementary attributes and rank aggregation as a supplement to existing methods, exemplifying a collaborative approach to zero-shot learning [141]. Min et al. (2021) contribute the Domain-Oriented Semantic Embedding (DOSE) network, a domain-specific approach that incorporates specialized sub-projections and a cycle consistency approach [142]. Ding et al. (2021) utilize a latent space and two domain classifiers for both ZSL and supervised classification tasks, showcasing the potential for hybrid methods [143].

In a distinctive approach, Wen et al. (2020) introduce a ZSL-based method rooted in Traditional Chinese Medicine concepts, bridging the gap between ancient wisdom and modern AI [14]. Zhang et al. (2020) present a deep learning architecture that addresses domain shift problems in GZSL through a KL Divergence constraint, exemplifying the use of constraints in zero-shot learning [144]. Wang et al. (2020) propose Deep Attribute Prediction (DeepAP), a model that leverages a class-attribute matrix to explore attribute-class correlations and incorporates weighted attributes for zero-shot image classification [145]. Zhang et al. (2020) put forward a hierarchical prototype learning approach (HPL) for zero-shot recognition, leveraging class prototypes and semantic spaces to differentiate between seen and unseen classes [146]. Li et al. (2020) contribute a zero-shot learning procedure that maintains semantic consistency between visual and semantic spaces while learning class prototypes, demonstrating the significance of semantic alignment [147].

Zhang et al. (2019) introduce a probabilistic model with triplet learning and Non-Negative Matrix Factorization (NMF), illustrating the integration of probabilistic methods and traditional machine learning techniques [148]. Ji et al. (2020) innovate with an adversarial feature fusion network that fuses different class semantic prototypes to generate pseudo visual features, highlighting the power of feature fusion [149].

Liu et al. (2020) contribute to Explainable Zero-Shot Learning (XZSL) with a novel vision-attribute embedding module and a multi-channel explanation model, shedding light on the interpretability of ZSL systems [150]. Changpinyo et al. (2019) propose two innovative frameworks for ZSL using manifold embeddings and synthesized “exemplars,” expanding the repertoire of techniques available for handling the complexities of unseen class recognition [151]. Jia et al. (2020) introduce the DUET model, comprising a Deep Embedding Transfer (DET) module and an Unseen Visual Feature Generation (UVG) module, pushing the boundaries of feature transfer and visual feature synthesis in ZSL [152].

Liu et al. (2020) present the Label-Activating Framework (LAF) through Indirect Attribute Prediction (IAP) for Generalized Zero-Shot Learning (GZSL), emphasizing the role of attribute predictions in improving recognition across seen and unseen classes [21]. Ding et al. (2019) propose the Cross-Domain Mapping (CDM) model, addressing the domain shift problem in ZSL by mapping visual features to a common domain, showcasing the importance of domain adaptation [153]. Jiang et al. (2019) put forward a novel ZSL method leveraging class similarities to adjust the visual-semantic embedding for unseen classes, highlighting the value of semantic alignment [154].

Ji et al. (2019) introduce a synthesized approach based on dictionary learning, merging traditional machine learning techniques with ZSL concepts [155]. Zhang et al. (2019) present a hybrid approach involving random attribute selection and conditional GAN, demonstrating the potential for combining various strategies to enhance ZSL [156]. Zhang et al. (2019) further contribute with a dual-verification network for zero-shot classification, highlighting the importance of verifying both feature and attribute spaces for accurate recognition [157].

Yu et al. (2018) propose ASTE and SPASS techniques to improve the accuracy of unseen class recognition, along with a fast training (FT) strategy to enhance classification efficiency, reflecting their dedication to practical advancements in ZSL [158]. Liu et al. (2018) introduce CORL, a fusion of ontology and reinforcement learning, to construct classification rules based on attribute annotations, underscoring the fusion of knowledge-driven and data-driven approaches [159].

Sumbul et al. (2018) use image features acquired through a CNN and additional information from manually selected attributes, a natural language model, and a scientific taxonomy for the identification of street trees in aerial data, showcasing the versatility of ZSL in diverse domains [160]. Song et al. (2017) propose a deep neural network architecture consisting of a generator and an interpreter to tackle the issue of limited training samples for Automatic Target Recognition (ATR) of Synthetic Aperture Radar (SAR) [161]. Yu et al. (2017) introduce the Regularized Cross-Modality Ranking (ReCMR) approach, emphasizing the exploration of relationships between different modalities through hinge ranking loss and regularizers [162].

Table 3
Generative modelling and hybrid zsl methods

Name	Method	Year	Ref
DCA-VAE	Generative modeling	2023	[73]
Cheng et al.	Generative modeling	2023	[74]
ABA-GAN	Generative modeling	2023	[75]
LCR-GAN	Generative modeling	2023	[76]
Gao et al.	Generative modeling	2023	[77]
AMAZ	Generative modeling	2023	[78]
Incremental ZSL	Generative modeling	2022	[79]
Tang et al.	Generative modeling	2022	[80]
Liu et al.	Generative modeling	2022	[81]
Mahapatra et al.	Generative modeling	2022	[82]
Boomerang-GAN	Generative modeling	2022	[83]
CMPN	Generative modeling	2022	[84]
MGA-GAN	Generative modeling	2022	[85]
iVAE	Generative Modeling	2022	[25]
GAN-MVAE	Generative modeling	2022	[86]
Shinzaki et al.	Generative modeling	2022	[87]
cascade GAN	Generative modeling	2022	[88]
GSMFlow	Generative modeling	2022	[89]
BMCoGAN	Generative modeling	2022	[90]
QVAN	Generative modeling	2022	[91]
Discriminative Learning GAN	Generative modeling	2022	[92]
ASFGN	Generative modeling	2021	[93]
Dual VAEGAN	Generative modeling	2021	[94]
Xie et al.	Generative modeling	2021	[95]
ZSAL	Generative modeling	2021	[96]
Feng, L., Zhao, C.	Generative modeling	2021	[97]
Liu et al.	Generative modeling	2021	[98]
Li et al.	Generative modeling	2020	[99]
DaSAE	Generative modeling	2021	[100]
Ponti et al.	Generative modeling	2021	[101]
Geng et al.	Generative modeling	2021	[102]
CoCoGAN	Generative modeling	2022	[103]
ZSGAN	Generative modeling	2020	[104]
SPGAN	Generative modeling	2020	[105]
Dual-Stream GAN	Generative modeling	2020	[106]
DADN	Generative modeling	2020	[107]
Gao et al.	Generative modeling	2019	[108]
Shao et al.	Generative modeling	2020	[109]
Zero-VAE-GAN	Generative modeling	2020	[110]
Ding et al.	Generative modeling	2019	[111]
UPL	Hybrid	2021	[112]
SMDM	Hybrid	2022	[113]
GhostShuffleNet	Hybrid	2023	[114]
BGSNet	Hybrid	2023	[115]
Hu et al.	Hybrid	2022	[116]
CMPL	Hybrid	2022	[117]
G-ZSL	Hybrid	2022	[22]
ERPCNet	Hybrid	2022	[118]
MFHI	Hybrid	2022	[119]
SALN	Hybrid	2022	[120]
Holistically Associated Model	Hybrid	2022	[121]
Bian et al.	Hybrid	2022	[122]
Li et al.	Hybrid	2022	[123]
Semantic-Visual CPN	Hybrid	2022	[124]
CPM	Hybrid	2022	[125]
Lu et al.	Hybrid	2022	[126]

Table 3, continued
Name	Method	Year	Ref
Shermin et al.	Hybrid	2022	[127]
AREES	Hybrid	2022	[128]
TUPL	Hybrid	2022	[129]
GNDAN	Hybrid	2022	[130]
Kwon G., Al Regib G.	Hybrid	2022	[131]
Xu et al.	Hybrid	2022	[132]
Jia el atl.	Hybrid	2022	[133]
DCR-GAN	Hybrid	2022	[134]
AIBE	Hybrid	2022	[135]
LPDCMENs	Hybrid	2021	[136]
Adversarial Strategy	Hybrid	2021	[137]
Zhang et al.	Hybrid	2021	[138]
Nihal et al.	Hybrid	2021	[139]
CDLRL-ZPG	Hybrid	2021	[140]
GAN-CST	Hybrid	2021	[20]
Complementary Attributes	Hybrid	2021	[141]
DOSE	Hybrid	2021	[142]
SE-OOD	Hybrid	2021	[143]
Wen et al.	Hybrid	2020	[14]
Zhang et al.	Hybrid	2020	[144]
DeepAP	Hybrid	2020	[145]
HPL	Hybrid	2020	[146]
Li et al.	Hybrid	2020	[147]
Zhang et al.	Hybrid	2019	[148]
Ji et al.	Hybrid	2020	[149]
DME	Hybrid	2020	[150]
Changpinyo et al.	Hybrid	2019	[151]
DUET	Hybrid	2020	[152]
LAF	Hybrid	2020	[21]
CDM	Hybrid	2019	[153]
Jiang et al.	Hybrid	2019	[154]
Ji et al.	Hybrid	2019	[155]
Zhang et al.	Hybrid	2019	[156]
Dual-Verification Network	Hybrid	2019	[157]
ASTE & SPASS	Hybrid	2018	[158]
CORL	Hybrid	2018	[159]
Sumbul et al.	Hybrid	2018	[160]
Song Q., Xu F.	Hybrid	2017	[161]
ReCMR	Hybrid	2017	[162]
MCME-DA	Hybrid	2017	[163]
Fu et al.	Hybrid	2015	[164]

Ji et al. (2017) focus on manifold constraints and domain adaptation for knowledge transfer with MCME-DA, exemplifying the significance of domain alignment and constraints [163]. Fu et al. (2015) propose an approach leveraging transductive multi-view embedding and heterogeneous multi-view hypergraph label propagation for effective zero-shot recognition, illustrating the potential of multi-view and transductive learning techniques [164]. These innovative methods which are also listed in Table 3, continue to shape and advance the field of ZSL, offering a rich tapestry of approaches to tackle the complexities of recognizing unseen classes across diverse domains and applications.

4. Modality and attribute-based methods

4.1 Multi-modal ZSL

In the realm of Multi-modal Zero-Shot Learning (MZSL), researchers have introduced innovative approaches to tackle the challenges associated with leveraging multiple data modalities for enhanced recognition performance. Cao et al. (2022) propose a Multi-modal feature fusion model designed to excel in supervised learning tasks. This model showcases the significance of fusing information from diverse modalities to bolster recognition accuracy across various domains [165].

Chen (2022) addresses a crucial issue encountered in Generalized Zero-Shot Learning (GZSL) by introducing MM-APANN. This model focuses on mitigating incongruence between visual features and semantic attributes, contributing to more effective recognition in multi-modal settings [166].

Additionally, exploring the broader implications of Multi-modal ZSL, researchers have delved into domains such as robotics. A notable example is the work by Lázaro-Gredilla et al. (2019), which discusses a comprehensive framework for aiding robots in interpreting high-level concepts. This framework incorporates principles from mental imagery and other pertinent sources, signifying the importance of multi-modal approaches in enabling robots to comprehend and interact with their surroundings [167].

4.1.1 Multilabel MZSL

Graph Convolution Networks have proven to be a valuable tool in addressing the complexities of Multi-label Zero-Shot Learning (MZSL). Ou et al. (2020) present a Graph Convolution Network-based MZSL model, highlighting the potential of graph-based techniques to facilitate the recognition of multiple labels associated with a single instance. This approach demonstrates the importance of leveraging graph structures to enhance the performance of MZSL systems, particularly when dealing with multi-label scenarios [168].

4.2 Graph-based methods

Graph-based approaches have become instrumental in advancing Attribute-Based Zero-Shot Learning (ZSL), unraveling the latent potential of attribute relationships. Within this dynamic field, these methods have redefined how attributes are harnessed to create more effective ZSL models. In the evolving landscape of Attribute-Based Zero-Shot Learning (ZSL), graph-based methodologies have emerged as pioneers, harnessing the power of attribute relationships to redefine the efficacy of ZSL models. One such groundbreaking innovation is MR-Selection, introduced by Feng et al. (2023), which offers a novel zero-shot band selection approach for hyperspectral image (HSI) classification. This method leverages dynamic structure-aware graph convolutional networks to yield remarkable results [169].

Roy et al. (2022) have also made a substantial contribution by devising a graph convolution network-based autoencoder that generates commonsense embeddings. This innovative approach enhances the interpretability of visual data and bolsters ZSL models’ performance [19].

The realm of graph-based techniques continues to evolve with Xu et al.’s (2022) introduction of Poincaré graph convolutional networks, pushing the boundaries of graph-based methods and their application in ZSL [170]. Furthermore, Wang et al. (2022) have unveiled LND-GMF, a methodology featuring a neighborhood-based gating system. This approach represents a significant stride in improving attribute-based ZSL models [171].

Mancini et al. (2022) have enriched this landscape by presenting Compositional Cosine Graph Embedding, a potent technique for effectively capturing attribute relationships [172]. Additionally, Gao et al. (2021) have introduced the Prototype-Sample Graph Neural Network (PS-GNN), specifically designed for video ZSL, underscoring the versatility of graph-based ZSL methods [173].

Moreover, Wang et al. (2021) have taken a pioneering step by presenting GAZL, a novel active learning approach tailored for designer convolutional graph networks (GCNs). This method finds application in zero-shot image classification, adding a valuable facet to the evolving field of graph-based ZSL [174]. These innovative graph-based approaches collectively illuminate the path towards more robust and efficient ZSL models by exploring and harnessing the intricate web of attribute relationships.

4.3 Embedding-based methods

Within the domain of Embedding-Based Zero-Shot Learning (ZSL), innovative approaches are reshaping the landscape of attribute-driven models, offering novel perspectives on feature representations and semantic embeddings. Rao et al. (2023) introduce the Dual Projective model, strategically designed to navigate the challenges of generalized ZSL by adeptly combining both feature and semantic spaces, thereby enhancing model performance in diverse scenarios [8]. Meanwhile, Han et al. (2022) delve into the realm of semantic contrastive embedding, unveiling a technique that leverages the power of semantic information to refine feature representations, ultimately contributing to the improved effectiveness of ZSL models [175].

The intriguing convergence of graph-based methodologies with embedding-based techniques is exemplified by Hu et al.’s (2022) work on RIAEm, a method that incorporates a region graph network and attribute feature embedding. This hybrid approach offers a promising avenue for exploring the synergy between graph structures and embeddings in ZSL [176]. Barros et al. (2022) break new ground with Malware-SMELL, a ZSL model rooted in an S-Space representation, catering to the unique challenges of malware classification through embedding-based techniques [177].

In the field of bioinformatics, Kulmanov et al. (2022) present DeepGOZero, a pioneering application of ontology embeddings for more accurate predictions of proteins’ functions, showcasing the wide-ranging impact of embedding-based methods [178]. Moreover, Liu et al. (2022) introduce LSA and SRSA, innovative approaches aimed at addressing the projection domain shift problem, enhancing the adaptability of zero-shot learning models through embeddings [179]. Yang et al. (2021) introduce an embedding-based ZSL model with a self-focus mechanism, shedding light on the significance of attention mechanisms in embedding-based approaches [180].

An entirely different perspective of embedding-based ZSL is unveiled by Buckchash et al. (2021) with their Zero-Shot Visual Anomaly Recognition (VAR) approach. It operates on raw image frames drawn over a Grassmann product space, offering a unique perspective on anomaly detection and classification [26]. Furthermore, Lin et al. (2021) propose Class Label Autoencoder with Structure Refinement (CLASR), a novel ZSL approach that adapts to multi-semantic embedding spaces, highlighting the importance of structured embeddings in ZSL [181]. Lastly, Wang et al. (2021) pioneer a group-based attribute/object collaborative learning model that employs a structured sparse method to constrain model parameters, thereby enhancing the efficacy of zero-shot learning in complex scenarios [182]. Guo et al. (2021) present AMS-SFE, a method that ingeniously employs a shared semantic feature space and an autoencoder-based expansion of semantic features to bridge the gap between seen and unseen classes, demonstrating the power of collaborative embeddings in ZSL [183].

In a different vein, Yu et al. (2019) introduce the latent space encoding (LSE), an encoder-decoder approach that unfolds new dimensions in zero-shot learning. LSE utilizes latent spaces to create discriminative embeddings, redefining the way feature representations are employed in ZSL [184]. Long et al. (2018) tackle the challenge of data synthesis for unseen classes with their UVDS approach, providing a robust framework for generating previously unseen data instances, thereby extending ZSL to uncharted territories [185]. Jiang et al. (2019) propose a novel perspective on zero-shot learning by leveraging class similarities to adjust visual-semantic embeddings. This approach offers a flexible means of tailoring embeddings to the intricacies of different ZSL scenarios [186].

Jin et al. (2019) delve into the intricacies of embedding-based ZSL, incorporating center loss and a varying learning rate to enhance feature discrimination and the overall learning process, underscoring the importance of optimization strategies in ZSL [18]. Shen et al. (2019) embark on a journey into binary embedding-based zero-shot learning, exploring the potential of binary representations to capture complex semantic relationships, providing new insights into encoding semantics in ZSL [187]. Meng et al. (2019) pioneer a new framework for ZSL, collaboratively learning a latent subspace and cross-modal embedding, illustrating how a fusion of modalities and latent representations can enrich the ZSL process [188]. Niu et al. (2019) introduce an adaptive approach to visual-semantic mapping, accompanied by progressive label refinement for ZSL. Their scalable version, DEEP AEZSL, empowers ZSL models to adapt and refine their semantic mappings in a data-driven manner, addressing challenges related to evolving data distributions [189].

Rahman et al. (2018) shed light on Class Adapting Principal Directions (CAPDs), a novel method for mapping image features to semantically meaningful spaces, offering a fresh perspective on feature representations in ZSL [190]. Meng et al. (2018) introduce a low-rank-representation (LRR) based manifold-regularization approach, which seamlessly incorporates locality and similarity information to foster the learning of discriminative semantic representations, showcasing the versatility of embedding-based techniques [191]. Long et al. (2018) propose a comprehensive ZSL framework that effectively maps semantic embeddings to a discriminative representation space, integrating KLDA, CLN, and KRR to create a powerful tool for zero-shot learning across diverse domains [192]. Lastly, Ji et al. (2017) embark on a journey of fusion, combining various types of side information and visual features into a shared semantic space, revealing a holistic approach to embedding-based ZSL, where diverse sources of knowledge converge [193].

Table 4
Modality and attribute-based methods

Name	Method	Year	Ref
MFF	Multi-modal ZSL	2022	[165]
MM-APANN	Multi-modal ZSL	2022	[166]
Lázaro-Gredilla et al.	Multi-modal ZSL	2019	[167]
MZSL-GCN	Multilabel MZSL	2020	[168]
MR-Selection	Graph based	2023	[169]
Roy et al.	Graph based	2022	[19]
Poincaré graph	Graph based	2022	[170]
LND-GMF	Graph based	2022	[171]
Mancini et al.	Graph based	2022	[172]
PS-GNN	Graph based	2021	[173]
GAZL	Graph based	2021	[174]
Dual Projectile ZSL	Embedding based	2023	[8]
Dual Projectile ZSL	Embedding based	2022	[175]
RIAE	Embedding based	2022	[176]
Malware-SMELL	Embedding based	2022	[177]
DeepGOZero	Embedding based	2022	[178]
LSA, SRSA	Embedding based	2022	[179]
Yang et al.	Embedding based	2021	[180]
zero-shot VAR	Embedding based	2021	[26]
CLASR	Embedding based	2021	[181]
Wang et al.	Embedding based	2021	[182]
AMS-SFE	Embedding based	2021	[183]
LSE	Embedding based	2019	[184]
UVDS	Embedding based	2018	[185]
Jiang et al.	Embedding based	2019	[186]
Jin et al.	Embedding based	2019	[18]
BZSL	Embedding based	2019	[187]
Meng M., Yu J.	Embedding based	2019	[188]
AEZSL	Embedding based	2019	[189]
CAPDs	Embedding based	2018	[190]
Low-Rank-Representation for ZSL	Embedding based	2018	[191]
Long et al.	Embedding based	2018	[192]
MBFA ZSL	Embedding based	2017	[193]

These pioneering modality and attribute-based methods collectively also listen in Table 4 redefine the boundaries of ZSL, demonstrating the power of semantic embeddings and feature representations in addressing the intricacies of real-world applications across diverse domains.

5. Advanced techniques

5.1 Attention mechanisms

Advanced Techniques for Zero-Shot Learning (ZSL) have witnessed a transformative shift with the incorporation of attention mechanisms. These mechanisms have emerged as indispensable tools for unraveling the intricacies of both visual and semantic data, substantially enhancing the interpretability and overall performance of ZSL models.

In a recent innovation by Xie et al. (2023), an end-to-end attention-based embedding network takes center stage, dedicated to uncovering the most salient image components for ZSL. This approach offers the dual advantage of precisely localizing relevant image regions and extracting discriminative features essential for accurate ZSL [3].

Yang et al. (2022) present the Spatial Response Attention (SRA) model, which leverages spatial attention localization. This model takes a step further by introducing a novel Attribute Attention Cross Entropy loss, refining the alignment between visual features and semantic attributes with unprecedented precision [194]. Meng et al. (2022) harness the potential of attention mechanisms to focus on pivotal image parts essential for distinguishing categories. This innovative approach refines the process of feature extraction, further elevating the ZSL performance by concentrating on the most informative regions of the visual data [195].

In another avenue of research, Zhu et al. (2022) combine attention mechanisms with Kullback-Leibler divergence, forging a powerful synergy between interpretability and information theoretic principles. This amalgamation enriches the understanding of the underlying data distribution, paving the way for enhanced ZSL [9].

Conversely, in the study by Liu et al. (2021), the Semantic-diversity Transfer Network introduces a multi-attention architecture, emphasizing not only the richness of semantic information but also the integration of diverse attention mechanisms. This holistic framework provides a comprehensive perspective on data, bolstering the capabilities of ZSL models [196]. These pioneering advances underscore the pivotal role of attention mechanisms in reshaping how ZSL models interpret both visual and semantic information, ultimately expanding the boundaries of performance and interpretability in real-world ZSL applications.

5.2 Other

Table 5
Other advanced zsl methods

Name	Method	Year	Ref
Xie et al.	Attention mechanisms	2023	[3]
SRA	Attention mechanisms	2022	[194]
Meng et al.	Attention mechanisms	2022	[195]
Zhu et al.	Attention mechanisms	2022	[9]
Liu et al.	Attention mechanisms	2021	[196]
DARN	Other	2023	[197]
Chen et al.	Other	2023	[15]
ZeroNAS	Other	2022	[198]
Tf-GCZSL	Other	2022	[199]
Meta-learning ZSL	Other	2022	[200]
Yucel et al.	Other	2022	[201]
NucNormZSL	Other	2021	[202]
Wang et al.	Other	2021	[203]
ZSL-CPLSR	Other	2021	[204]
Pamungkas et al.	Other	2021	[205]
Visual Structure Constraint XSL	Other	2021	[206]
Kim et al.	Other	2021	[207]
GPAC	Other	2021	[208]
Inf-FG framework	Other	2021	[209]
Wu et al.	Other	2021	[210]
SELAR	Other	2021	[211]
LTI-ST	Other	2021	[212]
PSD	Other	2020	[213]
Liu et al.	Other	2020	[214]
Luo et al.	Other	2020	[215]
DDIP	Other	2020	[216]
Mishra et al.	Other	2020	[217]
Pradham et al.	Other	2020	[218]
Tang et al.	Other	2020	[219]
Rostami et al.	Other	2020	[220]
Wang et al.	Other	2019	[221]
Liu et al.	Other	2018	[222]
Zhang et al.	Other	2019	[223]
LUVP	Other	2018	[224]
Yu et al.	Other	2018	[225]
Haptic ZSL	Other	2018	[226]
Zhang et al.	Other	2018	[227]
Luo et al.	Other	2018	[228]

The field of zero-shot learning expands beyond traditional applications, with innovative tools and techniques emerging to tackle a diverse range of challenges Table 5. In recent developments, a novel Deep Attention Relation Network (DARN) has been crafted to enhance bearing fault diagnosis (BFD), paving the way for improved machinery reliability [197]. Simultaneously, a vision-based system emerges to assess the productivity of excavators engaged in earthmoving tasks, marking significant strides in industrial efficiency and monitoring [15].

Moreover, advancements extend to the core architecture of zero-shot learning models. ZeroNAS, a specialized neural architecture, stands out as a game-changer, surpassing conventional methods and opening doors to new possibilities in model design [198]. On the frontier of continual learning, Tf-GCZSL introduces task-free generalized continual zero-shot learning, revolutionizing how machines adapt and acquire knowledge over time [199]. The realm of meta-learning leaves an indelible mark on zero-shot learning as well, with innovative models demonstrating the capacity to learn and generalize from limited labeled data [200].

In the realm of robustness, focus has sharpened on the resilience of discriminative ZSL models to image corruptions, shedding light on the model’s reliability under challenging real-world conditions [201]. Meanwhile, NucNormZSL leverages nuclear norm to engineer a low-rank solution within the source domain and applies regularization in the target domain, exemplifying innovative domain adaptation techniques [202].

Expanding into the domain of semantic segmentation, a groundbreaking meta-learning-based model redefines zero-shot semantic segmentation, addressing complex challenges in scene understanding [203].

Zhao and colleagues [204] put forward ZSL-CPLSR, a novel approach designed to tackle the intricate challenges tied to recognizing previously unseen classes. Expanding into the domain of natural language processing, Pamungkas et al. [205] delve into the investigation of low-resource language detection for hate speech. Their pioneering work revolves around joint teaching models that harness diverse bilingual language presentations.

Furthering the sophistication of zero-shot learning systems, Wan [206] introduces a visual structure constraint applied to category centers. This innovative approach fortifies the projection of unseen semantic midpoints, contributing to the advancement of Zero-Shot Learning systems. Kim [207] pioneers a self-supervision technique tailored for vector-level CNN features, elevating the performance of zero-shot learning outcomes. Meanwhile, Zhang and colleagues [208] present the Generic Plug-in Attribute Correction (GPAC) module, a remarkable addition that enhances existing models for Generalized Zero-Shot Learning (GZSL). This module diligently preserves the semantic meaning of attributes, addressing challenges specific to the GZSL setting. Shifting focus to the intricacies of under-constrained ZSL problems, Han [209] introduces the Inf-FG framework. This framework employs two parallel streams, offering a comprehensive strategy to tackle the challenges inherent in such scenarios. Wu [210] details an innovative algorithm founded on the Gauss-Seidel iteration and Barzilai-Borwein stepsize. This algorithm has a pivotal role in reducing domain shift and mitigating information loss, further enhancing the robustness of zero-shot learning models.

Yang [211] introduces SELAR, a focused approach aimed at enhancing the performance of Generalized Zero-Shot Learning (GZSL). This method contributes significantly to the advancement of GZSL capabilities.

Turning to the domain of image indexing and retrieval, Kan [212] presents the LTI-ST system. This novel system prioritizes efficiency and scalability, marking a noteworthy contribution to the field. Zhang and colleagues [213] propose the PSD method, designed explicitly to address challenges related to Discriminative Classifiers (DCNs) in the context of generalized ZSL.

Cross-modal transfer between visual and tactile modalities is a unique challenge addressed by Liu [214], who introduces a structured approach employing dictionary learning. On the front of feature extraction, Luo [215] pioneers a feature extractor tailored to datasets. This innovative technique mitigates the issue of feature mismatch when utilizing pre-trained neural networks for Zero-Shot Learning.

Meanwhile, Wang [216] unveils DDIP, a novel method strategically designed to tackle knowledge transfer challenges between distinct classes and domains in both Zero-Shot Learning (ZSL) and Generalized Zero-Shot Learning (GZSL). In the domain of action recognition, Mishra [217] explores unsupervised methods to overcome limitations associated with supervised techniques.

Pradhan [218] shifts focus to land cover mapping in Malaysia, leveraging high-resolution orthophotos to delve into the applications of Zero-Shot Learning. Lastly, Tang [219] pioneers a noise-contrastive estimation method, contributing to the transfer of knowledge from seen categories to unseen ones, further enriching the ZSL landscape. Rostami [220] introduces coupled dictionary learning as an approach to lifelong learning within the context of zero-shot learning, opening doors to new possibilities and expanding the field’s horizons.

Wang et al. [221] lead the charge by proposing a deep learning framework that combines Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) for tactile material recognition, promising advancements in understanding materials through touch. Building on this progress, Liu et al. [222] introduce a pioneering generalized zero-shot learning approach that leverages web video knowledge to detect anomalous activities in surveillance videos, potentially revolutionizing the field of security and anomaly detection.

In tandem, Zhang et al. [223] present an optimization approach designed to address the challenges posed by Generalized Zero-Shot Learning (GZSL). They frame GZSL as a triple verification problem and employ complementary losses, enhancing the robustness of GZSL models. Li et al. [224] delve into the realm of hubness issues and domain shifts within Zero-Shot Learning with their LUVP method, aiming to improve the reliability and accuracy of ZSL models. Yu et al. [225] break new ground with their novel ZSL approach for computer vision, harnessing bidirectional mapping-based semantic relationship modeling to reshape the way machines perceive visual concepts and their intricate relationships.

Advancing robotics, Abderrahmane et al. [226] put forth an optimized Zero-Shot Learning algorithm for haptic recognition, enabling a robot hand to recognize novel objects without prior training data. Expanding the scope, Zhang et al. [227] describe a deep semi-supervised method that uses descriptive texts instead of labels to obtain more accurate semantics of the categories. Addressing domain shift and hubness problems, Luo et al. [228] formulate Zero-Shot Learning as attribute regression, offering novel insights into mitigating these challenges.

6. Conclusion and future directions

In an era where data are simultaneously expanding and become more intricate, effectively training deep learning models has become an extremely complex task, especially in domains characterized by lack of large datasets. Transfer learning emerges as a pivotal strategy in this context, providing a pathway to harness pre-existing neural networks for new tasks, thereby economizing on data and computational resources. As a category of transfer-learning, zero-shot learning (ZSL), tackles the nuanced challenge of identifying and categorizing ‘unseen’ data during testing, leveraging semantic descriptions and intermediate features extracted from ‘seen’ training data to navigate through the unknowns. Together, transfer learning and ZSL present an indispensable tool, that can be used for tackling problems inherent to the complexities of vast and sparse datasets, and holding the promise of propelling effective model development across various knowledge domains.

Comparative evaluation of different categories of zero-shot learning algorithms involves a comprehensive analysis of their performance, scalability, generalization capabilities, and robustness across diverse datasets and domains. Semantic-based methods excel in capturing high-level semantic relationships but may face challenges in handling fine-grained distinctions and noisy data. Embedding-based approaches offer flexibility and scalability by directly learning feature representations from data, but their performance may be limited by the quality and diversity of training data. Hybrid methods aim to leverage the advantages of both semantic and embedding information, providing a more comprehensive understanding of classes. They often achieve improved generalization performance by integrating semantic knowledge into the embedding space. Meta-learning methods offer a promising avenue for zero-shot learning by learning to adapt to new classes with limited labeled data, but their applicability may be constrained by computational complexity and the availability of meta-training data.

Zero-shot learning (ZSL) has changed how we use machine learning, allowing us to work with new categories of data that models haven’t seen during training. This is crucial for fields where data is scarce and has been especially transformative across various domains. It is becoming a cornerstone in machine learning, providing solutions for dealing with scarce and complex data and promising advancements across many distinct fields of research. One significant advantage of zero-shot learning techniques is their ability to generalize to unseen classes, allowing models to recognize and classify objects or concepts not encountered during training. This capability is invaluable in scenarios where obtaining labeled data for all possible classes is impractical or costly. Additionally, zero-shot learning promotes model scalability by reducing the need for continuous retraining as new classes emerge. It fosters adaptability, making it suitable for dynamic environments.

The ongoing innovations in this area are fascinating and we believe that they will most likely lead to more refined and impactful tools and solutions in the very near future. Out of the 221 papers surveyed in this review, 42 employ generative modelling, 5 leverage attention-based mechanisms in ZSL, and 57 incorporate a blend of methodologies with zero-shot learning, highlighting the adaptability and escalating significance of these models in varied knowledge domains. Such versatile approaches are instrumental in managing complex or scarcely available data types, reflecting potential dominance in fields that struggle due to data limitations.

However, it should be noted that zero-shot learning-based methodologies are also riddled with challenges that need addressing in order to reach their full potential. A critical concern is enhancing scalability and generalization as it is difficult for most current ZSL methods to adapt to extensive datasets and to generalize to datasets exhibiting substantial divergence from the ones used for training. Addressing the handling of domain shift is equally crucial; many ZSL methods struggle to notice the small details and meanings in visual attributes of target data points, requiring better models to understand these small differences effectively.

Furthermore, mitigating bias in cross-modal learning is imperative. The representation of data from disparate domains in a unified embedding space often leads to the acquisition of incorrect correlations and associations. Therefore, refinements in cross-modal learning methodologies are essential to ensure accurate and unbiased learning outcomes.

Another inherent challenge is the anticipation of unseen classes during training. ZSL models, although adept at recognizing unseen classes, typically do not undergo explicit training on these classes independently, potentially compromising their performance on unseen data. The refinement of training methodologies to enable models to anticipate and adapt to unseen classes more effectively is a pressing need.

Despite these challenges, the versatility of zero-shot learning is unfolding unparalleled possibilities across diverse domains of knowledge. Its adaptability and the potential it holds, promise to discover applications well beyond the existing ones, thus influencing many different aspects of our lives. It is plausible that techniques inherent to ZSL could serve as catalysts for upcoming innovations, resulting in novel paradigms across diverse fields of research and enabling unprecedented advancements and solutions to enduring challenges and problems.

References

Géron

. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow. ISBN: 9781098125974: “O’Reilly Media, Inc.”; 2022.

Benois-Pineau

Zemmari

. Multi-faceted deep learning: Models and data. Springer Nature; 2021.

Xie

Zhang

Xiong

Shao

. towards zero-shot learning: A brief review and an attention-based embedding network. IEEE Transactions on Circuits and Systems for Video Technology. 2023 Mar; 33(3): 1181-97. Available from: doi: 10.1109/tcsvt.2022.3208071.

Pourpanah

Abdar

Luo

Zhou

Wang

Lim

, et al. A review of generalized zero-shot learning methods. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2022.

Cao

Sun

Zhang

Ren

, et al. A review on multimodal zero-shot learning. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery. 2023; 13(2): e1488.

Kirk

Zhang

Grefenstette

Rocktäschel

. A survey of zero-shot generalisation in deep reinforcement learning. Journal of Artificial Intelligence Research. 2023; 76: 201-64.

Wang

Zheng

Miao

. A survey of zero-shot learning: Settings, methods, and applications. ACM Transactions on Intelligent Systems and Technology (TIST). 2019; 10(2): 1-37.

Rao

Yang

Zeng

Wang

. Dual projective zero-shot learning using text descriptions. ACM Transactions on Multimedia Computing, Communications, and Applications. 2023 Jan; 19(1): 1-17. Available from: doi: 10.1145/3514247.

Zhu

HKA

Yeung

Lam

. Microplastic pollution assessment with digital holography and zero-shot learning. APL Photonics. 2022 Jul; 7(7). Available from: doi: 10.1063/5.0093439.

10.

Singh

Thakur

. Meta-DZSL: a meta-dictionary learning based approach to zero-shot recognition. Applied Intelligence. 2022; 52(14): 15938-60.

11.

Sun

. Research progress of zero-shot learning. Applied Intelligence. 2021; 51: 3600-14.

12.

Zhang

Wang

Ren

Gao

. SMDM: Tackling zero-shot relation extraction with semantic max-divergence metric learning. Applied Intelligence. 2023; 53(6): 6569-84.

13.

Luo

Wang

Pourpanah

. Dual VAEGAN: A generative model for generalized zero-shot learning. Applied Soft Computing. 2021; 107: 107352.

14.

Wen

Jiang

. Grouping attributes zero-shot learning for tongue constitution recognition. Artificial Intelligence in Medicine. 2020 Sep; 109: 101951. Available from: doi: 10.1016/j.artmed.2020.101951.

15.

Chen

Xiao

Zhang

Zhu

. Automatic vision-based calculation of excavator earthmoving productivity using zero-shot learning activity recognition. Automation in Construction. 2023 Feb; 146: 104702. Available from: doi: 10.1016/j.autcon.2022.104702.

16.

Kulmanov

Hoehndorf

. DeepGOZero: improving protein function prediction from sequence and zero-shot learning based on ontology axioms. bioRxiv. 2022.

17.

Deznabi

Arabaci

Koyutürk

Tastan

. DeepKinZero: Zero-shot learning for predicting kinase–phosphosite associations involving understudied kinases. Bioinformatics. 2020; 36(12): 3652-61.

18.

Jin

Xie

Huang

Cao

Wang

. Discriminant zero-shot learning with center loss. Cognitive Computation. 2019; 11: 503-12.

19.

Roy

Ghosal

Cambria

Majumder

Mihalcea

Poria

. Improving zero-shot learning baselines with commonsense knowledge. Cognitive Computation. 2022; 14(6): 2212-22.

20.

Xie

Zeng

Xiang

Yang

Liu

. Class knowledge overlay to visual feature learning for zero-shot image classification. Computer Vision and Image Understanding. 2021 Jun; 207: 103206. Available from: doi: 10.1016/j.cviu.2021.103206.

21.

Liu

Gao

Han

Shao

. Label-activating framework for zero-shot learning. Neural Networks. 2020 Jan; 121: 1-9. Available from: doi: 10.1016/j.neunet.2019.08.023.

22.

Dong

Hwang

Sigal

Xue

. Learning the compositional domains for generalized zero-shot learning. Computer Vision and Image Understanding. 2022 Aug; 221: 103454. Available from: doi: 10.1016/j.cviu.2022.103454.

23.

Barros

Chagas

Oliveira

Queiroz

Ramos

. Malware-SMELL: A zero-shot learning strategy for detecting zero-day vulnerabilities. Computers and Security. 2022; 120: 102785.

24.

Liang

Ding

Yan

. A zero-shot fault semantics learning model for compound fault diagnosis. Expert Systems with Applications. 2023; 221: 119642.

25.

Gull

Arif

. Generalized zero-shot learning using identifiable variational autoencoders. Expert Systems with Applications. 2022 Apr; 191: 116268. Available from: doi: 10.1016/j.eswa.2021.116268.

26.

Buckchash

Raman

. Towards zero shot learning of geometry of motion streams and its application to anomaly recognition. Expert Systems with Applications. 2021 Sep; 177: 114916. Available from: doi: 10.1016/j.eswa.2021.114916.

27.

Wah

Branson

Welinder

Perona

Belongie

. The caltech-ucsd birds-200-2011 dataset. 2011.

28.

Deng

Dong

Socher

Fei-Fei

. Imagenet: A large-scale hierarchical image database. 2009 IEEE conference on computer vision and pattern recognition. Ieee. 2009; 248-55.

29.

Lampert

Nickisch

Harmeling

. Learning to detect unseen object classes by between-class attribute transfer. 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE. 2009; 951-8.

30.

APascal-aYahoo Image Data Collection. Department of Computer Science, University of Illinois at Urbana-Champaign.

31.

Krishna

Zhu

Groth

Johnson

Hata

Kravitz

, et al. Visual genome: Connecting language and vision using crowdsourced dense image annotations. International Journal of Computer Vision. 2017; 123: 32-73.

32.

Patterson

Hays

. Sun attribute database: Discovering, annotating, and recognizing scene attributes. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE. 2012. 2751-8.

33.

Chua

Tang

Hong

Luo

Zheng

. Nus-wide: A real-world web image database from national university of singapore. In: Proceedings of the ACM International Conference on Image and Video Retrieval. 2009; 1-9.

34.

Huang

Luo

Wang

. Hippocampus-heuristic character recognition network for zero-shot learning in Chinese character recognition. Pattern Recognition. 2022; 130: 108818.

35.

Kutbi

Peng

. Zero-shot deep domain adaptation with common representation learning. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2021; 44(7): 3909-24.

36.

McCartney

Devereux

Martinez-del Rincon

. A zero-shot deep metric learning approach to brain–computer interfaces for image retrieval. Knowledge-Based Systems. 2022; 246: 108556.

37.

Zhou

Zhao

Fan

Ding

Yuan

. Zero-shot learning for compound fault diagnosis of bearings. Expert Systems with Applications. 2022; 190: 116197.

38.

Dong

Jiang

Zhou

Lin

Shi

. SR2CNN: Zero-shot learning for signal recognition. IEEE Transactions on Signal Processing. 2021; 69: 2316-29.

39.

Wang

Dong

Jiang

Wang

Xue

, et al. Vocabulary-informed zero-shot and open-set learning. IEEE transactions on pattern analysis and machine intelligence. 2019; 42(12): 3136-52.

40.

Wang

Pang

Shao

. Dual triplet network for image zero-shot learning. Neurocomputing. 2020; 373: 90-7.

41.

Huang

Lin

Huangfu

. Class-prototype discriminative network for generalized zero-shot learning. IEEE Signal Processing Letters. 2020; 27: 301-5.

42.

Sun

Guo

Pang

. Semantic softmax loss for zero-shot learning. Neurocomputing. 2018; 316: 369-75.

43.

Xiang

Kodirov

Gong

. Zero-shot learning on semantic class prototype graph. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017; 40(8): 2009-22.

44.

Guo

Ding

Han

Gao

. Zero-shot learning with transferred samples. IEEE Transactions on Image Processing. 2017; 26(7): 3277-90.

45.

Zhang

Yang

Feng

Duan

. Learning cross-domain semantic-visual relationships for transductive zero-shot learning. Pattern Recognition. 2023; 141: 109591.

46.

Han

Zhang

. Knowledge distillation classifier generation network for zero-shot learning. IEEE Transactions on Neural Networks and Learning Systems. 2021.

47.

Freitas

Silva

. Hyperspectral imaging zero-shot learning for remote marine litter detection and classification. Remote Sensing. 2022; 14(21): 5516.

48.

Cheraghian

Rahman

Chowdhury

Campbell

Petersson

. Zero-shot learning on 3d point cloud objects and beyond. International Journal of Computer Vision. 2022; 130(10): 2364-84.

49.

Pang

Zhang

. Semantic-guided class-imbalance learning model for zero-shot image classification. IEEE Transactions on Cybernetics. 2021; 52(7): 6543-54.

50.

Kim

Lee

Byun

. Discriminative deep attributes for generalized zero-shot learning. Pattern Recognition. 2022; 124: 108435.

51.

Liu

Dong

. An iterative co-training transductive framework for zero shot learning. IEEE Transactions on Image Processing. 2021; 30: 6943-56.

52.

Guan

Xiang

Wang

Wen

. Transferrable feature and projection learning with class hierarchy for zero-shot learning. International Journal of Computer Vision. 2020; 128: 2810-27.

53.

Hou

Xia

Zhang

Gao

. Discriminative comparison classifier for generalized zero-shot learning. Neurocomputing. 2020; 414: 10-7.

54.

Duan

Pang

Zheng

, et al. Zero-shot learning for EEG classification in motor imagery-based BCI system. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2020; 28(11): 2411-9.

55.

Fang

. Learning domain invariant unseen features for generalized zero-shot classification. Knowledge-Based Systems. 2020; 206: 106378.

56.

Zhang

Wang

Liu

Shen

Wei

Zhang

, et al. Towards effective deep embedding for zero-shot learning. IEEE Transactions on Circuits and Systems for Video Technology. 2020; 30(9): 2843-52.

57.

Del Chiaro

Bagdanov

Del Bimbo

. Webly-supervised zero-shot learning for artwork instance recognition. Pattern Recognition Letters. 2019; 128: 420-6.

58.

Gui

Wang

Yang

. A generalized zero-shot learning framework for PolSAR land cover classification. Remote Sensing. 2018; 10(8): 1307.

59.

Cheng

Qiao

Wang

. Random forest classifier for zero-shot learning based on relative attribute. IEEE transactions on neural networks and learning systems. 2017; 29(5): 1662-74.

60.

Qin

Wang

Liu

Chen

Shao

. Beyond semantic attributes: Discrete latent attributes learning for zero-shot recognition. IEEE Signal Processing Letters. 2016; 23(11): 1667-71.

61.

Liu

Zhang

Chen

. Attribute relation learning for zero-shot classification. Neurocomputing. 2014; 139: 34-46.

62.

Yang

Sun

Yang

Wang

Chen

, et al. Iterative class prototype calibration for transductive zero-shot learning. IEEE Transactions on Circuits and Systems for Video Technology. 2022; 33(3): 1236-46.

63.

Pham

Sarabakha

Odnoshyvkin

Kayacan

. Pencilnet: Zero-shot sim-to-real transfer learning for robust gate perception in autonomous drone racing. IEEE Robotics and Automation Letters. 2022; 7(4): 11847-54.

64.

Zarei

Taheri

Long

. Kernelized distance learning for zero-shot recognition. Information Sciences. 2021; 580: 801-18.

65.

Han

Guo

Zhao

. Disentangled features with direct sum decomposition for zero shot learning. Neurocomputing. 2021; 426: 216-26.

66.

Xie

Zhang

Yao

Zhang

Zhao

Shao

. VMAN: A virtual mainstay alignment network for transductive zero-shot learning. IEEE Transactions on Image Processing. 2021; 30: 4316-29.

67.

Zeng

Lian

Ding

. Semi-supervised low-rank semantics grouping for zero-shot learning. IEEE Transactions on Image Processing. 2021; 30: 2207-19.

68.

Xie

Cao

Ming

. A further study on biologically inspired feature enhancement in zero-shot learning. International Journal of Machine Learning and Cybernetics. 2021; 12: 257-69.

69.

Song

Chen

Cui

. EM simulation-aided zero-shot learning for SAR automatic target recognition. IEEE Geoscience and Remote Sensing Letters. 2020 Jun; 17(6): 1092-6. Available from: doi: 10.1109/lgrs.2019.2936897.

70.

Liu

Zhang

Zhu

Zheng

Zhao

Cheng

. Convolutional prototype learning for zero-shot recognition. Image and Vision Computing. 2020 Jun; 98: 103924. Available from: doi: 10.1016/j.imavis.2020.103924.

71.

Liu

Wang

Zhao

Yang

. Learning unbiased zero-shot semantic segmentation networks via transductive transfer. IEEE Signal Processing Letters. 2020; 27: 1640-4. Available from: doi: 10.1109/lsp.2020.3023340.

72.

Rahman

Khan

Barnes

. Deep0tag: Deep multiple instance learning for zero-shot image tagging. IEEE Transactions on Multimedia. 2019; 22(1): 242-55.

73.

Liu

Gao

Han

Shao

. A discriminative cross-aligned variational autoencoder for zero-shot learning. IEEE Transactions on Cybernetics. 2023 Jun; 53(6): 3794-805. Available from: doi: 10.1109/tcyb.2022.3164142.

74.

Cheng

Wang

Zhang

Han

Zhang

. Hybrid routing transformer for zero-shot learning. Pattern Recognition. 2023 May; 137: 109270. Available from: doi: 10.1016/j.patcog.2022.109270.

75.

Yang

Zhang

Yang

Deng

. Adaptive bias-aware feature generation for generalized zero-shot learning. IEEE Transactions on Multimedia. 2023; 25: 280-90. Available from: doi: 10.1109/tmm.2021.3125134.

76.

Pan

Luo

Shen

. Learning mlatent representations for generalized zero-shot learning. IEEE Transactions on Multimedia. 2023; 25: 2252-65. Available from: doi: 10.1109/tmm.2022.3145237.

77.

Gao

Hou

Qin

Shen

Long

Liu

, et al. Visual-semantic aligned bidirectional network for zero-shot learning. IEEE Transactions on Multimedia. 2023; 25: 1649-64. Available from: doi: 10.1109/tmm.2022.3145666.

78.

Liu

Yao

Chang

. Attribute-modulated generative meta learning for zero-shot learning. IEEE Transactions on Multimedia. 2023; 25: 1600-10. Available from: doi: 10.1109/tmm.2021.3139211.

79.

Wei

Deng

Yang

Tao

. Incremental zero-shot learning. IEEE Transactions on Cybernetics. 2022 Dec; 52(12): 13788-99. Available from: doi: 10.1109/tcyb.2021.3110369.

80.

Tang

. Zero-shot learning via structure-aligned generative adversarial network. IEEE Transactions on Neural Networks and Learning Systems. 2022 Nov; 33(11): 6749-62. Available from: doi: 10.1109/tnnls.2021.3083367.

81.

Liu

Chen

Liu

Zhang

. From less to more: Progressive generalized zero-shot detection with curriculum learning. IEEE Transactions on Intelligent Transportation Systems. 2022 Oct; 23(10): 19016-29. Available from: doi: 10.1109/tits.2022.3151073.

82.

Mahapatra

Reyes

. Self-supervised generalized zero shot learning for medical image classification using novel interpretable saliency maps. IEEE Transactions on Medical Imaging. 2022 Sep; 41(9): 2443-56. Available from: doi: 10.1109/tmi.2022.3163232.

83.

Jing

Zhu

Shen

. Investigating the bilateral connections in generative zero-shot learning. IEEE Transactions on Cybernetics. 2022 Aug; 52(8): 8167-78. Available from: doi: 10.1109/tcyb.2021.3050803.

84.

Guo

Liang

Xie

. Cross-modal propagation network for generalized zero-shot learning. Pattern Recognition Letters. 2022 Jul; 159: 125-31. Available from: doi: 10.1016/j.patrec.2022.05.009.

85.

Xie

Zhang

Liu

Zhu

Liu

Shao

, et al. Generalized zero-shot learning with multiple graph adaptive generative networks. IEEE Transactions on Neural Networks and Learning Systems. 2022 Jul; 33(7): 2903-15. Available from: doi: 10.1109/tnnls.2020.3046924.

86.

Yang

Ran

. GAN-MVAE: A discriminative latent feature generation framework for generalized zero-shot learning. Pattern Recognition Letters. 2022 Mar; 155: 77-83. Available from: doi: 10.1016/j.patrec.2022.02.002.

87.

Shinzaki

Koda

Yamamoto

Nishio

Morikura

Shirato

, et al. Zero-shot adaptation for mmWave beam-tracking on overhead messenger wires through robust adversarial reinforcement learning. IEEE Transactions on Cognitive Communications and Networking. 2022 Mar; 8(1): 232-45. Available from: doi: 10.1109/tccn.2021.3116231.

88.

Liu

Zhang

Yang

Liu

. Learning discriminative and representative feature with cascade GAN for generalized zero-shot learning. Knowledge-Based Systems. 2022 Jan; 236: 107780. Available from: doi: 10.1016/j.knosys.2021.107780.

89.

Chen

Luo

Wang

Huang

. GSMFlow: Generation shifts mitigating flow for generalized zero-shot learning. IEEE Transactions on Multimedia. 2022; 1-12. Available from: doi: 10.1109/tmm.2022.3190678.

90.

Shermin

Teng

Sohel

Murshed

. Bidirectional mapping coupled GAN for generalized zero-shot learning. IEEE Transactions on Image Processing. 2022; 31: 721-33. Available from: doi: 10.1109/tip.2021.3135480.

91.

Deng

Xiang

Gao

Xia

Gao

. Zero-shot learning based on quality-verifying adversarial network. IEEE Transactions on Multimedia. 2022; 24: 4526-37. Available from: doi: 10.1109/tmm.2021.3119854.

92.

Pan

Shen

. Alleviating domain shift via discriminative learning for generalized zero-shot learning. IEEE Transactions on Multimedia. 2022; 24: 1325-37. Available from: doi: 10.1109/tmm.2021.3063616.

93.

Chen

Liu

. Augmented semantic feature based generative network for generalized zero-shot learning. Neural Networks. 2021 Nov; 143: 1-11. Available from: doi: 10.1016/j.neunet.2021.04.014.

94.

Luo

Wang

Pourpanah

. Dual VAEGAN: A generative model for generalized zero-shot learning. Applied Soft Computing. 2021 Aug; 107: 107352. Available from: doi: 10.1016/j.asoc.2021.107352.

95.

Xie

Xiang

Zeng

Yang

Liu

. Cross knowledge-based generative zero-shot learning approach with taxonomy regularization. Neural Networks. 2021 Jul; 139: 168-78. Available from: doi: 10.1016/j.neunet.2021.02.009.

96.

Guo

Luo

Bhuiyan

MZA

Ren

Zhang

Zhou

. Zero shot augmentation learning in internet of biometric things for health signal processing. Pattern Recognition Letters. 2021 Jun; 146: 142-9. Available from: doi: 10.1016/j.patrec.2021.03.012.

97.

Feng

Zhao

. Transfer increment for generalized zero-shot learning. IEEE Transactions on Neural Networks and Learning Systems. 2021 Jun; 32(6): 2506-20. Available from: doi: 10.1109/tnnls.2020.3006322.

98.

Liu

Zhang

Yang

. Cross-class generative network for zero-shot learning. Information Sciences. 2021 May; 555: 147-63. Available from: doi: 10.1016/j.ins.2020.12.063.

99.

Zhang

Dou

. Bidirectional generative transductive zero-shot learning. Neural computing and applications. 2021; 33: 5313-26.

100.

Song

Shi

Xie

Zhang

. Domain-aware Stacked AutoEncoders for zero-shot learning. Neurocomputing. 2021 Mar; 429: 118-31. Available from: doi: 10.1016/j.neucom.2020.12.017.

101.

Ponti

Vulić

Cotterell

Parovic

Reichart

Korhonen

. Parameter space factorization for zero-shot learning across tasks and languages. Transactions of the Association for Computational Linguistics. 2021; 9: 410-28.

102.

Geng

Chen

Yuan

Zhang

Chen

. Explainable zero-shot learning via attentive graph convolutional network and knowledge graphs. Semantic Web. 2021; 12(5): 741-65.

103.

Wang

Jiang

. Learning across tasks for zero-shot domain adaptation from a single source domain. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2022 Oct; 44(10): 6264-79. Available from: doi: 10.1109/tpami.2021.3088859.

104.

Kim

Lee

Byun

. Unseen image generating domain-free networks for generalized zero-shot learning. Neurocomputing. 2020 Oct; 411: 67-77. Available from: doi: 10.1016/j.neucom.2020.05.043.

105.

Shen

. Similarity preserving feature generating networks for zero-shot learning. Neurocomputing. 2020 Sep; 406: 333-42. Available from: doi: 10.1016/j.neucom.2019.08.111.

106.

Liu

Yao

Zheng

Luo

Zhao

Lyu

. Dual-stream generative adversarial networks for distributionally robust zero-shot learning. Information Sciences. 2020 May; 519: 407-22. Available from: doi: 10.1016/j.ins.2020.01.025.

107.

Chi

Peng

. Zero-shot cross-media embedding learning with dual adversarial distribution network. IEEE Transactions on Circuits and Systems for Video Technology. 2020 Apr; 30(4): 1173-87. Available from: doi: 10.1109/tcsvt.2019.2900171.

108.

Gao

Zheng

. A zero-shot learning method for fault diagnosis under unknown working loads. Journal of Intelligent Manufacturing. 2020; 31: 899-909.

109.

Shao

. Generalized zero-shot learning with multi-channel gaussian mixture VAE. IEEE Signal Processing Letters. 2020; 27: 456-60. Available from: doi: 10.1109/lsp.2020.2977498.

110.

Gao

Hou

Qin

Chen

Liu

Zhu

, et al. Zero-VAE-GAN: Generating unseen features for generalized and transductive zero-shot learning. IEEE Transactions on Image Processing. 2020; 29: 3665-80. Available from: doi: 10.1109/tip.2020.2964429.

111.

Ding

Shao

. Generative zero-shot learning via low-rank embedded semantic dictionary. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2019 Dec; 41(12): 2861-74. Available from: doi: 10.1109/tpami.2018.2867870.

112.

Cui

Pang

Zhang

. Zero-shot classification with unseen prototype learning. Neural Computing and Applications. 2021; 1-11.

113.

Zhang

Wang

Ren

Gao

. SMDM: Tackling zero-shot relation extraction with semantic max-divergence metric learning. Applied Intelligence. 2023; 53(6): 6569-84.

114.

Yao

Wang

Ding

Zhong

Bullock

, et al. Lightweight network learning with Zero-Shot Neural Architecture Search for UAV images. Knowledge-Based Systems. 2023 Jan; 260: 110142. Available from: doi: 10.1016/j.knosys.2022.110142.

115.

Liu

Chang

McAuley

Yao

. Diversity-boosted generalization-specialization balancing for zero-shot learning. IEEE Transactions on Multimedia. 2023; 1-11. Available from: doi: 10.1109/tmm.2023.3236211.

116.

Cheng

Chen

Jiang

. Attribute-Based Zero-Shot Learning for Encrypted Traffic Classification. IEEE Transactions on Network and Service Management. 2022 Dec; 19(4): 4583-99. Available from: doi: 10.1109/tnsm.2022.3183247.

117.

Zhang

Liu

. Cross-modal prototype learning for zero-shot handwritten character recognition. Pattern Recognition. 2022 Nov; 131: 108859. Available from: doi: 10.1016/j.patcog.2022.108859.

118.

Liu

Yao

Wang

McAuley

Chang

. An entropy-guided reinforced partial convolutional network for zero-shot learning. IEEE Transactions on Circuits and Systems for Video Technology. 2022 Aug; 32(8): 5175-86. Available from: doi: 10.1109/tcsvt.2022.3147902.

119.

Liu

Zhang

Zhu

Zheng

Zhao

Cheng

. MFHI: Taking modality-free human identification as zero-shot learning. IEEE Transactions on Circuits and Systems for Video Technology. 2022 Aug; 32(8): 5225-37. Available from: doi: 10.1109/tcsvt.2021.3137216.

120.

Yun

Wang

Hou

Gao

. Attributes learning network for generalized zero-shot learning. Neural Networks. 2022 Jun; 150: 112-8. Available from: doi: 10.1016/j.neunet.2022.02.018.

121.

Han

. Holistically associated transductive zero-shot learning. IEEE Transactions on Cognitive and Developmental Systems. 2022 Jun; 14(2): 437-47. Available from: doi: 10.1109/tcds.2021.3049274.

122.

Bian

Yuan

Wei

Zheng

. Domain adaptation meets zero-shot learning: An annotation-efficient approach to multi-modality medical image segmentation. IEEE Transactions on Medical Imaging. 2022 May; 41(5): 1043-56. Available from: doi: 10.1109/tmi.2021.3131245.

123.

Hou

Lai

Yang

. Cross-modal distribution alignment embedding network for generalized zero-shot learning. Neural Networks. 2022 Apr; 148: 176-82. Available from: doi: 10.1016/j.neunet.2022.01.007.

124.

Song

Zhang

. Semantic-visual combination propagation network for zero-shot learning. IEEE Transactions on Circuits and Systems II: Express Briefs. 2022 Apr; 69(4): 2341-5. Available from: doi: 10.1109/tcsii.2021.3136250.

125.

Zhang

ao Geng

Wang

Sun

Shi

, et al. A zero-shot learning framework via cluster-prototype matching. Pattern Recognition. 2022 Apr; 124: 108469. Available from: doi: 10.1016/j.patcog.2021.108469.

126.

Wang

. Learn more from less: Generalized zero-shot learning with severely limited labeled data. Neurocomputing. 2022 Mar; 477: 25-35. Available from: doi: 10.1016/j.neucom.2022.01.007.

127.

Shermin

Teng

Sohel

Murshed

. Integrated generalized zero-shot learning for fine-grained classification. Pattern Recognition. 2022 Feb; 122: 108246. Available from: doi: 10.1016/j.patcog.2021.108246.

128.

Liu

Dang

Gao

Han

Shao

. Zero-shot learning with attentive region embedding and enhanced semantics. IEEE Transactions on Neural Networks and Learning Systems. 2022; 1-12. Available from: doi: 10.1109/tnnls.2022.3202014.

129.

Fang

Chen

. Generalized zero-shot domain adaptation with target unseen class prototype learning. Neural Computing and Applications. 2022; 34(20): 17793-807.

130.

Chen

Hong

Xie

Peng

You

Ding

, et al. GNDAN: Graph navigated dual attention network for zero-shot learning. IEEE Transactions on Neural Networks and Learning Systems. 2022; 1-14. Available from: doi: 10.1109/tnnls.2022.3155602.

131.

Kwon

Regib

. A gating model for bias calibration in generalized zero-shot learning. IEEE Transactions on Image Processing. 2022; 1-1. Available from: doi: 10.1109/tip.2022.3153138.

132.

Zeng

Lian

Ding

. Generative Mixup Networks for Zero-Shot Learning. IEEE Transactions on Neural Networks and Learning Systems. 2022; 1-12. Available from: doi: 10.1109/tnnls.2022.3142181.

133.

Jia

Chen

Zhang

. Towards visual explainable active learning for zero-shot classification. IEEE Transactions on Visualization and Computer Graphics. 2022 Jan; 28(1): 791-801. Available from: doi: 10.1109/tvcg.2021.3114793.

134.

Lyu

Huang

. Disentangling semantic-to-visual confusion for zero-shot learning. IEEE Transactions on Multimedia. 2022; 24: 2828-40. Available from: doi: 10.1109/tmm.2021.3089017.

135.

Yao

Min

Zhang

. Attribute-induced bias eliminating for transductive zero-shot learning. IEEE Transactions on Multimedia. 2022; 24: 1933-42. Available from: doi: 10.1109/tmm.2021.3074252.

136.

Zhu

Zhang

. Learning deep cross-modal embedding networks for zero-shot remote sensing image scene classification. IEEE Transactions on Geoscience and Remote Sensing. 2021 Dec; 59(12): 10590-603. Available from: doi: 10.1109/tgrs.2020.3047447.

137.

Liu

. Adversarial strategy for transductive zero-shot learning. Information Sciences. 2021 Nov; 578: 750-61. Available from: doi: 10.1016/j.ins.2021.06.085.

138.

Zhang

Zhu

Zhang

Huang

. Visual-guided attentive attributes embedding for zero-shot learning. Neural Networks. 2021 Nov; 143: 709-18. Available from: doi: 10.1016/j.neunet.2021.07.031.

139.

Nihal

Rahman

Broti

Deowan

. Bangla sign alphabet recognition with zero-shot and transfer learning. Pattern Recognition Letters. 2021 Oct; 150: 84-93. Available from: doi: 10.1016/j.patrec.2021.06.020.

140.

Qian

Xiong

Liu

. Zero-shot policy generation in lifelong reinforcement learning. Neurocomputing. 2021 Jul; 446: 65-73. Available from: doi: 10.1016/j.neucom.2021.02.058.

141.

Tsang

Liu

. Complementary attributes: A new clue to zero-shot learning. IEEE Transactions on Cybernetics. 2021 Mar; 51(3): 1519-30. Available from: doi: 10.1109/tcyb.2019.2930744.

142.

Min

Yao

Xie

Zha

Zhang

. Domain-oriented semantic embedding for zero-shot learning. IEEE Transactions on Multimedia. 2021; 23: 3919-30. Available from: doi: 10.1109/tmm.2020.3033124.

143.

Ding

Zhong

. A semantic encoding out-of-distribution classifier for generalized zero-shot learning. IEEE Signal Processing Letters. 2021; 28: 1395-9. Available from: doi: 10.1109/lsp.2021.3092227.

144.

Zhang

Liu

Long

Zhang

Shao

. Deep transductive network for generalized zero shot learning. Pattern Recognition. 2020 Sep; 105: 107370. Available from: doi: 10.1016/j.patcog.2020.107370.

145.

Wang

Chen

Cheng

Chen

Liu

. Zero-shot learning based on deep weighted attribute prediction. IEEE Transactions on Systems, Man, and Cybernetics: Systems. 2020 Aug; 50(8): 2948-57. Available from: doi: 10.1109/tsmc.2018.2837670.

146.

Zhang

Gui

Zhu

Zhao

Liu

. Hierarchical prototype learning for zero-shot recognition. IEEE Transactions on Multimedia. 2020 Jul; 22(7): 1692-703. Available from: doi: 10.1109/tmm.2019.2959433.

147.

Fang

. Zero shot learning based on class visual prototypes and semantic consistency. Pattern Recognition Letters. 2020 Jul; 135: 368-74. Available from: doi: 10.1016/j.patrec.2020.04.029.

148.

Zhang

Mao

Long

Yang

Shao

. A probabilistic zero-shot learning method via latent nonnegative prototype synthesis of unseen classes. IEEE Transactions on Neural Networks and Learning Systems. 2019; 1-15. Available from: doi: 10.1109/tnnls.2019.2955157.

149.

Chen

Wang

Zhang

. Multi-modal generative adversarial network for zero-shot learning. Knowledge-Based Systems. 2020 Jun; 197: 105847. Available from: doi: 10.1016/j.knosys.2020.105847.

150.

Liu

Tuytelaars

. A deep multi-modal explanation model for zero-shot learning. IEEE Transactions on Image Processing. 2020; 29: 4788-803. Available from: doi: 10.1109/tip.2020.2975980.

151.

Changpinyo

Chao

Gong

Sha

. Classifier and exemplar synthesis for zero-shot learning. International Journal of Computer Vision. 2020; 128: 166-201.

152.

Jia

Zhang

Wang

Shan

Tan

. Deep unbiased embedding transfer for zero-shot learning. IEEE Transactions on Image Processing. 2020; 29: 1958-71. Available from: doi: 10.1109/tip.2019.2947780.

153.

Ding

Wang

. Cross-domain mapping learning for transductive zero-shot learning. Computer Vision and Image Understanding. 2019 Oct; 187: 102784. Available from: doi: 10.1016/j.cviu.2019.07.004.

154.

Jiang

Wang

Shan

Chen

. Adaptive metric learning for zero-shot recognition. IEEE Signal Processing Letters. 2019 Sep; 26(9): 1270-4. Available from: doi: 10.1109/lsp.2019.2917148.

155.

Wang

Pang

Han

. Class-specific synthesized dictionary model for zero-shot learning. Neurocomputing. 2019 Feb; 329: 339-47. Available from: doi: 10.1016/j.neucom.2018.10.069.

156.

Zhang

Long

Liu

Shao

. Adversarial unseen visual feature synthesis for Zero-shot Learning. Neurocomputing. 2019 Feb; 329: 12-20. Available from: doi: 10.1016/j.neucom.2018.10.043.

157.

Zhang

Long

Yang

Shao

. Dual-verification network for zero-shot learning. Information Sciences. 2019 Jan; 470: 43-57. Available from: doi: 10.1016/j.ins.2018.08.048.

158.

Guo

Pang

. Transductive zero-shot learning with adaptive structural embedding. IEEE Transactions on Neural Networks and Learning Systems. 2018 Sep; 29(9): 4116-27. Available from: doi: 10.1109/tnnls.2017.2753852.

159.

Liu

Yao

Ding

. Combining ontology and reinforcement learning for zero-shot classification. Knowledge-Based Systems. 2018 Mar; 144: 42-50. Available from: doi: 10.1016/j.knosys.2017.12.022.

160.

Sumbul

Cinbis

Aksoy

. Fine-grained object recognition and zero-shot learning in remote sensing imagery. IEEE Transactions on Geoscience and Remote Sensing. 2018 Feb; 56(2): 770-9. Available from: doi: 10.1109/tgrs.2017.2754648.

161.

Song

. Zero-shot learning of SAR target feature space with deep generative neural networks. IEEE Geoscience and Remote Sensing Letters. 2017 Dec; 14(12): 2245-9. Available from: doi: 10.1109/lgrs.2017.2758900.

162.

Guo

Pang

. Zero-shot learning with regularized cross-modality ranking. Neurocomputing. 2017 Oct; 259: 14-20. Available from: doi: 10.1016/j.neucom.2016.06.085.

163.

Pang

Guo

Zhang

. Manifold regularized cross-modal embedding for zero-shot learning. Information Sciences. 2017 Feb; 378: 48-58. Available from: doi: 10.1016/j.ins.2016.10.025.

164.

Hospedales

Xiang

Gong

. Transductive multi-view zero-shot learning. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2015 Nov; 37(11): 2332-45. Available from: doi: 10.1109/tpami.2015.2408354.

165.

Cao

Huang

Patwary

MJA

Wang

. MFF: Multi-modal feature fusion for zero-shot learning. Neurocomputing. 2022 Oct; 510: 172-80. Available from: doi: 10.1016/j.neucom.2022.09.070.

166.

Chen

Lan

Zheng

. Generalized zero-shot learning via multi-modal aggregated posterior aligning neural network. IEEE Transactions on Multimedia. 2022; 24: 177-87. Available from: doi: 10.1109/tmm.2020.3047546.

167.

Lázaro-Gredilla

Lin

Guntupalli

George

. Beyond imitation: Zero-shot task transfer on robots by learning concepts as cognitive programs. Science Robotics. 2019 Jan; 4(26). Available from: doi: 10.1126/scirobotics.aav3150.

168.

Domeniconi

Zhang

. Multi-label zero-shot learning with graph convolutional networks. Neural Networks. 2020 Dec; 132: 333-41. Available from: doi: 10.1016/j.neunet.2020.09.010.

169.

Feng

Bai

Zhang

Shang

Jiao

. MR-selection: A meta-reinforcement learning approach for zero-shot hyperspectral band selection. IEEE Transactions on Geoscience and Remote Sensing. 2023; 61: 1-20. Available from: doi: 10.1109/tgrs.2022.3231870.

170.

Liu

Han

. Meta hyperbolic networks for zero-shot learning. Neurocomputing. 2022 Jun; 491: 57-66. Available from: doi: 10.1016/j.neucom.2022.03.040.

171.

Wang

Zhang

. Domain-aware multi-modality fusion network for generalized zero-shot learning. Neurocomputing. 2022 Jun; 488: 23-35. Available from: doi: 10.1016/j.neucom.2022.02.056.

172.

Mancini

Naeem

Xian

Akata

. Learning graph embeddings for open world compositional zero-shot learning. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2022; 1-1. Available from: doi: 10.1109/tpami.2022.3163667.

173.

Gao

Zhang

. Learning to model relationships for zero-shot video classification. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2021 Oct; 43(10): 3476-91. Available from: doi: 10.1109/tpami.2020.2985708.

174.

Wang

Zhao

Zhuang

. Graph active learning for GCN-based zero-shot classification. Neurocomputing. 2021 May; 435: 15-25. Available from: doi: 10.1016/j.neucom.2020.12.127.

175.

Han

Chen

Yang

. Semantic contrastive embedding for generalized zero-shot learning. International Journal of Computer Vision. 2022; 130(11): 2606-22.

176.

Zhao

Peng

. Region interaction and attribute embedding for zero-shot learning. Information Sciences. 2022 Sep; 609: 984-95. Available from: doi: 10.1016/j.ins.2022.07.096.

177.

Barros

Chagas

Oliveira

Queiroz

Ramos

. Malware-SMELL: A zero-shot learning strategy for detecting zero-day vulnerabilities. Computers and Security. 2022; 120: 102785.

178.

Kulmanov

Hoehndorf

. DeepGOZero: Improving protein function prediction from sequence and zero-shot learning based on ontology axioms. Bioinformatics. 2022 Jun; 38(Supplement_1): i238-45. Available from: doi: 10.1093/bioinformatics/btac256.

179.

Liu

Gao

Han

Liu

Shao

. Zero-shot learning via a specific rank-controlled semantic autoencoder. Pattern Recognition. 2022 Feb; 122: 108237. Available from: doi: 10.1016/j.patcog.2021.108237.

180.

Yang

Huang

Zhang

Goulermas

Hussain

. Coarse-grained generalized zero-shot learning with efficient self-focus mechanism. Neurocomputing. 2021 Nov; 463: 400-10. Available from: doi: 10.1016/j.neucom.2021.08.027.

181.

Lin

Fan

Chen

Zhao

. Class label autoencoder with structure refinement for zero-shot learning. Neurocomputing. 2021 Mar; 428: 54-64. Available from: doi: 10.1016/j.neucom.2020.11.061.

182.

Wang

Gong

Cheng

. Zero-shot learning based on multitask extended attribute groups. IEEE Transactions on Systems, Man, and Cybernetics: Systems. 2021 Mar; 51(3): 2003-11. Available from: doi: 10.1109/tsmc.2019.2912206.

183.

Guo

. A novel perspective to zero-shot learning: towards an alignment of manifold structures via semantic feature expansion. IEEE Transactions on Multimedia. 2021; 23: 524-37. Available from: doi: 10.1109/tmm.2020.2984091.

184.

Guo

Zhang

. Zero-shot learning via latent space encoding. IEEE Transactions on Cybernetics. 2019; Oct; 49(10): 3755-66. Available from: doi: 10.1109/tcyb.2018.2850750.

185.

Long

Liu

Shen

Shao

. Zero-shot learning using synthesised unseen visual data with diffusion regularisation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2018 Oct; 40(10): 2498-512. Available from: doi: 10.1109/tpami.2017.2762295.

186.

Jiang

Wang

Shan

Chen

. Adaptive metric learning for zero-shot recognition. IEEE Signal Processing Letters. 2019 Sep; 26(9): 1270-4. Available from: doi: 10.1109/lsp.2019.2917148.

187.

Shen

Zhou

Yang

Liu

Shen

. Scalable zero-shot learning via binary visual-semantic embeddings. IEEE Transactions on Image Processing. 2019 Jul; 28(7): 3662-74. Available from: doi: 10.1109/tip.2019.2899987.

188.

Meng

. Zero-shot learning via robust latent representation and manifold regularization. IEEE Transactions on Image Processing. 2019 Apr; 28(4): 1824-36. Available from: doi: 10.1109/tip.2018.2881926.

189.

Niu

Cai

Veeraraghavan

Zhang

. Zero-shot learning via category-specific visual-semantic mapping and label refinement. IEEE Transactions on Image Processing. 2019 Feb; 28(2): 965-79. Available from: doi: 10.1109/tip.2018.2872916.

190.

Rahman

Khan

Porikli

. A unified approach for conventional zero-shot, generalized zero-shot, and few-shot learning. IEEE Transactions on Image Processing. 2018 Nov; 27(11): 5652-67. Available from: doi: 10.1109/tip.2018.2861573.

191.

Meng

Zhan

. Zero-shot learning via low-rank-representation based manifold regularization. IEEE Signal Processing Letters. 2018 Sep; 25(9): 1379-83. Available from: doi: 10.1109/lsp.2018.2857201.

192.

Long

Shen

Liu

Xie

Yang

. Zero-shot learning via discriminative representation extraction. Pattern Recognition Letters. 2018 Jul; 109: 27-34. Available from: doi: 10.1016/j.patrec.2017.09.030.

193.

Xie

Pang

Chen

Zhang

. Zero-shot learning with Multi-Battery Factor Analysis. Signal Processing. 2017 Sep; 138: 265-72. Available from: doi: 10.1016/j.sigpro.2017.03.023.

194.

Yang

Zhang

Tong

. Semantic-aligned reinforced attention model for zero-shot learning. Image and Vision Computing. 2022 Dec; 128: 104586. Available from: doi: 10.1016/j.imavis.2022.104586.

195.

Meng

Wei

. Learning multipart attention neural network for zero-shot classification. IEEE Transactions on Cognitive and Developmental Systems. 2022 Jun; 14(2): 414-23. Available from: doi: 10.1109/tcds.2020.3044313.

196.

Liu

Dong

. Semantic-diversity transfer network for generalized zero-shot learning via inner disagreement based OOD detector. Knowledge-Based Systems. 2021 Oct; 229: 107337. Available from: doi: 10.1016/j.knosys.2021.107337.

197.

Chen

Deng

Wang

. Deep attention relation network: A zero-shot learning method for bearing fault diagnosis under unknown domains. IEEE Transactions on Reliability. 2023 Mar; 72(1): 79-89. Available from: doi: 10.1109/tr.2022.3177930.

198.

Yan

Chang

Guan

Zhu

, et al. ZeroNAS: Differentiable generative adversarial networks search for zero-shot learning. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2022 Dec; 44(12): 9733-40. Available from: doi: 10.1109/tpami.2021.3127346.

199.

Gautam

Parameswaran

Mishra

Sundaram

. Tf-GCZSL: Task-free generalized continual zero-shot learning. Neural Networks. 2022 Nov; 155: 487-97. Available from: doi: 10.1016/j.neunet.2022.08.034.

200.

Singh

Thakur

. Meta-DZSL: A meta-dictionary learning based approach to zero-shot recognition. Applied Intelligence. 2022; 52(14): 15938-60.

201.

Yucel

Cinbis

Duygulu

. How robust are discriminatively trained zero-shot learning models? Image and Vision Computing. 2022 Mar; 119: 104392. Available from: doi: 10.1016/j.imavis.2022.104392.

202.

Singh

Thakur

. NucNormZSL: nuclear norm-based domain adaptation in zero-shot learning. Neural Computing and Applications. 2022; 1-22.

203.

Wang

Duan

Zhang

. Context-sensitive zero-shot semantic segmentation model based on meta-learning. Neurocomputing. 2021; 465: 465-75.

204.

Zhao

Zhang

Liu

. Zero-shot learning via the fusion of generation and embedding for image recognition. Information Sciences. 2021; 578: 831-47.

205.

Pamungkas

Basile

Patti

. A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection. Information Processing and Management. 2021; 58(4): 102544.

206.

Wan

Chen

Liao

. Visual structure constraint for transductive zero-shot learning in the wild. International Journal of Computer Vision. 2021; 129: 1893-909.

207.

Kim

Lee

Byun

. Zero-shot learning with self-supervision by shuffling semantic embeddings. Neurocomputing. 2021; 437: 1-8.

208.

Zhang

Bai

Long

Liu

Shao

. A plug-in attribute correction module for generalized zero-shot learning. Pattern Recognition. 2021; 112: 107767.

209.

Han

Yang

. Inference guided feature generation for generalized zero-shot learning. Neurocomputing. 2021; 430: 150-8.

210.

Yan

Chen

Huang

. Joint visual and semantic optimization for zero-shot learning. Knowledge-Based Systems. 2021; 215: 106773.

211.

Yang

Wang

Herranz

van de Weijer

. On implicit attribute localization for generalized zero-shot learning. IEEE Signal Processing Letters. 2021; 28: 872-6.

212.

Kan

Cen

Vladimir

. Zero-shot learning to index on semantic trees for scalable image retrieval. IEEE Transactions on Image Processing. 2020; 30: 501-16.

213.

Zhang

Liu

Yao

Long

. Pseudo distribution on unseen classes for generalized zero shot learning. Pattern Recognition Letters. 2020; 135: 451-8.

214.

Liu

Sun

Fang

Guo

. Cross-modal zero-shot-learning for tactile object recognition. IEEE Transactions on Systems, Man, and Cybernetics: Systems. 2018; 50(7): 2466-74.

215.

Luo

Wang

Cao

. A novel dataset-specific feature extractor for zero-shot learning. Neurocomputing. 2020; 391: 74-82.

216.

Wang

Zhang

Long

Shao

. Learning discriminative domain-invariant prototypes for generalized zero shot learning. Knowledge-Based Systems. 2020; 196: 105796.

217.

Mishra

Pandey

Murthy

. Zero-shot learning for action recognition using synthesized features. Neurocomputing. 2020; 390: 117-30.

218.

Pradhan

Al-Najjar

Sameen

Tsang

Alamri

. Unseen land cover classification from high-resolution orthophotos using integration of zero-shot learning and convolutional neural networks. Remote Sensing. 2020; 12(10): 1676.

219.

Tang

Yang

. Zero-shot learning by mutual information estimation and maximization. Knowledge-Based Systems. 2020; 194: 105490.

220.

Rostami

Isele

Eaton

. Using task descriptions in lifelong machine learning for improved performance and zero-shot transfer. Journal of Artificial Intelligence Research. 2020; 67: 673-704.

221.

Wang

Liu

Sun

Pan

. Fabric recognition using zero-shot learning. Tsinghua Science and Technology. 2019; 24(6): 645-53.

222.

Liu

Huang

Dong

. Generalized zero-shot learning for action recognition with web-scale video data. World Wide Web. 2019; 22(2): 807-24.

223.

Zhang

Long

Guan

Shao

. Triple verification network for generalized zero-shot learning. IEEE Transactions on Image Processing. 2018; 28(1): 506-17.

224.

Fang

Feng

. Learning unseen visual prototypes for zero-shot classification. Knowledge-Based Systems. 2018; 160: 176-87.

225.

Guo

Zhang

Ling

, et al. Transductive zero-shot learning with a self-training dictionary approach. IEEE transactions on cybernetics. 2018; 48(10): 2908-19.

226.

Abderrahmane

Ganesh

Crosnier

Cherubini

. Haptic zero-shot learning: Recognition of objects never touched before. Robotics and Autonomous Systems. 2018; 10511-25.

227.

Zhang

Liu

Luo

Chang

Zheng

. Deep semisupervised zero-shot learning with maximum mean discrepancy. Neural computation. 2018; 30(5): 1426-47.

228.

Luo

Huang

Feng

Wang

. Zero-shot learning via attribute regression and class prototype rectification. IEEE Transactions on Image Processing. 2017; 27(2): 637-48.

A comprehensive review on zero-shot-learning techniques

Abstract

Keywords

1. Introduction

2. Learning strategy-based methods

2.1 Metric learning methods

2.2 Classifier-based methods

2.3 Instance-based methods

Table 2 Learning strategy-based methods

3.1 Generative modelling

3.2 Hybrid methods

Table 3 Generative modelling and hybrid zsl methods

4.1 Multi-modal ZSL

4.1.1 Multilabel MZSL

4.2 Graph-based methods

4.3 Embedding-based methods

Table 4 Modality and attribute-based methods

5.1 Attention mechanisms

5.2 Other

Table 5 Other advanced zsl methods

References

Table 2
Learning strategy-based methods

Table 3
Generative modelling and hybrid zsl methods

Table 4
Modality and attribute-based methods

Table 5
Other advanced zsl methods