Prompt-based Few-Shot Text Classification With Multi-Granularity Label Augmentation and Adaptive Verbalizer

Huang, Deling; Li, Zanxiong; Yu, Jian; Zhou, Yulong

Prompt-based Few-Shot Text Classification With Multi-Granularity Label Augmentation and Adaptive Verbalizer

Files

Journal article(1.05 MB)

Date

2026-01-08

Authors

Huang, Deling

Li, Zanxiong

Yu, Jian

Zhou, Yulong

Item type

Journal Article

Publisher

MDPI AG

Abstract

Few-Shot Text Classification (FSTC) aims to classify text accurately into predefined categories using minimal training samples. Recently, prompt-tuning-based methods have achieved promising results by constructing verbalizers that map input data to the label space, thereby maximizing the utilization of pre-trained model features. However, existing verbalizer construction methods often rely on external knowledge bases, which require complex noise filtering and manual refinement, making the process time-consuming and labor-intensive, while approaches based on pre-trained language models (PLMs) frequently overlook inherent prediction biases. Furthermore, conventional data augmentation methods focus on modifying input instances while overlooking the integral role of label semantics in prompt tuning. This disconnection often leads to a trade-off where increased sample diversity comes at the cost of semantic consistency, resulting in marginal improvements. To address these limitations, this paper first proposes a novel Bayesian Mutual Information-based method that optimizes label mapping to retain general PLM features while reducing reliance on irrelevant or unfair attributes to mitigate latent biases. Based on this method, we propose two synergistic generators that synthesize semantically consistent samples by integrating label word information from the verbalizer to effectively enrich data distribution and alleviate sparsity. To guarantee the reliability of the augmented set, we propose a Low-Entropy Selector that serves as a semantic filter, retaining only high-confidence samples to safeguard the model against ambiguous supervision signals. Furthermore, we propose a Difficulty-Aware Adversarial Training framework that fosters generalized feature learning, enabling the model to withstand subtle input perturbations. Extensive experiments demonstrate that our approach outperforms state-of-the-art methods on most few-shot and full-data splits, with F1 score improvements of up to +2.8% on the standard AG’s News benchmark and +1.0% on the challenging DBPedia benchmark.

Keywords

08 Information and Computing Sciences, 46 Information and computing sciences, few-shot learning, text classification, prompt tuning, data augmentation, adversarial training

Source

Information, ISSN: 2078-2489 (Print); 2078-2489 (Online), MDPI AG, 17(1), 58-58. doi: 10.3390/info17010058

DOI

10.3390/info17010058

Publisher's version

https://www.mdpi.com/2078-2489/17/1/58

Rights statement

Permanent link

http://hdl.handle.net/10292/20495

Collections

School of Engineering, Computer and Mathematical Sciences - Te Kura Mātai Pūhanga, Rorohiko, Pāngarau

Full item page

Prompt-based Few-Shot Text Classification With Multi-Granularity Label Augmentation and Adaptive Verbalizer

Files

Date

Authors

Supervisor

Item type

Degree name

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Source

DOI

Publisher's version

Rights statement

Permanent link

Collections