Residential College | false |
Status | 已發表Published |
Expert Knowledge-Guided Length-Variant Hierarchical Label Generation for Proposal Classification | |
Meng Xiao1,2; Ziyue Qiao1,2; Yanjie Fu3; Yi Du1; Pengyang Wang4; Yuanchun Zhou1 | |
2021-12 | |
Conference Name | 21st IEEE International Conference on Data Mining, ICDM 2021 |
Source Publication | Proceedings - IEEE International Conference on Data Mining, ICDM |
Volume | 2021-December |
Pages | 757-766 |
Conference Date | 07-10 December 2021 |
Conference Place | Auckland, New Zealand |
Publisher | IEEE |
Abstract | To advance the development of science and technology, research proposals are submitted to open-court competitive programs developed by government agencies (e.g., NSF). Proposal classification is one of the most important tasks to achieve effective and fair review assignment. Proposal classification aims to classify a proposal into a length-variant sequence of labels. In this paper, we formulate the proposal classification problem into a hierarchical multi-label classification task. Although there are certain prior studies, proposal classification exhibit unique features: 1) the classification result of a proposal is in a hierarchical discipline structure with different levels of granularity; 2) proposals contain multiple types of documents; 3) domain experts can empirically provide partial labels that can be leveraged to improve task performances. In this paper, we focus on developing a new deep proposal classification framework to jointly model the three features. We design a deep transformer-based encoder-decoder framework. In this framework, we use a two-level (word-level and document-level) Transformer structure as an encoder to learn the embedding feature vectors of proposals. The decoder generates labels from the starting coarse-grained level to a certain fine-grained level to form the hierarchical discipline tree. In particular, to sequentially generate labels, we leverage previously-generated labels to predict the label of next level; to integrate partial labels from experts, we use the embedding of these empirical partial labels to initialize the state of neural networks. Our model can automatically identify the best length of label sequence to stop next label prediction. Finally, we present extensive results to demonstrate that our method can jointly model partial labels, textual information, and semantic dependencies in label sequences and, thus, achieve advanced performances. |
Keyword | Hierarchical Multi-label Classification Text Classification Transformer |
DOI | 10.1109/ICDM51629.2021.00087 |
URL | View the original |
Indexed By | CPCI-S |
Language | 英語English |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Information Systems |
WOS ID | WOS:000780454100077 |
Scopus ID | 2-s2.0-85125198568 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Affiliation | 1.Computer Network Information Center, Chinese Academy of Sciences, Beijing 2.University of Chinese Academy of Sciences, Beijing 3.Department of Computer Science, University of Central Florida, Orlando 4.University of Macau, Macau |
Recommended Citation GB/T 7714 | Meng Xiao,Ziyue Qiao,Yanjie Fu,et al. Expert Knowledge-Guided Length-Variant Hierarchical Label Generation for Proposal Classification[C]:IEEE, 2021, 757-766. |
APA | Meng Xiao., Ziyue Qiao., Yanjie Fu., Yi Du., Pengyang Wang., & Yuanchun Zhou (2021). Expert Knowledge-Guided Length-Variant Hierarchical Label Generation for Proposal Classification. Proceedings - IEEE International Conference on Data Mining, ICDM, 2021-December, 757-766. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment