Abstract
In Chinese classics, the sentiment attitudes or thoughts of the ancients regarding specific environments, people, and events were generally expressed in the form of poetry. Compared with previous attempts to classify the polarity of poetry, sentiment terms can be used to detect more fine-grained humanity knowledge in literary information resources. However, the existing techniques of domain sentiment lexicon construction fail to take full advantage of deep learning and linguistic knowledge, which cannot ensure the term integrity and accuracy. To this end, this work proposes a novel approach for the construction of a sentiment lexicon via the combination of supervised sentiment term extraction and classification, aiming at incorporating multi-dimensional linguistic knowledge into a two-phase deep learning model. A character-sequence labeling model for term extraction is first constructed by fusing the emotion radical features of Chinese characters, and term embedding augmentation via word knowledge is then carried out to classify the extracted terms. Experiments on Chinese poetry and its appreciation texts validate the superiority of the proposed method, and the model incorporating linguistic knowledge is found to outperform the benchmark models in different metrics. A fine-grained sentiment lexicon with two first classes, five-second classes, 15 third classes, and 14,368 domain terms and unregistered terms is constructed via hierarchical term classification, thereby contributing to the advancement of the interpretability of the humanities computing of classical Chinese poetry.
Original language | English |
---|---|
Pages (from-to) | 2325-2346 |
Number of pages | 22 |
Journal | Neural Computing and Applications |
Volume | 35 |
Issue number | 3 |
DOIs | |
Publication status | Published - 2023 Jan |
Bibliographical note
Funding Information:This paper was supported by the National Natural Science Foundation of China [72074108], the Postgraduate Research & Practice Innovation Program of Jiangsu Province [KYCX21_0026], the National Research Foundation of Korea [2022R1A2B5B02002359], the Fundamental Research Funds for the Central Universities [010814370113], as well as the program of the China Scholarship Council (award to Wei Zhang for 1 year’s study abroad at the Yonsei University). We would like to acknowledge the annotator, Tao Fan, a doctoral student in Information Science from Nanjing University, as well as his valuable suggestions. We would also like to express our special thanks to the editor and reviewers for their very constructive comments and suggestions.
Funding Information:
This paper was supported by the National Natural Science Foundation of China [72074108], the Postgraduate Research & Practice Innovation Program of Jiangsu Province [KYCX21_0026], the National Research Foundation of Korea [2022R1A2B5B02002359], the Fundamental Research Funds for the Central Universities [010814370113], as well as the program of the China Scholarship Council (award to Wei Zhang for 1 year’s study abroad at the Yonsei University). We would like to acknowledge the annotator, Tao Fan, a doctoral student in Information Science from Nanjing University, as well as his valuable suggestions. We would also like to express our special thanks to the editor and reviewers for their very constructive comments and suggestions.
Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
All Science Journal Classification (ASJC) codes
- Software
- Artificial Intelligence