
Agricultural Disease Named Entity Recognition with Pointer Network Based on RoFormer Pre-trained Model
WANGTong, WANGChunshan, LIJiuxi, ZHUHuaji, MIAOYisheng, WUHuarui
Agricultural Disease Named Entity Recognition with Pointer Network Based on RoFormer Pre-trained Model
[Objective] With the development of agricultural informatization, a large amount of information about agricultural diseases exists in the form of text. However, due to problems such as nested entities and confusion of entity types, traditional named entities recognition (NER) methods often face challenges of low accuracy when processing agricultural disease text. To address this issue, this study proposes a new agricultural disease NER method called RoFormer-PointerNet, which combines the RoFormer pre-trained model with the PointerNet baseline model. The aim of this method is to improve the accuracy of entity recognition in agricultural disease text, providing more accurate data support for intelligent analysis, early warning, and prevention of agricultural diseases. [Methods] This method first utilized the RoFormer pre-trained model to perform deep vectorization processing on the input agricultural disease text. This step was a crucial foundation for the subsequent entity extraction task. As an advanced natural language processing model, the RoFormer pre-trained model's unique rotational position embedding approach endowed it with powerful capabilities in capturing textual positional information. In agricultural disease text, due to the diversity of terminology and the existence of polysemy, traditional entity recognition methods often faced challenges in confusing entity types. However, through its unique positional embedding mechanism, the RoFormer model was able to incorporate more positional information into the vector representation, effectively enriching the feature information of words. This characteristic enabled the model to more accurately distinguish between different entity types in subsequent entity extraction tasks, reducing the possibility of type confusion. After completing the vectorization representation of the text, this study further emploied a pointer network for entity extraction. The pointer network was an advanced sequence labeling approach that utilizes head and tail pointers to annotate entities within sentences. This labeling method was more flexible compared to traditional sequence labeling methods as it was not restricted by fixed entity structures, enabling the accurate extraction of all types of entities within sentences, including complex entities with nested relationships. In agricultural disease text, entity extraction often faced the challenge of nesting, such as when multiple different entity types are nested within a single disease symptom description. By introducing the pointer network, this study effectively addressed this issue of entity nesting, improving the accuracy and completeness of entity extraction. [Results and Discussions] To validate the performance of the RoFormer-PointerNet method, this study constructed an agricultural disease dataset, which comprised 2 867 annotated corpora and a total of 10 282 entities, including eight entity types such as disease names, crop names, disease characteristics, pathogens, infected areas, disease factors, prevention and control methods, and disease stages. In comparative experiments with other pre-trained models such as Word2Vec, BERT, and RoBERTa, RoFormer-PointerNet demonstrated superiority in model precision, recall, and F1-Score, achieving 87.49%, 85.76% and 86.62%, respectively. This result demonstrated the effectiveness of the RoFormer pre-trained model. Additionally, to verify the advantage of RoFormer-PointerNet in mitigating the issue of nested entities, this study compared it with the widely used bidirectional long short-term memory neural network (BiLSTM) and conditional random field (CRF) models combined with the RoFormer pre-trained model as decoding methods. RoFormer-PointerNet outperformed the RoFormer-BiLSTM, RoFormer-CRF, and RoFormer-BiLSTM-CRF models by 4.8%, 5.67% and 3.87%, respectively. The experimental results indicated that RoFormer-PointerNet significantly outperforms other models in entity recognition performance, confirming the effectiveness of the pointer network model in addressing nested entity issues. To validate the superiority of the RoFormer-PointerNet method in agricultural disease NER, a comparative experiment was conducted with eight mainstream NER models such as BiLSTM-CRF, BERT-BiLSTM-CRF, and W2NER. The experimental results showed that the RoFormer-PointerNet method achieved precision, recall, and F1-Score of 87.49%, 85.76% and 86.62%, respectively in the agricultural disease dataset, reaching the optimal level among similar methods. This result further verified the superior performance of the RoFormer-PointerNet method in agricultural disease NER tasks. [Conclusions] The agricultural disease NER method RoFormer-PointerNet, proposed in this study and based on the RoFormer pre-trained model, demonstrates significant advantages in addressing issues such as nested entities and type confusion during the entity extraction process. This method effectively identifies entities in Chinese agricultural disease texts, enhancing the accuracy of entity recognition and providing robust data support for intelligent analysis, early warning, and prevention of agricultural diseases. This research outcome holds significant importance for promoting the development of agricultural informatization and intelligence.
agricultural disease / named entity recognition / entity nesting / RoFormer pre-trained model / pointer network {{custom_keyword}} /
Table 1 Example of entity type annotation表1 实体类型标注示例 |
实体类型 | 示例 | 标签 | 数量/个 |
---|---|---|---|
病害名称 | 小麦白粉病、黄瓜蔓枯病 | Disease | 1 014 |
作物名称 | 小麦、黄瓜、番茄 | Crop | 1 115 |
病害特征 | 暗绿色水渍状软腐 | Feature | 4 359 |
病原 | 葫芦科刺盘孢 | Pathogeny | 571 |
发病地区 | 长江中下游、中国上海 | Region | 617 |
发病因素 | 低温高湿、连作 | Factor | 1 109 |
防治方法 | 合理密植、轮作换茬 | Method | 1 123 |
发病阶段 | 结果期、成株期 | Period | 374 |
Table 2 NER experimental parameter settings表2 NER实验参数设置 |
参数 | 数值 |
---|---|
max_sqe_len | 256 |
epoch | 50 |
Batch_size | 32 |
warmup_proportion | 0.01 |
learning rate | 3e-5 |
Dropout | 0.5 |
Table 3 Performance comparison of model with different embedded vector表3 不同嵌入向量模型性能对比 |
模型 | 精确率/% | 召回率/% | F 1值/% |
---|---|---|---|
PointerNet | 80.03 | 74.52 | 77.17 |
Word2Vec-PointerNet | 80.67 | 75.11 | 77.79 |
BERT-PointerNet | 85.58 | 79.64 | 82.50 |
RoBerta-PointerNet | 86.03 | 79.91 | 82.86 |
RoFormer-PointerNet | 87.49 | 85.76 | 86.62 |
Table 4 Performance comparison of RoFormer combined with different decoding models表4 RoFormer联合不同解码模型的性能对比 |
模型 | 精确率/% | 召回率/% | F 1值/% |
---|---|---|---|
RoFormer-BiLSTM | 85.48 | 78.47 | 81.82 |
RoFormer-CRF | 84.32 | 77.84 | 80.95 |
RoFormer-BiLSTM-CRF | 85.20 | 80.45 | 82.75 |
RoFormer-PointerNet | 87.49 | 85.76 | 86.62 |
Table 5 Performance comparison of different NER models表5 不同NER模型的性能对比 |
模型 | 精确率/% | 召回率/% | F 1值/% |
---|---|---|---|
BilSTM-CRF | 71.93 | 73.66 | 72.79 |
BERT-BiLSTM-CRF | 85.43 | 76.04 | 80.46 |
RoBERT-BiLSTM-CRF | 84.38 | 78.29 | 81.22 |
TPLinker | 82.57 | 85.60 | 84.06 |
UIE | 72.93 | 58.02 | 64.63 |
MRC | 81.77 | 83.58 | 82.57 |
W2NER | 76.29 | 81.77 | 78.55 |
Cascade | 87.46 | 84.22 | 85.81 |
RoFormer-PointerNet | 87.49 | 85.76 | 86.62 |
1 |
孙新, 任翔渝, 郑洪超, 等. 基于参数迁移的领域命名实体识别方法[J]. 情报工程, 2022, 8(3): 13-27.
{{custom_citation.content}}
{{custom_citation.annotation}}
|
2 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
3 |
冀振燕, 孔德焱, 刘伟, 等. 基于深度学习的命名实体识别研究[J]. 计算机集成制造系统, 2022, 28(6): 1603-1615.
{{custom_citation.content}}
{{custom_citation.annotation}}
|
4 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
5 |
徐帅博. 基于枸杞病虫害知识图谱的问答系统研究与实现[D]. 银川: 宁夏大学, 2020.
{{custom_citation.content}}
{{custom_citation.annotation}}
|
6 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
7 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
8 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
9 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
10 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
11 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
12 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
13 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
14 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
15 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
16 |
金彦亮, 谢晋飞, 吴迪嘉. 基于分层标注的中文嵌套命名实体识别[J]. 上海大学学报(自然科学版), 2022, 28(2): 270-280.
{{custom_citation.content}}
{{custom_citation.annotation}}
|
17 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
18 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
19 |
王泽儒, 柳先辉. 基于指针级联标注的中文实体关系联合抽取模型[J]. 武汉大学学报(理学版), 2022, 68(3): 304-310.
{{custom_citation.content}}
{{custom_citation.annotation}}
|
20 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
21 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
22 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
23 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
24 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
25 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
26 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
27 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
28 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
29 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
30 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
31 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
32 |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
{{custom_ref.label}} |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
/
〈 |
|
〉 |