1673-159X

CN 51-1686/N

基于LSTM-SNP的命名实体识别

Named Entity Recognition Based on the LSTM-SNP

  • 摘要: 脉冲神经P系统(SNPs)是抽象于生物神经元信息交互机制的高效并行计算系统。LSTM-SNP首次结合非线性SNP和长短期记忆神经网络(LSTM),从而形成门控机制可解释的深度学习通用模型。LSTM-SNP作为传统序列分析模型LSTM的最新变体,在处理典型自然语言处理序列分析问题的性能表现未见相关研究。文章以命名实体识别任务为基础,通过在LSTM-SNP上增补不同的深度学习组件,对LSTM-SNP与传统LSTM以及其变体BiLSTM的性能差异进行了全面分析,为在自然语言处理任务中使用LSTM-SNP模型提供可靠的改进参考。通过以CoNLL-2003和OntoNotes5.0为标准数据集的对比实验,发现:LSTM-SNP模型与LSTM模型具有类似的实体识别性能,但随着预处理的操作,LSTM-SNP模型的整体性能提升更为显著;LSTM-SNP模型对命名实体的识别是一种行之有效的方法,且具有较大的应用潜力。

     

    Abstract: Spiking neural P systems (SNPs) are efficient parallel computing systems abstracted from the mechanism of information exchange between biological neurons. For the first time, LSTM-SNPs combine nonlinear SNPs and long short-term memory (LSTM) to form a universal deep learning model that gating mechanisms can explain. LSTM-SNPs, the latest variant of the traditional sequence analysis model LSTM, has yet to be studied on the performance of typical sequence analysis in natural language processing. This paper comprehensively analyzes the performance difference in the named entity recognition tasks between LSTM-SNPs, traditional LSTMs, and its variant BiLSTM by adding different deep learning components. The study provides a reliable reference for applying the LSTM-SNP model in natural language processing tasks. The results of comparative experiments based on CoNLL-2003 and OntoNotes 5.0 data sets indicate the LSTM-SNP model has a similar entity recognition performance to the LSTM model. In further research,the overall model performance can be improved significantly with the pretreatment operation. The results show the LSTM-SNP model is an effective method for named entity recognition and has great application potential.

     

/

返回文章
返回