基于BiLSTM的公共安全事件触发词识别

易士翔; 尹宏鹏; 郑恒毅

doi:10.13374/j.issn2095-9389.2019.09.012

基于BiLSTM的公共安全事件触发词识别

Public security event trigger identification based on Bidirectional LSTM

摘要

摘要: 提出基于双向长短期记忆网络（bidirectional long short-term memory，BiLSTM）和前向神经网络的融合模型完成公共安全事件的触发词识别任务.首先通过BiLSTM提取整段文本的高层语义特征，避免了以往机器学习方法需要人工提取特征的问题，其次采用特征拼接并在前向神经网络中识别并分类事件触发词.实验结果表明相较于基准模型，本文方法在中文突发事件语料库（Chinese emergency corpus，CEC）上取得了更为突出的性能，Micro-F1值为78.47%.此外本文讨论了不同拼接特征在触发词识别任务中的重要性，对文本分析中3类特征（词性、句法、实体）的重要程度进行了比较和分析，得出句法特征对于事件触发词识别任务助益最大的结论.

Abstract: As the internet coverage continues to expand, obtaining valuable information from a large amount of fragmented semi-structured text data has become a huge challenge considering the vast amount of social public information. Event trigger identification technology can effectively mine and refine text information so that the users can quickly and accurately get what they need; thus, it has gradually become an active research area in the field of natural language processing. An event trigger word is generally a word or phrase that marks the occurrence of the event, then trigger word identification has been applied to many aspects and plays an important role in the fields of knowledge base construction, intelligent search engine, automatic question answering robot, and automatic summarization. However, the text data are characterized by high dimensionality and ambiguity. The existing identification methods are mostly based on manual complex feature engineering or only consider the features in a certain text window. In this process, manual analysis and selection of a large number of features are required. Considerable reliance on natural language processing tools leads to the inability of applying the model on a large scale, and there are problems of erroneous cascade communication and complicated feature engineering. This paper proposed a fusion model based on the bidirectional long short-term memory (BiLSTM) and feed-forward neural networks to complete the trigger identification task for public security events. First, the high-level features of the entire text were extracted through BiLSTM to avoid manual feature extraction, which was associated with the existing machine learning methods. Then, contacted features were used to input feed-forward neural networks and identify event triggers. The experimental results show that the proposed method achieves good performance in the Chinese emergency corpus, CEC, and the Micro-F1 is 78.47%. In addition, the importance of different contacted features was also discussed in trigger word recognition tasks, and the importance of three types of features, namely part of speech, syntax, and entity, in text analysis was analyzed. It is concluded that syntactic features are most helpful to the task of event-trigger word recognition.

HTML全文

参考文献(18)

施引文献

资源附件(0)