Abstract
Chinese named entity recognition (CNER) aims to identify entity names such as person names and organization names from Chinese raw text and thus can quickly extract the entity information that people are concerned about from large-scale texts. Recent studies attempt to improve performance by integrating lexicon words into char-based CNER models. These existing studies, however, usually focus on leveraging the context-free words in lexicon without considering the contextual information of words and subwords in the sentences. To address this issue, in addition to utilizing the lexicon words, we further propose to construct a hierarchical tree structure representation composed of characters, subwords and context-aware predicted words from segmentor to represent each sentence for CNER. Based on the tree-structure representation, we propose a hierarchical long short-term memory (HiLSTM) framework, which consists of hierarchical encoding layer, fusion layer and CRF layer, to capture linguistic knowledge at different levels. On the one hand, the interactions within each level help to obtain the contextual information. On the other hand, the propagations from the lower-levels to the upper-levels can provide additional semantic knowledge for CNER. Experimental results on three widely used CNER datasets show that our proposed HiLSTM model achieves significant improvement over several strong benchmark methods.
Similar content being viewed by others
References
Chen Y B, Xu L H, Liu K, et al. Event extraction via dynamic multi-pooling convolutional neural networks. In: Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2015. 167–176
Miwa M, Bansal M. End-to-end relation extraction using LSTMs on sequences and tree structures. In: Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2016. 1105–1116
Diefenbach D, Lopez V, Singh K, et al. Core techniques of question answering systems over knowledge bases: a survey. Knowl Inform Syst, 2018. https://hal.archives-ouvertes.fr/hal-01637143/document
Yang J F, Guan Y, He B, et al. Corpus construction for named entities and entity relations on Chinese electronic medical records. J Softw, 2016, 27: 2725–2746
Song L F, Zhang Y, Gildea D, et al. Leveraging dependency forest for neural medical relation extraction. In: Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2019. 208–218
Tian Y H, Ma W C, Xia F, et al. ChiMed: a Chinese medical corpus for question answering. In: Proceedings of the 18th BioNLP Workshop and Shared Task, 2019. 250–260
Saito K, Nagata M. Multi-language named-entity recognition system based on HMM. In: Proceedings of Annual Meeting of the Association for Computational Linguistics Workshop on Multilingual and Mixed-language Named Entity Recognition, 2003
Yu H K, Zhang H P, Liu Q, et al. Chinese named entity identification using cascaded hidden Markov model. J Commun, 2006, 27: 87–94
Solorio T, López A. Learning named entity classifiers using support vector machines. In: Proceedings of Conference on Computational Linguistics and Natural Language Processing (CICLing), 2004. 158–167
Mccallum A, Li W. Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons. In: Proceedings of Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), 2003. 188–191
Chiu J P C, Nichols E. Named entity recognition with bidirectional LSTM-CNNs. Trans Assoc Comput Linguist, 2016, 4: 357–370
Lample G, Ballesteros M, Subramanian S, et al. Neural architectures for named entity recognition. In: Proceedings of Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), 2016. 260–270
Liu L Y, Shang J B, Xu F, et al. Empower sequence labeling with task-aware neural language model. In: Proceedings of Association for the Advance of Artificial Intelligence (AAAI), 2018. 5253–5260
Dong C H, Zhang J J, Zong C Q, et al. Character-based LSTM-CRF with radical-level features for Chinese named entity recognition. In: Proceedings of International Conference on Computer Processing of Oriental Languages (ICCPOL), 2016. 239–250
Zhang Y, Yang J. Chinese NER using lattice LSTM. In: Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2018. 1554–1564
Gui T, Zou Y C, Zhang Q, et al. A lexicon-based graph neural network for Chinese NER. In: Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2019. 1040–1050
Liu W, Xu T G, Xu Q H, et al. An encoding strategy based word-character LSTM for Chinese NER. In: Proceedings of North American Chapter of the Association for Computational Linguistics (NAACL), 2019. 2379–2389
Sui D B, Chen Y B, Liu K, et al. Leverage lexical knowledge for Chinese named entity recognition via collaborative graph network. In: Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2019. 3828–3838
Gong C, Li Z H, Zhang M, et al. Multi-grained Chinese word segmentation. In: Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2017. 703–714
Heinzerling B, Strube M. BPEmb: tokenization-free pre-trained subword embeddings in 275 languages. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC), 2018. 2989–2993
Collobert R, Weston J, Bottou L, et al. Natural language processing (almost) from scratch. J Mach Learn Res, 2011, 12: 2493–2537
He H F, Sun X. A unified model for cross-domain and semi-supervised named entity recognition in Chinese social media. In: Proceedings of Association for the Advance of Artificial Intelligence (AAAI), 2017. 3216–3222
Zhao H, Kit C Y. Unsupervised segmentation helps supervised learning of character tagging for word segmentation and namedentity recognition. In: Proceedings of SIGHAN Workshop on Chinese Language Processing, 2008
Peng N Y, Dredze M. Named entity recognition for Chinese social media with jointly trained embeddings. In: Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2015. 548–554
He H F, Sun X. F-score driven max margin neural network for named entity recognition in Chinese social media. In: Proceedings of European Chapter of the Association for Computational Linguistics (EACL), 2017. 713–718
Peng N Y, Dredze M. Improving named entity recognition for Chinese social media with word segmentation representation learning. In: Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2016. 149–155
Cao P F, Chen Y B, Liu K, et al. Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In: Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2018. 182–192
Sennrich R, Haddow B, Birch A. Neural machine translation of rare words with subword units. In: Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2016. 1715–1725
Weischedel R, Palmer M, Marcus M, et al. OntoNotes Release 4.0. Philadelphia: Linguistic Data Consortium, 2011
Levow G A. The third international Chinese language processing backoff: word segmentation and named entity recognition. In: Proceedings of SIGHAN Workshop on Chinese Language Processing, 2006. 108–117
Noreen E. Computer-intensive methods for testing hypotheses. Biometrics, 1990, 46: 540–541
Zhang S X, Qin Y, Wen J, et al. Word segmentation and named entity recognition for SIGHAN bakeoff3. In: Proceedings of SIGHAN Workshop on Chinese Language Processing, 2006. 158–161
Zhou J S, Qu W G, Zhang F. Chinese named entity recognition via joint identification and categorization. Chinese J Electron, 2013, 22: 225–230
Li S, Zhao Z, Hu R F, et al. Analogical reasoning on Chinese morphological and semantic relations. In: Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), 2018. 138–143
Acknowledgements
This work was supported by National Science Fund for Distinguished Young Scholars (Grant No. 61525205), National Natural Science Foundation of China (Grant No. 61876116), and Project Funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gong, C., Li, Z., Xia, Q. et al. Hierarchical LSTM with char-subword-word tree-structure representation for Chinese named entity recognition. Sci. China Inf. Sci. 63, 202102 (2020). https://doi.org/10.1007/s11432-020-2982-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11432-020-2982-y