Abstract
Using the fine-grained entity typing method of distant supervision, when assigning type labels to entity mention, since the knowledge base contains all type labels of the entity, noisy labels will be introduced. This paper proposed a Fine-grained Entity Typing model combined with Features (FETF) to reduce the negative impact of noisy labels. It is different from the previous methods such as manual annotation and heuristic rule pruning. The model not only improves the classification efficiency, but also does not need to reduce the size of the training set, which can improve the overall performance of the classification model. FETF divides the training set into clean dataset and noisy dataset according to the type numbers of entity mention in candidate type set, and constructs different objective functions for them to achieve the purpose of reducing the impact of noisy labels. At the same time, FETF can use the feature generator to jointly learn the relational features of entity mention - type label, as well as the similarity and hierarchical features of type label - type label. The feature generator extracts semantic features other than context, so as to help the fine-grained entity typing model to assign type labels to entity mention. In addition, we introduce adversarial training in the context processor, which can effectively alleviate the model overfitting noisy labels, and improve the robustness and generalization ability of the model. Experimental results on the public datasets show that the method proposed in this paper can effectively alleviate the negative impact of noisy labels on the fine-grained entity typing model, and outperforms previous methods in accuracy, Macro F1 value and Micro F1 value.
Similar content being viewed by others
Notes
The baselines results are reported on [36].
References
Nadeau D, Sekine S (2007) A survey of named entity recognition and classification[J]. Lingvisticae Investigationes 30(1):3–26
Stern R, Sagot B, Béchet F (2012) A joint named entity recognition and entity linking system[C]// In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, EACL, Avignon, France, April 23–27 2012. 52–60
Liu Y, Liu K, Xu L et al (2014) Exploring fine-grained entity type constraints for distantly supervised relation extraction[C]//Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 2107–2116
Yaghoobzadeh Y, Adel H, Schütze H (2017) Noise Mitigation for Neural Entity Typing and Relation Extraction[C]//Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. 1183–1194
Han S, Kwon S, Yu H et al (2017) Answer ranking based on named entity types for question answering[C]//Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication. 1–4
Das R, Zaheer M, Reddy S et al (2017) Question Answering on Knowledge Bases and Text using Universal Schema and Memory Networks[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 358–365
Collins M, Singer Y (1999) Unsupervised models for named entity classification[C]//1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. 100–110
Cucerzan S, Yarowsky D (1999) Language independent named entity recognition combining morphological and contextual evidence[C]//1999 joint SIGDAT conference on empirical methods in natural language processing and very large corpora. 90–99
Bikel DM, Schwartz R, Weischedel RM (1999) An Algorithm that Learns What’s in a Name[J]. Mach Learn 34(1–3):211–231
Borthwick AE (1999) A maximum entropy approach to named entity recognition[M]. New York University
McCallum A, Li W (2003) Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons[C]//Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003. 188–191
Isozaki H, Kazawa H (2002) : The 19th International Conference on Computational Linguistics. 2002, 1: 1–7
Ling X, Weld DS (2012) Fine-Grained Entity Recognition[C]// Conference on Artificial Intelligence, AAAI, Toronto, Ontario, Canada, July 22–26, 2012. AAAI Press, 94–100
Mintz M, Bills S, Snow R et al (2009) Distant supervision for relation extraction without labeled data[C]//Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Singapore, 2–7 August 2009. Stroudsburg, PA: The Association for Computer Linguistics, 1003–1011
Ji H, Grishman R (2008) Refining event extraction through cross-document inference[C]//46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-08: HLT. 254–262
Lin T, Etzioni O (2012) No noun phrase left behind: detecting and typing unlinkable entities[C]//Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning. 893–903
Dong X, Gabrilovich E, Heitz G et al (2014) Knowledge vault: A web-scale approach to probabilistic knowledge fusion[C]//Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. 601–610
Lee C, Hwang YG, Oh HJ et al (2006) Fine-grained named entity recognition using conditional random fields for question answering[C]//Asia information retrieval symposium. Springer, Berlin, Heidelberg, 581–587
Sekine S (2008) Extended Named Entity Ontology with Attribute Information[C]// Proceedings of the International Conference on Language Resources and Evaluation, LREC 2008, Marrakech, Morocco, 26 May – 1 June 2008. European Language Resources Association,
Yosef MA, Bauer S, Hoffart J et al (2012) Hyena: Hierarchical type classification for entity names[C]// 24th International Conference on Computational Linguistics, COLING, Mumbai, India, 8–15 December 2012. India: Indian Institute of Technology Bombay, 1361–1370
Gillick D, Lazic N, Ganchev K et al (2014) Context-dependent fine-grained entity type tagging[J].Computer Science,
Elman JL (1990) Finding structure in time[J]. Cogn Sci 14(2):179–211
Rosenblatt F (1957) The perceptron, a perceiving and recognizing automaton Project Para[M]. Cornell Aeronautical Laboratory
Yogatama D, Gillick D, Lazic N (2015) Embedding methods for fine grained entity type classification[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL(Volume 2: Short Papers), Beijing, China, July 26–31, 2015. Stroudsburg, PA: The Association for Computer Linguistics, 291–296
Del Corro L, Abujabal A, Gemulla R et al (2015) Conference on Empirical Methods in Natural Language Processing, EMNLP, Lisbon, Portugal, September 17–21, 2015. Stroudsburg, PA: The Association for Computational Linguistics, 2015 868–878
Shimaoka S, Stenetorp P, Inui K et al (2016) An Attentive Neural Architecture for Fine-grained Entity Type Classification[C]//Proceedings of the 5th Workshop on Automated Knowledge Base Construction, AKBC@NAACL-HLT, San Diego, CA, USA, June 17, 2016. Stroudsburg, PA: The Association for Computational Linguistics, 69–74
Dong X, Gabrilovich E, Heitz G et al (2014) Knowledge vault: A web-scale approach to probabilistic knowledge fusion[C]//Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, New York, August 24–27, New York: ACM, 2014. 601–610
Yaghoobzadeh Y, Schütze H (2015) Conference on Empirical Methods in Natural Language Processing. 2015. 715–725
Xu B, Luo Z, Huang L et al (2018) Metic: Multi-instance entity typing from corpus[C]//Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 903–912
Jin H, Hou L, Li J et al (2018) Attributed and predictive entity embedding for fine-grained entity typing in knowledge bases[C]//Proceedings of the 27th international conference on computational linguistics. 282–292
Ren X, He W, Qu M et al (2016) Conference on Empirical Methods in Natural Language Processing, EMNLP, Austin, Texas, USA, November 1–4, 2016. Stroudsburg, PA: The Association for Computational Linguistics, 2016: 1369–1378
Abhishek A, Anand A, Awekar A (2017) Fine-grained entity type classification by jointly learning representations and label embeddings[C]//Proceedings of the 15th Confer-ence of the European Chapter of the Association for Computational Linguistics, EACL, Valencia, Spain, April 3–7, 2017. Stroudsburg, PA: The Association for Computational Linguistics, 797–807
Xu P, Barbosa D (2018) Neural fine-grained entity type classification with hierarchy-aware loss[C]//Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT, New Orleans, Louisiana, USA, June 1–6, 2018. Stroudsburg, PA: The Association for Computational Linguistics, 16–25
Chen B, Gu X, Hu Y et al (2019) Improving distantly-supervised entity typing with compact latent space clustering[C]//Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT, Minneapolis, MN, USA, June 2–7, 2019. Stroudsburg, PA: The Association for Computational Linguistics, 2862–2872
Xin J, Zhu H, Han X et al (2018) Conference on Empirical Methods in Natural Language Processing, EMNLP, Brussels, Belgium, October 31 - November 4, 2018. Stroudsburg, PA: The Association for Computational Linguistics, 2018: 993–998
Zhang H, Long D, Xu G et al (2020) Learning with Noise: Improving Distantly-Supervised Fine-grained Entity Typing via Automatic Relabeling[C]// Twenty-Ninth International Joint Conference on Artificial Intelligence and Seventeenth Pacific Rim International Conference on Artificial Intelligence, IJCAI, Yokohama, Japan, 2020. California, ijcai.org, 3808–3815
Miyato T, Dai AM, Goodfellow I (2017) Adversarial Training Methods for Semi-Supervised Text Classification[C]// 5th International Conference on Learning Representations, ICLR, Toulon, France, April 24–26,
Madry A, Makelov A, Schmidt L et al (2017) Towards deep learning models resistant to adversarial attacks[J]. arXiv preprint arXiv:1706.06083,
Zhou B, Khashabi D, Tsai CT et al (2018) Zero-shot open entity typing as type-compatible grounding[C]// Conference on Empirical Methods in Natural Language Processing, EMNLP, Brussels, Belgium, October 31 - November 4, 2018. Stroudsburg, PA: The Association for Computational Linguistics, : 2065–2076
Pan X, Cassidy T, Hermjakob U et al (2015) Unsupervised entity linking with abstract meaning representation[C]//Proceedings of the 2015 conference of the north american chapter of the association for computational linguistics: Human language technologies, NAACL-HLT, Denver, Colorado, USA, May 31 - June 5, 2015. Stroudsburg, PA: The Association for Computational Linguistics, : 1130–1139
Weischedel R, Brunstein A (2005) BBN pronoun comention and entity type corpus[J]. Linguistic Data Consortium, Philadelphia, p 112
Pennington J, Socher R, Manning CD, Glove (2014) : Global vectors for word representation[C]//Proceedings of the 2014 conference on empirical methods in natural language processing, EMNLP, Doha, Qatar, October 25–29, 2014. Stroudsburg, PA: The Association for Computational Linguistics, : 1532–1543
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Qi, Z., Wan, T. & Fei, C. A Fine-Grained Entity Typing Method Combined with Features. Neural Process Lett 54, 3793–3809 (2022). https://doi.org/10.1007/s11063-022-10786-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-022-10786-w