Skip to main content
Log in

A Fine-Grained Entity Typing Method Combined with Features

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

Using the fine-grained entity typing method of distant supervision, when assigning type labels to entity mention, since the knowledge base contains all type labels of the entity, noisy labels will be introduced. This paper proposed a Fine-grained Entity Typing model combined with Features (FETF) to reduce the negative impact of noisy labels. It is different from the previous methods such as manual annotation and heuristic rule pruning. The model not only improves the classification efficiency, but also does not need to reduce the size of the training set, which can improve the overall performance of the classification model. FETF divides the training set into clean dataset and noisy dataset according to the type numbers of entity mention in candidate type set, and constructs different objective functions for them to achieve the purpose of reducing the impact of noisy labels. At the same time, FETF can use the feature generator to jointly learn the relational features of entity mention - type label, as well as the similarity and hierarchical features of type label - type label. The feature generator extracts semantic features other than context, so as to help the fine-grained entity typing model to assign type labels to entity mention. In addition, we introduce adversarial training in the context processor, which can effectively alleviate the model overfitting noisy labels, and improve the robustness and generalization ability of the model. Experimental results on the public datasets show that the method proposed in this paper can effectively alleviate the negative impact of noisy labels on the fine-grained entity typing model, and outperforms previous methods in accuracy, Macro F1 value and Micro F1 value.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. The baselines results are reported on [36].

References

  1. Nadeau D, Sekine S (2007) A survey of named entity recognition and classification[J]. Lingvisticae Investigationes 30(1):3–26

    Article  Google Scholar 

  2. Stern R, Sagot B, Béchet F (2012) A joint named entity recognition and entity linking system[C]// In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, EACL, Avignon, France, April 23–27 2012. 52–60

  3. Liu Y, Liu K, Xu L et al (2014) Exploring fine-grained entity type constraints for distantly supervised relation extraction[C]//Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 2107–2116

  4. Yaghoobzadeh Y, Adel H, Schütze H (2017) Noise Mitigation for Neural Entity Typing and Relation Extraction[C]//Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. 1183–1194

  5. Han S, Kwon S, Yu H et al (2017) Answer ranking based on named entity types for question answering[C]//Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication. 1–4

  6. Das R, Zaheer M, Reddy S et al (2017) Question Answering on Knowledge Bases and Text using Universal Schema and Memory Networks[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 358–365

  7. Collins M, Singer Y (1999) Unsupervised models for named entity classification[C]//1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. 100–110

  8. Cucerzan S, Yarowsky D (1999) Language independent named entity recognition combining morphological and contextual evidence[C]//1999 joint SIGDAT conference on empirical methods in natural language processing and very large corpora. 90–99

  9. Bikel DM, Schwartz R, Weischedel RM (1999) An Algorithm that Learns What’s in a Name[J]. Mach Learn 34(1–3):211–231

    Article  Google Scholar 

  10. Borthwick AE (1999) A maximum entropy approach to named entity recognition[M]. New York University

  11. McCallum A, Li W (2003) Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons[C]//Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003. 188–191

  12. Isozaki H, Kazawa H (2002) : The 19th International Conference on Computational Linguistics. 2002, 1: 1–7

  13. Ling X, Weld DS (2012) Fine-Grained Entity Recognition[C]// Conference on Artificial Intelligence, AAAI, Toronto, Ontario, Canada, July 22–26, 2012. AAAI Press, 94–100

  14. Mintz M, Bills S, Snow R et al (2009) Distant supervision for relation extraction without labeled data[C]//Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Singapore, 2–7 August 2009. Stroudsburg, PA: The Association for Computer Linguistics, 1003–1011

  15. Ji H, Grishman R (2008) Refining event extraction through cross-document inference[C]//46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-08: HLT. 254–262

  16. Lin T, Etzioni O (2012) No noun phrase left behind: detecting and typing unlinkable entities[C]//Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning. 893–903

  17. Dong X, Gabrilovich E, Heitz G et al (2014) Knowledge vault: A web-scale approach to probabilistic knowledge fusion[C]//Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. 601–610

  18. Lee C, Hwang YG, Oh HJ et al (2006) Fine-grained named entity recognition using conditional random fields for question answering[C]//Asia information retrieval symposium. Springer, Berlin, Heidelberg, 581–587

  19. Sekine S (2008) Extended Named Entity Ontology with Attribute Information[C]// Proceedings of the International Conference on Language Resources and Evaluation, LREC 2008, Marrakech, Morocco, 26 May – 1 June 2008. European Language Resources Association,

  20. Yosef MA, Bauer S, Hoffart J et al (2012) Hyena: Hierarchical type classification for entity names[C]// 24th International Conference on Computational Linguistics, COLING, Mumbai, India, 8–15 December 2012. India: Indian Institute of Technology Bombay, 1361–1370

  21. Gillick D, Lazic N, Ganchev K et al (2014) Context-dependent fine-grained entity type tagging[J].Computer Science,

  22. Elman JL (1990) Finding structure in time[J]. Cogn Sci 14(2):179–211

    Article  Google Scholar 

  23. Rosenblatt F (1957) The perceptron, a perceiving and recognizing automaton Project Para[M]. Cornell Aeronautical Laboratory

  24. Yogatama D, Gillick D, Lazic N (2015) Embedding methods for fine grained entity type classification[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL(Volume 2: Short Papers), Beijing, China, July 26–31, 2015. Stroudsburg, PA: The Association for Computer Linguistics, 291–296

  25. Del Corro L, Abujabal A, Gemulla R et al (2015) Conference on Empirical Methods in Natural Language Processing, EMNLP, Lisbon, Portugal, September 17–21, 2015. Stroudsburg, PA: The Association for Computational Linguistics, 2015 868–878

  26. Shimaoka S, Stenetorp P, Inui K et al (2016) An Attentive Neural Architecture for Fine-grained Entity Type Classification[C]//Proceedings of the 5th Workshop on Automated Knowledge Base Construction, AKBC@NAACL-HLT, San Diego, CA, USA, June 17, 2016. Stroudsburg, PA: The Association for Computational Linguistics, 69–74

  27. Dong X, Gabrilovich E, Heitz G et al (2014) Knowledge vault: A web-scale approach to probabilistic knowledge fusion[C]//Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, New York, August 24–27, New York: ACM, 2014. 601–610

  28. Yaghoobzadeh Y, Schütze H (2015) Conference on Empirical Methods in Natural Language Processing. 2015. 715–725

  29. Xu B, Luo Z, Huang L et al (2018) Metic: Multi-instance entity typing from corpus[C]//Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 903–912

  30. Jin H, Hou L, Li J et al (2018) Attributed and predictive entity embedding for fine-grained entity typing in knowledge bases[C]//Proceedings of the 27th international conference on computational linguistics. 282–292

  31. Ren X, He W, Qu M et al (2016) Conference on Empirical Methods in Natural Language Processing, EMNLP, Austin, Texas, USA, November 1–4, 2016. Stroudsburg, PA: The Association for Computational Linguistics, 2016: 1369–1378

  32. Abhishek A, Anand A, Awekar A (2017) Fine-grained entity type classification by jointly learning representations and label embeddings[C]//Proceedings of the 15th Confer-ence of the European Chapter of the Association for Computational Linguistics, EACL, Valencia, Spain, April 3–7, 2017. Stroudsburg, PA: The Association for Computational Linguistics, 797–807

  33. Xu P, Barbosa D (2018) Neural fine-grained entity type classification with hierarchy-aware loss[C]//Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT, New Orleans, Louisiana, USA, June 1–6, 2018. Stroudsburg, PA: The Association for Computational Linguistics, 16–25

  34. Chen B, Gu X, Hu Y et al (2019) Improving distantly-supervised entity typing with compact latent space clustering[C]//Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT, Minneapolis, MN, USA, June 2–7, 2019. Stroudsburg, PA: The Association for Computational Linguistics, 2862–2872

  35. Xin J, Zhu H, Han X et al (2018) Conference on Empirical Methods in Natural Language Processing, EMNLP, Brussels, Belgium, October 31 - November 4, 2018. Stroudsburg, PA: The Association for Computational Linguistics, 2018: 993–998

  36. Zhang H, Long D, Xu G et al (2020) Learning with Noise: Improving Distantly-Supervised Fine-grained Entity Typing via Automatic Relabeling[C]// Twenty-Ninth International Joint Conference on Artificial Intelligence and Seventeenth Pacific Rim International Conference on Artificial Intelligence, IJCAI, Yokohama, Japan, 2020. California, ijcai.org, 3808–3815

  37. Miyato T, Dai AM, Goodfellow I (2017) Adversarial Training Methods for Semi-Supervised Text Classification[C]// 5th International Conference on Learning Representations, ICLR, Toulon, France, April 24–26,

  38. Madry A, Makelov A, Schmidt L et al (2017) Towards deep learning models resistant to adversarial attacks[J]. arXiv preprint arXiv:1706.06083,

  39. Zhou B, Khashabi D, Tsai CT et al (2018) Zero-shot open entity typing as type-compatible grounding[C]// Conference on Empirical Methods in Natural Language Processing, EMNLP, Brussels, Belgium, October 31 - November 4, 2018. Stroudsburg, PA: The Association for Computational Linguistics, : 2065–2076

  40. Pan X, Cassidy T, Hermjakob U et al (2015) Unsupervised entity linking with abstract meaning representation[C]//Proceedings of the 2015 conference of the north american chapter of the association for computational linguistics: Human language technologies, NAACL-HLT, Denver, Colorado, USA, May 31 - June 5, 2015. Stroudsburg, PA: The Association for Computational Linguistics, : 1130–1139

  41. Weischedel R, Brunstein A (2005) BBN pronoun comention and entity type corpus[J]. Linguistic Data Consortium, Philadelphia, p 112

    Google Scholar 

  42. Pennington J, Socher R, Manning CD, Glove (2014) : Global vectors for word representation[C]//Proceedings of the 2014 conference on empirical methods in natural language processing, EMNLP, Doha, Qatar, October 25–29, 2014. Stroudsburg, PA: The Association for Computational Linguistics, : 1532–1543

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tao Wan.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Qi, Z., Wan, T. & Fei, C. A Fine-Grained Entity Typing Method Combined with Features. Neural Process Lett 54, 3793–3809 (2022). https://doi.org/10.1007/s11063-022-10786-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-022-10786-w

Keywords

Navigation