当前位置: X-MOL 学术IEEE Trans. Neural Netw. Learn. Syst. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Combining Knowledge Graph and Word Embeddings for Spherical Topic Modeling
IEEE Transactions on Neural Networks and Learning Systems ( IF 10.4 ) Pub Date : 2021-09-24 , DOI: 10.1109/tnnls.2021.3112045
Hafsa Ennajari , Nizar Bouguila , Jamal Bentahar

Probabilistic topic models are considered as an effective framework for text analysis that uncovers the main topics in an unlabeled set of documents. However, the inferred topics by traditional topic models are often unclear and not easy to interpret because they do not account for semantic structures in language. Recently, a number of topic modeling approaches tend to leverage domain knowledge to enhance the quality of the learned topics, but they still assume a multinomial or Gaussian document likelihood in the Euclidean space, which often results in information loss and poor performance. In this article, we propose a Bayesian embedded spherical topic model (ESTM) that combines both knowledge graph and word embeddings in a non-Euclidean curved space, the hypersphere, for better topic interpretability and discriminative text representations. Extensive experimental results show that our proposed model successfully uncovers interpretable topics and learns high-quality text representations useful for common natural language processing (NLP) tasks across multiple benchmark datasets.

中文翻译:

结合知识图和词嵌入进行球形主题建模

概率主题模型被认为是文本分析的有效框架,可以揭示未标记文档集中的主要主题。然而,传统主题模型推断的主题往往不清楚且不易解释,因为它们没有考虑语言中的语义结构。最近,许多主题建模方法倾向于利用领域知识来提高所学习主题的质量,但它们仍然假设欧几里得空间中的多项式或高斯文档似然,这通常会导致信息丢失和性能不佳。在本文中,我们提出了一种贝叶斯嵌入式球形主题模型(ESTM),它将知识图和词嵌入结合在非欧几里得弯曲空间(超球面)中,以实现更好的主题可解释性和判别性文本表示。
更新日期:2021-09-24
down
wechat
bug