当前位置: X-MOL 学术Food Qual. Prefer. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Fast and automated sensory analysis: Using natural language processing for descriptive lexicon development
Food Quality and Preference ( IF 4.9 ) Pub Date : 2020-07-01 , DOI: 10.1016/j.foodqual.2020.103926
Leah M. Hamilton , Jacob Lahne

Abstract As sensory evaluation relies upon humans accurately communicating their sensory experience, the diverse and overlapping vocabulary of flavor descriptors remains a major challenge. The lexicon generation protocols used in methods like Descriptive Analysis are expensive and time-consuming, while the post-facto analyses of natural vocabulary in “quick and dirty” methods like Free Choice or Flash Profiling require considerable subjective decision-making on the part of the analyst. A potential alternative for producing lexicons and analyzing the sensory attributes of products in nonstandardized text can be found in Natural Language Processing (NLP). NLP tools allow for the analysis of larger volumes of free text with fewer subjective decisions. This paper describes the steps necessary to automatically collect, clean, and analyze existing product descriptions from the web. As a case study, online reviews of international whiskies from two prominent websites (2309 reviews from WhiskyCast and 4289 reviews from WhiskyAdvocate) were collected, preprocessed to only retain potentially-descriptive nouns, adjectives, and verbs, and then the final term list was grouped into a flavor wheel using Correspondence Analysis and Agglomerative Hierarchical Clustering. The wheel is compared to an existing Scotch flavor wheel. The ease of collecting nonstandardized descriptions of products and the improved speed of automated methods can facilitate collection of descriptive sensory data for products where no lexicon exists. This has the potential to speed up and standardize many of the bottlenecks in rapid descriptive methods and facilitate the collection and use of very large datasets of product descriptions.

中文翻译:

快速和自动化的感官分析:使用自然语言处理进行描述性词典开发

摘要 由于感官评估依赖于人类准确传达他们的感官体验,因此风味描述符的多样化和重叠词汇仍然是一个重大挑战。描述性分析等方法中使用的词典生成协议既昂贵又耗时,而在“快速而肮脏”的方法(如自由选择或 Flash 分析)中对自然词汇的事后分析需要大量的主观决策。分析师。在自然语言处理 (NLP) 中可以找到在非标准化文本中生成词典​​和分析产品感官属性的潜在替代方案。NLP 工具允许以较少的主观决定分析大量的自由文本。本文描述了自动收集、清洁、并分析来自网络的现有产品描述。作为案例研究,收集了来自两个著名网站(WhiskyCast 的 2309 条评论和 WhiskyAdvocate 的 4289 条评论)对国际威士忌的在线评论,进行预处理以仅保留可能具有描述性的名词、形容词和动词,然后对最终术语列表进行分组使用对应分析和凝聚层次聚类进入风味轮。该轮与现有的苏格兰风味轮进行了比较。收集产品的非标准化描述的便利性和自动化方法的改进速度可以促进对不存在词典的产品的描述性感官数据的收集。
更新日期:2020-07-01
down
wechat
bug