An Exponential Factorization Machine with Percentage Error Minimization to Retail Sales Forecasting,arXiv - CS - Information Retrieval

当前位置： X-MOL 学术 › arXiv.cs.IR › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

An Exponential Factorization Machine with Percentage Error Minimization to Retail Sales Forecasting
arXiv - CS - Information Retrieval Pub Date : 2020-09-22 , DOI: arxiv-2009.10619
Chongshou Li, Brenda Cheang, Zhixing Luo and Andrew Lim

This paper proposes a new approach to sales forecasting for new products with long lead time but short product life cycle. These SKUs are usually sold for one season only, without any replenishments. An exponential factorization machine (EFM) sales forecast model is developed to solve this problem which not only considers SKU attributes, but also pairwise interactions. The EFM model is significantly different from the original Factorization Machines (FM) from two-fold: (1) the attribute-level formulation for explanatory variables and (2) exponential formulation for the positive response variable. The attribute-level formation excludes infeasible intra-attribute interactions and results in more efficient feature engineering comparing with the conventional one-hot encoding, while the exponential formulation is demonstrated more effective than the log-transformation for the positive but not skewed distributed responses. In order to estimate the parameters, percentage error squares (PES) and error squares (ES) are minimized by a proposed adaptive batch gradient descent method over the training set. Real-world data provided by a footwear retailer in Singapore is used for testing the proposed approach. The forecasting performance in terms of both mean absolute percentage error (MAPE) and mean absolute error (MAE) compares favourably with not only off-the-shelf models but also results reported by extant sales and demand forecasting studies. The effectiveness of the proposed approach is also demonstrated by two external public datasets. Moreover, we prove the theoretical relationships between PES and ES minimization, and present an important property of the PES minimization for regression models; that it trains models to underestimate data. This property fits the situation of sales forecasting where unit-holding cost is much greater than the unit-shortage cost.

中文翻译：

具有最小百分比误差的零售销售预测指数分解机

本文提出了一种新的销售预测方法，用于对提前期长但产品生命周期短的新产品进行销售预测。这些 SKU 通常只售一季，没有任何补货。为了解决这个问题，开发了指数分解机（EFM）销售预测模型，该模型不仅考虑了 SKU 属性，还考虑了成对交互。EFM 模型与原始因子分解机 (FM) 有两个显着区别：(1) 解释变量的属性级公式和 (2) 正响应变量的指数公式。与传统的 one-hot 编码相比，属性级形成排除了不可行的属性内交互，并导致更有效的特征工程，而对于正但非偏态分布响应，指数公式被证明比对数转换更有效。为了估计参数，百分比误差平方 (PES) 和误差平方 (ES) 通过在训练集上提出的自适应批量梯度下降方法最小化。新加坡一家鞋类零售商提供的真实世界数据用于测试所提出的方法。平均绝对百分比误差 (MAPE) 和平均绝对误差 (MAE) 方面的预测性能不仅与现成模型相比，而且与现有销售和需求预测研究报告的结果相比都具有优势。两个外部公共数据集也证明了所提出方法的有效性。此外，我们证明了 PES 和 ES 最小化之间的理论关系，并提出回归模型的 PES 最小化的一个重要特性；它训练模型以低估数据。此属性适合销售预测的情况，即单位持有成本远大于单位短缺成本。

更新日期：2020-09-23

点击分享查看原文

点击收藏

阅读更多本刊最新论文

全部期刊列表>>