当前位置: X-MOL 学术J. Choice Model. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Estimation of discrete choice models with hybrid stochastic adaptive batch size algorithms
Journal of Choice Modelling ( IF 2.8 ) Pub Date : 2020-08-22 , DOI: 10.1016/j.jocm.2020.100226
Gael Lederrey , Virginie Lurkin , Tim Hillel , Michel Bierlaire

The emergence of Big Data has enabled new research perspectives in the discrete choice community. While the techniques to estimate Machine Learning models on a massive amount of data are well established, these have not yet been fully explored for the estimation of statistical Discrete Choice Models based on the random utility framework. In this article, we provide new ways of dealing with large datasets in the context of Discrete Choice Models. We achieve this by proposing new efficient stochastic optimization algorithms and extensively testing them alongside existing approaches. We develop these algorithms based on three main contributions: the use of a stochastic Hessian, the modification of the batch size, and a change of optimization algorithm depending on the batch size. A comprehensive experimental comparison of fifteen optimization algorithms is conducted across ten benchmark Discrete Choice Model cases. The results indicate that the HAMABS algorithm, a hybrid adaptive batch size stochastic method, is the best performing algorithm across the optimization benchmarks. This algorithm speeds up the optimization time by a factor of 23 on the largest model compared to existing algorithms used in practice. The integration of the new algorithms in Discrete Choice Models estimation software will significantly reduce the time required for model estimation and therefore enable researchers and practitioners to explore new approaches for the specification of choice models.



中文翻译:

混合随机自适应批次大小算法的离散选择模型估计

大数据的出现为离散选择社区提供了新的研究视角。尽管已经建立了在大量数据上估计机器学习模型的技术,但尚未充分探索这些技术来基于随机效用框架估计统计离散选择模型。在本文中,我们提供了在离散选择模型的上下文中处理大型数据集的新方法。我们通过提出新的高效随机优化算法并与现有方法一起对其进行广泛测试来实现这一目标。我们基于以下三个主要方面来开发这些算法:随机Hessian的使用,批大小的修改以及根据批大小更改的优化算法。在十个基准离散选择模型案例中进行了十五种优化算法的综合实验比较。结果表明,HAMABS算法是一种混合的自适应批次大小随机方法,在整个优化基准测试中是性能最好的算法。与实际使用的现有算法相比,该算法在最大模型上的优化时间缩短了23倍。离散选择模型估计软件中新算法的集成将显着减少模型估计所需的时间,因此使研究人员和从业人员能够探索用于选择模型规范的新方法。混合自适应批次大小随机方法是优化基准上性能最好的算法。与实际使用的现有算法相比,该算法在最大模型上的优化时间缩短了23倍。在离散选择模型估计软件中集成新算法将大大减少模型估计所需的时间,因此使研究人员和从业人员能够探索用于选择模型规范的新方法。混合自适应批次大小随机方法是优化基准上性能最好的算法。与实际使用的现有算法相比,该算法在最大模型上的优化时间缩短了23倍。在离散选择模型估计软件中集成新算法将大大减少模型估计所需的时间,因此使研究人员和从业人员能够探索用于选择模型规范的新方法。

更新日期:2020-08-22
down
wechat
bug