Deep neural networks for choice analysis: A statistical learning theory perspective,Transportation Research Part B: Methodological

当前位置： X-MOL 学术 › Transp. Res. Part B Methodol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Deep neural networks for choice analysis: A statistical learning theory perspective
Transportation Research Part B: Methodological ( IF 6.8 ) Pub Date : 2021-05-03 , DOI: 10.1016/j.trb.2021.03.011
Shenhao Wang , Qingyi Wang , Nate Bailey , Jinhua Zhao

Although researchers increasingly use deep neural networks (DNN) to analyze individual choices, overfitting and interpretability issues remain obstacles in theory and practice. This study presents a statistical learning theoretical framework to examine the tradeoff between estimation and approximation errors, and between the quality of prediction and of interpretation. It provides an upper bound on the estimation error of the prediction quality in DNN, measured by zero-one and log losses, shedding light on why DNN models do not overfit. It proposes a metric for interpretation quality by formulating a function approximation loss that measures the difference between true and estimated choice probability functions. It argues that the binary logit (BNL) and multinomial logit (MNL) models are the specific cases of DNNs, since the latter always has smaller approximation errors. We explore the relative performance of DNN and classical choice models through three simulation scenarios comparing DNN, BNL, and binary mixed logit models (BXL), as well as one experiment comparing DNN to BNL, BXL, MNL, and mixed logit (MXL) in analyzing the choice of trip purposes based on the National Household Travel Survey 2017. The results indicate that DNN can be used for choice analysis beyond the current practice of demand forecasting because it has the inherent utility interpretation and the power of automatically learning utility specification. Our results suggest DNN outperforms BNL, BXL, MNL, and MXL models in both prediction and interpretation when the sample size is large ( $\geq O (10^{4})$ ), the input dimension is high, or the true data generating process is complex, while performing worse when the opposite is true. DNN outperforms BNL and BXL in zero-one, log, and approximation losses for most of the experiments, and the larger sample size leads to greater incremental value of using DNN over classical discrete choice models. Overall, this study introduces the statistical learning theory as a new foundation for high-dimensional data, complex statistical models, and non-asymptotic data regimes in choice analysis, and the experiments show the effective prediction and interpretation of DNN for its applications to policy and behavioral analysis.

中文翻译：

用于选择分析的深度神经网络：一种统计学习理论的观点

尽管研究人员越来越多地使用深度神经网络（DNN）来分析个人选择，但是过拟合和可解释性问题仍然是理论和实践中的障碍。这项研究提出了一种统计学习理论框架，以检验估计误差与近似误差之间以及在预测质量与解释质量之间的权衡。它提供了DNN中预测质量的估计误差的上限（以零一和对数损失衡量），从而阐明了DNN模型不会过拟合的原因。它通过公式化函数近似损失来度量解释真实性和估计的选择概率函数之间的差异，从而为解释质量提出了一个度量标准。它认为二进制logit（BNL）和多项式logit（MNL）模型是DNN的特殊情况，因为后者总是具有较小的近似误差。我们通过比较DNN，BNL和二进制混合logit模型（BXL）的三种模拟方案，以及通过比较DNN与BNL，BXL，MNL和混合logit（MXL）的一项实验，探索DNN和经典选择模型的相对性能。根据《 2017年全国家庭出行调查》对出行目的的选择进行了分析。结果表明，DNN可以用于需求分析之外的当前需求预测方法，因为它具有内在的效用解释和自动学习效用规范的功能。我们的结果表明，当样本量较大时，DNN在预测和解释方面都优于BNL，BXL，MNL和MXL模型（我们通过比较DNN，BNL和二进制混合logit模型（BXL）的三种模拟方案，以及通过比较DNN与BNL，BXL，MNL和混合logit（MXL）的一项实验，探索DNN和经典选择模型的相对性能。根据《 2017年全国家庭出行调查》对出行目的的选择进行了分析。结果表明，DNN可以用于需求分析之外的当前需求预测方法，因为它具有内在的效用解释和自动学习效用规范的功能。我们的结果表明，当样本量较大时，DNN在预测和解释方面都优于BNL，BXL，MNL和MXL模型（我们通过比较DNN，BNL和二进制混合logit模型（BXL）的三种模拟方案，以及通过比较DNN与BNL，BXL，MNL和混合logit（MXL）的一项实验，探索DNN和经典选择模型的相对性能。根据《 2017年全国家庭出行调查》对出行目的的选择进行了分析。结果表明，DNN可以用于需求分析之外的当前需求预测方法，因为它具有内在的效用解释和自动学习效用规范的功能。我们的结果表明，当样本量较大时，DNN在预测和解释方面都优于BNL，BXL，MNL和MXL模型（和混合logit（MXL）在基于《 2017年美国家庭出行调查》的出行目的选择分析中。结果表明，DNN不仅可以用于当前的需求预测实践，还可以用于选择分析，因为它具有内在的效用解释和强大的功能。自动学习实用程序规范。我们的结果表明，当样本量较大时，DNN在预测和解释方面都优于BNL，BXL，MNL和MXL模型（和混合logit（MXL）在基于《 2017年美国家庭出行调查》的出行目的选择分析中。结果表明，DNN不仅可以用于当前的需求预测实践，还可以用于选择分析，因为它具有内在的效用解释和强大的功能。自动学习实用程序规范。我们的结果表明，当样本量较大时，DNN在预测和解释方面都优于BNL，BXL，MNL和MXL模型（ $\geq Ø （ 10^{4} ）$ ），则输入维数较高，或者真实数据生成过程很复杂，而当反之成立时，输入数据的性能就会变差。在大多数实验中，DNN在零一，对数和逼近损失方面均优于BNL和BXL，并且与传统的离散选择模型相比，较大的样本量导致使用DNN的增量值更大。总体而言，本研究引入了统计学习理论，将其作为选择分析中高维数据，复杂统计模型和非渐近数据体制的新基础，并且实验证明了DNN在其应用于政策和决策方面的有效预测和解释。行为分析。

更新日期：2021-05-03

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文

全部期刊列表>>