当前位置: X-MOL 学术Stat. Model. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Semi-supervised clustering of time-dependent categorical sequences with application to discovering education-based life patterns
Statistical Modelling ( IF 1 ) Pub Date : 2021-03-08 , DOI: 10.1177/1471082x21989170
Yingying Zhang 1 , Volodymyr Melnykov 2 , Igor Melnykov 3
Affiliation  

A new approach to the analysis of heterogeneous categorical sequences is proposed. The first-order Markov model is employed in a finite mixture setting with initial state and transition probabilities being expressed as functions of time. The expectation–maximization algorithm approach to parameter estimation is implemented in the presence of positive equivalence constraints that determine which observations must be placed in the same class in the solution. The proposed model is applied to a dataset from the British Household Panel Survey to evaluate the association between the education background and life outcomes of study participants. The analysis of the survey data reveals many interesting relationships between the level of education and major life events.



中文翻译:

基于时间的分类序列的半监督聚类及其在发现基于教育的生活模式中的应用

提出了一种分析异构分类序列的新方法。一阶马尔可夫模型用于有限混合设置,其初始状态和跃迁概率表示为时间的函数。参数估计的期望最大化算法方法是在存在正等价约束的情况下实施的,该约束确定必须将哪些观测值放入解决方案的同一类中。拟议的模型应用于英国家庭调查的数据集,以评估受教育者的教育背景和生活成果之间的关联。对调查数据的分析揭示了教育水平与重大生活事件之间的许多有趣关系。

更新日期:2021-03-09
down
wechat
bug