当前位置: X-MOL 学术Explor. Econ. Hist. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Breathing new life into death certificates: Extracting handwritten cause of death in the LIFE-M project
Explorations in Economic History ( IF 1.857 ) Pub Date : 2022-09-13 , DOI: 10.1016/j.eeh.2022.101474
Martha J Bailey 1 , Susan H Leonard 2 , Joseph Price 3 , Evan Roberts 4 , Logan Spector 4 , Mengying Zhang 5
Affiliation  

The demographic and epidemiological transitions of the past 200 years are well documented at an aggregate level. Understanding differences in individual and group risks for mortality during these transitions requires linkage between demographic data and detailed individual cause of death information. This paper describes the digitization of almost 185,000 causes of death for Ohio to supplement demographic information in the Longitudinal, Intergenerational Family Electronic Micro-database (LIFE-M). To extract causes of death, our methodology combines handwriting recognition, extensive data cleaning algorithms, and the semi-automated classification of causes of death into International Classification of Diseases (ICD) codes. Our procedures are adaptable to other collections of handwritten data, which require both handwriting recognition and semi-automated coding of the information extracted.



中文翻译:

为死亡证明注入新的活力:在 LIFE-M 项目中提取手写死因

过去 200 年的人口和流行病学转变在总体水平上得到了很好的记录。了解这些过渡期间个人和群体死亡风险的差异需要将人口统计数据与详细的个人死因信息联系起来。本文描述了俄亥俄州近 185,000 种死因的数字化,以补充纵向、代际家庭电子微数据库 (LIFE-M) 中的人口统计信息。为了提取死因,我们的方法结合了手写识别、广泛的数据清理算法以及将死因半自动分类为国际疾病分类 (ICD) 代码。我们的程序适用于其他手写数据集合,

更新日期:2022-09-14
down
wechat
bug