当前位置: X-MOL 学术J. R. Stat. Soc. A › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Estimating the number of persons with HIV in jails via web scraping and record linkage
The Journal of the Royal Statistical Society, Series A (Statistics in Society) ( IF 2 ) Pub Date : 2022-08-10 , DOI: 10.1111/rssa.12909
Bonnie E Shook-Sa 1 , Michael G Hudgens 1 , Andrew L Kavee 2 , David L Rosen 3
Affiliation  

This paper presents methods to estimate the number of persons with HIV in North Carolina jails by applying finite population inferential approaches to data collected using web scraping and record linkage techniques. Administrative data are linked with web-scraped rosters of incarcerated persons in a non-random subset of counties. Outcome regression and calibration weighting are adapted for state-level estimation. Methods are compared in simulations and are applied to data from the US state of North Carolina. Outcome regression yielded more precise inference and allowed for county-level estimates, an important study objective, while calibration weighting exhibited double robustness under misspecification of the outcome or weight model.

中文翻译:

通过网络抓取和记录链接估计监狱中的艾滋病毒感染者人数

本文提出了通过对使用网络抓取和记录链接技术收集的数据应用有限人口推理方法来估计北卡罗来纳州监狱中艾滋病毒感染者人数的方法。行政数据与从网络上抓取的非随机县子集中的被监禁者名册相关联。结果回归和校准权重适用于州级估计。在模拟中对方法进行了比较,并将其应用于来自美国北卡罗来纳州的数据。结果回归产生了更精确的推论,并允许进行县级估计,这是一个重要的研究目标,而校准权重在结果或权重模型的错误指定下表现出双重稳健性。
更新日期:2022-08-10
down
wechat
bug