当前位置: X-MOL 学术Math. Popul. Stud. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Methods for big data in social sciences
Mathematical Population Studies ( IF 1.4 ) Pub Date : 2019-04-03 , DOI: 10.1080/08898480.2019.1597577
Enrica Amaturo 1 , Biagio Aragona 1
Affiliation  

The diffusion of digital technologies and social networks has multiplied the forms of digital data that can be employed for social research. The main two forms are native digital data, which are produced in social networks, search engines, or blogging, and digitized data, which are analog data transformed into digital (Rogers, 2013). Big data are originally produced in the Internet. They allow for analyzing behaviors without interfering with individuals (Webb et al., 1966). An example is the data used in web platforms analytics, such as Google Correlate, whose purpose is to reveal the co-occurrences associated with a keyword searched through the Google search engine. This tool helped to predict the flu epidemic in the US, well before the US Centre for Disease Control and Prevention (Ginsberg et al., 2009). This example demonstrates that digital web platforms enable innovations in data analysis. Another example of native digital data is the data voluntarily uploaded on social networks, blogs, and websites. These are mainly textual or visual (images and videos), often unstructured. A third example is transactional data and the Internet of things. Transactions made through digital devices, such as smart-phones, scanners, tablets, and cards with chips (credit cards, shopping cards) produce data with some structure. These data comprise metadata (date, time, duration, or expenditures) associated with transactions. The objects connected to the Internet (the Internet of things), such as sensors for health monitoring, house automation, and driving aid, usually produce structured data, which can be organized and analyzed. Digitized data previously existed in analog form, for example images, videos, and scanned or digitally photographed documents uploaded on the web, such as museum collections or libraries available on-line. Digital humanities have converted this material into digital form. Another example is the surveys assisted by computers, where the data are inserted into digital databases. Web surveys now are conducted through the Internet (by e-mail) (Amaturo and Aragona, 2016), and allow for reaching a large sample with a small budget.

中文翻译:

社会科学中的大数据方法

数字技术和社交网络的传播使可用于社会研究的数字数据形式成倍增加。主要的两种形式是在社交网络、搜索引擎或博客中产生的原生数字数据和数字化数据,它们是将模拟数据转换为数字的(Rogers,2013)。大数据最初是在互联网中产生的。它们允许在不干扰个人的情况下分析行为(Webb 等,1966)。一个例子是网络平台分析中使用的数据,例如谷歌相关,其目的是揭示与通过谷歌搜索引擎搜索的关键字相关联的共现。该工具有助于预测美国的流感流行,远早于美国疾病控制和预防中心(Ginsberg 等,2009)。这个例子表明数字网络平台能够实现数据分析的创新。本地数字数据的另一个例子是自愿上传到社交网络、博客和网站的数据。这些主要是文本或视觉(图像和视频),通常是非结构化的。第三个例子是交易数据和物联网。通过智能手机、扫描仪、平板电脑和带有芯片的卡片(信用卡、购物卡)等数字设备进行的交易会产生具有某种结构的数据。这些数据包括与交易相关的元数据(日期、时间、持续时间或支出)。连接到互联网(物联网)的对象,例如用于健康监测、房屋自动化和驾驶辅助的传感器,通常会产生结构化数据,这些数据可以被组织和分析。数字化数据以前以模拟形式存在,例如图像、视频以及上传到网络上的扫描或数码照片文件,例如在线提供的博物馆藏品或图书馆。数字人文已经将这些材料转化为数字形式。另一个例子是由计算机辅助的调查,其中数据被插入到数字数据库中。网络调查现在通过互联网(通过电子邮件)进行(Amaturo 和 Aragona,2016 年),并且允许以较小的预算获得大量样本。数据被插入数字数据库的地方。网络调查现在通过互联网(通过电子邮件)进行(Amaturo 和 Aragona,2016 年),并且允许以较小的预算获得大量样本。数据被插入数字数据库的地方。网络调查现在是通过互联网(通过电子邮件)进行的(Amaturo 和 Aragona,2016 年),并且允许以较小的预算获得大量样本。
更新日期:2019-04-03
down
wechat
bug