A temporal graph framework for intelligence extraction in social media networks
Introduction
Flexibility and widespread adoption have made social media (SM) a powerful means of communication and collaboration in modern organizations [1]. Businesses increasingly use social media networks (SMNs) to support relationship development and network tie formation between knowledge workers and assets [2], such as software, patents, and expertise. An SMN is a digitized network of entities that represent corresponding entities connected socially in the physical world [2], [3]. For example, software developers discuss program code issues and vulnerabilities in SMNs (e.g., on GitHub) to facilitate timely resolution. Marketers promote their brands in SMNs (e.g., on Facebook) to increase customer recruitment and loyalty. Scientists participate in SMNs (e.g., on ResearchGate) to learn about innovations and patents. These SMNs produce tremendous benefits to organizations and economies [4] and are estimated to create more than $1 trillion in value because of enhanced communication and improved collaboration [1]. As more knowledge workers are engaged in SMNs, it is instructive to identify and predict activity trends from these networks for various applications such as enhancing social cybersecurity[5] and connecting heterogeneous expertise [6]. In the software development profession, it has been found that social media (SM) enables a much faster identification of software vulnerabilities (nearly 90 days earlier) than using the National Vulnerabilities Database (NVD) maintained by the US government [7]. Despite this, accurately extracting intelligence from complex, evolving SMNs over time can challenge the management and protection of valuable knowledge assets.
SMNs allow knowledge workers to interact with each other through collaborating on related knowledge assets over time. The term “knowledge asset” encompasses companies’ core competencies, areas of expertise, and deep pools of talents [6]. Increased diffusion of knowledge assets due to SMN usage is challenging the management of these assets. For example, GitHub SMNs are widely used to support collaborative software development in financial transaction using cryptocurrency, which relies on decentralized computer software programs to manage transactions and issue money. The software validates a public list of records (known as “blockchain”) to ensure security. In recent years, interest in cryptocurrency software has increased dramatically [8], [9] (e.g., Bitcoin was estimated to account for $76 billion of illegal transactions (or 46% of all its transactions) per year [10]). Prices of cryptocurrencies can vary dramatically because of illegal activities (e.g., hacking, theft) and shifting investor sentiment [11], [12]. As developers of cryptocurrency software interact in SMNs [9], identifying activity trends and extracting intelligence from the online community can be difficult, challenging the state-of-the-art technology [13]. These trends and intelligence could help managers to spot disparate areas of expertise and enhance cybersecurity. The GitHub SMNs consist of user-repository connections, in which knowledge assets are the know-how. Other examples of using SMNs in managing knowledge assets (and the type of ties) include organizational expertise networks (expert-topic), talent networks (people-job), and online review networks (item-review).
Traditional research on SMNs focuses primarily on interpersonal interactions [4] in which each node is assumed to be a human user and each link is a tie between two users. While this assumption conforms with the dominant theoretical lens for information systems (IS) research [14], it excludes the possibility of non-human nodes and ties connecting a non-human agent delegated or created by human users. Recent advances in IS and artificial intelligence (AI) have greatly facilitated human capabilities to manage knowledge assets by overcoming this “human-centric” assumption, allowing managers and developers to delegate tasks to non-human agents in SMNs. Inspired by the agent interaction theories and the IS delegation theoretical framework [15], we consider an SMN to consist of rich and dynamic interactions of participating human and non-human entities. This view allows a more flexible modeling of SMNs and supports the development of a novel framework for managing knowledge assets, as explained in the following sections.
In addition, some prior studies in social network analysis assume the number and properties of network nodes to be static (i.e., unchanged) over time [16]. By contrast, real-world SMNs seldom follow this assumption. Human behavior is often affected by information timeliness, such as recency, primary, trends and periodicity, whose behavioral effect on SMNs is not well understood. This lack of understanding makes it difficult to use knowledge assets strategically or identify cyber threats effectively (e.g., [17]). However, existing models lack the capability to accurately represent evolving SMNs (e.g., data are not modeled as evolving SMNs over time [18], [19]), such as those of cryptocurrency software development communities. Extant theories, though useful in traditional social settings, are not widely used in modeling dynamic SMNs [20].
This research developed a theoretically grounded temporal graph framework for extracting cyber intelligence in dynamic SMNs. We define “cyber intelligence” as the result of “acquisition, interpretation, collation, assessment, and exploitation of information” [21], [22] in SM and SMNs. The framework provides theoretically-based constructs to model collective behavior and temporal effects in SMNs, and supports flexible specification of reference history to learn from past behavioral data. Based on the framework, four models were developed to represent human behavior of conforming to collective norms and temporal effects of information (recency and primacy, trends, and periodicity). The models were used to predict and simulate the activities of cryptocurrency software SMNs extracted from GitHub cryptocurrency software repositories. Activity history of the SMNs and pricing information of the related cryptocurrencies are combined dynamically to enhance the accuracy of prediction and fidelity of simulation. The research seeks to answer these questions: (1) What are the factors affecting SMN behavior? (2) How can these factors be modeled to explain and predict SMN behavior? (3) How do the models perform in predicting SMN behavior of a cryptocurrency software development community?
In our experiments, the models were used to predict and simulate the SMN activities of 83,536 repositories related to three cryptocurrencies: Bitcoin, Monero, and Ethereum. Each model’s predictions were compared against the ground truth (actual data) to calculate various performance measures and metrics pertaining to contribution, influence, popularity, and engagement. The measures were then compared across 18 configurations of the models to provide insights into their performance. We believe the research contributes to (1) developing new framework and models of SMN behavior, (2) providing a new application and a three-step process of model deployment for diverse domains such as cybersecurity and knowledge management (KM), (3) increasing situation awareness on cryptocurrency software SMNs, and (4) providing a new example of SMN research in the IS field.
Section snippets
Literature Review
Dynamic social networks have been studied in diverse disciplines, including mathematics [23], physics [24], computer science [25], [26], IS [4], [27], statistics [28], sociology [29], and economics [30], among others. Various theories and methods have been developed to facilitate understanding and prediction of collective behavior, network dynamics, nodal behavior, and link structure. The prevalence of SM promotes further interest in the modeling of human behavior in dynamic SMNs, prompting the
A Temporal Graph Framework
This research developed a temporal graph framework for modeling dynamic social networks and for predicting future SMN trends and activities, with a goal to enhance the management and security of knowledge assets in SMNs. Developed based on a design science paradigm [80], the framework incorporates all the core components (influence, collective, network, and time) identified in the literature review to build predictive models of dynamic SMN behavior. Emphasis is placed on modeling networks as
Experiments on SMN Prediction
This section describes the experiments conducted to study the effectiveness of the models on predicting and simulating cryptocurrency software SMN activities. The following provides the experimental goals and design, evaluation methods, performance measures, and experimental findings.
Experimental Findings
This section reports the results of evaluating the models’ performance in accurately predicting cryptocurrency software SMN activities and discusses the implication of the findings. Overall, REM model variants achieved the best performance in 9 (out of 18) measures. The following sections provide detailed results that are grouped into four social aspects explained above.
Conclusion
A powerful and popular tool for managing professional and virtual work, SMNs enable the formation of complex networks of knowledge workers and related assets, thereby facilitating understanding of human behavior in virtual organizations. For example, modeling the dynamic SMNs of cryptocurrency software may provide cyber intelligence and increase situation awareness. However, the factors affecting behavioral trends and activity levels in these networks are not well understood. This research
CRediT authorship contribution statement
Wingyan Chung: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data curation, Writing – original draft, Writing – review & editing, Project administration, Visualization, Supervision, Funding acquisition. Vincent S. Lai: Conceptualization, Methodology, Resources, Funding acquisition, Writing – review & editing, Project administration.
Acknowledgements
The authors thank the journal editors and reviewers for their valuable comments and input.
Dr. Wingyan Chung is Professor of Computer Science in Soules College of Business at The University of Texas at Tyler. His scholarly interests and expertise include business analytics, machine learning, social media analytics, cybersecurity, network science, data science, knowledge management, and human-computer interaction. He has published extensively in IS, CS, and scientific journals, including Journal of Management Information Systems, Information & Management, ACM Transactions on MIS,
References (95)
- et al.
Tensions and ties in social media networks: Towards a model of understanding business relationship development and business performance enhancement through the use of linkedin
Industrial Marketing Management
(2016) - et al.
Social Network Analysis: methods and applications
(1994) Toward a theory of behavioral contagion
Psychological Review
(1966)Technology and Society
(2016)- et al.
Probabilistic community and role model for social networks
Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining
(2015) - et al.
Temporal link prediction using matrix and tensor factorizations
ACM Transactions on Knowledge Discovery from Data
(2011) - et al.
Capturing business value with social technologies
McKinsey Quarterly
(2012) - et al.
Interaction models for detecting nodal activities in temporal social media networks
ACM Transactions on Management Information Systems
(2019) - et al.
What’s different about social media networks? A framework and research agenda
MIS Quarterly
(2014) - et al.
Social cybersecurity
Military Review
(2019)
Managing your mission-critical knowledge
Harvard Business Review
Multiple social platforms reveal actionable signals for software vulnerability awareness: A study of Github, Twitter and Reddit
PLOS ONE
Bitcoin: A Look at the Past and the Future
Hacking activities increase along with cryptocurrency pricing (2/26/2018)
techopedia
Sex, drugs, and bitcoin: How much illegal activity is financed through cryptocurrencies?
The Review of Financial Studies
Bitcoin sinks after exchange reports hack (Aug. 2, 2016)
Wall Street Journal
Social media as an intelligence goldmine
Cyber Security Review
The next generation of research on IS use: a theoretical framework of delegation to and from agentic IS artifacts
MIS Quarterly
Link prediction in social networks: the state-of-the-art
Science China-Information Sciences
The Illicit Cryptocurrency Mining Threat
Technical Report
How does social media impact bitcoin value? A test of the silent majority hypothesis
Journal of Management Information Systems
The digital traces of bubbles: feedback cycles between socio-economic signals in the bitcoin economy
Journal of the Royal Society Interface
The online crowd: A contradiction in terms? On the potentials of Gustave Le Bon’s crowd psychology in an analysis of affective blogging
Distinktion: Journal of Social Theory
Intelligence, information technology, and information warfare
A visual framework for knowledge discovery on the web: An empirical study on business intelligence exploration
Journal of Management Information Systems
On random graphs
Publicationes Mathematicae
Temporal networks
Physics Reports
A statistical infinite feature cascade-based approach to anomaly detection for dynamic social networks
Computer Communications
Measuring temporal patterns in dynamic social networks
ACM Transantions on Knowledge Discovery from Data
Social network analysis: a powerful strategy, also for the information sciences
Journal of information Science
Introduction to stochastic actor-based models for network dynamics
Social Networks
Social and Economic Networks
The psychology of crowd dynamics
Blackwell Handbook of Social Psychology: Group processes
The crowd: A study of the popular mind
Collective Behavior
Contagion and differentiation in unethical behavior: The effect of one bad apple on the barrel
Psychological Science
Birds of a feather: Homophily in social networks
Annual Review of Sociology
The psychology of social impact
American Psychologist
Social impact theory: A field test of source strength, source immediacy and number of targets
Basic and Applied Social Psychology
A theory of social interactions
Journal of Political Economy
Social exchange theory
Annual Review of Sociology
The psychology of interpersonal relations
Real-time novel event detection from social media
IEEE 33rd International Conference on Data Engineering
The order of presentation in persuasion
Recency and primacy in persuasion as a function of the timing of speeches and measurements
The Journal of Abnormal and Social Psychology
Cited by (1)
Building Human Digital Twins: Cases for Intelligence and Security Informatics
2023, Proceedings - 2023 IEEE International Conference on Intelligence and Security Informatics, ISI 2023
Dr. Wingyan Chung is Professor of Computer Science in Soules College of Business at The University of Texas at Tyler. His scholarly interests and expertise include business analytics, machine learning, social media analytics, cybersecurity, network science, data science, knowledge management, and human-computer interaction. He has published extensively in IS, CS, and scientific journals, including Journal of Management Information Systems, Information & Management, ACM Transactions on MIS, Communications of the ACM, IEEE Computer, Decision Support Systems, and Scientific Reports, among others.
Dr. Vincent S. Lai is an Emeritus Professor of MIS at The Chinese University of Hong Kong. His current research focuses on innovation assimilation, online auctions, technology imitation, virtual collaboration, and global IS strategy. His findings in these research areas have been published extensively in IS journals, including Journal of Management Information Systems, Information & Management, Communications of the ACM, Decision Support Systems, Decision Sciences, European Journal of Information Systems, and IEEE Transactions on Engineering Management, among others.