Abstract
Structural diversity of a user in a social network is the number of social contexts in his/her contact neighborhood. The problem of structural diversity search is to find the top-k vertices with the largest structural diversity in a graph. However, when identifying distinct social contexts, existing structural diversity models (e.g., t-sized component, t-core, and t-brace) are sensitive to an input parameter of t. To address this drawback, we propose a parameter-free structural diversity model. Specifically, we propose a novel notation of discriminative core, which automatically models various kinds of social contexts without parameter t. Leveraging on discriminative cores and h-index, the structural diversity score for a vertex is calculated. We study the problem of parameter-free structural diversity search in this paper. An efficient top-k search algorithm with a well-designed upper bound for pruning is proposed. To further speed up the computation, we design a novel parallel algorithm for efficient top-k search over large graphs. The parallel algorithm computes diversity scores for a batch of vertices simultaneously using multi-threads. Extensive experiment results demonstrate the parameter sensitivity of existing t-core based model and verify the superiority of our methods.
Similar content being viewed by others
References
Aridhi, S., Brugnara, M., Montresor, A., Velegrakis, Y.: Distributed k-core decomposition and maintenance in large dynamic graphs. In: DEBS, pp 161–168. ACM (2016)
Arifuzzaman, S., Khan, M., Marathe, M.: Fast parallel algorithms for counting and listing triangles in big graphs. TKDD 14(1), 1–34 (2019)
Batagelj, V., Zaversnik, M.: An o (m) algorithm for cores decomposition of networks arXiv preprint cs/0310049 (2003)
Bonchi, F, Gullo, F., Kaltenbrunner, A., Volkovich, Y.: Core decomposition of uncertain graphs. In: KDD, pp 1316–1325. ACM (2014)
Chang, L., Qin, L.: Cohesive subgraph computation over large sparse graphs: algorithms, data structures, and programming techniques. Springer, Berlin (2018)
Chang, L., Zhang, C., Lin, X., Qin, L.: Scalable top-k structural diversity search. In: ICDE, pp 95–98 (2017)
Cheng, H., Zhong, M., Wang, J., Qian, T.: Keyword search based mashup construction with guaranteed diversity. In: DEXA, pp 423–433 (2019)
Cheng, J., Ke, Y., Chu, S., Özsu, M. T.: Efficient core decomposition in massive networks. In: ICDE, pp 51–62 (2011)
Chiba, N., Nishizeki, T.: Arboricity and subgraph listing algorithms. SIAM J. Comput. 14(1), 210–223 (1985)
Ding, F., Zhuang, Y.: Ego-network probabilistic graphical model for discovering on-line communities. Appl Intell. 48(9), 3038–3052 (2018)
Esfandiari, H., Lattanzi, S., Mirrokni, V.: Parallel and streaming algorithms for k-core decomposition. In: ICML, pp 1397–1406 (2018)
Fang, Y., Cheng, R., Luo, S., Hu, J.: Effective community search for large attributed graphs. Proc. VLDB Endow. 9(12), 1233–1244 (2016)
Fang, Y., Wang, Z., Cheng, R., Wang, H., Hu, J.: Effective and efficient community search over large directed graphs. IEEE Trans. Knowl. Data Eng. 31(11), 2093–2107 (2018)
Fang, Y., Huang, X., Qin, L., Zhang, Y., Zhang, W., Cheng, R., Lin, X.: A survey of community search over big graphs 1–40. VLDB J 29(1), 353–392 (2020)
Fang, Y., Yang, Y., Zhang, W., Lin, X., Cao, X.: Effective and efficient community search over large heterogeneous information networks. Proc VLDB Endow 13(6), 854–867 (2020)
Galimberti, E., Bonchi, F., Gullo, F.: Core decomposition and densest subgraph in multilayer networks. In: CIKM, pp 1807–1816. ACM (2017)
Goyal, A., Lu, W., Lakshmanan, L.V.S.: CELF++: optimizing the greedy algorithm for influence maximization in social networks. In: WWW, pp 47–48 (2011)
Hirsch, J. E.: An index to quantify an individual’s scientific research output. Proc. Natl. Acad. Sci. 102(46), 16569–16572 (2005)
Hou, J., Wang, S., Wu, G., Fu, G., Jia, S., Wang, Y., Li, B., Zhang, L.: Parallel strongly connected components detection with multi-partition on gpus. In: ICCS, pp 16–30. Springer (2019)
Huang, X., Cheng, H., Li, R. -H., Qin, L., Yu, J. X.: Top-k structural diversity search in large networks. PVLDB 6(13), 1618–1629 (2013)
Huang, X., Cheng, H., Li, R., Qin, L., Yu, J. X.: Top-k structural diversity search in large networks. VLDB J. 24(3), 319–343 (2015)
Huang, X., Cheng, H., Yu, J. X.: Attributed community analysis: global and ego-centric views. IEEE Data Eng. Bull. 39(3), 29–40 (2016)
Huang, J., Huang, X., Zhu, Y., Xu, J.: Parameter-free structural diversity search. In: WISE, pp 677–693 (2019)
Huang, X., Lakshmanan, L. V., Xu, J.: Community search over big graphs. Morgan & Claypool Publishers, San Rafael (2019)
Huang, J., Huang, X., Xu, J.: Truss-based structural diversity search in large graphs. arXiv preprint arXiv:2007.05437 (2020)
Huckfeldt, R. R., Sprague, J.: Citizens, Politics and Social Communication: Information and Influence in an Election Campaign. Cambridge University Press, Cambridge (1995)
Jakma, P., Orczyk, M., Perkins, C.S., Fayed, M.: Distributed k-core decomposition of dynamic graphs. In: StudentWorkshop@CoNEXT, pp 39–40. ACM (2012)
Jiang, J., Huang, X., Choi, B., Xu, J., Bhowmick, S. S., ppkws, L. X. u.: An efficient framework for keyword search on public-private networks. In: ICDE, pp 457–468 (2020)
Jin, J., Luo, J., Khemmarat, S., Dong, F., Gao, L.: Gstar: an efficient framework for answering top-k star queries on billion-node knowledge graphs. World Wide Web 22(4), 1611–1638 (2019)
Kempe, D., Kleinberg, J. M., Tardos, É.: Maximizing the spread of influence through a social network. In: KDD, pp 137–146 (2003)
Kim, M. -S., Lee, S., Han, W. -S., Park, H., Lee, J.-H.: Dsp-cc-: I/o efficient parallel computation of connected components in billion-scale networks. ICDE 27(10), 2658–2671 (2015)
Leskovec, J., Krevl, A: SNAP Datasets: Stanford large network dataset collection http://snap.stanford.edu/data (2014)
Levorato, V.: Core decomposition in directed networks: Kernelization and strong connectivity. In: CompleNet, vol. 549, pp 129–140 (2014)
Li, R., Yu, J. X., Mao, R.: Efficient core maintenance in large dynamic graphs. TKDE 26(10), 2453–2465 (2014)
Liu, G., Shi, Q., Zheng, K., Li, Z., Liu, A., Xu, J.: Context-aware graph pattern based top-k designated nodes finding in social graphs. World Wide Web 22(2), 751–770 (2019)
Liu, Q., Zhu, Y., Zhao, M., Huang, X., Xu, J., Gao, Y.: Vac: vertex-centric attributed community search. In: ICDE, pp 937–948 (2020)
Mcauley, J., Leskovec, J.: Discovering social circles in ego networks. TKDD 8(1), 4 (2014)
Montresor, A., Pellegrini, F. D., Miorandi, D.: Distributed k-core decomposition. TPDS 24(2), 288–300 (2013)
Nguyen, D., Lenharth, A., Pingali, K.: A lightweight infrastructure for graph analytics. In: SOSP, pp 456–471 (2013)
Sanz-Cruzado, J., Pepa, S. M., Castells, P.: Structural novelty and diversity in link prediction. In: Companion Proceedings of the The Web Conference, vol. 2018, pp 1347–1351 (2018)
Sarıyüce, A. E., Gedik, B., Jacques-Silva, G., Wu, K.-L., Çatalyürek, Ü. V.: Streaming algorithms for k-core decomposition. PVLDB 6(6), 433–444 (2013)
Shang, Z.J., Yu, X., Zhang, Z.: Tufast: a lightweight parallelization library for graph analytics. In: ICDE, pp 710–721. IEEE (2019)
Shun, J., Blelloch, G. E.: Ligra: a lightweight graph processing framework for shared memory. In: Proceedings of the 18th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp 135–146 (2013)
Smith, S., Liu, X., Ahmed, N.K., Tom, A.S., Petrini, F., Karypis, G.: Truss decomposition on shared-memory parallel systems. In: HPEC, pp 1–6. IEEE (2017)
Su, J., Kamath, K., Sharma, A., Ugander, J., Goel, S.: An experimental study of structural diversity in social networks. arXiv preprint arXiv:1909.03543 (2019)
Tang, Y., Shi, Y., Xiao, X.: Influence maximization in near-linear time: a martingale approach. In: SIGMOD Conference, pp 1539–1554. ACM (2015)
Ugander, J., Backstrom, L., Marlow, C., Kleinberg, J.: Structural diversity in social contagion. PNAS 109(16), 5962–5966 (2012)
Wang, W., Gu, Y., Wang, Z., Yu, G.: Parallel triangle counting over large graphs. In: DASFAA, pp 301–308. Springer (2013)
Wang, R., Wang, S., Zhou, X.: Parallelizing approximate single-source personalized pagerank queries on shared memory. VLDBJ 28(6), 923–940 (2019)
Wen, D., Qin, L., Zhang, Y., Lin, X., Yu, J.X.: I/O efficient core graph decomposition Application to degeneracy ordering. TKDE 31(1), 75–90 (2019)
Wu, H., Cheng, J., Lu, Y., Ke, Y., Huang, Y., Yan, D., Wu, H.: Core decomposition in large temporal graphs. In: BigData, pp 649–658 (2015)
Zhang, Y., Yu, J. X., Zhang, Y., Qin, L.: A fast order-based approach for core maintenance. In: ICDE, pp 337–348 (2017)
Zhang, Y., Wang, L., Zhu, J. J., Wang, X., Pentland, A.: The strength of structural diversity in online social networks. arXiv preprint arXiv:1906.00756 (2019)
Zhang, Q., Li, R., Yang, Q., Wang, G., Qin, L.: Efficient top-k edge structural diversity search. In: ICDE, pp 205–216 (2020)
Acknowledgments
This work is supported by the NSFC Nos. 61702435, 61972291, RGC Nos. 12200917, 12200817, CRF C6030-18GF, and the National Science Foundation of Hubei Province No. 2018CFB519.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article belongs to the Topical Collection: Special Issue on Web Information Systems Engineering 2019
Guest Editors: Reynold Cheng, Nikos Mamoulis, and Xin Huang
Rights and permissions
About this article
Cite this article
Huang, J., Huang, X., Zhu, Y. et al. Parallel algorithms for parameter-free structural diversity search on graphs. World Wide Web 24, 397–417 (2021). https://doi.org/10.1007/s11280-020-00843-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-020-00843-6