A novel temporal and spatial panorama stream processing engine on IoT applications

https://doi.org/10.1016/j.jii.2020.100143Get rights and content

Abstract

Nowadays, more and more streaming data are generated with the development of Internet of Things. Although, streaming data show great application values in practical scenarios, raw streams from terminal sensors are quite massive, heterogeneous and complex, and those features make it difficult for applications to deal with them. In order to simplify streaming data processing for applications, a novel temporal and spatial panorama stream processing engine is proposed. This stream processing engine forms an effective link between bottom sensors and upper stream-based applications, and provides configurable, flexible, available and usable stream services for various upper applications. With support of Internet plus and domain data, raw streaming data are formatted thorough data fusion and are represented in temporal, spatial, logic and storage views. Fusion data are encapsulated with multiple strategies and methods of clustering models on every dimension. According to configurable strategies, encapsulated data are extracted, partitioned and distributed as the form of dynamical variable gratitude data blocks into stream channels. Our engine is applied to a practical application scenario. Case study of dynamic power distribution application based on people crowd streaming data proves the customized service capabilities of our engine. And the outstanding performance of the engine is shown further by experiments evaluation in this case.

Introduction

In recent years, with the rapid development of Internet of Things (IoT), IoT technologies are applied in more and more industrial and daily life scenarios, which leads to the boost of streaming data. Streaming data show great significance because of their real-time feature, and now become the basis of those applications like real-time analysis, continuous computation, and online machine learning.

However, streaming data also bring many challenges. In practical scenarios, IoT streaming data tend to be massive, heterogeneous and complex. Along with the expansion of smart cities, great quantity of sensing equipment is applied. Meanwhile, the quality and frequency of data collection are also improved. Billions of IoT devices have been put into use nowadays, and they have continuously generated zettabytes of data up to now. Such streaming data are not only massive but also heterogeneous. There are various kinds of sensors, which are different in operating principle, physical scenarios and transport protocols, generating streaming data diverse in dimensions, structures and formats. Besides, streaming data are quite complex as well. Different from traditional data formats, streaming data consist of both the data themselves and their contexts. Only when temporal, spatial and logical views of contexts are considered, the panoramic information of streaming data can be shown clearly.

Because of the features of streaming data, stream-based applications are complex and difficult to develop. IoT streaming data processing becomes an important and costly process for application development. It is impossible for all applications to deal with this process by themselves. Developers need to focus more on the business logic of an application. Therefore, it is worthwhile to realize an independent engine for IoT streaming data processing. The engine gathers origin data from IoT sensors and provides formatted stream service to upper applications. With the use of this engine, the stream-based applications can be much lighter in weight and more convenient to develop and maintain.

In this paper, a novel temporal and spatial panorama stream processing engine on IoT applications is proposed by us. The engine consists of a panorama data model and interfaces to IoT sensors, the Internet and upper applications. Kernel of our engine is this temporal and spatial panorama data model. In the model, streaming data are gathered, formatted, classified, extracted, partitioned and distributed.

Based on the fusion of streaming data generated from IoT scenarios and supporting data from the Internet and domain models, the information representation basis is constructed. Data are represented in unified format by dimensions from multiple categories. Fused data are encapsulated through our proposed parallel data classification framework. Multiple strategies and methods of data clustering models can operate in parallel mode. Though configurable extraction, partition and distribution, the encapsulated data are sent to applications in forms of variable granularity data blocks through stream channels. Without redundant and complex development work, flexible and customized stream services can be configured and realized easily using our engine.

Our proposed engine is also applied to a practical intelligent energy network scenario. In this application case, our engine is used to provide data service for dynamic power distribution application. The complex stream service needed by this application is configured and registered in convenience using our engine. The outstanding performance of our engine is also proved through several experiments and quantitative analysis.

The rest parts of this paper will be structured as follows. Related works of or study are introduced in Section 2. And in Section 3, the framework of our proposed temporal and spatial panorama stream processing engine is illustrated briefly. In Section 4, the methodologies and design details of the panorama data model are explained further. Section 5 presents the practical case of our engine. And the performance of the engine is evaluated by several experiments in Section 6. Finally, our contributions and future works are concluded in Section 7.

Section snippets

Related works

In the recent years, Internet of Things has been an area of research focus [1] and attracted attention of industrial community [2]. IoT technology helps traditional manufacturing firms turn to service-oriented perspective of business [3] and makes it possible of real-time monitoring in manufacturing environments [4]. The development of IoT leads to the booming of streaming data, and brings much convenience as well as many challenges [5]. Blockchain technology leads to great progress in security

Framework

The framework of our proposed temporal and spatial stream processing engine is shown in Fig. 1. The engine connects complex IoT streaming data and upper applications. It consists of a temporal and spatial panorama data model and provides interfaces to terminal sensors, IoT applications and external supporting systems.

The streaming data generated from terminal sensors in industrial scenarios are gathered through unified IoT interface. Considering heterogeneity of streaming data, the interface is

Panorama information representation

Panorama information representation is the basis of our temporal and spatial panorama data model, which integrates the origin streaming data and external supporting data to produce a unified information format. The information representation structure is shown in Fig. 2.

Information of our system is stored in a category-dimension-attribute three-level data model. Each piece of streaming data is encapsulated as a three-level tuple. In first level, each tuple is represented by its temporal

System implementation architecture

The implementation architecture of our system is shown in Fig. 7. The architecture can be divided hierarchically into several layers. Bottom of the architecture consists of sensing layer and supporting data layer, and they are the basic data source of our system. Sensors in sensing layer upload data in two main modes. They are request mode and push mode. Sensors in request mode encapsulate data as a form, and issue a request to send the form according to specified protocol each time. Sensors in

Evaluation

In the application scenario of intelligent energy network system, we take the stream service of dynamic power distribution as an example. The configuration process in Section 5.3 illustrates the usable, available, flexible and configurable features of our engine. The functional advantages of our engine should be evaluated through quantitative analysis. Because clustering is core part of the engine, and the quality of clustering directly influence the efficiency and result of data extraction,

Conclusion

Streaming data is valuable for IoT applications, but stream processing for massive and heterogeneous raw data from terminal sensors is complex and difficult. Aiming to provide configurable, flexible, available and usable streaming data service for IoT applications, a novel temporal and spatial panorama stream processing engine is proposed by us. It is easy to register and configure stream service using our engine, which effectively reduce the workload of developers. Our main contributions are

CRediT authorship contribution statement

Yifan Yin: Methodology, Writing - original draft, Writing - review & editing, Software. Boyi Xu: Data curation, Investigation. Hongming Cai: Conceptualization, Visualization, Supervision. Han Yu: Software, Validation.

Declaration of Competing Interest

None

Acknowledgements

This work was supported by the National Natural Science Foundation of China under Grant 61972243, and in part by the Science and Technology Commission of Shanghai Municipality under Grant 17DZ1201502.

References (28)

  • Y. Lu

    Blockchain and the related issues: a review of current research topics

    J. Manag. Anal.

    (2018)
  • E. Baccarelli et al.

    Energy-efficient dynamic traffic offloading and reconfiguration of networked data centers for big data stream mobile computing: review, challenges, and a case study

    IEEE Netw.

    (2016)
  • S. Boubiche et al.

    Big data challenges and data aggregation strategies in wireless sensor networks

    IEEE Access

    (2018)
  • E. Begoli et al.

    Apache calcite: a foundational framework for optimized query processing over heterogeneous data sources

  • Cited by (17)

    • A survey on 802.11 MAC industrial standards, architecture, security & supporting emergency traffic: Future directions

      2021, Journal of Industrial Information Integration
      Citation Excerpt :

      IEEE 802.11 [114] standards 802.11a support 5 GHz frequency band and 54 Mbps data rate, 802.11b support 2.4 GHz frequency and 11Mpbs data rate, 802.11 g support 2.4 GHz frequency band and data rate 54 Mbps. Details are as in Table 10 [119-122] VoIP WLAN Industrial Integration networks have observed the fastest growth in the world of industrial communication.

    • A reliability and performance GSPN-Based model for anti-collision RFID algorithms under noisy channels in industrial internet of things

      2021, Computers in Industry
      Citation Excerpt :

      But each target scenario has its particular challenges and opportunities. In this vibrating IoT ecosystem, the industrial landscape has been strongly shocked by the transforming nature of the IoT, which has ultimately paved the way for the rise of the Industry 4.0 (Liu et al., 2017; Zhang and Chen, 2020; Yin et al., 2020). When coming to the RFID standard, industrial applications have exploited it to perform cost-effective identification of objects for different functions.

    • PoEWAL: A lightweight consensus mechanism for blockchain in IoT

      2020, Pervasive and Mobile Computing
      Citation Excerpt :

      The things are devices that have limited storage, computation, and energy with communication capabilities. These IoT devices generate, process, and exchange a large amount of information including the transfer of security-critical real-time data over the Internet without human interaction [2]. The multitude of such connected devices makes the system smart, fault-tolerant, and reliable.

    View all citing articles on Scopus
    View full text