Skip to main content
Log in

Analysis and elimination of noise-induced temperature error in processor thermal control

  • Published:
Real-Time Systems Aims and scope Submit manuscript

Abstract

Processor thermal control in real-time systems is crucial because overheated processors may result in critical performance degradation or even system failure. The main challenges in processor thermal control in real-time systems are as follows: (i) the need to satisfy both real-time and thermal constraints; (ii) uncertain system dynamics; and (iii) thermal sensor noise. The first two issues have been resolved in substantial studies while the issue of sensor noise has not been thoroughly addressed. In this paper, we experimentally identify that even a small zero-mean sensor noise can induce a significant steady-state error in processor thermal control. This steady-state temperature error is contrary to the intuition that zero-mean sensor noise normally induces zero-mean fluctuations around the target temperature. We rigorously analyze the phenomenon based on the stochastic averaging theory and quantify the error in a closed form in terms of noise statistics and system parameters. We propose a thermal control architecture for effectively eliminating the thermal error and tightly controlling the processor temperature. Through extensive experiments, we validate the proposed architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

Notes

  1. Noise level with variance of 1 is a mild condition in practice (Rotem et al. 2006; Long et al. 2008).

  2. We have chosen three standard deviations in an arbitrary manner. We can use a different value if needed. However, use of too high a margin value slows down the transient of temperature control. Three standard deviations can eliminate NITE substantially while at the same time causing little degradation of the transient performance of the temperature control.

References

  • Ahmed R, Ramanathan P, Saluja KK (2016) Necessary and sufficient conditions for thermal schedulability of periodic real-time tasks under fluid scheduling model. ACM Trans Embed Comput Syst (TECS) 15(3):49

    Google Scholar 

  • Avirneni NDP, Ramesh PK, Somani AK (2016) Utilization aware power management in reliable and aggressive chip multi processors. IEEE Trans Comput 65(3):979–991

    Article  MathSciNet  Google Scholar 

  • Bai EW, Fu LC, Sastry SS (1988) Averaging analysis for discrete time and sampled data adaptive systems. IEEE Trans Circuits Syst 35(2):137–148

    Article  Google Scholar 

  • Becker M, Sandström K, Behnam M, Nolte T (2014) Dynamic power management for thermal control of many-core real-time systems. In: Proc. workshop on adaptive and reconfigurable embedded systems (APRES)

  • Bellosa F, Weissel A, Waitz M, Kellner S (2003) Event-driven energy accounting for dynamic thermal management. In: Proc. workshop on compilers and operating systems for low power (COLP)

  • Beneventi F, Bartolini A, Tilli A, Benini L (2014) An effective gray-box identification procedure for multicore thermal modeling. IEEE Trans Comput 63(5):1097–1110

    Article  MathSciNet  Google Scholar 

  • Chantem T, Hu XS, Dick RP (2011) Temperature-aware scheduling and assignment for hard real-time applications on mpsocs. IEEE Trans Very Large Scale Integr VLSI Syst 19(10):1884–1897

    Article  Google Scholar 

  • Chu HH, Tsai TH, Chen YS (2012) Thermal-aware service rate adjustment for cyber-physical systems. In: Proc. international conference on anti-counterfeiting, security and identification (ASID)

  • Cui Y, Zhang W, He B (2017) A variation-aware adaptive fuzzy control system for thermal management of microprocessors. IEEE Trans Very Large Scale Integr VLSI Syst 25(2):683–695

    Article  Google Scholar 

  • Eun Y, Hamby ES (2014) Noise induced loss of tracking in systems with saturating actuators and anti-windup. ASME J Dyn Syst Meas Control 136(5):p054501 (6 pages)

  • Ferreira AP, Mosse D, Oh JC (2007) Thermal faults modeling using a rc model with an application to web farms. In: Proc. Euromicro conference on real-time systems (ECRTS)

  • Floyd MS, Ghiasi S, Keller TW, Rajamani K, Rawson F, Rubio JC, Ware MS (2007) System power management support in the ibm power6 microprocessor. IBM J Res Dev 51(6):733–746

    Article  Google Scholar 

  • Fu Y, Kottenstette N, Chen Y, Lu C, Koutsoukos XD, Wang H (2010) Feedback thermal control for real-time systems. In: Proc. real-time and embedded technology and applications symposium (RTAS)

  • Fu Y, Kottenstette N, Lu C, Koutsoukos XD (2012) Feedback thermal control of real-time systems on multicore processors. In: Proc. international conference on embedded software (EMSOFT)

  • Goodwin GC, Graebe SF, Salgado ME (2000) Control system design. Prentice Hall, Englewood Cliffs

    Google Scholar 

  • Hettiarachchi PM, Fisher N, Ahmed M, Wang LY, Wang S, Shi W (2012) The design and analysis of thermal-resilient hard-real-time systems. In: Proc. real-time and embedded technology and applications symposium (RTAS)

  • Intel (2009) i7-800 and i5-700 desktop processor series and lga1156 socket: thermal/mechanical specifications and design guidelines

  • Khairnasov S, Naumova A (2015) Heat pipes application in electronics thermal control systems. Front Heat Pipes (FHP)

  • Kim D, Park KJ, Eun Y, Son SH, Lu C (2015) When thermal control meets sensor noise: analysis of noise-induced temperature error. In: Proc. real-time and embedded technology and applications symposium (RTAS), pp 98–107

  • Kornaros G, Pnevmatikatos D (2013) A survey and taxonomy of on-chip monitoring of multicore systems-on-chip. ACM Trans Des Autom Electron Syst (TODAES) 18(2):17:1–17:38

    Article  Google Scholar 

  • Kottenstette N, Hall JF III, Koutsoukos X, Antsaklis P (2011) Digital control of multiple discrete passive plants over networks. Int J Syst Control Commun 3(2):194–228

    Article  Google Scholar 

  • Krishna C, Koren I (2017) Thermal-aware management techniques for cyber-physical systems. Sustain Comput 15:39–51

    Google Scholar 

  • Long J, Memik SO, Memik G, Mukherjee R (2008) Thermal monitoring mechanisms for chip multiprocessors. ACM Trans Archit Code Optim (TACO) 5(2):9

    Google Scholar 

  • Mohaqeqi M, Kargahi M, Fouladi K (2016) Stochastic thermal control of a multicore real-time system. In: Proc. 24th Euromicro international conference on parallel, distributed, and network-based processing (PDP), pp 208–215

  • Pathan RM (2017) Real-time scheduling algorithm for safety-critical systems on faulty multicore environments. Real-Time Syst 53(1):45–81

    Article  Google Scholar 

  • Prakash A, Amrouch H, Shafique M, Mitra T, Henkel J (2016) Improving mobile gaming performance through cooperative cpu-gpu thermal management. In: Proceedings of the 53rd annual design automation conference

  • Rotem E, Hermerding J, Cohen A, Cain H (2006) Temperature measurement in the intel® coretm duo processor. In: Proc. international workshop on thermal investigations of ICs and systems (THERMINIC)

  • Saen M, Osada K, Misaka S, Yamada T, Tsujimoto Y, Kondoh Y, Kamei T, Yoshida Y, Nagahama E, Nitta Y, et al (2007) Embedded soc resource manager to control temperature and data bandwidth. In: Proc. international solid-state circuits conference (ISSCC)

  • Segovia VR, Kralmark M, Lindberg M, Årzén KE (2011) Processor thermal control using adaptive bandwidth resource management. In: Proc. World Congress of the international federation of automatic control (IFAC)

  • Sheikh HF, Ahmad I, Wang Z, Ranka S (2012) An overview and classification of thermal-aware scheduling techniques for multi-core processing systems. Sustain Comput 2(3):151–169

    Google Scholar 

  • Tsai TH, Chen YS, He XX, Li CY (2017) Stem: a thermal-constrained real-time scheduling for 3d heterogeneous-isa multicore processors. IEEE Trans Comput

  • Yue J, Zhang T, Liu Y, Quan B, Tianzhou C (2012) Thermal-aware feedback control scheduling for soft real-time systems. In: Proc. international conference on high performance computing and communications & international conference on embedded software and systems (HPCC-ICESS)

  • Yun B, Shin KG, Wang S (2013) Predicting thermal behavior for temperature management in time-critical multicore systems. In: Proc. real-time and embedded technology and applications symposium (RTAS)

  • Zhou J, Wei T (2015) Stochastic thermal-aware real-time task scheduling with considerations of soft errors. J Syst Softw 102:123–133

    Article  Google Scholar 

  • Zhou J, Wei T, Chen M, Yan J, Hu XS, Ma Y (2016) Thermal-aware task scheduling for energy minimization in heterogeneous real-time mpsoc systems. IEEE Trans Comput Aided Des Integr Circuits Syst 35(8):1269–1282

    Article  Google Scholar 

Download references

Acknowledgements

This work was partly supported by Institute for Information & communications Technology Promotion (IITP) grant funded by the Korea government (MSIP) (2014-0-00065, Resilient Cyber-Physical Systems Research), the DGIST R&D Program of the Ministry of Science and ICT (18-EE-01), and Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (NRF-2016R1C1B2007899).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Kyung-Joon Park or Yongsoon Eun.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kim, D., Lee, J., Park, KJ. et al. Analysis and elimination of noise-induced temperature error in processor thermal control. Real-Time Syst 56, 1–27 (2020). https://doi.org/10.1007/s11241-019-09342-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11241-019-09342-y

Keywords

Navigation