xref: /freebsd/sys/contrib/device-tree/Bindings/thermal/thermal.txt (revision 7fdf597e96a02165cfe22ff357b857d5fa15ed8a)
1* Thermal Framework Device Tree descriptor
2
3This file describes a generic binding to provide a way of
4defining hardware thermal structure using device tree.
5A thermal structure includes thermal zones and their components,
6such as trip points, polling intervals, sensors and cooling devices
7binding descriptors.
8
9The target of device tree thermal descriptors is to describe only
10the hardware thermal aspects. The thermal device tree bindings are
11not about how the system must control or which algorithm or policy
12must be taken in place.
13
14There are five types of nodes involved to describe thermal bindings:
15- thermal sensors: devices which may be used to take temperature
16  measurements.
17- cooling devices: devices which may be used to dissipate heat.
18- trip points: describe key temperatures at which cooling is recommended. The
19  set of points should be chosen based on hardware limits.
20- cooling maps: used to describe links between trip points and cooling devices;
21- thermal zones: used to describe thermal data within the hardware;
22
23The following is a description of each of these node types.
24
25* Thermal sensor devices
26
27Thermal sensor devices are nodes providing temperature sensing capabilities on
28thermal zones. Typical devices are I2C ADC converters and bandgaps. These are
29nodes providing temperature data to thermal zones. Thermal sensor devices may
30control one or more internal sensors.
31
32Required property:
33- #thermal-sensor-cells: Used to provide sensor device specific information
34  Type: unsigned	 while referring to it. Typically 0 on thermal sensor
35  Size: one cell	 nodes with only one sensor, and at least 1 on nodes
36			 with several internal sensors, in order
37			 to identify uniquely the sensor instances within
38			 the IC. See thermal zone binding for more details
39			 on how consumers refer to sensor devices.
40
41* Cooling device nodes
42
43Cooling devices are nodes providing control on power dissipation. There
44are essentially two ways to provide control on power dissipation. First
45is by means of regulating device performance, which is known as passive
46cooling. A typical passive cooling is a CPU that has dynamic voltage and
47frequency scaling (DVFS), and uses lower frequencies as cooling states.
48Second is by means of activating devices in order to remove
49the dissipated heat, which is known as active cooling, e.g. regulating
50fan speeds. In both cases, cooling devices shall have a way to determine
51the state of cooling in which the device is.
52
53Any cooling device has a range of cooling states (i.e. different levels
54of heat dissipation). For example a fan's cooling states correspond to
55the different fan speeds possible. Cooling states are referred to by
56single unsigned integers, where larger numbers mean greater heat
57dissipation. The precise set of cooling states associated with a device
58should be defined in a particular device's binding.
59For more examples of cooling devices, refer to the example sections below.
60
61Required properties:
62- #cooling-cells:	Used to provide cooling device specific information
63  Type: unsigned	while referring to it. Must be at least 2, in order
64  Size: one cell	to specify minimum and maximum cooling state used
65			in the reference. The first cell is the minimum
66			cooling state requested and the second cell is
67			the maximum cooling state requested in the reference.
68			See Cooling device maps section below for more details
69			on how consumers refer to cooling devices.
70
71* Trip points
72
73The trip node is a node to describe a point in the temperature domain
74in which the system takes an action. This node describes just the point,
75not the action.
76
77Required properties:
78- temperature:		An integer indicating the trip temperature level,
79  Type: signed		in millicelsius.
80  Size: one cell
81
82- hysteresis:		A low hysteresis value on temperature property (above).
83  Type: unsigned	This is a relative value, in millicelsius.
84  Size: one cell
85
86- type:			a string containing the trip type. Expected values are:
87	"active":	A trip point to enable active cooling
88	"passive":	A trip point to enable passive cooling
89	"hot":		A trip point to notify emergency
90	"critical":	Hardware not reliable.
91  Type: string
92
93* Cooling device maps
94
95The cooling device maps node is a node to describe how cooling devices
96get assigned to trip points of the zone. The cooling devices are expected
97to be loaded in the target system.
98
99Required properties:
100- cooling-device:	A list of phandles of cooling devices with their specifiers,
101  Type: phandle +	referring to which cooling devices are used in this
102    cooling specifier	binding. In the cooling specifier, the first cell
103			is the minimum cooling state and the second cell
104			is the maximum cooling state used in this map.
105- trip:			A phandle of a trip point node within the same thermal
106  Type: phandle of	zone.
107   trip point node
108
109Optional property:
110- contribution:		The cooling contribution to the thermal zone of the
111  Type: unsigned	referred cooling device at the referred trip point.
112  Size: one cell	The contribution is a ratio of the sum
113			of all cooling contributions within a thermal zone.
114
115Note: Using the THERMAL_NO_LIMIT (-1UL) constant in the cooling-device phandle
116limit specifier means:
117(i)   - minimum state allowed for minimum cooling state used in the reference.
118(ii)  - maximum state allowed for maximum cooling state used in the reference.
119Refer to include/dt-bindings/thermal/thermal.h for definition of this constant.
120
121* Thermal zone nodes
122
123The thermal zone node is the node containing all the required info
124for describing a thermal zone, including its cooling device bindings. The
125thermal zone node must contain, apart from its own properties, one sub-node
126containing trip nodes and one sub-node containing all the zone cooling maps.
127
128Required properties:
129- polling-delay:	The maximum number of milliseconds to wait between polls
130  Type: unsigned	when checking this thermal zone.
131  Size: one cell
132
133- polling-delay-passive: The maximum number of milliseconds to wait
134  Type: unsigned	between polls when performing passive cooling.
135  Size: one cell
136
137- thermal-sensors:	A list of thermal sensor phandles and sensor specifier
138  Type: list of		used while monitoring the thermal zone.
139  phandles + sensor
140  specifier
141
142- trips:		A sub-node which is a container of only trip point nodes
143  Type: sub-node	required to describe the thermal zone.
144
145Optional property:
146- cooling-maps:		A sub-node which is a container of only cooling device
147  Type: sub-node	map nodes, used to describe the relation between trips
148			and cooling devices.
149
150- coefficients:		An array of integers (one signed cell) containing
151  Type: array		coefficients to compose a linear relation between
152  Elem size: one cell	the sensors listed in the thermal-sensors property.
153  Elem type: signed	Coefficients defaults to 1, in case this property
154			is not specified. A simple linear polynomial is used:
155			Z = c0 * x0 + c1 * x1 + ... + c(n-1) * x(n-1) + cn.
156
157			The coefficients are ordered and they match with sensors
158			by means of sensor ID. Additional coefficients are
159			interpreted as constant offset.
160
161- sustainable-power:	An estimate of the sustainable power (in mW) that the
162  Type: unsigned	thermal zone can dissipate at the desired
163  Size: one cell	control temperature.  For reference, the
164			sustainable power of a 4'' phone is typically
165			2000mW, while on a 10'' tablet is around
166			4500mW.
167
168Note: The delay properties are bound to the maximum dT/dt (temperature
169derivative over time) in two situations for a thermal zone:
170(i)  - when passive cooling is activated (polling-delay-passive); and
171(ii) - when the zone just needs to be monitored (polling-delay) or
172when active cooling is activated.
173
174The maximum dT/dt is highly bound to hardware power consumption and dissipation
175capability. The delays should be chosen to account for said max dT/dt,
176such that a device does not cross several trip boundaries unexpectedly
177between polls. Choosing the right polling delays shall avoid having the
178device in temperature ranges that may damage the silicon structures and
179reduce silicon lifetime.
180
181* The thermal-zones node
182
183The "thermal-zones" node is a container for all thermal zone nodes. It shall
184contain only sub-nodes describing thermal zones as in the section
185"Thermal zone nodes". The "thermal-zones" node appears under "/".
186
187* Examples
188
189Below are several examples on how to use thermal data descriptors
190using device tree bindings:
191
192(a) - CPU thermal zone
193
194The CPU thermal zone example below describes how to setup one thermal zone
195using one single sensor as temperature source and many cooling devices and
196power dissipation control sources.
197
198#include <dt-bindings/thermal/thermal.h>
199
200cpus {
201	/*
202	 * Here is an example of describing a cooling device for a DVFS
203	 * capable CPU. The CPU node describes its four OPPs.
204	 * The cooling states possible are 0..3, and they are
205	 * used as OPP indexes. The minimum cooling state is 0, which means
206	 * all four OPPs can be available to the system. The maximum
207	 * cooling state is 3, which means only the lowest OPPs (198MHz@0.85V)
208	 * can be available in the system.
209	 */
210	cpu0: cpu@0 {
211		...
212		operating-points = <
213			/* kHz    uV */
214			970000  1200000
215			792000  1100000
216			396000  950000
217			198000  850000
218		>;
219		#cooling-cells = <2>; /* min followed by max */
220	};
221	...
222};
223
224&i2c1 {
225	...
226	/*
227	 * A simple fan controller which supports 10 speeds of operation
228	 * (represented as 0-9).
229	 */
230	fan0: fan@48 {
231		...
232		#cooling-cells = <2>; /* min followed by max */
233	};
234};
235
236ocp {
237	...
238	/*
239	 * A simple IC with a single bandgap temperature sensor.
240	 */
241	bandgap0: bandgap@0000ed00 {
242		...
243		#thermal-sensor-cells = <0>;
244	};
245};
246
247thermal-zones {
248	cpu_thermal: cpu-thermal {
249		polling-delay-passive = <250>; /* milliseconds */
250		polling-delay = <1000>; /* milliseconds */
251
252		thermal-sensors = <&bandgap0>;
253
254		trips {
255			cpu_alert0: cpu-alert0 {
256				temperature = <90000>; /* millicelsius */
257				hysteresis = <2000>; /* millicelsius */
258				type = "active";
259			};
260			cpu_alert1: cpu-alert1 {
261				temperature = <100000>; /* millicelsius */
262				hysteresis = <2000>; /* millicelsius */
263				type = "passive";
264			};
265			cpu_crit: cpu-crit {
266				temperature = <125000>; /* millicelsius */
267				hysteresis = <2000>; /* millicelsius */
268				type = "critical";
269			};
270		};
271
272		cooling-maps {
273			map0 {
274				trip = <&cpu_alert0>;
275				cooling-device = <&fan0 THERMAL_NO_LIMIT 4>;
276			};
277			map1 {
278				trip = <&cpu_alert1>;
279				cooling-device = <&fan0 5 THERMAL_NO_LIMIT>, <&cpu0 THERMAL_NO_LIMIT THERMAL_NO_LIMIT>;
280			};
281		};
282	};
283};
284
285In the example above, the ADC sensor (bandgap0) at address 0x0000ED00 is
286used to monitor the zone 'cpu-thermal' using its sole sensor. A fan
287device (fan0) is controlled via I2C bus 1, at address 0x48, and has ten
288different cooling states 0-9. It is used to remove the heat out of
289the thermal zone 'cpu-thermal' using its cooling states
290from its minimum to 4, when it reaches trip point 'cpu_alert0'
291at 90C, as an example of active cooling. The same cooling device is used at
292'cpu_alert1', but from 5 to its maximum state. The cpu@0 device is also
293linked to the same thermal zone, 'cpu-thermal', as a passive cooling device,
294using all its cooling states at trip point 'cpu_alert1',
295which is a trip point at 100C. On the thermal zone 'cpu-thermal', at the
296temperature of 125C, represented by the trip point 'cpu_crit', the silicon
297is not reliable anymore.
298
299(b) - IC with several internal sensors
300
301The example below describes how to deploy several thermal zones based off a
302single sensor IC, assuming it has several internal sensors. This is a common
303case on SoC designs with several internal IPs that may need different thermal
304requirements, and thus may have their own sensor to monitor or detect internal
305hotspots in their silicon.
306
307#include <dt-bindings/thermal/thermal.h>
308
309ocp {
310	...
311	/*
312	 * A simple IC with several bandgap temperature sensors.
313	 */
314	bandgap0: bandgap@0000ed00 {
315		...
316		#thermal-sensor-cells = <1>;
317	};
318};
319
320thermal-zones {
321	cpu_thermal: cpu-thermal {
322		polling-delay-passive = <250>; /* milliseconds */
323		polling-delay = <1000>; /* milliseconds */
324
325				/* sensor       ID */
326		thermal-sensors = <&bandgap0     0>;
327
328		trips {
329			/* each zone within the SoC may have its own trips */
330			cpu_alert: cpu-alert {
331				temperature = <100000>; /* millicelsius */
332				hysteresis = <2000>; /* millicelsius */
333				type = "passive";
334			};
335			cpu_crit: cpu-crit {
336				temperature = <125000>; /* millicelsius */
337				hysteresis = <2000>; /* millicelsius */
338				type = "critical";
339			};
340		};
341
342		cooling-maps {
343			/* each zone within the SoC may have its own cooling */
344			...
345		};
346	};
347
348	gpu_thermal: gpu-thermal {
349		polling-delay-passive = <120>; /* milliseconds */
350		polling-delay = <1000>; /* milliseconds */
351
352				/* sensor       ID */
353		thermal-sensors = <&bandgap0     1>;
354
355		trips {
356			/* each zone within the SoC may have its own trips */
357			gpu_alert: gpu-alert {
358				temperature = <90000>; /* millicelsius */
359				hysteresis = <2000>; /* millicelsius */
360				type = "passive";
361			};
362			gpu_crit: gpu-crit {
363				temperature = <105000>; /* millicelsius */
364				hysteresis = <2000>; /* millicelsius */
365				type = "critical";
366			};
367		};
368
369		cooling-maps {
370			/* each zone within the SoC may have its own cooling */
371			...
372		};
373	};
374
375	dsp_thermal: dsp-thermal {
376		polling-delay-passive = <50>; /* milliseconds */
377		polling-delay = <1000>; /* milliseconds */
378
379				/* sensor       ID */
380		thermal-sensors = <&bandgap0     2>;
381
382		trips {
383			/* each zone within the SoC may have its own trips */
384			dsp_alert: dsp-alert {
385				temperature = <90000>; /* millicelsius */
386				hysteresis = <2000>; /* millicelsius */
387				type = "passive";
388			};
389			dsp_crit: gpu-crit {
390				temperature = <135000>; /* millicelsius */
391				hysteresis = <2000>; /* millicelsius */
392				type = "critical";
393			};
394		};
395
396		cooling-maps {
397			/* each zone within the SoC may have its own cooling */
398			...
399		};
400	};
401};
402
403In the example above, there is one bandgap IC which has the capability to
404monitor three sensors. The hardware has been designed so that sensors are
405placed on different places in the DIE to monitor different temperature
406hotspots: one for CPU thermal zone, one for GPU thermal zone and the
407other to monitor a DSP thermal zone.
408
409Thus, there is a need to assign each sensor provided by the bandgap IC
410to different thermal zones. This is achieved by means of using the
411#thermal-sensor-cells property and using the first cell of the sensor
412specifier as sensor ID. In the example, then, <bandgap 0> is used to
413monitor CPU thermal zone, <bandgap 1> is used to monitor GPU thermal
414zone and <bandgap 2> is used to monitor DSP thermal zone. Each zone
415may be uncorrelated, having its own dT/dt requirements, trips
416and cooling maps.
417
418
419(c) - Several sensors within one single thermal zone
420
421The example below illustrates how to use more than one sensor within
422one thermal zone.
423
424#include <dt-bindings/thermal/thermal.h>
425
426&i2c1 {
427	...
428	/*
429	 * A simple IC with a single temperature sensor.
430	 */
431	adc: sensor@49 {
432		...
433		#thermal-sensor-cells = <0>;
434	};
435};
436
437ocp {
438	...
439	/*
440	 * A simple IC with a single bandgap temperature sensor.
441	 */
442	bandgap0: bandgap@0000ed00 {
443		...
444		#thermal-sensor-cells = <0>;
445	};
446};
447
448thermal-zones {
449	cpu_thermal: cpu-thermal {
450		polling-delay-passive = <250>; /* milliseconds */
451		polling-delay = <1000>; /* milliseconds */
452
453		thermal-sensors = <&bandgap0>,	/* cpu */
454				  <&adc>;	/* pcb north */
455
456		/* hotspot = 100 * bandgap - 120 * adc + 484 */
457		coefficients =		<100	-120	484>;
458
459		trips {
460			...
461		};
462
463		cooling-maps {
464			...
465		};
466	};
467};
468
469In some cases, there is a need to use more than one sensor to extrapolate
470a thermal hotspot in the silicon. The above example illustrates this situation.
471For instance, it may be the case that a sensor external to CPU IP may be placed
472close to CPU hotspot and together with internal CPU sensor, it is used
473to determine the hotspot. Assuming this is the case for the above example,
474the hypothetical extrapolation rule would be:
475		hotspot = 100 * bandgap - 120 * adc + 484
476
477In other context, the same idea can be used to add fixed offset. For instance,
478consider the hotspot extrapolation rule below:
479		hotspot = 1 * adc + 6000
480
481In the above equation, the hotspot is always 6C higher than what is read
482from the ADC sensor. The binding would be then:
483        thermal-sensors =  <&adc>;
484
485		/* hotspot = 1 * adc + 6000 */
486	coefficients =		<1	6000>;
487
488(d) - Board thermal
489
490The board thermal example below illustrates how to setup one thermal zone
491with many sensors and many cooling devices.
492
493#include <dt-bindings/thermal/thermal.h>
494
495&i2c1 {
496	...
497	/*
498	 * An IC with several temperature sensor.
499	 */
500	adc_dummy: sensor@50 {
501		...
502		#thermal-sensor-cells = <1>; /* sensor internal ID */
503	};
504};
505
506thermal-zones {
507	batt-thermal {
508		polling-delay-passive = <500>; /* milliseconds */
509		polling-delay = <2500>; /* milliseconds */
510
511				/* sensor       ID */
512		thermal-sensors = <&adc_dummy     4>;
513
514		trips {
515			...
516		};
517
518		cooling-maps {
519			...
520		};
521	};
522
523	board_thermal: board-thermal {
524		polling-delay-passive = <1000>; /* milliseconds */
525		polling-delay = <2500>; /* milliseconds */
526
527				/* sensor       ID */
528		thermal-sensors = <&adc_dummy     0>, /* pcb top edge */
529				  <&adc_dummy     1>, /* lcd */
530				  <&adc_dummy     2>; /* back cover */
531		/*
532		 * An array of coefficients describing the sensor
533		 * linear relation. E.g.:
534		 * z = c1*x1 + c2*x2 + c3*x3
535		 */
536		coefficients =		<1200	-345	890>;
537
538		sustainable-power = <2500>;
539
540		trips {
541			/* Trips are based on resulting linear equation */
542			cpu_trip: cpu-trip {
543				temperature = <60000>; /* millicelsius */
544				hysteresis = <2000>; /* millicelsius */
545				type = "passive";
546			};
547			gpu_trip: gpu-trip {
548				temperature = <55000>; /* millicelsius */
549				hysteresis = <2000>; /* millicelsius */
550				type = "passive";
551			}
552			lcd_trip: lcp-trip {
553				temperature = <53000>; /* millicelsius */
554				hysteresis = <2000>; /* millicelsius */
555				type = "passive";
556			};
557			crit_trip: crit-trip {
558				temperature = <68000>; /* millicelsius */
559				hysteresis = <2000>; /* millicelsius */
560				type = "critical";
561			};
562		};
563
564		cooling-maps {
565			map0 {
566				trip = <&cpu_trip>;
567				cooling-device = <&cpu0 0 2>;
568				contribution = <55>;
569			};
570			map1 {
571				trip = <&gpu_trip>;
572				cooling-device = <&gpu0 0 2>;
573				contribution = <20>;
574			};
575			map2 {
576				trip = <&lcd_trip>;
577				cooling-device = <&lcd0 5 10>;
578				contribution = <15>;
579			};
580		};
581	};
582};
583
584The above example is a mix of previous examples, a sensor IP with several internal
585sensors used to monitor different zones, one of them is composed by several sensors and
586with different cooling devices.
587