Good day,
I finally had a chance to spend a bit more time with my two Tezro machines and I noticed that the Odyssey V12 video boards seem to run a bit warm on both of them. Once the machines have been running for a period of time, I frequently get the following type of messages on the console and in the logs:
These occur with virtually no load on the video hardware, so on the surface it would seem to me that the V12 boards are running somewhat warmer than intended. Interestingly both machines are exhibiting exactly the same behaviour, so I doubt that it is a hardware issue. For reference, both V12 boards have the DCD option and both machines are equipped with the DM3 cards. Removing the DM3 card appears to mostly correct this, but I still have seen an occasional message:
With the DM3 board removed, it seems that the temperatures are just marginally below the advisory level:
I looked through the forums here and came across a number of posts with sample environmental monitor output on Tezros. They all seem lower than what I see on both of my machines. Can anyone offer any thoughts on this? Should I worry?
Thank you.
I finally had a chance to spend a bit more time with my two Tezro machines and I noticed that the Odyssey V12 video boards seem to run a bit warm on both of them. Once the machines have been running for a period of time, I frequently get the following type of messages on the console and in the logs:
Code:
Dec 23 20:41:41 4A:tezro unix: |$(0x15f)WARNING: 001c01 ATTN: ODY zone advisory limit reached 50 C/ 122 F Fan: 87
Dec 23 20:42:41 4A:tezro unix: |$(0x162)WARNING: 001c01 ATTN: Cooling system stabilized
Dec 23 21:12:03 4A:tezro unix: |$(0x15f)WARNING: 001c01 ATTN: ODY zone advisory limit reached 50 C/ 122 F Fan: 87
Dec 23 21:13:03 4A:tezro unix: |$(0x162)WARNING: 001c01 ATTN: Cooling system stabilized
Dec 23 20:42:41 4A:tezro unix: |$(0x162)WARNING: 001c01 ATTN: Cooling system stabilized
Dec 23 21:12:03 4A:tezro unix: |$(0x15f)WARNING: 001c01 ATTN: ODY zone advisory limit reached 50 C/ 122 F Fan: 87
Dec 23 21:13:03 4A:tezro unix: |$(0x162)WARNING: 001c01 ATTN: Cooling system stabilized
These occur with virtually no load on the video hardware, so on the surface it would seem to me that the V12 boards are running somewhat warmer than intended. Interestingly both machines are exhibiting exactly the same behaviour, so I doubt that it is a hardware issue. For reference, both V12 boards have the DCD option and both machines are equipped with the DM3 cards. Removing the DM3 card appears to mostly correct this, but I still have seen an occasional message:
Code:
Dec 26 16:05:17 4A:tezro unix: |$(0x15f)WARNING: 001c01 ATTN: ODY zone advisory limit reached 51 C/ 123 F Fan: 80
Dec 26 16:08:17 4A:tezro unix: |$(0x162)WARNING: 001c01 ATTN: Cooling system stabilized
Dec 26 23:11:55 4A:tezro unix: |$(0x15f)WARNING: 001c01 ATTN: ODY zone advisory limit reached 50 C/ 122 F Fan: 80
Dec 26 23:12:15 4A:tezro unix: |$(0x162)WARNING: 001c01 ATTN: Cooling system stabilized
Dec 26 16:08:17 4A:tezro unix: |$(0x162)WARNING: 001c01 ATTN: Cooling system stabilized
Dec 26 23:11:55 4A:tezro unix: |$(0x15f)WARNING: 001c01 ATTN: ODY zone advisory limit reached 50 C/ 122 F Fan: 80
Dec 26 23:12:15 4A:tezro unix: |$(0x162)WARNING: 001c01 ATTN: Cooling system stabilized
With the DM3 board removed, it seems that the temperatures are just marginally below the advisory level:
Code:
tezro 1# l1cmd env
Environmental monitoring is enabled and running.
Description State Warning Limits Fault Limits Current
-------------- ---------- ----------------- ----------------- -------
1.8V Enabled 10% 1.62/ 1.98 20% 1.44/ 2.16 1.875
12V Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.000
12V #2 Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.125
3.3V Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.474
2.5V Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.613
12V IO Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.063
5V AUX Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.070
3.3V AUX Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.268
5V Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.070
XIO 12V BIAS Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.063
XIO 5V Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.070
XIO 2.5V Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.574
XIO 3.3V AUX Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.285
IP53 3.3V AUX Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.302
IP53 5V AUX Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.044
IP53 12V Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 11.875
IP53 VCPU Enabled 10% 1.13/ 1.38 20% 1.00/ 1.50 1.283
IP53 SRAM Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.574
IP53 1.5V Enabled 10% 1.35/ 1.65 20% 1.20/ 1.80 1.551
Description State Warning RPM Current RPM
--------------- ---------- ----------- -----------
FAN 0 NODE 1 Enabled 1800 2109
FAN 1 NODE 2 Enabled 1800 2136
FAN 2 NODE 3 Enabled 1800 2149
FAN 3 PCI 1 Enabled 1350 1430
FAN 4 PCI 2 Enabled 1350 1493
FAN 5 HD Enabled 1620 4218
FAN 6 ODY 1 Enabled 1300 2220
FAN 7 ODY 2 Enabled 1300 2083
Advisory Critical Fault Current
Description State Temp Temp Temp Temp
----------------- ---------- --------- --------- --------- ---------
0 INTERFACE 0 Enabled [Autofan Control] 76C/168F 39C/102F
1 INTERFACE 1 Enabled [Autofan Control] 76C/168F 35C/ 95F
2 INTERFACE 2 Enabled [Autofan Control] 76C/168F 34C/ 93F
3 INTERFACE 3 Enabled [Autofan Control] 76C/168F 41C/105F
4 ODYSSEY Enabled [Autofan Control] 76C/168F 50C/122F
5 NODE Enabled [Autofan Control] 76C/168F 55C/131F
6 BEDROCK Enabled [Autofan Control] 85C/185F 55C/131F
Zone Temp Target Current Zone Fan Curr/Min
Zone Name State Sensors Average Average Index Fan %
--------- -------- ------------ -------- -------- --------- ---------
Node Enabled 5,6 62C/143F 55C/131F 0 46%/ 46%
PCI Enabled 0,1,2,3 45C/113F 37C/ 98F 3,4 57%/ 57%
ODY Enabled 4 50C/122F 50C/122F 6 78%/ 64%
HD Enabled 5 40C/104F 55C/131F 5 80%/ 38%
Environmental monitoring is enabled and running.
Description State Warning Limits Fault Limits Current
-------------- ---------- ----------------- ----------------- -------
1.8V Enabled 10% 1.62/ 1.98 20% 1.44/ 2.16 1.875
12V Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.000
12V #2 Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.125
3.3V Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.474
2.5V Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.613
12V IO Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.063
5V AUX Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.070
3.3V AUX Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.268
5V Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.070
XIO 12V BIAS Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.063
XIO 5V Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.070
XIO 2.5V Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.574
XIO 3.3V AUX Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.285
IP53 3.3V AUX Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.302
IP53 5V AUX Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.044
IP53 12V Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 11.875
IP53 VCPU Enabled 10% 1.13/ 1.38 20% 1.00/ 1.50 1.283
IP53 SRAM Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.574
IP53 1.5V Enabled 10% 1.35/ 1.65 20% 1.20/ 1.80 1.551
Description State Warning RPM Current RPM
--------------- ---------- ----------- -----------
FAN 0 NODE 1 Enabled 1800 2109
FAN 1 NODE 2 Enabled 1800 2136
FAN 2 NODE 3 Enabled 1800 2149
FAN 3 PCI 1 Enabled 1350 1430
FAN 4 PCI 2 Enabled 1350 1493
FAN 5 HD Enabled 1620 4218
FAN 6 ODY 1 Enabled 1300 2220
FAN 7 ODY 2 Enabled 1300 2083
Advisory Critical Fault Current
Description State Temp Temp Temp Temp
----------------- ---------- --------- --------- --------- ---------
0 INTERFACE 0 Enabled [Autofan Control] 76C/168F 39C/102F
1 INTERFACE 1 Enabled [Autofan Control] 76C/168F 35C/ 95F
2 INTERFACE 2 Enabled [Autofan Control] 76C/168F 34C/ 93F
3 INTERFACE 3 Enabled [Autofan Control] 76C/168F 41C/105F
4 ODYSSEY Enabled [Autofan Control] 76C/168F 50C/122F
5 NODE Enabled [Autofan Control] 76C/168F 55C/131F
6 BEDROCK Enabled [Autofan Control] 85C/185F 55C/131F
Zone Temp Target Current Zone Fan Curr/Min
Zone Name State Sensors Average Average Index Fan %
--------- -------- ------------ -------- -------- --------- ---------
Node Enabled 5,6 62C/143F 55C/131F 0 46%/ 46%
PCI Enabled 0,1,2,3 45C/113F 37C/ 98F 3,4 57%/ 57%
ODY Enabled 4 50C/122F 50C/122F 6 78%/ 64%
HD Enabled 5 40C/104F 55C/131F 5 80%/ 38%
I looked through the forums here and came across a number of posts with sample environmental monitor output on Tezros. They all seem lower than what I see on both of my machines. Can anyone offer any thoughts on this? Should I worry?
Thank you.