Getting Started, Documentation, Tips & Tricks

origin 300 fan issue

Hi all.

I've got a pair of 4x500mghz CPU 4gb ram 2x 73gb disk origin 300s, a numa cable and irix media. Trying to get this party started. :D Anyway, on console power up (see below) I am informed that 3 fans are failing and so the system shuts down. A look inside (see picture) shows missing fans. Same story on both nodes. How many do I really need? Where would I connected them if I had them? Can I disable this check and just put any old fans in there to keep it cool? Thoughts?

Code: Select all

001c16-L1>env
Environmental monitoring is enabled and running.

Description    State       Warning Limits     Fault Limits       Current
-------------- ----------  -----------------  -----------------  -------
12 IO   Wait Pwr  10%  10.80/ 13.20  20%   9.60/ 14.40    0.00
12 DIG   Wait Pwr  10%  10.80/ 13.20  20%   9.60/ 14.40    0.00
5V   Wait Pwr  10%   4.50/  5.50  20%   4.00/  6.00    0.00
3.3V   Wait Pwr  10%   2.97/  3.63  20%   2.64/  3.96    0.05
5V AUX   Wait Pwr  10%   4.50/  5.50  20%   4.00/  6.00    4.99
3.3V AUX   Wait Pwr  10%   2.97/  3.63  20%   2.64/  3.96    3.42
2.5V   Wait Pwr  10%   2.25/  2.75  20%   2.00/  3.00    0.00
VCPU   Wait Pwr  10%   1.44/  1.76  20%   1.28/  1.92    0.00
1.5V   Wait Pwr  10%   1.35/  1.65  20%   1.20/  1.80    0.00

Description     State       Warning RPM  Current RPM
--------------- ----------  -----------  -----------
FAN  0     LEFT   Wait Pwr         2160            0
FAN  1   CENTER   Wait Pwr         2160            0
FAN  2    RIGHT   Wait Pwr         2160            0
FAN  3       PS   Wait Pwr         2160            0
FAN  4      PS'   Wait Pwr         2160            0

Advisory   Critical   Fault      Current
Description       State       Temp       Temp       Temp       Temp
----------------- ----------  ---------  ---------  ---------  ---------
0 NODE 0           Wait Pwr   Disabled   Disabled   49C/120F   18C/ 64F
1 NODE 1           Wait Pwr   Disabled   Disabled   49C/120F   18C/ 64F

001c16-L1>power up
001c16-L1>
001c16 ATTN: FAN 2 warning limit reached @ 0 RPM.

001c16 ATTN: Environmental redundancy lost.

001c16 ATTN: FAN 0 warning limit reached @ 0 RPM.

001c16 ATTN: FAN 1 warning limit reached @ 0 RPM.

001c16 ATTN: FAN 1 fault limit reached @ 0 RPM.

001c16 ATTN: brick auto power down in 30 seconds

001c16 ATTN: brick auto power down in 25 seconds

001c16 ATTN: brick auto power down in 20 seconds

001c16 ATTN: brick auto power down in 15 seconds

001c16 ATTN: brick auto power down in 10 seconds

001c16 ATTN: brick auto power down in 5 seconds

001c16 ATTN: brick is powering down now!
env
Environmental monitoring is enabled and running.

Description    State       Warning Limits     Fault Limits       Current
-------------- ----------  -----------------  -----------------  -------
12 IO   Wait Pwr  10%  10.80/ 13.20  20%   9.60/ 14.40    0.00
12 DIG   Wait Pwr  10%  10.80/ 13.20  20%   9.60/ 14.40    0.00
5V   Wait Pwr  10%   4.50/  5.50  20%   4.00/  6.00    0.00
3.3V   Wait Pwr  10%   2.97/  3.63  20%   2.64/  3.96    0.05
5V AUX   Wait Pwr  10%   4.50/  5.50  20%   4.00/  6.00    4.97
3.3V AUX   Wait Pwr  10%   2.97/  3.63  20%   2.64/  3.96    3.42
2.5V   Wait Pwr  10%   2.25/  2.75  20%   2.00/  3.00    0.00
VCPU   Wait Pwr  10%   1.44/  1.76  20%   1.28/  1.92    0.00
1.5V   Wait Pwr  10%   1.35/  1.65  20%   1.20/  1.80    0.00

Description     State       Warning RPM  Current RPM
--------------- ----------  -----------  -----------
FAN  0     LEFT   Wait Pwr         2160            0
FAN  1   CENTER   Wait Pwr         2160            0
FAN  2    RIGHT   Wait Pwr         2160            0
FAN  3       PS   Wait Pwr         2160            0
FAN  4      PS'   Wait Pwr         2160            0

Advisory   Critical   Fault      Current
Description       State       Temp       Temp       Temp       Temp
----------------- ----------  ---------  ---------  ---------  ---------
0 NODE 0           Wait Pwr   Disabled   Disabled   49C/120F   21C/ 69F
1 NODE 1           Wait Pwr   Disabled   Disabled   49C/120F   20C/ 68F
tstover wrote: Can I disable this check and just put any old fans in there to keep it cool? Thoughts?
You could disable environmental monitoring, but I wouldn't recommend it.

It is possible to use other fans, viewtopic.php?f=3&t=16718040& but I'd suggest checking that the current set are tightly connected. If the connection isn't suspect you might want to test the fans with an external power source and the fan connections in the O300 with a DMM. Having the same fans fail in both O300s seems little unusual to me. What do you know about their history?
***********************************************************************
Welcome to ARMLand - 0/0x0d00
running...(sherwood-root 0607201829)
* InfiniteReality/Reality Software, IRIX 6.5 Release *
***********************************************************************
The picture is a little misleading. What you see is actually just the fan mounting bracket. The fans are absent. I bought these used with the claim that they were pulled from a working environment. I think either the fans were salvaged, or perhaps there were other fans in the cabinet or something. Is it possible to leave the environmental monitoring of the temperatures, just not the fans? That way as long as I was cooling it somehow it might run ok. I'm not sure where the fans would connect to. Is there a header on the motherboard, or power supply? Anyone know?
* The fans are connected to headers on the mainboard, under the black foam shroud.
* Recondas posted a link to a post by me, I'd just like to say the those fans are no longer sold. You'll find something similar. Just keep in mind the three wires are ordered differently vs. what you find in most PCs, so you'll have to pry the wires from the replacement fan out of it's connector and re-arrange them. I no longer have the O300 so I can't help there ...
* If you can get an L1 prompt on the console port, you can do 'env off' to switch of monitoring. I sure hope you won't do that without pointing some sort of blower at the open chassis, or you will fry the system

Good luck!
Now this is a deep dark secret, so everybody keep it quiet :)
It turns out that when reset, the WD33C93 defaults to a SCSI ID of 0, and it was simpler to leave it that way... -- Dave Olson, in comp.sys.sgi

Currently in commercial service: Image :Onyx2: (2x) :O3x02L:
In the museum : almost every MIPS/IRIX system.
Wanted : GM1 board for Professional Series GT graphics (030-0076-003, 030-0076-004)
So other than a different pinout, "regular" 3 wire pc fans can be used? I'll give this a try later. How did you figure out which pin is what? I suppose 2 of the 3 will be the DC power so it's probably not that hard to get it wrong. Now to dumpster dive for 6 fans...
Well, I had the original fans, and the wires are red, black & yellow in both cases so that was straight forward.

The fans have to move a certain volume of air (CFM) or you risk overheating the system, and they have to run at a minimum RPM or you'll trigger the environmental monitoring. The original fans are type NMB Minebea 3110KL-04W-B79.
Now this is a deep dark secret, so everybody keep it quiet :)
It turns out that when reset, the WD33C93 defaults to a SCSI ID of 0, and it was simpler to leave it that way... -- Dave Olson, in comp.sys.sgi

Currently in commercial service: Image :Onyx2: (2x) :O3x02L:
In the museum : almost every MIPS/IRIX system.
Wanted : GM1 board for Professional Series GT graphics (030-0076-003, 030-0076-004)
tstover wrote: I How many do I really need? Where would I connected them if I had them? Can I disable this check and just put any old fans in there to keep it cool? Thoughts?


My thought is you are entering brave new territory here. Sorry. I don't think you can put any old fans in there.

as the GOBI, recondas wrote: You could disable environmental monitoring, but I wouldn't recommend it.

Seconded.

tstover wrote: So other than a different pinout, "regular" 3 wire pc fans can be used? I'll give this a try later. How did you figure out which pin is what? I suppose 2 of the 3 will be the DC power so it's probably not that hard to get it wrong. Now to dumpster dive for 6 fans...


I am not an expert here but the Fuel is a workstation of the same era as your Origin 300s are as servers. I suspect that the way things work is that the power supplies of machines of that era (Fuel, Tezro, Origin/Onyx 3xx) have circuits leading back to the power supply and the L1 monitoring. It senses the load or rpm with pwm(*) and the designers knew that load X meant an rpm of Y on the fan. If you found fans with the same electronic characteristics as those NMB Minebea 3110KL-04W-B79 present in the Origin 300s and they fit, you are in business. In the nekochan wiki article on Fuel PSU repair ( here ).

In the wiki it is wrote: Pin 12: This is FANC, it should control fan speed. Read more below
Pin 24 This is FANM, it monitors fan speed. Connect to tachyometer pin on fan:
....
The FANC on pin 12 doesn't appear to work as expected. I expected that the wire would go directly to the fan header inside the PSU, but it looks like its going into one of the IC's, but I'm not sure. The FANC line stays put at +3.3V even when the system if powered down. This might be to low to start the fan from the PSU. Which might indicate that it is just a signal wire to a fan controller in the PSU.. but I don't know.. perhaps a good reason to get my oscilloscope out.


Image

The fuel is merely from the same era. It may not work that way. But I hope I give you the courage to investigate and document.

People here in the past have replaced fans in Octanes and the like merely because they are too loud.

This is an opportunity to learn something about Origin 3(x)0 fans and power supplies and share it as wiki article.

Good luck,

R.

* How? I don't know, resistance? pulse width modulation on something that rotates with the fan? I am just guessing here.
Here is an article on mode sense:
http://www.e-gizmo.com/ARTICLES/Sense/MAKESENS.HTM
死の神はりんごだけ食べる

開いた括弧は必ず閉じる -- あるプログラマー

:Tezro: :Tezro: :Onyx2R: :Onyx2RE: :Onyx2: :O3x04R: :O3x0: :O200: :Octane: :Octane2: :O2: :O2: :Indigo2IMP: :PI: :PI: :1600SW: :1600SW: :Indy: :Indy: :Indy: :Indy: :Indy:
:hpserv: J5600, 2 x Mac, 3 x SUN, Alpha DS20E, Alpha 800 5/550, 3 x RS/6000, Amiga 4000 VideoToaster, Amiga4000 -030, 733MHz Sam440 AmigaOS 4.1 update 1.

Sold: :Indy: :Indy: :Indy: :Indigo: Tandem Himalaya S-Series Nonstop S72000 ServerNet.

Twitter @PymbleSoftware
Current Apps (iOS) -> https://itunes.apple.com/au/artist/pymb ... d553990081
(Android) https://play.google.com/store/apps/deve ... +Ltd&hl=en
(Onyx2) Cortex ---> http://www.facebook.com/pages/Cortex-th ... 11?sk=info
(0300s) Minnie ---> http://www.facebook.com/pages/Minnie-th ... 02?sk=info
Github ---> https://github.com/pymblesoftware
If the O300 uses the same non-standard 3-pin fan connector header as the Fuel and Origin 350, there's a photo of a spliced fan wire connection at the bottom of the Fuel Hardware Aggregator (the Aggregator might be of further interest to you as the Fuel and O300 are both IP35 at the software level): viewtopic.php?t=9046&f=3#p136996

As jan-jaap already mentioned, when sourcing fans the other thing to check is the RPM rating of the replacement fan. The "Warning RPM" level for your O300s is 2160 RPM, if the replacement fans run any slower than that you'll still run into issues with the L1 controller automatically shutting down the system. I recently added some low-noise variable speed fans to an O350, even though the minimum speed rating for the variable speed fans was above the warning rpm level, they still generated fan warnings during a cold start because the replacement fans had an integral thermostat and didn't spin up until a certain temperature was reached.
***********************************************************************
Welcome to ARMLand - 0/0x0d00
running...(sherwood-root 0607201829)
* InfiniteReality/Reality Software, IRIX 6.5 Release *
***********************************************************************
Well the glacial pace of this project took a huge leap forward last night when I installed a fan set and got past this stage. I ended up just buying some used sgi ones.