SGI: Hardware

TFT display for Onyx2

Hi !.

In our lab we have octane 2 with IRIX 6.5. We are unable to boot into the system. We are getting following error after "Power-on Diagnostics".

Quote:
Internal SCSI Device/Cable Diagnostic Failed *FAILED*
External SCSI Device/Cable Diagnostic Failed *FAILED*

Check or replace : Disk, Floppy, CDROM, or SCSI Cable.
Diagnostics Failed
[ Press any key to continue. ]


After pressing 'enter', I get the menu(Start System, Install System Software, ...), when I Chose "Start System",

I get "Unrecovered Data block Error"(s) on following paths : /hw/node/xtalk/pci/0/scsi_ctrl/...

And then I get following Error

Quote:
PANIC: CPU 0 : Vfs_mountroot : no root found.
Reboot Started from CPU 0
CPU 0 rebooting


And no further response from the machine. I have following doubts.

Quote:
1. In case of complete Disk failure, How to check remaining disks Health from "Command Monitor"

2. If they are healthy can i install fresh IRIX on that Disk.

3. Is there any way to back up the current system, which fails even to enter "Single user" mode.

wishing your earnest reply,
kabees


Try checking your SCSI back pane .. seems pretty obvious where the issue lies.
Either that or your onboard controller gave up the ghost.



_________________
MAYA, nut-
:Octane2: :Octane2: Octane 2 R14k 600 V12 4GB, Octane2 R14K 600 V10 1GB ,
:Onyx2: :Onyx2: Onyx2 IR3 4GB Quad R14K 500 DIVO, Onyx2 IR Quad R12K 400 2GB,
:Indigo2: SGI Indigo 2 R8K75 TEAL Extreme 256MB,
:Indigo2IMP: SGI Indigo 2 R10K 195 Solid Impact 256MB, MAX Impact Pending
,
Apple G5 Quad, NV Quadro 4500 + 7800GT, 12GB RAM
Sun Blade 1000 Dual 900 XVR 1000
Given the first error messages from your post I would guess it is an issue with the scsi backplane or controller or maybe a bad external scsi cable and not a disk failure. If you have a second system you can check the drives by plugging them in as option drives and do an fdisk for instance.

If you don"t have a second system you could remove all drives, insert one of the option drives in the lowest slot and see where that gets you. It is unlikely all disks fail at once. You will obviously not be able to boot from the option drive but if the SCSI device / cable error messages remain it is very likely not a disk issue. If they go away you could do an fx and look at the partition table without making any changes, thus not destroying your data. If that looks good do the same with the root drive.

_________________
:Indigo: :Indy: :O2: :1600SW: :1600SW: :Octane: :Octane: :Octane: :Octane: :Octane2: :Fuel: :Onyx2: :Onyx2: :Onyx2: :O2000: :O3200:
Can you hear if your hard drive is actually spinning up? Oddly enough, just had a very similar error message like this on the week-end.

In my case the hard drive was not spinning up (i.e. were stuck) and an error message was given similar to yours. If you have a spare drive, perhaps changing over a hard drive to rule that out might help...
The fact that there are errors on both SCSI HBAs (internal and external) does point to something else besides a single disk failure. What is your setup? Are you using both SCSI channels?

Both SCSI HBAs on Octane are fed from the same BRIDGE chip and XIO connector, so you might want to try removing the IP30 board and gently blowing the compression connector (from the side and using dry air) then reseating to see if it's an odd intermittent connection thing. SCSI 0 goes via an internal connector to the drive backplane, and reseating the board will also reseat that. Bus 1 is soldered through to the external port - not sure what could be causing your error there. Is it in use?

_________________
Damn the torpedoes, full speed ahead!

:Indigo: :Octane: :Indigo2: :Indigo2IMP: :Indy: :PI: :O200: :ChallengeL:
SAQ wrote:
~ and gently blowing the compression connector ~


Yipe! try to avoid getting gob on the compression connector.Blowing not recommended :P
If all else fails, get a can of compressed air, and
some contact cleaner. Or just remove the compression connector from the board and soak in pure alcohol.
Did that on some dysfunctional node boards, worked like a charm. lol


_________________
MAYA, nut-
:Octane2: :Octane2: Octane 2 R14k 600 V12 4GB, Octane2 R14K 600 V10 1GB ,
:Onyx2: :Onyx2: Onyx2 IR3 4GB Quad R14K 500 DIVO, Onyx2 IR Quad R12K 400 2GB,
:Indigo2: SGI Indigo 2 R8K75 TEAL Extreme 256MB,
:Indigo2IMP: SGI Indigo 2 R10K 195 Solid Impact 256MB, MAX Impact Pending
,
Apple G5 Quad, NV Quadro 4500 + 7800GT, 12GB RAM
Sun Blade 1000 Dual 900 XVR 1000
Ryan Fox wrote:
SAQ wrote:
~ and gently blowing the compression connector ~


Yipe! try to avoid getting gob on the compression connector.Blowing not recommended :P
If all else fails, get a can of compressed air, and
some contact cleaner. Or just remove the compression connector from the board and soak in pure alcohol.
Did that on some dysfunctional node boards, worked like a charm. lol



Hence the admonition to use dry air (forgot to note that it shouldn't be high-pressure, so don't grab your tank compressor).

SGI has a manual on how to clean CPAPs on TechPubs - http://techpubs.sgi.com/library/dynaweb ... l/apb.html .

Try reseating your RAM as well, as not-too-good-but-good-enough-to-pass-the-RAM-diags connections in RAM can show up as many different errors.

_________________
Damn the torpedoes, full speed ahead!

:Indigo: :Octane: :Indigo2: :Indigo2IMP: :Indy: :PI: :O200: :ChallengeL:
It is all very nice that we present options and discuss pros and cons of "blowjobs" and alcohol. I wonder if the original poster will show up again and give us some feedback on his findings, though.

_________________
:Indigo: :Indy: :O2: :1600SW: :1600SW: :Octane: :Octane: :Octane: :Octane: :Octane2: :Fuel: :Onyx2: :Onyx2: :Onyx2: :O2000: :O3200:
rusti wrote:
pros and cons of "blowjobs" and alcohol.

:lol: :lol: :lol:

_________________
Now this is a deep dark secret, so everybody keep it quiet :)
It turns out that when reset, the WD33C93 defaults to a SCSI ID of 0, and it was simpler to leave it that way... -- Dave Olson, in comp.sys.sgi

Currently in commercial service: Image :Octane2: :Onyx2: (2x) :0300:
In the museum: almost every MIPS/IRIX system.
jan-jaap wrote:
rusti wrote:
pros and cons of "blowjobs" and alcohol.

:lol: :lol: :lol:




AHEM! cleanup on isle 3...

_________________
MAYA, nut-
:Octane2: :Octane2: Octane 2 R14k 600 V12 4GB, Octane2 R14K 600 V10 1GB ,
:Onyx2: :Onyx2: Onyx2 IR3 4GB Quad R14K 500 DIVO, Onyx2 IR Quad R12K 400 2GB,
:Indigo2: SGI Indigo 2 R8K75 TEAL Extreme 256MB,
:Indigo2IMP: SGI Indigo 2 R10K 195 Solid Impact 256MB, MAX Impact Pending
,
Apple G5 Quad, NV Quadro 4500 + 7800GT, 12GB RAM
Sun Blade 1000 Dual 900 XVR 1000
:D

Hi Mates, excuse me, I just abandoned the machine for some time. I took the machine today. And eventually i checked the thread. Wow !. I wondered at the response. Thanks for the contributors.

I will try all possible options, and keep the thread updated.

thanking u,
kabees.