Hi folks,
I've acquired another fuel, this time it has 500Mhz PIMM, 2GB RAM and of course just a V10.
Couple things I noticed when I first powered it up:
L1 didn't give me any echo to any commands - it would just respond "ok", or print expected output - ie, there was no usual 001a01-L1> prompt
Nothing shows up on Serial port1 - everything goes onto L1 port, so after booting, I had to press CTRL-D to get into PROM - which is fine by me.
I noticed that PROM doesn't really hold settings (at least not all of them), settings like netaddr would last through soft reset, but settings like srvaddr, tapedevice, fxaddr, notape (all the standard for installation) wouldn't last (of course all of them were with -p ).
I was able to start FX remotely, partition my drive etc, but then, I started SASH, and when I wrote "install" it would throw a TLB error at me:
(it may not be exactly the address I got - my scrollback went up so I stole it from someone else).
Well, I thought, maybe that's because my console was set to g , I've changed it to d (it wouldn't last reboot), and just in case - plugged a keyboard and a mouse in - who knows, maybe it's as picky as an O2.
That didn't help.
So then I went into L1 again, reset nvram, checked flash status etc. And I finally got the ol'good L1 prompt - but then, it won't boot again anymore, all I get is this:
I believe it should start going into hardware discovery - but it freezes at this point...
Some info from L1:
I did notice it says that NVRAM checksum is invalid, I wonder how much is that related...
Any ideas ?
I do have another Fuel, and I might be able to try and swap the timekeeper chips (dallas and ST), but I wonder if anything else could be done to find out what's going on....
Cheers
I've acquired another fuel, this time it has 500Mhz PIMM, 2GB RAM and of course just a V10.
Couple things I noticed when I first powered it up:
L1 didn't give me any echo to any commands - it would just respond "ok", or print expected output - ie, there was no usual 001a01-L1> prompt
Nothing shows up on Serial port1 - everything goes onto L1 port, so after booting, I had to press CTRL-D to get into PROM - which is fine by me.
I noticed that PROM doesn't really hold settings (at least not all of them), settings like netaddr would last through soft reset, but settings like srvaddr, tapedevice, fxaddr, notape (all the standard for installation) wouldn't last (of course all of them were with -p ).
I was able to start FX remotely, partition my drive etc, but then, I started SASH, and when I wrote "install" it would throw a TLB error at me:
Code:
A 000: *** TLB Refill Exception on node 0
A 000: *** EPC: 0xc00000001fc44de4 (0xc00000001fc44de4)
A 000: *** Press ENTER to continue.
A 000: *** EPC: 0xc00000001fc44de4 (0xc00000001fc44de4)
A 000: *** Press ENTER to continue.
(it may not be exactly the address I got - my scrollback went up so I stole it from someone else).
Well, I thought, maybe that's because my console was set to g , I've changed it to d (it wouldn't last reboot), and just in case - plugged a keyboard and a mouse in - who knows, maybe it's as picky as an O2.
That didn't help.
So then I went into L1 again, reset nvram, checked flash status etc. And I finally got the ol'good L1 prompt - but then, it won't boot again anymore, all I get is this:
Code:
escaping to L1 system controller
001a01-L1>power up
returning to console mode 001a01 console, <CTRL_T> to escape to L1
Starting PROM Boot process
IP35 PROM SGI Version 6.180 built 01:50:56 PM Nov 18, 2003
Running in DDR mode
Testing/Initializing memory ............... DONE
Copying PROM code to memory ............... DONE
Discovering local IO ...................... DONE
Discovering NUMAlink connectivity .........
Local hub NUMAlink is down.
*** Local network link down
DONE
Found 1 objects (1 hubs, 0 routers) in 5895 usec
Waiting for peers to complete discovery.... DONE
No other nodes present; becoming global master
Global master is /hw/rack/001/bay/01
Intializing any CPUless nodes.............. DONE
Checking partitioning information ......... DONE
No other nodes present; becoming partition master
Loading BASEIO prom ....................... DONE
BASEIO PROM Monitor SGI Version 6.180 built 01:47:37 PM Nov 18, 2003 (BE64)
1 CPUs on 1 nodes found.
NVRAM checksum is incorrect: reinitializing.
Automatic update of PROM environment disabled
PS/2 Keyboard & Mouse diagnostics
Found mouse on port 0
Found keyboard on port 1
PS/2 Keyboard & Mouse diagnostics passed
Graphics diagnostics
Odyssey board #0 found on nasid 0
Running Odyssey xtalk sanity diag...
Board version 1 - Buzz revision 2B
On board sdram size: 32 Mb
Cas latency: CAS 3
2 banks by sdram module
Running Odyssey Buzz registers diag...
Device passed diagnostics
Installing PROM Device drivers ............
Base I/O Ethernet set to /dev/ethernet/ef0
Installing Graphics Console...
graphics install: searching for pipe 0
Walking SCSI Adapter 0, (pci id 1)
1- 2- 3- 4- 5- 6- 7- 8- 9- 10- 11- 12- 13- 14- 15- = 0 device(s)
Walking SCSI Adapter 1, (pci id 1)
1- 2- 3- 4- 5- 6- 7- 8- 9- 10- 11- 12- 13- 14- 15- = 0 device(s)
Initializing PROM Device drivers .......... DONE
001a01-L1>power up
returning to console mode 001a01 console, <CTRL_T> to escape to L1
Starting PROM Boot process
IP35 PROM SGI Version 6.180 built 01:50:56 PM Nov 18, 2003
Running in DDR mode
Testing/Initializing memory ............... DONE
Copying PROM code to memory ............... DONE
Discovering local IO ...................... DONE
Discovering NUMAlink connectivity .........
Local hub NUMAlink is down.
*** Local network link down
DONE
Found 1 objects (1 hubs, 0 routers) in 5895 usec
Waiting for peers to complete discovery.... DONE
No other nodes present; becoming global master
Global master is /hw/rack/001/bay/01
Intializing any CPUless nodes.............. DONE
Checking partitioning information ......... DONE
No other nodes present; becoming partition master
Loading BASEIO prom ....................... DONE
BASEIO PROM Monitor SGI Version 6.180 built 01:47:37 PM Nov 18, 2003 (BE64)
1 CPUs on 1 nodes found.
NVRAM checksum is incorrect: reinitializing.
Automatic update of PROM environment disabled
PS/2 Keyboard & Mouse diagnostics
Found mouse on port 0
Found keyboard on port 1
PS/2 Keyboard & Mouse diagnostics passed
Graphics diagnostics
Odyssey board #0 found on nasid 0
Running Odyssey xtalk sanity diag...
Board version 1 - Buzz revision 2B
On board sdram size: 32 Mb
Cas latency: CAS 3
2 banks by sdram module
Running Odyssey Buzz registers diag...
Device passed diagnostics
Installing PROM Device drivers ............
Base I/O Ethernet set to /dev/ethernet/ef0
Installing Graphics Console...
graphics install: searching for pipe 0
Walking SCSI Adapter 0, (pci id 1)
1- 2- 3- 4- 5- 6- 7- 8- 9- 10- 11- 12- 13- 14- 15- = 0 device(s)
Walking SCSI Adapter 1, (pci id 1)
1- 2- 3- 4- 5- 6- 7- 8- 9- 10- 11- 12- 13- 14- 15- = 0 device(s)
Initializing PROM Device drivers .......... DONE
I believe it should start going into hardware discovery - but it freezes at this point...
Some info from L1:
Code:
001a01-L1>flash status
Flash image B currently booted
Image Status Revision Built
----- ------------- ---------- -----
A valid 1.22.6 08/07/2003 12:56:31
B user default 1.22.6 08/07/2003 12:56:31
001a01-L1>
001a01-L1>serial all
Data Location Value
------------------------------ ------------ --------
Local System Serial Number EEPROM 08:00:69:10:26:C9
Local Brick Serial Number EEPROM MZL370
Reference Brick Serial Number NVRAM MZL370
EEPROM Product Name Serial Part Number Rev T/W
---------- -------------- ---------- -------------------- --- ------
NODE IP34 MZL370 030_1707_003 M 00
MAC MAC ADDRESS NA NA NA NA
PIMM IP34PIMM MEF168 030_1708_002 J 00
XIO ASTODYB MFB185 030_1725_001 F 00
EEPROM JEDEC-SPD Info Part Number Rev Speed SGI
---------- ------------------------ ------------------ ---- ------ --------
DIMM 0 CE0000000000000027F30900 M3 47L6510BT0-CA0 0B 10.0 N/A
DIMM 2 CE0000000000000028C4B801 M3 47L6510BT0-CA0 0B 10.0 N/A
DIMM 1 CE0000000000000027FA0900 M3 47L6510BT0-CA0 0B 10.0 N/A
DIMM 3 CE0000000000000028C2B801 M3 47L6510BT0-CA0 0B 10.0 N/A
001a01-L1>
001a01-L1>env
Environmental monitoring is enabled and running.
Description State Warning Limits Fault Limits Current
-------------- ---------- ----------------- ----------------- -------
12V Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.06
12V IO Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.12
5V Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.07
3.3V Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.32
2.5V Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.46
1.5V Enabled 10% 1.35/ 1.65 20% 1.20/ 1.80 1.47
5V aux Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.10
3.3V aux Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.27
PIMM0 12V bias Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.12
Fuel SRAM Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.54
Fuel CPU Enabled 10% 1.44/ 1.76 20% 1.28/ 1.92 1.61
PIMM0 1.5V Enabled 10% 1.35/ 1.65 20% 1.20/ 1.80 1.49
PIMM0 3.3V aux Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.27
PIMM0 5V aux Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.10
XIO 12V bias Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.00
XIO 5V Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.07
XIO 2.5V Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.47
XIO 3.3V aux Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.30
Description State Warning RPM Current RPM
-------------- ---------- ----------- -----------
FAN 0 EXHAUST Enabled 920 1357
FAN 1 HD Enabled 1560 2671
FAN 2 PCI Enabled 1120 1790
FAN 3 XIO 1 Enabled 1600 2200
FAN 4 XIO 2 Enabled 1600 2157
FAN 5 PS Enabled 1600 2531
Advisory Critical Fault Current
Description State Temp Temp Temp Temp
-------------- ---------- --------- --------- --------- ---------
NODE 0 Enabled 60C/140F 65C/149F 70C/158F 46C/114F
NODE 1 Enabled 60C/140F 65C/149F 70C/158F 44C/111F
NODE 2 Enabled 60C/140F 65C/149F 70C/158F 29C/ 84F
PIMM Enabled 60C/140F 65C/149F 70C/158F 60C/140F
ODYSSEY Enabled 60C/140F 65C/149F 70C/158F 35C/ 95F
BEDROCK Enabled 70C/158F 75C/167F 80C/176F 54C/129F
001a01-L1>
Flash image B currently booted
Image Status Revision Built
----- ------------- ---------- -----
A valid 1.22.6 08/07/2003 12:56:31
B user default 1.22.6 08/07/2003 12:56:31
001a01-L1>
001a01-L1>serial all
Data Location Value
------------------------------ ------------ --------
Local System Serial Number EEPROM 08:00:69:10:26:C9
Local Brick Serial Number EEPROM MZL370
Reference Brick Serial Number NVRAM MZL370
EEPROM Product Name Serial Part Number Rev T/W
---------- -------------- ---------- -------------------- --- ------
NODE IP34 MZL370 030_1707_003 M 00
MAC MAC ADDRESS NA NA NA NA
PIMM IP34PIMM MEF168 030_1708_002 J 00
XIO ASTODYB MFB185 030_1725_001 F 00
EEPROM JEDEC-SPD Info Part Number Rev Speed SGI
---------- ------------------------ ------------------ ---- ------ --------
DIMM 0 CE0000000000000027F30900 M3 47L6510BT0-CA0 0B 10.0 N/A
DIMM 2 CE0000000000000028C4B801 M3 47L6510BT0-CA0 0B 10.0 N/A
DIMM 1 CE0000000000000027FA0900 M3 47L6510BT0-CA0 0B 10.0 N/A
DIMM 3 CE0000000000000028C2B801 M3 47L6510BT0-CA0 0B 10.0 N/A
001a01-L1>
001a01-L1>env
Environmental monitoring is enabled and running.
Description State Warning Limits Fault Limits Current
-------------- ---------- ----------------- ----------------- -------
12V Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.06
12V IO Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.12
5V Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.07
3.3V Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.32
2.5V Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.46
1.5V Enabled 10% 1.35/ 1.65 20% 1.20/ 1.80 1.47
5V aux Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.10
3.3V aux Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.27
PIMM0 12V bias Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.12
Fuel SRAM Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.54
Fuel CPU Enabled 10% 1.44/ 1.76 20% 1.28/ 1.92 1.61
PIMM0 1.5V Enabled 10% 1.35/ 1.65 20% 1.20/ 1.80 1.49
PIMM0 3.3V aux Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.27
PIMM0 5V aux Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.10
XIO 12V bias Enabled 10% 10.80/ 13.20 20% 9.60/ 14.40 12.00
XIO 5V Enabled 10% 4.50/ 5.50 20% 4.00/ 6.00 5.07
XIO 2.5V Enabled 10% 2.25/ 2.75 20% 2.00/ 3.00 2.47
XIO 3.3V aux Enabled 10% 2.97/ 3.63 20% 2.64/ 3.96 3.30
Description State Warning RPM Current RPM
-------------- ---------- ----------- -----------
FAN 0 EXHAUST Enabled 920 1357
FAN 1 HD Enabled 1560 2671
FAN 2 PCI Enabled 1120 1790
FAN 3 XIO 1 Enabled 1600 2200
FAN 4 XIO 2 Enabled 1600 2157
FAN 5 PS Enabled 1600 2531
Advisory Critical Fault Current
Description State Temp Temp Temp Temp
-------------- ---------- --------- --------- --------- ---------
NODE 0 Enabled 60C/140F 65C/149F 70C/158F 46C/114F
NODE 1 Enabled 60C/140F 65C/149F 70C/158F 44C/111F
NODE 2 Enabled 60C/140F 65C/149F 70C/158F 29C/ 84F
PIMM Enabled 60C/140F 65C/149F 70C/158F 60C/140F
ODYSSEY Enabled 60C/140F 65C/149F 70C/158F 35C/ 95F
BEDROCK Enabled 70C/158F 75C/167F 80C/176F 54C/129F
001a01-L1>
I did notice it says that NVRAM checksum is invalid, I wonder how much is that related...
Any ideas ?
I do have another Fuel, and I might be able to try and swap the timekeeper chips (dallas and ST), but I wonder if anything else could be done to find out what's going on....
Cheers