Hi all,
My TEZRO with IP59-PIMM had not started up Power On Diagnostic because of subjected reason.
Some days ago, my TEZRO came to be stopped and reported "TLB refilll exception" error many times.
When I checked status of components from PROM console(issued enableall or so),
system had suddenly stopped that reason, and after re-plug power cable, POD stopped staring up.
(The reason of TLB refill error may trouble of DIMMs.
Because when I inspected the DIMMs, I found some ceramic capcitor had removed from component side of DIMMs.
Now these DIMMs group had removed.)
When I issued "pwr u" from L1 console, fans, Hdds are spinned up, but POD is not started up.
And issued "leds" after that, system reported following messages.
001c01-L1>leds
CPU A: 0x9a: FLED_DISABLED: CPU is disabled by env variable.
CPU B: < CPU not present >
CPU C: 0x9a: FLED_DISABLED: CPU is disabled by env variable.
CPU D: < CPU not present >
Ofcause, all CPUs enabled in L1 level.
001c01-L1>cpu
CPU Present Enabled
--- ------- -------
0A 1 1
0B 0 0
0C 1 1
0D 0 0
I had also reflashed L1 firmware from software L2 emulator, but could not re-enable.
Here is my logs and related infomations.
001c01-L1>pwr u
ERROR: no response from 001c01
(wait 10 sec.)
001c01-L1>leds
CPU A: 0x9a: FLED_DISABLED: CPU is disabled by env variable.
CPU B: < CPU not present >
CPU C: 0x9a: FLED_DISABLED: CPU is disabled by env variable.
CPU D: < CPU not present >
001c01-L1>pwr d
001c01-L1>log
04/12/13 08:58:09 ChiWS IP59
04/12/13 08:58:11 USB0: waiting on open
04/15/13 09:00:47 L1 booting 1.40.6
04/15/13 09:00:47 ChiWS IP59
04/15/13 09:00:49 USB0: waiting on open
04/15/13 09:02:24 USB0: opened
04/15/13 09:02:24 USB0: registered as remote
04/15/13 09:02:24 USB0: registered for events
04/15/13 09:07:08 power up (COMMAND)
04/15/13 09:07:13 Node 0 XTalk clock 88
04/15/13 09:07:14 reset again MIPS
04/15/13 09:07:19 Node 0 XTalk clock 88
04/15/13 09:07:46 power down (COMMAND)
04/15/13 09:12:32 USB-R: USB:connection lost
04/15/13 09:12:32 UNREG: 300054c0 0 7
04/15/13 09:12:32 USB0: unregistered
04/15/13 09:12:33 USB0-R: IRouter:read failed - read error
04/15/13 09:12:33 USB0: waiting on open
04/15/13 22:11:19 USB0: opened
04/15/13 22:11:19 USB0: registered as remote
04/15/13 22:11:19 USB0: registered for events
04/15/13 22:27:13 USB-R: USB:connection lost
04/15/13 22:27:13 UNREG: 300054c0 0 7
04/15/13 22:27:13 USB0: unregistered
04/15/13 22:27:14 USB0-R: IRouter:read failed - read error
04/15/13 22:27:14 USB0: waiting on open
04/15/13 22:45:29 USB0: opened
04/15/13 22:45:29 USB0: registered as remote
04/15/13 22:45:29 USB0: registered for events
04/15/13 23:33:30 2MB flash part!!
04/15/13 23:43:19 L1 booting 1.48.1
04/15/13 23:43:19 ChiWS IP59
04/15/13 23:43:21 USB0: waiting on open
04/15/13 23:43:21 USB0: opened
04/15/13 23:43:21 USB0: registered as remote
04/15/13 23:43:21 USB0: registered for events
04/15/13 23:50:41 power up (COMMAND)
04/15/13 23:50:45 Node 0 XTalk clock 88
04/15/13 23:50:47 reset again MIPS
04/15/13 23:50:51 Node 0 XTalk clock 88
04/15/13 23:53:00 power down (COMMAND)
04/15/13 23:53:06 L1 booting 1.48.1
04/15/13 23:53:06 ChiWS IP59
04/15/13 23:53:08 USB0: waiting on open
04/15/13 23:53:08 USB0: opened
04/15/13 23:53:08 USB0: registered as remote
04/15/13 23:53:08 USB0: registered for events
04/16/13 00:01:42 power up (COMMAND)
04/16/13 00:01:46 Node 0 XTalk clock 88
04/16/13 00:01:48 reset again MIPS
04/16/13 00:01:52 Node 0 XTalk clock 88
04/16/13 00:02:12 power down (COMMAND)
001c01-L1>serial all
Data Location Value
------------------------------ ------------ --------
Local System Serial Number NVRAM P1003842
Reference System Serial Number NVRAM P1003842
Local Brick Serial Number EEPROM NWC610
Reference Brick Serial Number NVRAM NWC610
EEPROM Product Name Serial Part Number Rev T/W
---------- -------------- ------------- -------------------- --- ------
INTERFACE WS_INT_53 NWC610 030_1881_007 B 00
IO9 IO9 NWY514 030_1771_006 A 00
ODYSSEY ODY128B1_2 NWG705 030_1884_005 B 00
SNOWBALL no hardware detected
NODE IP59_2CPU NSD938 030_2059_002 C 00
IO DGHTR CHWS_IO_DAUG NVG234 030_1875_003 A 00
EEPROM JEDEC-SPD Info Part Number Rev Speed SGI
---------- ------------------------ ------------------ ---- ------ --------
DIMM 0 no hardware detected
DIMM 2 no hardware detected
DIMM 4 7F94FFFFFFFFFFFF937DE80D SM57264DSGI100C3 00FF 8.0 N/A
DIMM 6 7F94FFFFFFFFFFFFD37DE80D SM57264DSGI100C3 00FF 8.0 N/A
DIMM 1 no hardware detected
DIMM 3 no hardware detected
DIMM 5 7F94FFFFFFFFFFFF2B99700D SM57264DSGI100C2 00FF 8.0 N/A
DIMM 7 7F94FFFFFFFFFFFFB37DE80D SM57264DSGI100C3 00FF 8.0 N/A
001c01-L1>cpu
CPU Present Enabled
--- ------- -------
0A 1 1
0B 0 0
0C 1 1
0D 0 0
001c01-L1>flash status
Flash image A currently booted
Image Status Revision Built
----- ------------- ---------- -----
A user default 1.48.1 01/22/2007 11:34:34 (<- reflashed image)
B valid 1.40.6 01/06/2006 13:16:50
The reason of this error is some strange data had written to PROM or NVRAM because of brokened DIMM, I think.
Does anyone have any solution to re-enable the CPUs?
And sorry for my poor English, I'm not native speaker.
My TEZRO with IP59-PIMM had not started up Power On Diagnostic because of subjected reason.
Some days ago, my TEZRO came to be stopped and reported "TLB refilll exception" error many times.
When I checked status of components from PROM console(issued enableall or so),
system had suddenly stopped that reason, and after re-plug power cable, POD stopped staring up.
(The reason of TLB refill error may trouble of DIMMs.
Because when I inspected the DIMMs, I found some ceramic capcitor had removed from component side of DIMMs.
Now these DIMMs group had removed.)
When I issued "pwr u" from L1 console, fans, Hdds are spinned up, but POD is not started up.
And issued "leds" after that, system reported following messages.
001c01-L1>leds
CPU A: 0x9a: FLED_DISABLED: CPU is disabled by env variable.
CPU B: < CPU not present >
CPU C: 0x9a: FLED_DISABLED: CPU is disabled by env variable.
CPU D: < CPU not present >
Ofcause, all CPUs enabled in L1 level.
001c01-L1>cpu
CPU Present Enabled
--- ------- -------
0A 1 1
0B 0 0
0C 1 1
0D 0 0
I had also reflashed L1 firmware from software L2 emulator, but could not re-enable.
Here is my logs and related infomations.
001c01-L1>pwr u
ERROR: no response from 001c01
(wait 10 sec.)
001c01-L1>leds
CPU A: 0x9a: FLED_DISABLED: CPU is disabled by env variable.
CPU B: < CPU not present >
CPU C: 0x9a: FLED_DISABLED: CPU is disabled by env variable.
CPU D: < CPU not present >
001c01-L1>pwr d
001c01-L1>log
04/12/13 08:58:09 ChiWS IP59
04/12/13 08:58:11 USB0: waiting on open
04/15/13 09:00:47 L1 booting 1.40.6
04/15/13 09:00:47 ChiWS IP59
04/15/13 09:00:49 USB0: waiting on open
04/15/13 09:02:24 USB0: opened
04/15/13 09:02:24 USB0: registered as remote
04/15/13 09:02:24 USB0: registered for events
04/15/13 09:07:08 power up (COMMAND)
04/15/13 09:07:13 Node 0 XTalk clock 88
04/15/13 09:07:14 reset again MIPS
04/15/13 09:07:19 Node 0 XTalk clock 88
04/15/13 09:07:46 power down (COMMAND)
04/15/13 09:12:32 USB-R: USB:connection lost
04/15/13 09:12:32 UNREG: 300054c0 0 7
04/15/13 09:12:32 USB0: unregistered
04/15/13 09:12:33 USB0-R: IRouter:read failed - read error
04/15/13 09:12:33 USB0: waiting on open
04/15/13 22:11:19 USB0: opened
04/15/13 22:11:19 USB0: registered as remote
04/15/13 22:11:19 USB0: registered for events
04/15/13 22:27:13 USB-R: USB:connection lost
04/15/13 22:27:13 UNREG: 300054c0 0 7
04/15/13 22:27:13 USB0: unregistered
04/15/13 22:27:14 USB0-R: IRouter:read failed - read error
04/15/13 22:27:14 USB0: waiting on open
04/15/13 22:45:29 USB0: opened
04/15/13 22:45:29 USB0: registered as remote
04/15/13 22:45:29 USB0: registered for events
04/15/13 23:33:30 2MB flash part!!
04/15/13 23:43:19 L1 booting 1.48.1
04/15/13 23:43:19 ChiWS IP59
04/15/13 23:43:21 USB0: waiting on open
04/15/13 23:43:21 USB0: opened
04/15/13 23:43:21 USB0: registered as remote
04/15/13 23:43:21 USB0: registered for events
04/15/13 23:50:41 power up (COMMAND)
04/15/13 23:50:45 Node 0 XTalk clock 88
04/15/13 23:50:47 reset again MIPS
04/15/13 23:50:51 Node 0 XTalk clock 88
04/15/13 23:53:00 power down (COMMAND)
04/15/13 23:53:06 L1 booting 1.48.1
04/15/13 23:53:06 ChiWS IP59
04/15/13 23:53:08 USB0: waiting on open
04/15/13 23:53:08 USB0: opened
04/15/13 23:53:08 USB0: registered as remote
04/15/13 23:53:08 USB0: registered for events
04/16/13 00:01:42 power up (COMMAND)
04/16/13 00:01:46 Node 0 XTalk clock 88
04/16/13 00:01:48 reset again MIPS
04/16/13 00:01:52 Node 0 XTalk clock 88
04/16/13 00:02:12 power down (COMMAND)
001c01-L1>serial all
Data Location Value
------------------------------ ------------ --------
Local System Serial Number NVRAM P1003842
Reference System Serial Number NVRAM P1003842
Local Brick Serial Number EEPROM NWC610
Reference Brick Serial Number NVRAM NWC610
EEPROM Product Name Serial Part Number Rev T/W
---------- -------------- ------------- -------------------- --- ------
INTERFACE WS_INT_53 NWC610 030_1881_007 B 00
IO9 IO9 NWY514 030_1771_006 A 00
ODYSSEY ODY128B1_2 NWG705 030_1884_005 B 00
SNOWBALL no hardware detected
NODE IP59_2CPU NSD938 030_2059_002 C 00
IO DGHTR CHWS_IO_DAUG NVG234 030_1875_003 A 00
EEPROM JEDEC-SPD Info Part Number Rev Speed SGI
---------- ------------------------ ------------------ ---- ------ --------
DIMM 0 no hardware detected
DIMM 2 no hardware detected
DIMM 4 7F94FFFFFFFFFFFF937DE80D SM57264DSGI100C3 00FF 8.0 N/A
DIMM 6 7F94FFFFFFFFFFFFD37DE80D SM57264DSGI100C3 00FF 8.0 N/A
DIMM 1 no hardware detected
DIMM 3 no hardware detected
DIMM 5 7F94FFFFFFFFFFFF2B99700D SM57264DSGI100C2 00FF 8.0 N/A
DIMM 7 7F94FFFFFFFFFFFFB37DE80D SM57264DSGI100C3 00FF 8.0 N/A
001c01-L1>cpu
CPU Present Enabled
--- ------- -------
0A 1 1
0B 0 0
0C 1 1
0D 0 0
001c01-L1>flash status
Flash image A currently booted
Image Status Revision Built
----- ------------- ---------- -----
A user default 1.48.1 01/22/2007 11:34:34 (<- reflashed image)
B valid 1.40.6 01/06/2006 13:16:50
The reason of this error is some strange data had written to PROM or NVRAM because of brokened DIMM, I think.
Does anyone have any solution to re-enable the CPUs?
And sorry for my poor English, I'm not native speaker.