SGI: Hardware

O300 success and dissapointment - memory issue

So my 1GB memory kit arrived today, two 512MB dimms (holy cow are they huge!). As soon as I got home I prepped an old Linux laptop I have with mincom and a laplink cable and proceaded to install the dimm's into the first unit in the pile. typed power up and the first O300 came to life, awesome! After I started the system I switch to console mode by hitting ctrl_d and watch as the memory check reports an error. The error is "swapping bank 0 with bank 1". Rats do I have a bad dimm? So i open the system, swapped the memory modules and powered it back up. same message.
Further down the console boot log it says:
memory enabled = 512MB
memory disabled = 256MB

Thinking I might have been sent two mismatched dimm's I took them out and inspected them carefully, even comparing chip part #'s. Identical. Okay lets try the next system, power up, and what do you know, no memory errors and "memory enabled = 1024MB". I went back to the first unit and checked the dimm sockets for dirt, nothing clean as a whistle. Tried installing the memory again and wiggleing the diims to make sure they are firmly seated and still the same memory error messages.

Now here is where it gets interesting, The first unit I opened had the two screws securing the access lid. The other four units have their screws missing when the memory was pulled. That person knew the memory was missing and never bothered to open the unit. This leads me to believe that the memory controller had gone bad before the system was decommissioned. From what I know, nodes can be operated without memory when linked to a unit or units with memory, correct? They just ran the unit without memory.

I have not yet tested the other 3 systems but I will tomorrow when I have some more time. Looks like one system is a partial lemon :cry: . Any way to fix this problem or is this a bad logic board?

_________________
Coming soon! :O3x05R: On their way: :Onyx2: :Octane:
Before panicking try resetting the POD (resetall, enableall - do a search for "enableall" "POD" "SGI" and you'll get instructions) (provided that the O3x follows the same convention as the O2x).

_________________
Damn the torpedoes, full speed ahead!

Systems available for remote access on request.

:Indigo: :Octane: :Indigo2: :Indigo2IMP: :Indy: :PI: :O200: :ChallengeL: :O2000R: (single-CM)
SAQ wrote:
Before panicking try resetting the POD (resetall, enableall - do a search for "enableall" "POD" "SGI" and you'll get instructions) (provided that the O3x follows the same convention as the O2x).


Similar, I had a stick of o300 RAM that would be disabled every so often and some variation of "enable" worked. I don't recall the exact syntax. It maybe have even been enableall... from "5. Enter Command Monitor".. on the o300 menu.

R.

_________________
死の神はりんごだけ食べる

アレゲはアレゲ以上のなにものでもなさげ -- アレゲ研究家

:Tezro: :Tezro: :Onyx2R: :Onyx2RE: :Onyx2: :O3x04R: :O3x0: :O200: :Octane: :Octane2: :O2: :O2: :Indigo2IMP: :PI: :PI: :1600SW: :1600SW: :Indy: :Indy: :Indy: :Indy: :Indy:
:hpserv: J5600, 2 x Mac, 3 x SUN, Alpha DS20E, Alpha 800 5/550, 3 x RS/6000, Amiga 4000 VideoToaster, Amiga4000 -030, 733MHz Sam440 AmigaOS 4.1 update 1. Tandem Himalaya S-Series Nonstop S72000 ServerNet.

Sold: :Indy: :Indy: :Indy: :Indigo:

Cortex ---> http://www.facebook.com/pages/Cortex-th ... 11?sk=info
Minnie ---> http://www.facebook.com/pages/Minnie-th ... 02?sk=info
Book ----> http://pymblesoftware.com/book/
Github ---> https://github.com/pymblesoftware
Visit http://www.pymblesoftware.com
Search for "Pymble", "InstaElf", "CryWhy" or "Cricket Score Sheet" in the iPad App store or search for "Pymble" or "CryWhy" in the iPhone App store.
Perhaps not your issue here, but I've had the 030-1060-xxx DIMMS show up with SMT components knocked off the PCB. Haven't tried plugging them into anything yet...

_________________
Then? :IRIS3130: ... Now? :O3x02L: :1600SW: +MLA :Fuel: :Octane2: :Octane: :Indigo2IMP: ... Other: DEC :BA213: :BA123: Sun , DG AViiON , NeXT :Cube:
The suggestions to clear the power on diagnostic (aka POD) logs are good ones.

To access POD mode, stop at the PROM command line (item 5 in the PROM menu list), and sequentially run the following commands from the command line:
  • pod
  • go cac
  • clearalllogs
  • initalllogs
  • flush
  • reset (the system will restart)

After the systen restarts, go back into the PROM monitor and execute:
  • enableall
  • update
  • reset (the system will restart)
If the memory errors don't reappear, you're probably safe.

If they do reappear then I'd suggest trying that pair of DIMMs in the second set of memory slots - slots 1 and 3 normally hold the first pair, slots 2 and 4 the second pair. The error message you received, ""swapping bank 0 with bank 1", makes it sound like the system has disabled bank 0 and is expecting to find the memory in bank 1. It's a long shot that may not work, but if it does it beats replacing the logic board.

_________________
***********************************************************************
Welcome to ARMLand - 0/0x0d00
running...(sherwood-root 0607201829)
* InfiniteReality/Reality Software, IRIX 6.5 Release *
***********************************************************************
recondas wrote:
The suggestions to clear the power on diagnostic (aka POD) logs are good ones.

To access POD mode, stop at the PROM command line (item 5 in the PROM menu list), and sequentially run the following commands from the command line:
  • pod
  • go cac
  • clearalllogs
  • initalllogs
  • flush
  • reset (the system will restart)

After the systen restarts, go back into the PROM monitor and execute:
  • enableall
  • update
  • reset (the system will restart)
If the memory errors don't reappear, you're probably safe.

If they do reappear then I'd suggest trying that pair of DIMMs in the second set of memory slots - slots 1 and 3 normally hold the first pair, slots 2 and 4 the second pair. The error message you received, ""swapping bank 0 with bank 1", makes it sound like the system has disabled bank 0 and is expecting to find the memory in bank 1. It's a long shot that may not work, but if it does it beats replacing the logic board.


You sir are MY HERO!

I went from this: :(
Code:
**** System Configuration and Diagnostics Summary ****
CONFIG:
No. of NODEs enabled    = 1
No. of NODEs disabled   = 0
No. of CPUs enabled     = 4
No. of CPUs disabled    = 0
Mem enabled             = 512 MB
Mem disabled            = 256 MB
No. of RTRs enabled     = 0
No. of RTRs disabled    = 0

DIAG RESULTS:
/hw/module/001c16/node/mem: MEMBANK(S) 0  disabled
Reason:
Bank 0: Some DIMMs failed mem test.
**** End System Configuration and Diagnostics Summary ****


To this! :D

Code:
**** System Configuration and Diagnostics Summary ****
CONFIG:
No. of NODEs enabled    = 1
No. of NODEs disabled   = 0
No. of CPUs enabled     = 4
No. of CPUs disabled    = 0
Mem enabled             = 1024 MB
Mem disabled            = 0 MB
No. of RTRs enabled     = 0
No. of RTRs disabled    = 0

DIAG RESULTS:
ALL DIAGS PASSED.
**** End System Configuration and Diagnostics Summary ****


Thank you for the clear and precise instructions.

_________________
Coming soon! :O3x05R: On their way: :Onyx2: :Octane:
ThaddeusW wrote:
Thank you for the clear and precise instructions.
You're welcome - I'm glad to see it had a happy ending.

_________________
***********************************************************************
Welcome to ARMLand - 0/0x0d00
running...(sherwood-root 0607201829)
* InfiniteReality/Reality Software, IRIX 6.5 Release *
***********************************************************************
It's a good idea to go through the enable/clear stuff every time you change the hardware. You can technically add hardware and the Origin will find it no problem, but if you substitute hardware or remove hardware there can be difficulties. The PROM/system controller "remembers" what it saw, and if things are different it can lock stuff out as failed or not correctly identify the new hardware.

_________________
Damn the torpedoes, full speed ahead!

Systems available for remote access on request.

:Indigo: :Octane: :Indigo2: :Indigo2IMP: :Indy: :PI: :O200: :ChallengeL: :O2000R: (single-CM)