Help - Search - Members - Calendar
Full Version: ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
The Planet Forums > System Administration > Server Hardware
Jeff
What do you make of the following in the event log of the second dual xeon server I just had setup?
QUOTE
Fri Dec 14 05:25:01 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 05:25:01 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 05:25:01 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 05:25:01 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 11:44:58 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 11:44:58 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 13:24:58 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 13:24:58 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 14:19:59 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 14:19:59 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
ajz4221
I think it you need to submit a ticket... cool.gif
(I know you know that as an obvious answer, but decided to say it anyways).

Be ready for them to have your server for awhile. One of my previous P4 servers had some issues and it was "under tests" for about 27 hours (unavailable).
I know how bad it feels when you are excited to bring a new server into your fleet and it has something wrong with it (after you spend days getting it setup).

See if you can get the Dell OpenManage tools installed to run a diag from Windows.
Jeff
Thanks for the reply,

OK, submitted a ticket.

Better to find out early on than later on I suppose.

It's a linux server... will do some reading on the openmanage tools for linux.
ajz4221
Agreed.
Even if they don't find anything, ask them why you are receiving such errors because there should not be a reason.
The tech's are there for hardware so better to do it now than have it go offline and loose customers data.
Jeff
I ran memtester overnight and it generated around 100 more errors in the event log matching the above.

Tech support volunteered to replace the RAM in the ticket.
ajz4221
That memtest probably sped things up.
Glad they were willing to agree to a replacement quickly.
Jeff
Memory swapped. Running a final memtester now just to be sure.

The memory fault was a hurdle I didn't expect when setting up this server, but I have to say I was very impressed with tech support's direct and on-target support to resolve the issue quickly and efficiently.
James Jhurani
QUOTE (Jeff @ Dec 15 2007, 10:32 PM) *
Memory swapped. Running a final memtester now just to be sure.

The memory fault was a hurdle I didn't expect when setting up this server, but I have to say I was very impressed with tech support's direct and on-target support to resolve the issue quickly and efficiently.


So what were the results of the memory test? did the RAM swap resolve it?
Jeff
Yes, ram swap resolved it.
And no more bad entries in the system's event log.

QUOTE
Event Log

Fri Dec 14 02:08:03 2007 0x10 Non-Critical - Log cleared
Fri Dec 14 05:25:01 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 05:25:01 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 05:25:01 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 05:25:01 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 11:44:58 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 11:44:58 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 13:24:58 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 13:24:58 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 14:19:59 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Fri Dec 14 14:19:59 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:09:38 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:09:38 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:13:22 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:13:22 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:17:45 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:17:45 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:17:45 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:17:45 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:27:41 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:27:41 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:27:53 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:27:53 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:33:15 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:33:15 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:33:15 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:33:15 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:46:37 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:46:37 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:57:59 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:57:59 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:58:32 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:58:32 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:59:39 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 03:59:39 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:03:16 2007 0x5a0010 Non-Critical - 5V AUX voltage sensor detected a warning (5.668 V)
Sat Dec 15 04:04:31 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:04:31 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:06:03 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:06:03 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:06:25 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:06:25 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:07:10 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:07:10 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:09:12 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:09:12 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:09:42 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:09:42 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:13:11 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:13:11 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:14:40 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:14:40 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:14:40 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:14:40 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:25:13 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:25:13 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:27:42 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:27:42 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:31:04 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:31:04 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:35:02 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:35:02 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:57:42 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 04:57:42 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:01:03 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:01:03 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:02:25 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:02:25 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:05:32 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:05:32 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:05:39 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:05:39 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:06:16 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:06:16 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:06:16 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:06:16 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:07:35 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:07:35 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:08:08 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:08:08 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:20:50 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:20:50 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:22:28 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:22:28 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:31:41 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sat Dec 15 05:31:41 2007 0x24 OK - ECC Single Bit Fault detected Memory board A - Bank 1, DIMM B
Sun Dec 16 05:33:33 2007 0x5a0010 Non-Critical - 5V AUX voltage sensor detected a warning (5.668 V)

Memory swapped December 15th, 2007; 5:23 PM - no new ECC faults detected after that to date:)
James Jhurani
good to hear!
This is a "lo-fi" version of our main content. To view the full version with more information, formatting and images, please click here.
Invision Power Board © 2001-2009 Invision Power Services, Inc.