I typed apt-get update
and then I saw a long list of
EDAC i7Core: Lost 127 memory errors
Please help me understand what has happened.
I am using ubuntu server edition 12.04 LTS
UPDATE:
I followed Gilles answer. I got the following when I do a dmesg
[113893.215234] EDAC MC0: CE error on CPU#0Channel#1_DIMM#0 (channel:1 slot:0 page:0x0 offset:0x0 grain:8 syndrome:0x0)
[113893.215237] EDAC MC0: CE error on CPU#0Channel#1_DIMM#0 (channel:1 slot:0 page:0x0 offset:0x0 grain:8 syndrome:0x0)
[113893.215240] EDAC MC0: CE error on CPU#0Channel#1_DIMM#0 (channel:1 slot:0 page:0x0 offset:0x0 grain:8 syndrome:0x0)
[113893.215243] EDAC MC0: CE error on CPU#0Channel#1_DIMM#0 (channel:1 slot:0 page:0x0 offset:0x0 grain:8 syndrome:0x0)
[113893.215246] EDAC MC0: CE error on CPU#0Channel#1_DIMM#0 (channel:1 slot:0 page:0x0 offset:0x0 grain:8 syndrome:0x0)
[113893.215248] EDAC MC0: CE error on CPU#0Channel#1_DIMM#0 (channel:1 slot:0 page:0x0 offset:0x0 grain:8 syndrome:0x0)
[113893.215251] EDAC MC0: CE error on CPU#0Channel#1_DIMM#0 (channel:1 slot:0 page:0x0 offset:0x0 grain:8 syndrome:0x0)
EDAC is the kernel component that watches for memory errors. You get a message about lost memory errors when so many errors occurred in a short interval that the kernel was unable to log them all.
A memory error is a bad thing. Check your kernel logs (
/var/log/kern.log
) for messages like this:CE
indicates an error that was corrected. If your RAM has error correction, it's ok to have a corrected error now and then. But when you get to the point where memory errors get lost due to their number, it's high time to replace your RAM. The message indicates which RAM module (DIMM) is faulty.If you see
UE
instead ofCE
, it means that an error was detected but not corrected. You may have corrupt data.If your RAM and CPU don't support error correction, then the first sign you'll get that your RAM is defective is when you realize that your data is corrupted.
This is completely unrelated to
apt-get update
, that's just the command you happened to be executing when the errors happened.Replace your RAM yesterday.