JCEUSA
Apr 27 2003, 02:58 PM
We've been getting this email once a night. Rackshack replaced the hard drive yesterday because of this, and we got another email last night after the old drive was replaced.
Is anyone else getting these, or does anyone have any information on the credibility of this email notice :
Subject : [smartcheck] Hard Drive Failure Soon
IMPORTANT: Do not ignore this email.
You should backup all the data on the hard drives listed below and replace them as soon as possible.
S.M.A.R.T has detected that they are not peforming within normal operating paramaters.
Excessive ATA Errors on disk /dev/hda. Please consider replacing this drive. Some Errors may be normal due to not 100% compatible IDE controllers and may be ignored.
SMART Error Log:
SMART Error Logging Version: 1
Error Log Data Structure Pointer: 05
ATA Error Count: 107
Non-Fatal Count: 0
Error Log Structure 1:
DCR FR SC SN CL SH D/H CR Timestamp
08 00 08 66 df 22 e0 ca 13984
08 00 08 66 df 22 e0 ca 13984
08 00 08 66 df 22 e0 ca 13984
08 00 08 0e cf 22 e0 ca 13984
08 00 05 01 00 00 a0 a1 14064
00 04 05 01 00 00 a0 51 51223
Error Log Structure 2:
DCR FR SC SN CL SH D/H CR Timestamp
08 00 08 66 df 22 e0 ca 15224
08 00 08 66 df 22 e0 ca 15224
08 00 08 66 df 22 e0 ca 15224
08 00 08 0e cf 22 e0 ca 15224
08 00 05 01 00 00 a0 a1 15239
00 04 05 01 00 00 a0 51 51218
Error Log Structure 3:
DCR FR SC SN CL SH D/H CR Timestamp
08 00 08 66 df 22 e0 ca 16746
08 00 08 66 df 22 e0 ca 16746
08 00 08 66 df 22 e0 ca 16746
08 00 08 0e cf 22 e0 ca 16746
08 00 05 01 00 00 a0 a1 16761
00 04 05 01 00 00 a0 51 51233
Error Log Structure 4:
DCR FR SC SN CL SH D/H CR Timestamp
08 00 08 66 df 22 e0 ca 24052
08 00 08 66 df 22 e0 ca 24052
08 00 08 66 df 22 e0 ca 24052
08 00 08 0e cf 22 e0 ca 24052
08 00 05 01 00 00 a0 a1 24067
00 04 05 01 00 00 a0 51 51199
Error Log Structure 5:
DCR FR SC SN CL SH D/H CR Timestamp
08 00 08 9e 08 93 e4 c8 50050
08 00 08 a6 3b 23 e3 c8 50050
08 00 18 e6 3b 23 e3 c8 50050
08 00 38 ae 3b 23 e3 c8 50050
08 da 00 00 4f c2 e0 b0 50050
00 04 00 0b 4f c2 e0 51 51231
Cpanoz
Apr 27 2003, 04:33 PM
JCEUSA , i have a q. that is off the topic, but i can't help it :
did RS make a backup for you or asked you to do it....or did they just replace the HDD without warnning and you got plane CPanel as u purchased the server?
sorry again.
ericfire
Apr 27 2003, 09:41 PM
I'm receiving the same email and have been working with RackShack on this issue for more than two weeks. I posted this here a couple weeks ago. The particular box receiving this email is a Duron as well. I'll let you know how it goes.
ThaPhantom
Apr 27 2003, 09:50 PM
notice: Some Errors may be normal due to not 100% compatible IDE controllers and may be ignored.
ericfire
Apr 27 2003, 09:55 PM
^Correct, but I've been having additional problems. I have a couple cPanel boxes, and the only one that acts up oddly is the one that receives this email. Rackshack support initially suspected a hard drive error a day before I even started receiving this email. The ATA error count has been steadily increasing these past couple of days as well.
Erwin
Apr 27 2003, 11:43 PM
SMART errors may mean that the hard disk is about to die (at least this is what happened to my home PC). Maybe ask RS to take a look.
jaredweb
Jul 27 2003, 04:12 PM
I submitted a trouble ticket to rackshack. Here it is.
ME: keep recieving emails from Cpanel (SnmartCheck) saying that /dev/hdb (the drive you just installed for me is very close to failing and has many errors on it.
RS: There is a known issue with the SMART utilities and Seagate drives. The utility reports issues that arent actual ATA errors. Theres no need to worry, this is only an inconvenience (excessive warning emails) and not a real hardware problem. We hope this issue will be resolved in a later release of the SMART utilites.
Thank you,
Rackshack Support
chapsrulez
Jul 28 2003, 12:23 AM
I recived the following email.
IMPORTANT: Do not ignore this email.
You should backup all the data on the hard drives listed below and replace them as soon as possible.
S.M.A.R.T has detected that they are not peforming within normal operating paramaters.
Excessive ATA Errors on disk /dev/hda. Please consider replacing this drive. Some Errors may be normal due to not 100% compatible IDE controllers and may be ignored.
SMART Error Log:
SMART Error Logging Version: 1
Error Log Data Structure Pointer: 03
ATA Error Count: 128
Non-Fatal Count: 0
Error Log Structure 1:
DCR FR SC SN CL SH D/H CR Timestamp
00 00 08 2e cf fc e6 20 11703
00 00 08 2e cf fc e6 20 11704
00 00 3f 00 00 00 e0 10 11704
00 00 08 2e cf fc e6 20 11704
00 00 08 2e cf fc e6 20 11704
00 10 08 2e cf fc e6 51 655
Error condition: 161 Error State: 3
Number of Hours in Drive Life: 87 (life of the drive in hours)
Error Log Structure 2:
DCR FR SC SN CL SH D/H CR Timestamp
00 00 08 2e cf fc e6 20 11704
06 00 00 00 00 00 00 00 11705
00 00 3f 3f 4d c6 ef 91 11705
00 00 3f 00 00 00 e0 10 11705
00 00 08 2e cf fc e6 20 11705
00 10 08 2e cf fc e6 51 655
Error condition: 161 Error State: 3
Number of Hours in Drive Life: 87 (life of the drive in hours)
Error Log Structure 3:
DCR FR SC SN CL SH D/H CR Timestamp
00 00 08 15 42 1f e2 c8 45
00 00 08 1d 42 1f e2 c8 45
00 00 08 25 42 1f e2 c8 45
00 00 08 2d 42 1f e2 c8 45
00 00 08 35 42 1f e2 c8 45
00 40 00 38 42 1f e2 51 1124073
Error condition: 161 Error State: 3
Number of Hours in Drive Life: 4320 (life of the drive in hours)
Error Log Structure 4:
DCR FR SC SN CL SH D/H CR Timestamp
06 00 00 00 00 00 00 00 11703
00 00 3f 3f 4d c6 ef 91 11703
00 00 3f 00 00 00 e0 10 11703
00 00 08 2e cf fc e6 20 11703
00 00 08 2e cf fc e6 20 11704
00 10 08 2e cf fc e6 51 655
Error condition: 161 Error State: 3
Number of Hours in Drive Life: 87 (life of the drive in hours)
Error Log Structure 5:
DCR FR SC SN CL SH D/H CR Timestamp
00 00 3f 00 00 00 e0 10 11703
00 00 08 2e cf fc e6 20 11703
00 00 08 2e cf fc e6 20 11704
00 00 3f 00 00 00 e0 10 11704
00 00 08 2e cf fc e6 20 11704
00 10 08 2e cf fc e6 51 655
Error condition: 161 Error State: 3
Number of Hours in Drive Life: 87 (life of the drive in hours).
After that, one day from WHM i tried to boot the server, and it never came back, i opened a TT asking RS to try to manually boot my server, and they couldnt do that. That took my server 2 days offline. After that RS said that there was a disk failure, and that the disk was going to be changed, and the server was going to be restored. They asked for my aproval and told me that i should do a manual backup. I did the backup, RS changed the disk, and restored the server, and everything was back to normal.
And the restore wasnt charged to my account.
Fallout2man
Aug 27 2003, 11:19 PM
I've had the same errors as the above. WHM and Cpanel fail to respond and I got a hard disk error email from SMART.
I sent RS a trouble ticket about it and they said it was beyond the scope of their support and asked for me to find a solution on the forum, even referring me to this thread. Seems a bit odd as so far the only "solution" I see here is to get the hardware replaced which would probably not be free like the above poster got, and I've no money for another restore.
shazy
Aug 29 2003, 07:54 AM
QUOTE
RS: There is a known issue with the SMART utilities and Seagate drives. The utility reports issues that arent actual ATA errors. Theres no need to worry, this is only an inconvenience (excessive warning emails) and not a real hardware problem. We hope this issue will be resolved in a later release of the SMART utilites.
Thank you,
Rackshack Support
This is quite true BUT search your /var/log/messages and see if you see any "actual" error messages...
Dave
Feb 4 2004, 09:52 PM
QUOTE
Originally posted by shazy
This is quite true BUT search your /var/log/messages and see if you see any "actual" error messages...
any idea of some keyword that can be searched?
thanks
-- Dave
benito
Feb 5 2004, 01:42 PM
I have the same error when i buy my box. RS changed the disk, the error happend again, RS changed another disk... then error ... again, RS changed my box from DURON to CELERON and the disk... error again.
Now i´m running my box with the same disk, receiving warnings every night and nothing bad happend.
tazdeveloper
Feb 16 2004, 09:41 PM
I was getting the same errors, and opened a TT. The reply back from RS is below.
2/15/04 12:55:12 PM
Dear Customer,
We are running badblocks test on your server and will update you as soon as the test is over. In the meantime, do not reboot the server.
Thankyou,
Ev1servers support team.
2/15/04 3:33:25 PM
Dear Customer,
Badblocks test complete, showed no sign of harddisk errors. You can safely ignore the error messages sent by Cpanel.
Please check the same.
Thankyou,
Ev1servers support team.
SMART technology does not check for bad blocks. It monitors the drive for the following predictive failure behavior:
Head Flying Height
A downward trend in flying height will often presage a head crash.
Number of Remapped Sectors:
If the drive is remapping many sectors due to internally-detected errors, this can mean the drive is starting to go.
ECC Use and Error Counts:
The number of errors encountered by the drive, even if corrected internally, often signal problems developing with the drive. The trend is in some cases more important than the actual count.
Spin-Up Time: Changes in spin-up time can reflect problems with the spindle motor.
Temperature:
Increases in drive temperature often signal spindle motor problems.
Data Throughput:
Reduction in the transfer rate of the drive can signal various internal problems.
I could be wrong, but I think that it wrong for RS Techs to say that there are no issues and to ignore the software checks from cPanel.
Just my opinion...
This is a "lo-fi" version of our main content. To view the full version with more information, formatting and images, please
click here.