Dell PowerEdge 2800 Logical Disk Failure: Any Possible Recovery?

8,745

Solution 1

well, 2800 is rather old now, but still, it would help if you posted the raid configuration and the state the disks are reported in, in the scsi controller bios. In general, the drives might be ok (they did pass the diags, right? you ran the dell pediags/onlineDiags from support.dell.com?) and you might just need to recover the array and possibly update the controller and disk firmware, but without any information, it's very hard to guide you through the required steps

Solution 2

I find it pretty implausible that a Windows Update caused the type of hardware problem you're experiencing, unless the Windows Update updated the firmware of the RAID controller or the HDD's themselves, which isn't likely as I've never seen a firmware update come through Windows Updates. If Windows Update updated the RAID controller driver then at it's worst it would cause Windows not to boot but wouldn't cause the POST errors you're having.

Did you run the diags from Windows? If so, you might want to download the Dell bootable diags, burn a CD, boot to it, and run them. That eliminates the OS from having any influence on the diags.

http://support.dell.com/support/downloads/download.aspx?c=us&cs=04&l=en&s=bsd&releaseid=R212797&SystemID=PWE_PNT_2800&servicetag=&os=WNET&osl=en&deviceid=196&devlib=0&typecnt=0&vercnt=16&catid=13&impid=-1&formatcnt=0&libid=13&typeid=-1&dateid=-1&formatid=-1&source=-1&fileid=301057

Solution 3

Call dell tech support (or someone else who knows about RAID recovery) if you value your data.

Share:
8,745
user779887
Author by

user779887

Updated on September 18, 2022

Comments

  • user779887
    user779887 almost 2 years

    I was noticing a issues with the server.

    Usually Microsoft Updates that have downloaded, but are waiting to be installed can cause this.

    Before applying the updates and restarting, I ran several diagnostic tests on the hardware and everything came up green. Both hard drives also displayed green lights

    I then chose 'Install updates and Restart'.

    Upon reboot, immediately after the 'SCSI scanning' hits 100%, both hard drives go from green to blinking amber and I get a '1 logical drive found/failed' error message.

    I've tried reseating the drives without any change.

    I've mounted two new drives in slots 2 and 3 (The two original drives are still in slots 0 and 1)

    Would it be possible to rebuild/recover anything at this point?

    If there is any possible step I'm missing before reinstalling the OS on the two new drives and starting from scratch, I'd REALLY REALLY appreciate someone schooling me on it before I kill any chance of recovery.

    Thanks.

  • Aaron
    Aaron about 13 years
    Agreed, what is the scsi controller bios reporting?
  • user779887
    user779887 about 13 years
    Yes, the 2800 has done its duty. We're ordering a new server but I's love to have this one as a dev server. Its a RAID1 (two mirrored drives) I have two new drives now too. The original drives were offline after the restart, but I was able to force them back online and it's currently checking them for consistency.
  • David Corsalini
    David Corsalini about 13 years
    ok, for a raid1, I'd try to recover one disk (leaving the other outside the machine), so that I'd get a degraded array - this way I'd still have the data. If the disk doesn't recover, well, you have the other one to try with. Once the array recovers, add the second disk and rebuild, then run diags again, and make sure you update the firmware.
  • KCotreau
    KCotreau about 13 years
    I could not agree more. I feel like I know a lot about this stuff, but whenever I am in a similar situation, I still call. A second set of eyes, one that does nothing but that kind of problem all day long, cannot hurt.
  • user779887
    user779887 about 13 years
    I didn't necessarily mean to imply that the windows updates caused the disk errors, I was just describing what happened before the crash. Though I have experienced OS issues with windows updates that were sitting in the queue. I have the 32 Diag disk, but the consistency check was still running when I left. I won't be able to run the diag again until tomorrow morning. I'll post the results then. Thanks everyone.
  • user779887
    user779887 about 13 years
    I did try for about 40 minutes with Dell support. But since the service plan has expired, he couldn't go too much further than giving me a few links to creating a boot disk etc. He said it's beyond their support level. And that's perfectly fine, the server is almost 7 years old.
  • user779887
    user779887 about 13 years
    I've copied the .bkf that I currently have access to onto one of the workstations. So at least I have that much so far. If I could get it to boot into the other original drive (if the data wasn't overwritten by this one, I should be able to copy the more recent.bkf)