• catloaf@lemm.ee
    link
    fedilink
    English
    arrow-up
    8
    ·
    1 month ago

    That’s a pretty common failure scenario in SANs. If you buy a bunch of drives, they’re almost guaranteed to come from the same batch, meaning they’re likely to fail around the same time. The extra load of a rebuild can kill drives that are already close to failure.

    Which is why SANs have hot spares that can be allocated instantly on failure. And you should use a RAID level with enough redundancy to meet your reliability needs. And RAID is not backup, you should have backups too.

    • danA
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 month ago

      If you buy a bunch of drives, they’re almost guaranteed to come from the same batch,

      That’s why you should always buy your drives from multiple stores, rather than getting all of them from one stores.

    • kalleboo@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 month ago

      Also why you need to schedule periodical parity scrubs, then the “extra load of a rebuild” is exercised regularly so weak drives will be found long before a rebuild is needed.