Have we just all become googlets. Or something? How does a person get hard copies of stuff for science and other things for a fiscal year? I can further explain if need be.

  • Don_DickleOP
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 days ago

    You had me going until hashes no clue what that is. But without using a ton of ink how do i do multiple copies in different places? Especially dealing with the government info?

    • Em Adespoton@lemmy.ca
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      1
      ·
      2 days ago

      I have an array of 8TB hard drives, and cycling 1TB drives that are rotated through a safe deposit box.

      These use a file system that has built in encryption and integrity protection, as well as rollback functionality.

      That means that I know immediately if some data has changed unintentionally, and can compare the file hashes (MD5, SHA1, SHA256 or all of the above) across devices to see if the files match. Since I always have at least 3 copies, if someone modified one of them, I’d know which one changed.

      8TB is around 4.4 billion pages of typed text.

      Assuming each hardback contains 10,000 pages (just to be generous), that’s 440,000 hardbacks stored in triplicate, fully and (relatively) instantly searchable.

    • howrar@lemmy.ca
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 days ago

      It takes the same amount of ink to print regardless of whether you do it yourself or if someone else does it. You can also just store it in its original digital form. Put it on your hard drive, or an external drive, or on magnetic tape, or on the cloud, or anywhere else.

      A hash is any kind of mapping from the raw data into a fixed length piece of data. This is usually preferable when you don’t want to store the actual data but you want to be able to verify with some degree of certainty that the data hasn’t changed. As an example, for tabular data, you can hash each column by computing the mean. If the data changes, then that mean will likely change too.