This workflow describes the process of transferring content to secure storage and preserving it. If there are terms that you are unfamiliar with on this page, please refer to the glossary for the most common terms used in digital preservation.
3.1 Storage
This is an essential step.
- The NDSA Levels of Digital Preservation are useful for planning storage – in particular the sections on ‘storage’ and ‘control’. Think about creating several copies, in different physical locations and using different storage technologies.
- If you currently have no storage, think about some practical solutions. For example, as an interim approach you could use your organisation’s storage network (see the guidance below for more information).
- Think carefully about who in your organisation is allowed to access the digital content and the type of access that they have (e.g. read, write, move, delete). Keep a record of who has access.
Further guidance
- DPC Handbook: Storage
- Bit by bit: Processing Born Digital Accessions at National Records of Scotland (includes a description of their interim solution).
- Digital Preservation Guidance for Scottish Local Authorities (includes an overview of storage options).
- Digital preservation recommendations for small museums (Canadian Heritage Information Network – includes practical storage solutions).
- Digital Preservation for Small Businesses: An Introductory Guide (Archives and Records Council Wales – includes practical storage solutions).
3.2 Move to storage
This is an essential step.
- Before moving the content to the storage, check that the documentation you created in step 1 and step 2 has been saved in the ‘metadata’ folder.
- If you use a collection management system, you may wish to add some of this documentation to it (e.g. the accession record or catalogue) or record where it is stored.
- Some archives will package the content and metadata in a ‘bag’ using software such as Bagger.
- Move the content to the storage. You could use copying software (see below) to do this to ensure date information and other file attributes are preserved.
- Some software will also check the copied content to ensure it is identical. If not, use checksum software to check this (see step 1.4).
- If applicable, you may decide to keep the original storage media or photograph it.
Software
- Bagger (packages files for transfer and storage).
- Teracopy Pro (copies content and ensures that they are identical USD$35.94).
- Robocopy command line (for copying).
Further guidance
- Bagger Tutorial (State Archives of Carolina videos)
- Teracopy Guide (Digital Preservation Coalition)
- Running the robocopy command (Canadian Heritage Information Network)
3.3 Check checksums and access
This is an essential step.
- Use checksum software (see below) to carry out regular integrity checks of the content.
- Keep a record of when you carry these out.
- If checksums of content do change then investigate. For example, if the content is corrupt or has been accidentally changed, it may need to be replaced.
- Ideally, you should keep logs of actions performed on content and carry out periodic reviews of these logs.
Software
- AVP Fixity Pro ($47.90 per year)
- CSV Validator and integrity schema (can be used to check checksums created in DROID)
- Checksum by Corz
- Jacksum
Further guidance
- Checking Your Digital Content: How, What and When to Check Fixity? (NDSA – Draft Fact sheet)
- DPC Handbook: Fixity and checksums
- DPC Handbook: Fixity Pro
- DPC Handbook: CSV Validator
- DPC Handbook: Checksum by Corz
- Fixity Pro videos and webinars tutorial (video)
- DROID report as basis for collection integrity checks (article by The National Archives)
- Checksum or Fixity? Which tool is for me? (University of York blog)
3.4 Monitor storage
This is an essential step.
- The lifetime of storage can be short – it can fail or corrupt the content.
- You will need to review your storage every 3-5 years and move content onto new storage.
- Create multiple copies and use a mix of different types of storage technologies if you can.
- For hard drives there is software that can help you with this (see below).
Software
- Windows 10: Built in tools for hard drive health check (includes undertaking a ‘S.M.A.R.T analysis’ of hard drives)
Further guidance
- DPC Handbook: Storage
- Digital preservation recommendations for small museums (Canadian Heritage Information Network. Includes section on refreshing storage including some practical tips)
- How to Check Your Hard Drive’s Health (PC Magazine article)
3.5 Monitor content
- You should monitor your content to understand if any of the file formats you hold, or the software/technology needed to access them, are at risk of becoming obsolete (outdated or no longer used).
- One solution is format migration where a file format is converted into a new file format. However if you do this it is important to also keep the original content.
- Some archives undertake format migration during step 2 (Ingest) and convert particular types of content into a preferred file format (called normalisation). Others wait until the risk of the content becoming obsolete is high.
- One low-cost option is to only migrate the content when someone wants to access it.
- Several types of software can carry out format migration and some are listed below.
- Emulation is an alternative to format migration and attempts to recreate the functionality of the original software or technology.
Software
- FFmpeg (for audio and video).
- ImageMagick (for images)
- Ghostscript (for pdfs)
- LibreOffice or Apache Open Office (for word processing documents)
- Joyce (an Amstrad emulator)
Further guidance
- DPC Handbook: Preservation Actions
- Library of Congress Recommended Formats Statement
- Sustainability of Digital Formats: Planning for Library of Congress Collections
- Practical Digital Preservation (blog by Tyler McNally which includes a section on normalisation software).
- File migration formats (blog by University of Glasgow).
- Bodleian Libraries: Introduction to Digital Preservation: Emulation
- ffmprovisr (a good resource for using FFmpeg).
For the next stage of the digital preservation workflow, head over to the Access page.