Skip to content

Ingesting Image Collections OLD

California Historical Society edited this page Sep 20, 2019 · 1 revision

Upload files

Upload images and MODS to "quarantine" server CHS-ISL002, preferably via the OwnCloud UI.

Image folder should be named like the prefix of the digital IDs, e.g., PC-RM-Stereos; PC-008

MODS folder should be named similarly, but with ".MODS" appended to the end, e.g., PC-RM-Stereos.MODS; PC-008.MODS

Create bags

Once files are uploaded to "quarantine", SSH into CHS-ISL002 (for Windows users, use PuTTY) and navigate to /data/quarantine

Follow the Basic Workflow outlined here: https://github.com/calhist/dam_workflow_scripts

This includes running the following scripts: create-bags.sh; copy-to-archive.sh; copy-to-production.sh

Cheatsheet

$ sudo -u www-data create-bags.sh -i collection_name
$ copy-to-archive.sh -i collection_name.bags
$ copy-to-production.sh -i collection_name.bags

Create new collection

In Islandora, create a new collection at the top-level of the repository: http://digitallibrary.californiahistoricalsociety.org/islandora

Go to Manage, then click + Add an object to this Collection

Complete Collection PID as namespace:collection, where namespace = chs. For example, for the California Agriculture collection, enter chs:agriculture.

Uncheck "Inherit collection policy?" and select the Allowable content models you which to set for this collection. Usually it will be "islandora:sp_large_image_cmodel ~ Islandora Large Image Content Model"

Click Next, then Next again.

Enter metadata for collection and save. Make a note of the PID of this new collection. You'll need that when creating an ingest batch.

Set metadata display

We're using Islandora Context to assign metadata display profiles based on collection type. For example, our Maps collection uses a different display profile than do our photograph collections.

Go to Structure > Context, then edit the Display Photos context.

Under Conditions, click on Islandora Collection Membership and select the new collection from the list. Save.

Create ingest batch

Once bags have been copied to production, SSH into the production server, CHS-ISL001, navigate to /data/quarantine and run the create-batch.sh script

Cheatsheet

$ create-batch.sh -i collection_name.bags -c chs:collection_name -m islandora:sp_large_image_cmodel

Process the batch

In Islandora go to Reports > Islandora Batch Ingest Sets (http://digitallibrary.californiahistoricalsociety.org/admin/reports/islandora_batch_sets)

Identify your new batch set and click "View items in set" at right

Click + Process Set, then Start Batch Processing

Add a collection thumbnail

Once assets have been ingested, find a good representative thumbnail image for the collection by going to an individual image's datastreams and downloading the JPG. Resize the JPG to 293px wide. Go to the collection, click Manage > Datastreams, and replace the TN datastream with the newly resized image. Remember to acquire the lock.

Regenerate DC datastream

Go to new collection, Manage > Collection > Regenerate child DC metadata > Regenerate DC metadata for all collection objects

Validate bags on Archive server

Make sure the bags were successfully copied to the Archive server.

SSH into CHS-ISL003 and navigate to /data

Run:

$ validate-bags.sh -i collection_name.bags

Look for any errors. If you do see an error, go back to the copy-to-archive.sh step and repeat.

Cleanup

Once assets are ingested in Production and successfully validated on the Archive server, we can delete them from the Quarantine server to free up space for the next collection. You can do this right from the OwnCloud UI.

Clone this wiki locally