AdViews Ingest - 2017-12-05 Meeting notes

Date

Attendees

Goals

Discussion items

ItemNotes
Background
  • DC-71 - Getting issue details... STATUS
    • More mp4s than movs so the mp4 is the preservation copy in some cases (7 files)
Size
  •  How big is the largest files - guesstimate - 1-2gb
    • One file is 113 gb large, but this is an anomaly - there are 10 files that are over 10gb
Ingest

Primary ingest: MOVs + 7 mp4 masters

  • 1 item is 1 file so folder ingest would probably work
  • Storage volume needs to be mounted on the repository
    • Molly to add file location here and contact core services (Jack, David, Chris?) Ask for Jim, Moira and Susan for permissions
    • Will require checksum files - 1 for the full folder

Derivative file ingest - mp4s:

  • streamable media upload - mp4s
  • The 4 files that have mp4 only will need to be ingested both during the master file ingest and the derivative ingest. 
  • Derivative durations may not match the masters, as color bars have been stripped out. 

66 Caption files - VTT:

  • the names all match

Thumbnails for poster frames - jpgs

  • sean to produce -  names will match derivatives

We shouldn't have to rename anything w/ in the collection prior to ingest. 

Ingest Logistics
  • We could use some more workers to process the AdViews job - we have 4 workers configured currently. 
    • Batches in progress will block other uploads 
    • Difficult to predict how long the ingest will take. 
  • Previous ingest of 200-300gb processed w/ in 2ish hours, which is encouraging.
  • Ideally ingest should be complete by end of June 2018.
Poster frames
  • Sean needs access to the AdViews server to create poster frames
Cleaning up the AdViews Server
  • Will happen after ingest 
  • Dupes will be moved into Dark storage and hopefully will be moved as normal files move
Side note
  • Molly and Alex to confirm w/ Hartman re: the "expert interviews"
DDR space

Only about 5tb of capacity - Molly to talk w/ John about that. 

FITSCan look at the technical metadata and drop it into descriptive metadata.  We are not currently displaying technical metadata
Next Steps
  • Create tickets:
    • Give permission to Sean, Moira r/w, Susan r/w and David r, Hydra - all need read access - core services
    • Mount the adviews server to the repository - core services
  • Sean will extract thumbnails from the derivatives
  • Talk to John about space
  • Once the permissions and server access are set, work w/ Susan and Moira on ingest logistics
  • Following ingest:
    • the usual digital collections stuff - metadata, making it pretty, etc
    • move dupes and other files to dark storage as needed

Action items

  • See Above