Link Search Menu Expand Document

Editor’s Draft

Profile: Film

The film profile supports the ingest of digitised film stored on one or more image and/or audio reels. This profile dictates how the media files (in file formats such as MKV, MOV, JPEG and PDF), their metadata, and the relationships between them, should be expressed and organized.

It mainly applies the DCTERMS metadata schema for descriptive metadata and allows extensions using Schema.org, thereby resembling the Basic profile.

Its additions lie in the introduction of a separate PREMIS representation to denote the physical carrier(s) (a so-called ‘carrier representation’) and custom film-specific metadata (using a <premis:significantPropertiesExtension> element in the package PREMIS file) to describe physical aspects of this/these carrier(s).

This carrier representation was added to facilitate the description of the physical carrier(s), since the PREMIS metadata schema itself doesn’t offer this possibility directly. Please note that, as a result, the carrier representation as such is not reflected by a representation folder in the representations directory, given that it is used purely for the addition of descriptive metadata about the carrier(s) and does not contain any files itself. Use the relationship sub types has carrier copy and is carrier copy of between the IE and the carrier representation.

Direction Relationship type Relationship subtype Reciprocal/inverse relationship Description
From IE to carrier structural has carrier copy is carrier copy of A IE is represented by a carrier representation.

Permalink: https://data.hetarchief.be/id/sip/2.1/film

Example Directory structure

root_directory
├── METS.xml
├── metadata
│   ├── descriptive
│   │   └── dc+schema.xml                               # descriptive metadata about the content of the film
│   └── preservation
│       └── premis.xml                                  # package PREMIS
│
└── representations
    │
    ├── representation_1
    │   ├── METS.xml
    │   ├── data
    │   │   └── master.mkv                              # archive master
    │   └── metadata
    │       └── preservation
    │           └── premis.xml
    │
    ├── representation_2
    │   ├── METS.xml
    │   ├── data
    │   │   └── mezzanine.mov                           # mezzanine
    │   └── metadata
    │       └── preservation
    │           └── premis.xml
    │
    ├── representation_3
    │   ├── METS.xml
    │   ├── data
    │   │   └── scan.jpeg                               # scan(s) of the reel's container (if present) in JPEG
    │   └── metadata
    │       └── preservation
    │           └── premis.xml
    │
    └── representation_4
        ├── METS.xml
        ├── data
        │   └── scan.pdf                                # scan(s) of the reel's container (if present) in PDF
        └── metadata
            └── preservation
                └── premis.xml

Requirements

General

  • A SIP MUST contain content of exactly one digitised film, consisting of one or more image and/or audio reels.
  • Each MKV, MOV, or set of scans (either in JPG and/or in PDF) contained in their respective representation directories MUST represent exactly one image or audio reel.
  • There MUST be exactly one IE present in the SIP, i.e. the digitised film.
  • There MUST be preservation metadata at the package level in the preservation/premis.xml file.
  • There MUST be preservation metadata at the representation level in the respective preservation/premis.xml files.
  • Preservation metadata in the SIP MUST be limited to the PREMIS metadata schema.
  • Fixity MUST be calculated using the MD5 hashing algorithm, thus:
    • The value of element premis:premis/premis:object[@xsi:type="premis:file"]/premis:objectCharacteristics/premis:fixity/premis:messageDigestAlgorithm MUST be set to MD5.
    • The value of attribute premis:premis/premis:object[@xsi:type="premis:file"]/premis:objectCharacteristics/premis:fixity/premis:messageDigestAlgorithm/@valueURI MUST be set to "http://id.loc.gov/vocabulary/preservation/cryptographicHashFunctions/md5".
    • The value of all //*/@CHECKSUMTYPE attributes in the METS.xml files MUST be set to MD5.
  • Descriptive metadata about the IE MUST be specified at the package level in the dc+schema.xml file.
  • Descriptive metadata about the carrier(s) (i.e. the reel(s)) MUST be specified at the package level in the preservation/premis.xml file.

Package METS

  • The /mets/@TYPE attribute MUST be set to Video – File-based and Physical Media.
  • The csip:CONTENTINFORMATIONTYPE attribute MUST be set to OTHER and the csip:OTHERCONTENTINFORMATIONTYPE attribute MUST be set to https://data.hetarchief.be/id/sip/2.1/film.
  • The mets/dmdSec/mdRef/@MDTYPE attribute MUST be set to OTHER and the mets/dmdSec/mdRef/@OTHERMDTYPE MUST be set to dc+schema.

Package Descriptive Metadata

  • A descriptive metadata file descriptive/dc+schema.xml describing the IE MUST be present at the package level.
  • Descriptive metadata in the descriptive/dc+schema.xml MUST be limited to the DCTERMS and SCHEMA elements outlined in the basic profile.
  • The DCTERMS and SCHEMA metadata in the descriptive/dc+schema.xml file MUST follow the basic profile requirements regarding the use of elements and attributes.
  • In addition to the elements and requirements outlined in the basic profile, the film profile allows the use of the elements outlined below:
Element metadata/schema:countryOfOrigin
Name Country of origin
Description The country of origin of the film, i.e. where the film was created.
Datatype ISO 3166
Cardinality 0..1
Obligation MAY
Element metadata/schema:creditText
Name Credit text
Description Text that can be used to credit person(s) and/or organization(s) associated with a film.
Datatype String
Cardinality 0..*
Obligation MAY
Element metadata/schema:genre
Name Genre
Description Genre of the film.
Datatype String
Cardinality 0..1
Obligation MAY

Package Preservation Metadata

The addition of a separate PREMIS representation for the carrier(s) (i.e. the carrier representation) leads to a number of additional requirements in the package premis.xml file. The section below outlines the high level requirements, while the section Describing a carrier within the carrier representation contains a more detailed discussion of the possibilities offered by the carrier representation.

  • The following relationships MUST be present between the <premis:object> of the intellectual entity and that of the carrier representation (see also Overview of relevant PREMIS relationships for more information):
    • A structural <premis:relationship> of type ‘has carrier copy’;
    • A structural <premis:relationship> of type ‘is carrier copy of’.

Example 1: an example <premis:object> of a carrier representation together the relationships between the Intellectual Entity and the carrier representation


<!-- IE for the film as a whole -->
<premis:object
    xsi:type="premis:intellectualEntity">

    <premis:objectIdentifier>
        <premis:objectIdentifierType>UUID</premis:objectIdentifierType>
        <premis:objectIdentifierValue>uuid-intellectual-entity</premis:objectIdentifierValue>
    </premis:objectIdentifier>

    <!-- relationship between the IE and its carrier representation -->
    <premis:relationship>
        <premis:relationshipType>structural</premis:relationshipType>
        <premis:relationshipSubType>has carrier copy</premis:relationshipSubType>
        <premis:relatedObjectIdentifier>
            <premis:relatedObjectIdentifierType>UUID</premis:relatedObjectIdentifierType>
            <premis:relatedObjectIdentifierValue>uuid-carrier-representation</premis:relatedObjectIdentifierValue>
        </premis:relatedObjectIdentifier>
    </premis:relationship>

</premis:object>

<!-- PREMIS object for the carrier representation -->
<premis:object xsi:type="premis:representation">
    <premis:objectIdentifier>
        <premis:objectIdentifierType>UUID</premis:objectIdentifierType>
        <premis:objectIdentifierValue>uuid-carrier-representation</premis:objectIdentifierValue>
    </premis:objectIdentifier>

    <!-- relationship between the carrier representation and its IE -->
    <premis:relationship>
        <premis:relationshipType>structural</premis:relationshipType>
        <premis:relationshipSubType>is carrier copy of</premis:relationshipSubType>
        <premis:relatedObjectIdentifier>
            <premis:relatedObjectIdentifierType>UUID</premis:relatedObjectIdentifierType>
            <premis:relatedObjectIdentifierValue>uuid-intellectual-entity</premis:relatedObjectIdentifierValue>
        </premis:relatedObjectIdentifier>
    </premis:relationship>
</premis:object>

Describing a carrier within the carrier representation

The carrier representation is described using the <premis:significantPropertiesExtension> element. A custom schema is defined inside this extension to capture structural and descriptive metadata of the carrier. The schema uses the hasip namespace and is specified below.

Example 2: hierarchical listing of a package premis.xml file with an Intellectual Entity and a Carrier Representation.

premis:premis
│
├── premis:object xsi:type="premis:intellectualEntity"    # Intellectual Entity
│
└── premis:object xsi:type="premis:representation"        # Carrier Representation
    └── premis:significantProperties
        └── premis:significantPropertiesExtension
            └── ...                                       # Carrier representation description
Element premis:premis/premis:object[@xsi:type='premis:representation']/premis:significantProperties/premis:significantPropertiesExtension[@xmlns:hasip='https://data.hetarchief.be/ns/sip/']
Name PREMIS significant properties extension
Description This extension contains structural and descriptive metadata of the carrier representation. It must declare the hasip namespace.
Cardinality 1..1
Obligation MUST

The following elements paths are relative to the <premis:significantPropertiesExtension> element.

Element premis:significantPropertiesExtension/hasip:numberOfReels
Name Number of reels
Description The number of image and audio reels that make up the carrier.
Datatype xsd:nonNegativeInteger
Cardinality 0..1
Obligation MAY
Element premis:significantPropertiesExtension/hasip:hasMissingAudioReels
Name Has missing audio reels
Description Whether there are one or more audio reel missing.
Datatype xsd:boolean
Cardinality 0..1
Obligation MAY
Element premis:significantPropertiesExtension/hasip:hasMissingImageReels
Name Has missing image reels
Description This element indicates whether there are one or more image reel missing.
Datatype xsd:boolean
Cardinality 0..1
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt
Name Stored at
Description This element contains the reels making up the carrier.
Cardinality 1..*
Obligation MUST
Element premis:significantPropertiesExtension/hasip:storedAt/hasip:imageReel
Name Image reel
Description A reel with (part of) the carrier image on it
Cardinality 0..*
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt/hasip:audioReel
Name Audio reel
Description A reel with (part of) the carrier audio on it
Cardinality 0..*
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt/(hasip:imageReel|hasip:audioReel)/hasip:identifier
Name Identifier
Description The identifier of the image or audio reel
Datatype String
Cardinality 1..1
Obligation MUST
Element premis:significantPropertiesExtension/hasip:storedAt/(hasip:imageReel|hasip:audioReel)/hasip:medium
Name Storage medium
Description The physical medium on which the Object is stored (e.g., magnetic tape, hard disk, CD-ROM, DVD).
Datatype String
Cardinality 1..1
Obligation MUST
Element premis:significantPropertiesExtension/hasip:storedAt/(hasip:imageReel|hasip:audioReel)/hasip:aspectRatio
Name Aspect ratio
Description The aspect ratio of the reel. Not to be confused with the aspect ratio of the digitized film.
Datatype String
Cardinality 0..1
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt/(hasip:imageReel|hasip:audioReel)/hasip:material
Name Material
Description The base material that makes up the carrier.
Datatype String
Cardinality 0..1
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt/(hasip:imageReel|hasip:audioReel)/hasip:preservationProblem
Name Preservation problem
Description Any preservation problems encountered during the digitization of the film.
Datatype String
Cardinality 0..*
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt/(hasip:imageReel|hasip:audioReel)/hasip:stockType
Name Stock type
Description The stock type refers to the specific kind of film material used—such as negative, positive, interpositive, or internegative—each designed for a particular stage of image/sound capture, duplication, or projection.
Datatype String
Cardinality 0..1
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt/hasip:imageReel/hasip:coloringType
Name Coloring type
Description Indication of the coloring of the image reel.
Datatype String; fixed vocabulary
Vocabulary BandW, Color, Colorized, Composite, UnknownColorType
Cardinality 0..*
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt/hasip:imageReel/hasip:hasCaptioning
Name Has captioning
Description This element contains the open captions of the film.
Cardinality 0..1
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt/hasip:imageReel/hasip:hasCaptioning/hasip:openCaptions
Name Open captions
Description Indicates the (embedded) captioning of an image reel.
Cardinality 0..*
Obligation MAY
Element premis:significantPropertiesExtension/hasip:storedAt/hasip:imageReel/hasip:hasCaptioning/hasip:openCaptions/hasip:inLanguage
Name Open captions language
Description The language the open captions are in.
Datatype BCP47
Cardinality 0..*
Obligation MAY
Normative summary
  • There MUST be a carrier representation in the package premis.xml, reflected by a <premis:object>;
  • Any descriptive metadata about the physical film’s reel(s) MUST be included as part of the carrier representation <premis:object>;
  • Each <premis:significantPropertiesExtension> element MUST use the schema described above;
  • If a <premis:significantPropertiesExtension> element is used, it MUST declare the namespaces of the external metadata schemas using the @xmlns attribute;
  • Each digitized reel in the SIP MUST be reflected in the carrier representation <premis:object> by using separate <premis:storageMedium> elements;
  • Each <premis:storageMedium> element MUST contain a <premis:storage> element with the specific carrier type of a reel;
  • Any events related to the handling of the real-life, physical carrier(s) MUST refer to the carrier representation <premis:object> with a <premis:linkingObjectIdentifier> element (see Adding provenance of representations;

Example 4 below contains an illustration of a simplified carrier representation (preceded by its intellectual entity) and a registration event involving the carrier representation in the package premis.xml file:

<?xml version="1.0" encoding="UTF-8"?>
<premis:premis version="3.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xmlns:premis="http://www.loc.gov/premis/v3" xmlns:haObj="https://data.hetarchief.be/ns/object/"
    xsi:schemaLocation="http://www.loc.gov/premis/v3 https://www.loc.gov/standards/premis/premis.xsd">

    <!-- IE for the film as a whole -->
    <premis:object
        xsi:type="premis:intellectualEntity">

        <premis:objectIdentifier>
            <premis:objectIdentifierType>UUID</premis:objectIdentifierType>
            <premis:objectIdentifierValue>uuid-b1c5fa36-6bb6-460b-836b-ade5541fe89e</premis:objectIdentifierValue>
        </premis:objectIdentifier>

        <!-- relationship between the IE and its carrier representation -->
        <premis:relationship>
            <premis:relationshipType authority="relationshipType"
                authorityURI="http://id.loc.gov/vocabulary/preservation/relationshipType"
                valueURI="http://id.loc.gov/vocabulary/preservation/relationshipType/str">structural</premis:relationshipType>
            <premis:relationshipSubType authority="relationshipSubType"
                authorityURI="http://id.loc.gov/vocabulary/preservation/relationshipSubType"
                valueURI="http://id.loc.gov/vocabulary/preservation/relationshipSubType/isr">is
                represented
                by</premis:relationshipSubType>
            <premis:relatedObjectIdentifier>
                <premis:relatedObjectIdentifierType>UUID</premis:relatedObjectIdentifierType>
                <premis:relatedObjectIdentifierValue>uuid-e2f092de-f800-486c-a291-3160ce740544</premis:relatedObjectIdentifierValue>
            </premis:relatedObjectIdentifier>
        </premis:relationship>

    </premis:object>

    <!-- PREMIS object for the carrier representation itself -->
    <premis:object xsi:type="premis:representation">

        <premis:objectIdentifier>
            <premis:objectIdentifierType>UUID</premis:objectIdentifierType>
            <premis:objectIdentifierValue>uuid-e2f092de-f800-486c-a291-3160ce740544</premis:objectIdentifierValue>
        </premis:objectIdentifier>

        <!-- descriptive metadata in several <premis:significantProperties> elements -->
        <premis:significantProperties>
            <premis:significantPropertiesType>barcode_image_reels</premis:significantPropertiesType>
            <premis:significantPropertiesValue>AFLM_FEL_001392</premis:significantPropertiesValue>
        </premis:significantProperties>
        <premis:significantProperties>
            <premis:significantPropertiesType>material_type</premis:significantPropertiesType>
            <premis:significantPropertiesValue>Original positive</premis:significantPropertiesValue>
        </premis:significantProperties>
        <premis:significantProperties>
            <premis:significantPropertiesType>num_reels</premis:significantPropertiesType>
            <premis:significantPropertiesValue>1</premis:significantPropertiesValue>
        </premis:significantProperties>
        <premis:significantProperties>
            <premis:significantPropertiesType>film_base</premis:significantPropertiesType>
            <premis:significantPropertiesValue>acetate</premis:significantPropertiesValue>
        </premis:significantProperties>

        <!-- descriptive metadata using Schema in a nested <premis:significantPropertiesExtension> element -->
        <premis:significantProperties>
            <premis:significantPropertiesExtension xmlns:schema='https://schema.org'>
                <schema:duration>0:04:55</schema:duration>
                <schema:material>Original positive</schema:material>
                <schema:size>
                    <schema:unitCode>MTR</schema:unitCode>
                    <schema:QuantitativeValue>135</schema:QuantitativeValue>
                </schema:size>
            </premis:significantPropertiesExtension>
        </premis:significantProperties>

        <!-- indication of the carrier type -->
        <premis:storage>
            <premis:storageMedium>
                super8mmfilm
            </premis:storageMedium>
        </premis:storage>

        <!-- relationship between the carrier representation and its IE -->
        <premis:relationship>
            <premis:relationshipType authority="relationshipType"
                authorityURI="http://id.loc.gov/vocabulary/preservation/relationshipType"
                valueURI="http://id.loc.gov/vocabulary/preservation/relationshipType/str">structural</premis:relationshipType>
            <premis:relationshipSubType authority="relationshipSubType"
                authorityURI="http://id.loc.gov/vocabulary/preservation/relationshipSubType"
                valueURI="http://id.loc.gov/vocabulary/preservation/relationshipSubType/rep">
                represents</premis:relationshipSubType>
            <premis:relatedObjectIdentifier>
                <premis:relatedObjectIdentifierType>UUID</premis:relatedObjectIdentifierType>
                <premis:relatedObjectIdentifierValue>uuid-b1c5fa36-6bb6-460b-836b-ade5541fe89e</premis:relatedObjectIdentifierValue>
            </premis:relatedObjectIdentifier>
        </premis:relationship>

    </premis:object>

    <!-- registration event -->
    <premis:event>
        <premis:eventIdentifier>

            <premis:eventIdentifierType>UUID</premis:eventIdentifierType>
            <premis:eventIdentifierValue>uuid-aba62b7b-dd7a-43cf-b077-45f8b96deae8</premis:eventIdentifierValue>

        </premis:eventIdentifier>
        <premis:eventType valueURI="https://data.hetarchief.be/id/event-type/registration">
      registration
    </premis:eventType>
        <premis:eventDateTime>
      2021-04-02T09:04:04
    </premis:eventDateTime>
        <premis:eventDetailInformation>
            <premis:eventDetail>Base Scratching remarks: Light scratches, lines and stripes. Some cables. vinegar date: 2021-06-30 pH value:PH 4.8</premis:eventDetail>
            <premis:eventDetailExtension xmlns:schema="https://schema.org/">
                <schema:name>estimate_preparation_time_for_digitisation</schema:name>
                <schema:value>1:30:00</schema:value>
            </premis:eventDetailExtension>
            <premis:eventDetailExtension xmlns:schema="https://schema.org/">
                <schema:name>estimate_manual_cleaning_time</schema:name>
                <schema:value>0:00:00</schema:value>
            </premis:eventDetailExtension>
            <premis:eventDetailExtension xmlns:schema="https://schema.org/">
                <schema:name>physical_state_film</schema:name>
                <schema:value>film in good state</schema:value>
            </premis:eventDetailExtension>
        </premis:eventDetailInformation>

        <premis:eventOutcomeInformation>
            <premis:eventOutcome
                valueURI="http://id.loc.gov/vocabulary/preservation/eventOutcome/suc">success</premis:eventOutcome>
        </premis:eventOutcomeInformation>
        
        <premis:linkingAgentIdentifier>
            <premis:linkingAgentIdentifierType>MEEMOO-OR-ID</premis:linkingAgentIdentifierType>
            <premis:linkingAgentIdentifierValue>OR-jw86m54</premis:linkingAgentIdentifierValue>
            <premis:linkingAgentRole
                valueURI="http://id.loc.gov/vocabulary/preservation/eventRelatedAgentRole/imp">implementer</premis:linkingAgentRole>
        </premis:linkingAgentIdentifier>

        <!-- reference to the premis:Representation object of the carrier representation -->
        <premis:linkingObjectIdentifier>
            <premis:linkingObjectIdentifierType>UUID</premis:linkingObjectIdentifierType>
            <premis:linkingObjectIdentifierValue>uuid-e2f092de-f800-486c-a291-3160ce740544</premis:linkingObjectIdentifierValue>
            <premis:linkingObjectRole
                valueURI="http://id.loc.gov/vocabulary/preservation/eventRelatedObjectRole/sou">source</premis:linkingObjectRole>
        </premis:linkingObjectIdentifier>
    </premis:event>

</premis:premis>

Validation

The XML files that are required by this profile can be validated using the following XML schema definitions:

File Format XML Schema
METS.xml METS v1.12.1 mets.xsd
premis.xml PREMIS v3.0 premis-v3-0.xsd
dc+schema.xml Dublin Core with Schema.org dc+schema.xsd (not yet available)

Use Cases

Some use cases that implement this profile are: