Table of Contents |
---|
...
<ARGUMENT> | Value Associated | Purpose | Accepted File Types | Additional Notes | ||||
---|---|---|---|---|---|---|---|---|
MODS | MODS XML file name | provide MODS metadata for an asset | xml | Can be used for publication/series-level assets, book and issue-level assets. | ||||
TN | thumbnail image file name | provide a thumbnail picture for an asset | png, jpg, jpeg | Can be used for publication/series-level assets, book and issue-level assets. If no thumbnail is provided during batch ingest, the DAMS will copy the thumbnail image of the first page of the asset to the book/issue level asset. | ||||
LANG | three-letter language code | instruct the DAMS software to perform OCR for each page | N/A | Can be used for book/issue-level assets. See page _Text extraction in DAMS for the list of languages for which the DAMS software supports OCR processing.
| ||||
PAGE<NUMBER> | name of the file with the page image | provide page content, in sequential order | tiff, tif, jp2 | Can be used for book/issue-level assets. Replace <NUMBER> with a number for each page that indicates the page's sequential order, for example:
Pad the number with zeroes. The number of zeroes for padding is up to you. | ||||
PAGE<NUMBER>_OCR_CUSTOM | name of externally generated OCR file for that page | allows you to provide your own OCR datastream for each page | txt | Can be used for book/issue-level assets. | ||||
PAGE<NUMBER>_<CUSTOM_DATASTREAM> | name of additional file | allows you to add custom datastreams to page-level assets | * | Replace <CUSTOM_DATASTREAM> with a datastream label. The label should correspond to one of the recommended datastream types listed on page Anatomy of DAMS digital assets. If you wish to ingest additional files that do not match any of the listed datastream types, please contact the DAMS managers for consultation (click here to submit a DAMS service request).
| ||||
FULL_TEXT_CUSTOM | name of text file with externally created full text (text extracted from PDF) | allows you to provide your own FULL_TEXT datastream for a book/issue | txt | Can be used for book/issue-level assets.
| ||||
name of your pdf file | PDF for resource | Can be used for book/issue-level assets. Use to add an externally created PDF document to an asset.
| ||||||
HOSTPUBLICATION | PID without namespace ID | Add issue(s) to publication | text | Can be used for book/issue-level assets. Use to specify which publication/series-level asset an issue shold be added to. PID without namespace ID is the part of a PID after the colon (UUID), e.g. | ||||
HOSTISSUE | PID without namespace ID | Add pages to an issue | text | Can be used with sets of page images. Use to specify which issue-level asset a set of page images should be added to. PID without namespace ID is the part of a PID after the colon (UUID), e.g. | ||||
HOSTBOOK | PID without namespace ID | Add pages to a book | text | Can be used with sets of page images. Use to specify which book-level asset a set of page images should be added to. PID without namespace ID is the part of a PID after the colon (UUID), e.g. |
Folder naming conventions and folder hierarchy
...
Code Block |
---|
eid1234_example-batch-submission/ (batch job folder) ├── grapes_of_wrath_BOOK/ │ ├── datastreams.txt │ ├── modsfile.xml │ ├── book_level_custom_ocr.txt │ ├── book_level_pdf.pdf │ ├── page01.tif │ └── page02.tif ├── wall_street_journal_PUBLICATION/ │ ├── datastreams.txt │ ├── modsfile.xml │ ├── wsj_jan_2016_ISSUE/ │ │ ├── datastreams.txt │ │ ├── modsfile.xml │ │ ├── page01.tif │ │ └── page02.tif │ └── wsj_feb_2016_ISSUE/ │ ├── datastreams.txt │ ├── modsfile.xml │ ├── page01.tif │ └── page02.tif ├── ascii_art_monthly_july_2021_ISSUE/ │ ├── datastreams.txt │ ├── modsfile.xml │ ├── page01.tif │ ├── page01_custom_ocr.txt │ ├── page02.tif │ └── page02_custom_ocr.txt └── nyt_2020-11-04_PAGES/ ├── datastreams.txt ├── issue_level_custom_ocr.txt ├── issue_level_pdf.pdf ├── page01.tif ├── page01_custom_ocr.txt ├── page02.tif └── page02_custom_ocr.txt |
Step 2: Upload batch job to Jscape
Multiexcerpt include | ||||
---|---|---|---|---|
|
Step 3: Set up collection and submit form in DAMS interface
Multiexcerpt include | ||||
---|---|---|---|---|
|