Skip to end of metadata
Go to start of metadata

This page briefly documents the interfaces currently available at .

Additional information  of  interfaces of the National Library can be found at


Digi interfaces are available for the metadata of the material, which is available from the network. I.e. for newspapers and journals until year 1949 (as digitization progresses). 

Interfaces can be used according to the terms of use of .

Data content

Metadata of the digitized works (newspapers, journals, books, sheet music, etc.). For newspapers it is possible to get publication time, issn of each digitized binding. For books there is all metadata of books  obtained from the description work.

02/2022: Currently ongoing: Modifying the marc21 format of books to offer enriched digital record of a book, which is easily processed by other national library information systems.


OpenURL links you to the page image based on the date information on the URL parameters. The parameters, which are available can be seen from example below:

  • genre ( journal, no need to change)
  • date (YYYY-MM-DD)
  • ISSN (the identifier of the newspaper or journal)
  • spage  (page number)

The above url returns the page image of Aamulehti 3.1.1888 page 2.


  • Use OpenURL if you want permanent reference to the newspaper, without using the binding id.

Getting started with OAI-PMH

OAI-PMH is a harvesting interface via with you can get metadata records of a specific service. Most often you will want to harvest metadata for a specific collection, which is in OAI-PMH vocabulary set .   The sets of OAI-PMH corresponds in Digi the collection identifier. The collection idenfier can be seen from the homepage of a collection  within Digi. For example for previous collection the identifier is 681, and the set number for OAI-PMH is col-681. So you can access all metadata of that collection via accessing: .

Quite often you can use specific OAI-PMH library from a programming language. Eg. Sickle for Python is one option.  There you set up the OAI-PMH server connection details, set up collection and you get back a handle which gives you the next batch of records.

from sickle import Sickle

URL =''

sickle = Sickle(URL)

records = sickle.ListRecords(
    **{'metadataPrefix': 'oai_dc'
       'set': colid,


for record in records:

    # do sth fun with a record.


Basic information about the service

Which formats offered: (dublin core , qdc_finna, marc21)

Which sets are offered:

The collections of digi can be retrieved by prepending 'col-' to the collection number, which can be seen at the url of the collection. For example for collection the OAI-PMH is :

Return the first 100 records, at the end is the resumption token to get next batch

The resumption token by which you can continue the first batch of results is given at the end of the first batch and so on until end of batches.

Returns different material types:

Returns just identifiers and datestamp:

Return 1 specific record (the identifier is created with help of binding id):

Usage of richer QDC_finna metadata format:

Filter records by date


  • Use OAI-PMH if you want the basic binding level metadata. The querying of all records will take some time as batch size via OAI-PMH is limited.

OAI-PMH for Books

The books operate in the same way as the newspaper and journal materials. In the books , the 'set' is the collection of books, you can find the collection id either from the collection page or by observing the ListSet verb results.

Getting all collections of books

Getting the 'Geography and Travel' collection:

Getting a subcollection (subcollections are separated via colon ((smile) , :

Getting a particular record:

(A quick way is just to replace the binding id of a desired binding to the example above).

Getting a particular record in marc21 format:


Available for the newspaper and journal title information (core metadata):


See instructions of using via SFX:

Sub components of individual binding available

The different sub components of post-processing can be accessed via , where <bindingid> is the unique local id of a particular issue number.   the access page image (.jpg)  the thumbnail of the page (.jpg)  the whole binding as pdf  the page text as it is.  returns the ALTO XML,which contains also the layout information. sample of an error page, if binding doesn't have that many pages   mets file for the whole binding, true=unfiltered, false=filters away some amdsec parts.


For all questions or comments, please use the Feedback-functionality of  .










  • No labels