This page briefly documents the interfaces currently available at digi.kansalliskirjasto.fi .
Additional information of interfaces of the National Library can be found at https://data.nationallibrary.fi
Access
Digi interfaces are available for the metadata of the material, which is available from the network. I.e. for newspapers and journals until year 1949 (as digitization progresses).
Interfaces can be used according to the terms of use of Digi.nationallibrary.fi .
Data content
Metadata of the digitized works (newspapers, journals, books, sheet music, etc.). For newspapers it is possible to get publication time, issn of each digitized binding. For books there is all metadata of books obtained from the description work.
02/2022: Currently ongoing: Modifying the marc21 format of books to offer enriched digital record of a book, which is easily processed by other national library information systems.
OpenURL
OpenURL links you to the page image based on the date information on the URL parameters. The parameters, which are available can be seen from example below:
- genre ( journal, no need to change)
- date (YYYY-MM-DD)
- ISSN (the identifier of the newspaper or journal)
- spage (page number)
The above url returns the page image of Aamulehti 3.1.1888 page 2.
Applicability
- Use OpenURL if you want permanent reference to the newspaper, without using the binding id.
Getting started with OAI-PMH
OAI-PMH is a harvesting interface via with you can get metadata records of a specific service. Most often you will want to harvest metadata for a specific collection, which is in OAI-PMH vocabulary set . The sets of OAI-PMH corresponds in Digi the collection identifier. The collection idenfier can be seen from the homepage of a collection https://digi.kansalliskirjasto.fi/collections?id=681 within Digi. For example for previous collection the identifier is 681, and the set number for OAI-PMH is col-681. So you can access all metadata of that collection via accessing: https://digi.kansalliskirjasto.fi/interfaces/OAI-PMH?verb=ListRecords&set=col-681&metadataPrefix=oai_dc .
Quite often you can use specific OAI-PMH library from a programming language. Eg. Sickle for Python is one option. There you set up the OAI-PMH server connection details, set up collection and you get back a handle which gives you the next batch of records.
from sickle import Sickle URL ='https://digi.kansalliskirjasto.fi/interfaces/OAI-PMH' sickle = Sickle(URL) records = sickle.ListRecords( **{'metadataPrefix': 'oai_dc' 'set': colid, }) for record in records: # do sth fun with a record.
OAI-PMH
Basic information about the service
Which formats offered: (dublin core , qdc_finna, marc21)
Which sets are offered:
The collections of digi can be retrieved by prepending 'col-' to the collection number, which can be seen at the url of the collection. For example for collection https://digi.kansalliskirjasto.fi/collections?id=41 the OAI-PMH is :
- http://digi.kansalliskirjasto.fi/interfaces/OAI-PMH?verb=ListRecords&metadataPrefix=qdc_finna&set=col-41
Return the first 100 records, at the end is the resumption token to get next batch
- http://digi.kansalliskirjasto.fi/interfaces/OAI-PMH?verb=ListRecords&metadataPrefix=oai_dc&set=sanomalehti
- http://digi.kansalliskirjasto.fi/interfaces/OAI-PMH?verb=ListRecords&resumptionToken=s9bbfLpHRi+100
- http://digi.kansalliskirjasto.fi/interfaces/OAI-PMH?verb=ListRecords&metadataPrefix=oai_dc&set=aikakauslehti
The resumption token by which you can continue the first batch of results is given at the end of the first batch and so on until end of batches.
Returns different material types:
Returns just identifiers and datestamp:
Return 1 specific record (the identifier is created with help of binding id):
- https://digi.kansalliskirjasto.fi/interfaces/OAI-PMH?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:digi.kansalliskirjasto.fi:1976463
- https://digi.kansalliskirjasto.fi/interfaces/OAI-PMH?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:digi.kansalliskirjasto.fi:2606
Usage of richer QDC_finna metadata format:
Filter records by date
Applicability
- Use OAI-PMH if you want the basic binding level metadata. The querying of all records will take some time as batch size via OAI-PMH is limited.
OAI-PMH for Books
The books operate in the same way as the newspaper and journal materials. In the books , the 'set' is the collection of books, you can find the collection id either from the collection page or by observing the ListSet verb results.
Getting all collections of books
Getting the 'Geography and Travel' collection:
Getting a subcollection (subcollections are separated via colon ( , :
Getting a particular record:
(A quick way is just to replace the binding id of a desired binding to the example above).
Getting a particular record in marc21 format:
- https://digi.kansalliskirjasto.fi/interfaces/OAI-PMH?verb=GetRecord&metadataPrefix=marc21&identifier=oai:digi.kansalliskirjasto.fi:1927004
- The marc format returns both the digital record and the printed binding marc records as-is.
JSON
Available for the newspaper and journal title information (core metadata):
https://digi.kansalliskirjasto.fi/api/newspaper/titles?language=fi
SFX
See instructions of using via SFX: https://www.kiwi.fi/pages/viewpage.action?pageId=103187594
Sub components of individual binding available
The different sub components of post-processing can be accessed via https://digi.kansalliskirjasto.fi , where <bindingid> is the unique local id of a particular issue number.
https://digi.kansalliskirjasto.fi/sanomalehti/binding/1426186/image/1 the access page image (.jpg)
https://digi.kansalliskirjasto.fi/sanomalehti/binding/1426186/thumbnail/1 the thumbnail of the page (.jpg)
https://digi.kansalliskirjasto.fi/sanomalehti/binding/1426186/pdf the whole binding as pdf
https://digi.kansalliskirjasto.fi/sanomalehti/binding/1426186/page-1.txt the page text as it is.
https://digi.kansalliskirjasto.fi/sanomalehti/binding/1426186/page-1.xml returns the ALTO XML,which contains also the layout information.
https://digi.kansalliskirjasto.fi/sanomalehti/binding/1426186/image/100 sample of an error page, if binding doesn't have that many pages
https://digi.kansalliskirjasto.fi/sanomalehti/binding/1426186/mets.xml?full=true mets file for the whole binding, true=unfiltered, false=filters away some amdsec parts.
References
- OAI-PMH Specification at OpenArchives.org http://www.openarchives.org/OAI/openarchivesprotocol.html .
For all questions or comments, please use the Feedback-functionality of http://digi.nationallibrary.fi .