JSON Format for Duplicate Books

This page explains the JSON schema for the duplicate book data. This is an example of the data for a single book:

{
    "canonBooks": [
        {
            "canon": "urn:cts:latinLit:phi0914.phi0012.perseus-eng1",
            "title": " The History of Rome, Book 2",
            "author": "Livy",
            "ocrBooks": [
                "livy01livy",
                "livywithenglisht01livyuoft"
            ]
        }
    ]
}

The cannon attribute is the URI identifier of the canon work within the Perseus Digital Library. Please see this Perseus Catalog Blog entry about canonical URIs.

The title and author attributes are self evident, and were extracted from the capabilities XML file which is a list of all the texts hosted by the Perseus Digital Library.

The ocrBooks attribute is a list of Internet Archive book identifiers. To see the book, append the Archive ID to the URL: https://archive.org/details/.