Book SP

  • Islandora Paged Content
gs -v
GPL Ghostscript 9.18 (2015-10-05)
Copyright (C) 2015 Artifex Software, Inc.  All rights reserved.

apt-get install poppler-utils

cd /var/www/
git clone git://
cd /var/www/
drush -u 1 en islandora_paged_content

Browse to config: admin/islandora/solution_pack_config/paged_content
	gs (GhostScript) = /usr/bin/gs
        pdfinfo = /usr/bin/pdfinfo
        pdftotext = /usr/bin/pdftotext
        Allow Extraction of Raw Text = NO
 	djatoka URL =
 	Solr page sequence number field = RELS_EXT_isSequenceNumber_literal_ms
        Use Solr to derive pages and sequence numbers = NO
        Set page labels to sequence numbers = NO
        Hide Page Objects From Search Results = NO
  • Tesseract
apt-get install tesseract-ocr tesseract-ocr-eng tesseract-ocr-ita tesseract-ocr-fra tesseract-ocr-spa tesseract-ocr-deu

tesseract -v
tesseract 3.04.01
  libgif 5.1.2 : libjpeg 8d (libjpeg-turbo 1.4.2) : libpng 1.2.54 : libtiff 4.0.6 : zlib 1.2.8 : libwebp 0.4.4 : libopenjp2 2.1.0

tesseract --list-langs
List of available languages (7):
  • Islandora OCR
cd /var/www/
git clone git://
cd /var/www/
drush -u 1 en islandora_ocr

Browse to config: admin/islandora/tools/ocr
     Tesseract = /usr/bin/tesseract
     Languages available for OCR 
       English = YES
       Italian = YES
     Enable Solr Fast Vector Highlighting = YES
     Solr field containing OCR text = text_nodes_HOCR_hlt
     Maximum number of results to return in a Solr query = 32
  • Islandora Internet Archive Bookreader
cd /var/www/
git clone git://
mv internet_archive_bookreader bookreader

cd /var/www/
git clone git://
  • Islandora Internet Archive Bookreader UPDATE
  • Diego git + IABR v2 (many many thanks Diego!!)
cd /var/www/
git clone git:// (branc Master, Commits on Jul 28, 2017)

cd /var/www/
git clone -b 7.x-2dev git:// (branch 7.x-2dev latest Commits on Apr 26, 2017)

Hardcoded redirect tile source to Fedora.

nano -w islandora_internet_archive_bookreader/js/islandora_book_reader.js
  IslandoraBookReader.prototype.getDjatokaUri = function(resource_uri) {
    var base_uri = this.settings.djatokaUri;
    //Do some sanitation on that base uri.
    //Since it comes from an admin form, let's make sure there's a '/' at the
    //end of it.
    if (base_uri.charAt(base_uri.length) != '/') {
      base_uri += '/';
+    var new_resource_uri = resource_uri.replace('', '');
//if djatoka on the same server than Fedora you can use instead
+    var new_resource_uri = resource_uri.replace('', '');
+    new_resource_uri = new_resource_uri.replace('datastream/JP2/view', 'datastreams/JP2/content');
    var params = $.param({
-      'rft_id': resource_uri,
+      'rft_id': new_resource_uri,
      'url_ver': 'Z39.88-2004',
      'svc_id': 'info:lanl-repo/svc/getRegion',
      'svc_val_fmt': 'info:ofi/fmt:kev:mtx:jpeg2000',
      'svc.format': 'image/jpeg',
      'svc.level': this.settings.compression,
      'svc.rotate': 0
    return (base_uri + 'resolver?' + params);
  • Colorbox
cd /var/www/
drush dl colorbox
drush en colorbox
cd ~
mv colorbox-1.x colorbox
mv colorbox /var/www/

Configure colorbox: admin/config/media/colorbox

  • Enable Colorbox inline YES
cd /var/www/
drush -u 1 en islandora_internet_archive_bookreader
  • Islandora Book SP
cd /var/www/
git clone git://
cd /var/www/
drush -u 1 en islandora_book

Browse to config: admin/islandora/solution_pack_config/book

  • PDF datastream. Requires ImageMagick = YES
  • Image datastreams (TN, JPEG, JP2). Requires Large Image Solution Pack = YES
  • OCR datastreams (OCR, HOCR). Requires Islandora OCR module = YES
  • Parent Solr Field = RELS_EXT_isMemberOf_uri_ms
  • Display object metadata = YES
  • Book Viewers = Internet Archive BookReader
  • Page Viewers = OpenSeadragon
isla7x/book.txt ยท Last modified: 2017/08/01 15:28 by giancarlo

Developers: CNR IRCrES IT Office and Library
Giancarlo Birello (giancarlo.birello _@_ and Anna Perin (anna.perin _@_
ASA@TO.CNR is licensed under: Creative Commons License
Recent changes RSS feed Creative Commons License Valid XHTML 1.0 Valid CSS Driven by DokuWiki
Drupal Garland Theme for Dokuwiki