Example Test Publication

Prototype publication (work in progress) Computational Publishing for Collections
GitHub repository
Project tasks

Work Plan - Oct 22

Use case

Proof of concept publication on a computational publishing platform.

Motivation: Can 'computational publishing platform' act as a authoring and querying tools for LOD sources. Most cultural digital databases are presented in the mode of a database engineering - can the platforms act as story telling (presentation) layers directly drawing from and connected to the LOD sources.

Presentation of CdBB content based on LOD work by NFDI4Culture 'Semantic annotation for 3D cultural artefacts' MVP content: Baroque ceiling painting and frescos Germany from the existing barocken Deckenmalerei in Deutschland (CbDD) collection.

Baroque ceiling painting in Germany

https://www.deckenmalerei.eu/

https://deckenmalerei.badw.de/das-projekt.html

CdBB NFDI4Culture MVP: 'Semantic annotation for 3D cultural artefacts'

Video and slides

Video - MVP

Video - https://youtu.be/FvU6O23Ozyc

Slides - https://zenodo.org/record/5628847

About MVP and data - Semantic annotation for 3D cultural artefacts: About our MVP

GitLab - https://gitlab.com/nfdi4culture/ta1-data-enrichment/kompakkt-docker

Source content: The Dinning Room - https://www.deckenmalerei.eu/42d06165-58e7-4653-bfe4-3d5f7091fc33#7fb9a718-7e18-4053-81d9-3689f3f65548

Goals

Produce a proof of concept publication for Feb '23 from CdBB MVP content
Present the PoC Publication at COPIM Feb 23 conference
Hold workshop with cultural historians to see how Notebook tools could help them tell their stories

Questions

What computational platforms work for publishers?
What's interesting for readers in computational publishing platform?
Road testing platforms: using digital objects; performing LOD queries and visualization; usability, suitability for publishers, readers, and authors?

Testing

Computational Platforms

Work on both at the same time as code should work in each platform set:

Jupyter Notebooks, Jupyter Hub, MyBinder
Curvenote

Objects and processes

Prioritize the following:

Text
Citations
PIDS: ORCID, DOI, ROR
Image
3D Model
Audio
TIB AV video
LOD for the above

Round one content processing would be so that we can demo the platforms to CbDD researcher and get their feedback:

Source content: The Dinning Room - https://www.deckenmalerei.eu/42d06165-58e7-4653-bfe4-3d5f7091fc33#7fb9a718-7e18-4053-81d9-3689f3f65548

Resources

Data model - https://wikibase.wbworkshop.tibwiki.io/wiki/Data_Model

All items with LoD - stored by team MVP

3D models - https://kompakkt.wbworkshop.tibwiki.io/explore

Manual, bulk, automatic LOD and media upload process - get info from

Open Refine term disambiguation guide - https://marketplace.sshopencloud.eu/training-material/90ekdY

Semantics Explainer - https://semanticclimate.github.io/p/en/posts/oaweek_semantic/

Blogpost from #semanticClimate for LOD explainer - https://blogs.tib.eu/wp/tib/2022/10/24/semantifying-the-ipcc-reports-a-hackathon/

Computational Platforms Comparison

The idea is to see if computational publishing platforms could be used as presentation platforms for authors to create publication from LoD collections.

The use case is in architecture with content from the collection of Baroque frescos and ceiling painting in Germany held by the Barocke Deckenmalerei in Deutschland CbDD.

This test use case is based on work being done by Task Area 1 and 5 - TIB/OSL team who are working with the CbDD.

Currently most digital heritage archives are obscured by often necessary database models of presentation - adding an autoring layer to these collection using computational publishing would add useful way to work with these collections.

The antcipated outcomes are:

Using the platforms is a way of rapid prototyping the features you would want in a real-time editor that could access LoD APIs, or / and
The platforms could be used in their own rights as editors.

Give example and make demo:

First as a dummy doc in ADA Pipeline to show what is meant
- The content would be based around replicated an existing work on a (Die Tafelstube / The Dining Room) as there are existing digital objects for this room and there are written descriptions.
Then in one of the platforms below: probably Curvenote, maybe Jupyter Book?
Document and present as computational publication with view on how could be implemented and integrated with the ADA Pipeline.

Platforms

Computational Publishing platform comparison 2022
Platform name	Web address	Example with Die Tafelstube
Colab	https://colab.research.google.com/	Colab
Jupyter Notebook	https://jupyter.org/
JupyterHub
JupyterLab
Binder	https://mybinder.org/	Binder
Voilà
Curvenote	https://curvenote.com/	Rendered Curvenote and edit mode
Quarto	https://quarto.org/	To Do
Jupyter Book / Executable Book	https://jupyterbook.org/ and https://executablebooks.org/	To Do
DeepNote	https://deepnote.com/

GitHub demo page

Collection working notes - https://github.com/mrchristian/computational-publishing-for-culture

Platform 'Academic Markup' tests

These are simple but important tests to see if platforms can handle basic academic markup. A benchmark is already in place here with the ADA Pipeline.

Markup
H 1, 2, 3, 4, 5, 6
Text markup: bold, emphasis, etc
List types and styles
Footnote
Citation / change CSL
Image

Test: Digital Objects

Objects and formatting (need to separate out)		Note
Retrieve amount of text from LOD source.
Text from CbDD webpage	Die Tafelstube
DOI / CSL	Not used here - but we could show off this feature.	Need to find out more about their note and citation style. As it looks mixed.
Thoth biblio info via API	Simon Bowie GitHub	Example outputs	Open Refine disabiguate / reconcile service? Note: not working on localhost.
Footnote		Currently, look hard coded
ORCID info retrieval for authors, add more roles.	Not used at present
Images	LoD storage MVP Images also stored with BAdW Source
Figure, Figure caption, List of figures	Currently images have LOD drop down
Image caption
Deep Image zoom	Source
Compakt model	https://kompakkt.wbworkshop.tibwiki.io/explore
TIB AV Portal video	https://av.tib.eu/	A related art history video segment could be found.
Leaflet map	Currently Open Street Map is used
Wikidata LOD info: Painting (customize)	https://query.wbworkshop.tibwiki.io/

Test: Interactive Queries

Source	Example
Wikidata / WikiBase	Show nine images from specific painter. Display as grid. List specific LoD info fields.
Wikidata / WikiBase	Show all castles on Leaflet map with Baroque paintings as per CbDD entries. Allow for change of artists.
Wikidata Term Annotation - AMI (semanticClimate)	CoLab instructions

Resources

Project

Sandbox Wikibase - https://gitlab.com/paulduchesne/wikibase-sandbox

On the web

Tools list LOD UCLA Library (Oct 22)

Work plan

The idea with the work plan is to deal with small exercised, like adding text and image, then a video and 3D model - and then to work up through from simple objects to more complex objects.

The starting point would be to replicate parts of the catalogue web page Die Tafelstube.

Words 3,800. Characters with blanks 26,976. 14/15 images, LOD per image. Footnotes and references, geolocations.

Die Tafelstube
- Belagerungsszenen des Langen Türkenkriegs an der Decke
- Programm und Synthese der einstigen Tafelstube

The first goal would be to have a presentation for CbDD so that they can be engaged in the project, and we can get their input and find out what interests them.

Evaluate platform for Academic Markup tests.
Make a small report on platforms - The platforms have different functionality, some with presentation formats, PDF generation.
Make a 'dummy prototype' in ADA Pipeline showing the overall idea and examples for different objects and the publication. Make as slide presentation too.

Text and images
Room metadata, LOD
Add Compakt models
Add deep image zoom
Add TIB AV Portal videos
Add PIDs
Annotate terms in Wikidata - Term stemming in Wikidata results in a supporting multilingual discovery
Show bibliographic record from Thoth
Support dynamic citation and footnoting, CSL
Try out queries

Computational Publishing for Architecture

An X-Sketchbook Research Work Plan

Simon Worthington Email: simon.worthington@tib.eu (NextGen Books – making the future book, TIB )

The work plan is for an experimental publishing prototype as part of the X-Sketchbook project exploring digital publishing for architecture. The project is organised by TIB Open Science Lab and the Bartlett School. The aim of the work programme is to produce a small proof of concept publication containing a set of sample digital objects. It is being conducted in the context of NFDI4Culture — the German Consortium for Research Data on Material and Immaterial Cultural Heritage in collaboration with COPIM the Community-Led Open Publication Infrastructures for Monographs project, an international research partnership working on open infrastructures for monographs.

The central question for the project is how can an existing publisher's infrastructures and workflows incorporate computational publishing — the combinations of text and executable code — as applicable to the topic area of architecture.

In addition we will be looking at how 'enhanced publication principles' for open access and FAIR‑publishing can be applied to computational publishing.

#XSketchbook – https://github.com/TIBHannover/xsketch

Keywords:

computational publishingenhanced publishingopen accessopen science publishingacademic publishinginfrastructureNFDI4CultureNFDIarchitectureopen sourceopen standardslinked open datapublishing from archives

Date: 2022-05-10

NFDI4Culture – Consortium for Research Data on Material and Immaterial Cultural Heritage. NFDI4Culture is the consortium within the Nationale Forschungsdateninfrastruktur (NFDI).
COPIM – Community-led Open Publication Infrastructures for Monographs.

Working definitions

Enhanced publication principles for open access and FAIR‑publishing: For enhanced publications the question is how to make a publication open in the context of modern computational and networked systems, and what additional functionality and enrichment can be added. What types of requirements are made of scholarly publishing to be fully open, and how these are implemented in systems or infrastructures. Example features are: PIDs, expanding roles and attribution, linking outputs, etc. The Enhanced Publications project is an activity of NFDI4Culture, Task Area 4. A working paper will be published in June 2022 for consultation on the implementation of enhanced publications. Contributors are: Alexandra Büttner, Ruprecht-Karls-Universität Heidelberg; Matthias Arnold, Ruprecht-Karls-Universität Heidelberg; Jörg Heseler, Sächsische Landesbibliothek – Staats- und Universitätsbibliothek Dresden, and; Simon Worthington, TIB – Leibniz-Informationszentrum Technik und Naturwissenschaften.
Computational publishing (Literate programming): (Knuth, 1992) Computational publishing means to combine both text and advanced computational functionality on a single hypertext document. Jupyter Notebook is one such tool where code, data, and findings can be presented in one instance for speedy verification (the replication crisis in science literature is one reason for Jupiter Notebook's current popularity).

Computational publishing is not new in computing, in ideas of knowledge management, or in the history or imaginations of book technologists and thinkers. In modern computing Alan Kay articulated it as the Dynabook (Odewahn, 2021) in his paper A Personal Computer for Children of All Ages (Kay, 1972) and went on to implement the concept, that same year, in the programming language for children's education called Smalltalk which allowed the editing of parameters to control the behaviour of objects on a screen for teaching concepts in geometry, music, literature, or maths. Kay added an interactive layer to existing ideas of networked knowledge publishing systems, and extensively acknowledges Douglas Engelbart's 'The Mother of All Demos' (Engelbert, 1968). Which in turn can be seen as the idea of the Memex introduced in the 1945 Vannevar Bush essay As We May Think from The Atlantic magazine (Bush, 1945) and then realised by Engelbert as the first networked desktop digital workstation. Engelbart and Bush do not explicitly propose computational publishing, but instead are more focused on ideas of networked knowledge and new interfaces based on their available 'knowledge galaxies'. Earlier analog, pre-computing, networked knowledge systems are for example Paul Otlet's 1934 Traité de Documentation (Otlet, 1934) which describes a paper based and telegraphic systems that we're put in place for cataloguing part of publications, 'Inspired by the arrival of radio, phonograph, cinema, and television' (Reagle, 2012). It has to be noted that Otlet's project sits under a perverse context that was heavily contested even at the time — on the one hand the project was part of a utopian vision of world peace and equality and closely aligned to the formation of the League of Nations, while at the same time directly funded from the profits from the profits of King Leopold II of Belgium's brutal regime in the Congo Free State. Mark Twain wrote at the time, the famous pamphlet King Leopold's Soliloquy (Twain, 1905) which grapically documented the killling and torture in the Congo.

For this project the following computational publishing tools will be used: Jupyter Notebook, Jupyter Book, and Curvenote.
Architecture: We are focusing on architecture as it is the specialist subject of the TIB Library as well an area of research that TIB is contributing to for the NFDI4Culure research programme. NFDI4Culture concerns predominantly relate to the history of architecture and its preservation in digital heritage archives such as Research Group Urban History 4D and historical buildings of Dresden.

In relation to contemporary practice architecture this is being explored in terms of modern studio practice through the X-Sketchbook experimental publishing project which is looking at the architects 'sketchbook' in studio practice and the issues of making a digital skectbook that can record and catalogue heterogeneous digital objects generated from 3D platforms like Sketchfab or game engines like Unity, to objects taken from social media, to robotics, sensors, or AI assisted CNC modelling.
Publication infrastructures: For open publishing using computational publishing and enhanced publishing principles the main questions are how computational books can be integrated and made compatible with existing publishing infrastructures and workflows. The publishing infrastructures being covered will be those that apply to conventional monographs and research papers. The enhanced publication features would be, for example: real-time collaborative editing platforms, linked open data, 3D models, and semantic video, etc. Some of the infrastructures that we will want to integrate or test with the proof-of-concepts will be:
- Open Book Publishers (OBP) COPIM member – conventional open-source publishing infrastructure.
- ADA Pipeline – Multi-format publishing pipeline from the Open Science Lab, TIB – The ADA Pipeline supports the TIB Service – NextGen Books.
- TIB AV Portal – Scholarly semantic video repository and open-source toolkit from TIB
- MVP Semantic annotation for 3D cultural artefacts – WikiData / Wikibase / Kompakkt / OpenRefine – Team NFDI4Culture TA 1 (TIB) – Demonstration video
- Thoth – COPIM partner. Book metadata platform
- Unity – Game engine and 3D Software for Architecture, Engineering & Construction

Motivation

The utopian vision of computational publishing has inspired us with its promise of a better world through the use of universally interconnected knowledge and learning, and how this might potentially be modelled in forms of digital publishing. Work at PARC (Palo Alto Research Center) and on Kay's Dynabook was taken and made into the products of the personal computer and later the tablet computer. But at the same time, in its more than seventy-five years history, computational publishing itself, as a vision and paradigm, has failed to be realised.

The recent prominence of Jupyter Notebook has shown a promising route for exploring computational publishing further as it offers a substantial and flexible publishing framework that a large number of stakeholders have bought into. Yet applying this to a 'traditional' digital and print book publishing workflow will be challenging, on both technological and socio-cultural fronts. This is one of the aspects we would like to explore in this project.

Architecture offers exciting opportunities for computational publishing and many computational features are already being explored within the field. These include: data visualisations and simulations, for the manipulation of design tools for robotics such as component fabrication, and in the presentation and exchange of ideas in 3D multi-modal-media and on social media platforms.

The following 2019 conference video Ubiquity and Autonomy from the Association for Computer Aided Design in Architecture gives a clear idea of modern design challenges for architecture.

Enhanced Publishing principles and testing implementation plans can play an important role in contributing to the sustainability and to real-world working models of computational publishing to support its use on monographs and research publications.

Work plan

The overall purpose of the work plan is to produce a sample book for further research of the types of computational objects that could be used in architecture and open up questions for publishers and publishing technologists.

Set up work plan with the communities involved: NFDI4Culture TA4 Data Publishing, COPIM, TIB NFDI4Culture team including NFDI4Culture TA1 'Semantic annotation for 3D cultural artefacts MVP', and others.

Scope: Review the proposed steps with the community.
1. Select enhanced publishing features to test.
1. Select architectural computational objects to include.
2. Define uses cases and personas (users). The use cases will from NFDI4Culture architectural digital heritage projects, and from X-Sketchbook developed with the Bartlett School and TIB Open Science Lab.
3. Define workflow for publishing infrastructures in use at OBP, and at the ADA Pipeline.
4. Write an initial blogpost outlining project and possibilities of and issues in computational publishing for monograph publishers (COPIM).
5. Write a blogpost on 'What is a Computational Book' (COPIM).
Proof-of-concept demo:
1. Execute part one 'Scope' in the OBP and ADA publishing infrastructures and produce a demonstration of the workflows and example publication outputs. The contents would be technical proof-of-concept examples for demonstration purposes.
2. The proof-of-concept would be run as a public-facing open demonstration for the purpose of community engagement.
3. Write blogpost on the role of the publisher in Computational Publishing (COPIM)
Demonstration mockups:
1. Bring on board two partner projects, the Bartlett and NFDI4Culture from architecture to create demonstration mockups with real publication content.
  1. One mockup would be for contemporary architecture – the Bartlett
  2. One mockup for historical architecture – NFDI4Culture
  3. Write blogpost reflecting on experiences of authors/communities around Computational Publishing (COPIM)
  4. Either organise a stand alone workshop or as part of an NFDI4Culture or COPIM bigger workshop

Schedule

April/May 2022

Finalise work plan

June/July 2022

Finalise scope and use cases

September 2022

Produce Proof-of-concept publication

Oct / Nov / Dec 2022

Demonstrations; Organise or participate in workshop

Communities

COPIM
NFDI4Culture
Open Source in Architecture
Single Source Publishing Community (SSPC)
NFDI4Culture
The Bartlett
Curvenote
Jupyter Notebook
Jupyter Book

Planned outputs

Project outline: Work plan
Blogposts: Announce, and; others
Public engagement using mockup building: Proof-of-concept; Demonstrations
Preprint and paper: Wikiversity
Workshop (participation and planning) NFDI4Culure – TIB or NFDI context Winter 2022
Paper: The Journal of Open Source Software, or Journal of Open Source Education

Prototyping Computational Publishing for Culture

An NFDI4Culture and COPIM collaboration

by Simon Worthington

v0.3

Use case

Baroque ceiling painting in Germany

CdBB NFDI4Culture MVP: 'Semantic annotation for 3D cultural artefacts'

Video and slides

Goals

Questions

Testing

Computational Platforms

Objects and processes

Resources

Give example and make demo:

Platforms

GitHub demo page

Platform 'Academic Markup' tests

Test: Digital Objects

Test: Interactive Queries

Resources

Project

On the web

Work plan

An X-Sketchbook Research Work Plan

Working definitions

Motivation

Work plan

Schedule

Communities

Planned outputs

Bibliography