-
Prototype publication (work in progress) Computational Publishing for Collections
Published by: https://write.handbuch.io/books/computational-publishing/
Last updated: 2023-02-22
Created: 2022-10-11
Language: English (United Kingdom)
Created by: Simon Worthington
Prototype publication (work in progress) Computational Publishing for Collections
Proof of concept publication on a computational publishing platform.
Motivation: Can 'computational publishing platform' act as a authoring and querying tools for LOD sources. Most cultural digital databases are presented in the mode of a database engineering - can the platforms act as story telling (presentation) layers directly drawing from and connected to the LOD sources.
Presentation of CdBB content based on LOD work by NFDI4Culture 'Semantic annotation for 3D cultural artefacts' MVP content: Baroque ceiling painting and frescos Germany from the existing barocken Deckenmalerei in Deutschland (CbDD) collection.
https://deckenmalerei.badw.de/das-projekt.html
Video - MVP
Video - https://youtu.be/FvU6O23Ozyc
Slides - https://zenodo.org/record/5628847
About MVP and data - Semantic annotation for 3D cultural artefacts: About our MVP
GitLab - https://gitlab.com/nfdi4culture/ta1-data-enrichment/kompakkt-docker
Source content: The Dinning Room - https://www.deckenmalerei.eu/42d06165-58e7-4653-bfe4-3d5f7091fc33#7fb9a718-7e18-4053-81d9-3689f3f65548
Produce a proof of concept publication for Feb '23 from CdBB MVP content
Present the PoC Publication at COPIM Feb 23 conference
Hold workshop with cultural historians to see how Notebook tools could help them tell their stories
What computational platforms work for publishers?
What's interesting for readers in computational publishing platform?
Road testing platforms: using digital objects; performing LOD queries and visualization; usability, suitability for publishers, readers, and authors?
Work on both at the same time as code should work in each platform set:
Jupyter Notebooks, Jupyter Hub, MyBinder
Curvenote
Prioritize the following:
Text
Citations
PIDS: ORCID, DOI, ROR
Image
3D Model
Audio
TIB AV video
LOD for the above
Round one content processing would be so that we can demo the platforms to CbDD researcher and get their feedback:
Source content: The Dinning Room - https://www.deckenmalerei.eu/42d06165-58e7-4653-bfe4-3d5f7091fc33#7fb9a718-7e18-4053-81d9-3689f3f65548
Data model - https://wikibase.wbworkshop.tibwiki.io/wiki/Data_Model
All items with LoD - stored by team MVP
3D models - https://kompakkt.wbworkshop.tibwiki.io/explore
Manual, bulk, automatic LOD and media upload process - get info from
Open Refine term disambiguation guide - https://marketplace.sshopencloud.eu/training-material/90ekdY
Semantics Explainer - https://semanticclimate.github.io/p/en/posts/oaweek_semantic/
Blogpost from #semanticClimate for LOD explainer - https://blogs.tib.eu/wp/tib/2022/10/24/semantifying-the-ipcc-reports-a-hackathon/
The idea is to see if computational publishing platforms could be used as presentation platforms for authors to create publication from LoD collections.
The use case is in architecture with content from the collection of Baroque frescos and ceiling painting in Germany held by the Barocke Deckenmalerei in Deutschland CbDD.
This test use case is based on work being done by Task Area 1 and 5 - TIB/OSL team who are working with the CbDD.
Currently most digital heritage archives are obscured by often necessary database models of presentation - adding an autoring layer to these collection using computational publishing would add useful way to work with these collections.
The antcipated outcomes are:
Using the platforms is a way of rapid prototyping the features you would want in a real-time editor that could access LoD APIs, or / and
The platforms could be used in their own rights as editors.
First as a dummy doc in ADA Pipeline to show what is meant
The content would be based around replicated an existing work on a (Die Tafelstube / The Dining Room) as there are existing digital objects for this room and there are written descriptions.
Then in one of the platforms below: probably Curvenote, maybe Jupyter Book?
Document and present as computational publication with view on how could be implemented and integrated with the ADA Pipeline.
Platform name |
Web address |
Example with Die Tafelstube |
Comments |
---|---|---|---|
Colab |
|||
Jupyter Notebook |
|||
JupyterHub |
|||
JupyterLab |
|||
Binder |
|||
Voilà |
|||
Curvenote |
|||
Quarto |
To Do |
||
Jupyter Book / Executable Book |
To Do |
||
DeepNote |
|||
Collection working notes - https://github.com/mrchristian/computational-publishing-for-culture
These are simple but important tests to see if platforms can handle basic academic markup. A benchmark is already in place here with the ADA Pipeline.
Markup |
|||
---|---|---|---|
H 1, 2, 3, 4, 5, 6 |
|||
Text markup: bold, emphasis, etc |
|||
List types and styles |
|||
Footnote |
|||
Citation / change CSL |
|||
Image |
|||
Objects and formatting (need to separate out) |
Note |
||
---|---|---|---|
Retrieve amount of text from LOD source. |
|||
Text from CbDD webpage |
|||
DOI / CSL |
Not used here - but we could show off this feature. |
Need to find out more about their note and citation style. As it looks mixed. |
|
Thoth biblio info via API |
Open Refine disabiguate / reconcile service? Note: not working on localhost. |
||
Footnote |
Currently, look hard coded |
||
ORCID info retrieval for authors, add more roles. |
Not used at present |
||
Images |
Images also stored with BAdW Source |
||
Figure, Figure caption, List of figures |
Currently images have LOD drop down |
||
Image caption |
|||
Deep Image zoom |
|||
Compakt model |
|||
TIB AV Portal video |
A related art history video segment could be found. |
||
Leaflet map |
Currently Open Street Map is used |
||
Wikidata LOD info: Painting (customize) |
|||
Source |
Example |
||
---|---|---|---|
Wikidata / WikiBase |
Show nine images from specific painter. Display as grid. List specific LoD info fields. |
||
Wikidata / WikiBase |
Show all castles on Leaflet map with Baroque paintings as per CbDD entries. Allow for change of artists. |
||
Wikidata Term Annotation - AMI (semanticClimate) |
|||
Sandbox Wikibase - https://gitlab.com/paulduchesne/wikibase-sandbox
Tools list LOD UCLA Library (Oct 22)
The idea with the work plan is to deal with small exercised, like adding text and image, then a video and 3D model - and then to work up through from simple objects to more complex objects.
The starting point would be to replicate parts of the catalogue web page Die Tafelstube.
Words 3,800. Characters with blanks 26,976. 14/15 images, LOD per image. Footnotes and references, geolocations.
Belagerungsszenen des Langen Türkenkriegs an der Decke
Belagerung I: „Vestung Tottis, wie die von den Christen bei der Nacht erobert wo
Belagerung II: „Vestung Gran wie die von Christen belegert gewesen. 1594“
Belagerung III: „Vestung Raab, wie die vom Türcken belegert gewesen. A[nn]o 1594
Belagerung IV: „Vestung Comorna wie die vom Türckn belegert gewe[sen] 1594“
Belagerung V: „Vestung Gran wie die von den Christen wider erobert worden. A[nn]
Belagerung VI: “Vestung Vizzegrad wie die von Christen belegert gewesen Anno 159
Belagerung VII: „Statt Waitzen wie die von vom Türcken belegert gewesen 1597“
Belagerung VIII: „Vestung Raab, die Christen beÿ der Nacht wider erobert. A[nn]o
Belagerung IX: „Hauptstatt Offen. wie die von Christen belegert gewesen. 1598.“
Belagerung X: „Hauptstatt Offen, wie die von Christen belegert gewesen. Anno 160
Belagerung XII: „Vestung Gran wie die vom Türcken belegert gewesen A[nn]o 1604“
Belagerung XI: „Hauptstatt Offen, wie die von Christn belegert gewesen, ein Schä
The first goal would be to have a presentation for CbDD so that they can be engaged in the project, and we can get their input and find out what interests them.
Evaluate platform for Academic Markup tests.
Make a small report on platforms - The platforms have different functionality, some with presentation formats, PDF generation.
Make a 'dummy prototype' in ADA Pipeline showing the overall idea and examples for different objects and the publication. Make as slide presentation too.
Text and images
Room metadata, LOD
Add Compakt models
Add deep image zoom
Add TIB AV Portal videos
Add PIDs
Annotate terms in Wikidata - Term stemming in Wikidata results in a supporting multilingual discovery
Show bibliographic record from Thoth
Support dynamic citation and footnoting, CSL
Try out queries
The work plan is for an experimental publishing prototype as part of the X-Sketchbook project exploring digital publishing for architecture. The project is organised by TIB Open Science Lab and the Bartlett School. The aim of the work programme is to produce a small proof of concept publication containing a set of sample digital objects. It is being conducted in the context of NFDI4Culture — the German Consortium for Research Data on Material and Immaterial Cultural Heritage in collaboration with COPIM the Community-Led Open Publication Infrastructures for Monographs project, an international research partnership working on open infrastructures for monographs.
The central question for the project is how can an existing publisher's infrastructures and workflows incorporate computational publishing — the combinations of text and executable code — as applicable to the topic area of architecture.
In addition we will be looking at how 'enhanced publication principles' for open access and FAIR‑publishing can be applied to computational publishing.
#XSketchbook – https://github.com/TIBHannover/xsketch
Keywords:
Date: 2022-05-10
© The Authors, Creative Commons: Attribution-ShareAlike 4.0 International (CC BY SA 4.0) https://creativecommons.org/licenses/by-sa/4.0/
NFDI4Culture – Consortium for Research Data on Material and Immaterial Cultural Heritage. NFDI4Culture is the consortium within the Nationale Forschungsdateninfrastruktur (NFDI).
COPIM – Community-led Open Publication Infrastructures for Monographs.
Enhanced publication principles for open access and FAIR‑publishing: For enhanced publications the question is how to make a publication open in the context of modern computational and networked systems, and what additional functionality and enrichment can be added. What types of requirements are made of scholarly publishing to be fully open, and how these are implemented in systems or infrastructures. Example features are: PIDs, expanding roles and attribution, linking outputs, etc. The Enhanced Publications project is an activity of NFDI4Culture, Task Area 4. A working paper will be published in June 2022 for consultation on the implementation of enhanced publications. Contributors are: Alexandra Büttner, Ruprecht-Karls-Universität Heidelberg; Matthias Arnold, Ruprecht-Karls-Universität Heidelberg; Jörg Heseler, Sächsische Landesbibliothek – Staats- und Universitätsbibliothek Dresden, and; Simon Worthington, TIB – Leibniz-Informationszentrum Technik und Naturwissenschaften.
Computational publishing (Literate programming): (Knuth, 1992) Computational publishing means to combine both text and advanced computational functionality on a single hypertext document. Jupyter Notebook is one such tool where code, data, and findings can be presented in one instance for speedy verification (the replication crisis in science literature is one reason for Jupiter Notebook's current popularity).
Computational publishing is not new in computing, in ideas of knowledge management, or in the history or imaginations of book technologists and thinkers. In modern computing Alan Kay articulated it as the Dynabook (Odewahn, 2021) in his paper A Personal Computer for Children of All Ages (Kay, 1972) and went on to implement the concept, that same year, in the programming language for children's education called Smalltalk which allowed the editing of parameters to control the behaviour of objects on a screen for teaching concepts in geometry, music, literature, or maths. Kay added an interactive layer to existing ideas of networked knowledge publishing systems, and extensively acknowledges Douglas Engelbart's 'The Mother of All Demos' (Engelbert, 1968). Which in turn can be seen as the idea of the Memex introduced in the 1945 Vannevar Bush essay As We May Think from The Atlantic magazine (Bush, 1945) and then realised by Engelbert as the first networked desktop digital workstation. Engelbart and Bush do not explicitly propose computational publishing, but instead are more focused on ideas of networked knowledge and new interfaces based on their available 'knowledge galaxies'. Earlier analog, pre-computing, networked knowledge systems are for example Paul Otlet's 1934 Traité de Documentation (Otlet, 1934) which describes a paper based and telegraphic systems that we're put in place for cataloguing part of publications, 'Inspired by the arrival of radio, phonograph, cinema, and television' (Reagle, 2012). It has to be noted that Otlet's project sits under a perverse context that was heavily contested even at the time — on the one hand the project was part of a utopian vision of world peace and equality and closely aligned to the formation of the League of Nations, while at the same time directly funded from the profits from the profits of King Leopold II of Belgium's brutal regime in the Congo Free State. Mark Twain wrote at the time, the famous pamphlet King Leopold's Soliloquy (Twain, 1905) which grapically documented the killling and torture in the Congo.
For this project the following computational publishing tools will be used: Jupyter Notebook, Jupyter Book, and Curvenote.
Architecture: We are focusing on architecture as it is the specialist subject of the TIB Library as well an area of research that TIB is contributing to for the NFDI4Culure research programme. NFDI4Culture concerns predominantly relate to the history of architecture and its preservation in digital heritage archives such as Research Group Urban History 4D and historical buildings of Dresden.
In relation to contemporary practice architecture this is being explored in terms of modern studio practice through the X-Sketchbook experimental publishing project which is looking at the architects 'sketchbook' in studio practice and the issues of making a digital skectbook that can record and catalogue heterogeneous digital objects generated from 3D platforms like Sketchfab or game engines like Unity, to objects taken from social media, to robotics, sensors, or AI assisted CNC modelling.
Publication infrastructures: For open publishing using computational publishing and enhanced publishing principles the main questions are how computational books can be integrated and made compatible with existing publishing infrastructures and workflows. The publishing infrastructures being covered will be those that apply to conventional monographs and research papers. The enhanced publication features would be, for example: real-time collaborative editing platforms, linked open data, 3D models, and semantic video, etc. Some of the infrastructures that we will want to integrate or test with the proof-of-concepts will be:
Open Book Publishers (OBP) COPIM member – conventional open-source publishing infrastructure.
ADA Pipeline – Multi-format publishing pipeline from the Open Science Lab, TIB – The ADA Pipeline supports the TIB Service – NextGen Books.
TIB AV Portal – Scholarly semantic video repository and open-source toolkit from TIB
MVP Semantic annotation for 3D cultural artefacts – WikiData / Wikibase / Kompakkt / OpenRefine – Team NFDI4Culture TA 1 (TIB) – Demonstration video
Thoth – COPIM partner. Book metadata platform
Unity – Game engine and 3D Software for Architecture, Engineering & Construction
The utopian vision of computational publishing has inspired us with its promise of a better world through the use of universally interconnected knowledge and learning, and how this might potentially be modelled in forms of digital publishing. Work at PARC (Palo Alto Research Center) and on Kay's Dynabook was taken and made into the products of the personal computer and later the tablet computer. But at the same time, in its more than seventy-five years history, computational publishing itself, as a vision and paradigm, has failed to be realised.
The recent prominence of Jupyter Notebook has shown a promising route for exploring computational publishing further as it offers a substantial and flexible publishing framework that a large number of stakeholders have bought into. Yet applying this to a 'traditional' digital and print book publishing workflow will be challenging, on both technological and socio-cultural fronts. This is one of the aspects we would like to explore in this project.
Architecture offers exciting opportunities for computational publishing and many computational features are already being explored within the field. These include: data visualisations and simulations, for the manipulation of design tools for robotics such as component fabrication, and in the presentation and exchange of ideas in 3D multi-modal-media and on social media platforms.
The following 2019 conference video Ubiquity and Autonomy from the Association for Computer Aided Design in Architecture gives a clear idea of modern design challenges for architecture.
Enhanced Publishing principles and testing implementation plans can play an important role in contributing to the sustainability and to real-world working models of computational publishing to support its use on monographs and research publications.
The overall purpose of the work plan is to produce a sample book for further research of the types of computational objects that could be used in architecture and open up questions for publishers and publishing technologists.
Set up work plan with the communities involved: NFDI4Culture TA4 Data Publishing, COPIM, TIB NFDI4Culture team including NFDI4Culture TA1 'Semantic annotation for 3D cultural artefacts MVP', and others.
Scope: Review the proposed steps with the community.
Select enhanced publishing features to test.
Select architectural computational objects to include.
Define uses cases and personas (users). The use cases will from NFDI4Culture architectural digital heritage projects, and from X-Sketchbook developed with the Bartlett School and TIB Open Science Lab.
Define workflow for publishing infrastructures in use at OBP, and at the ADA Pipeline.
Write an initial blogpost outlining project and possibilities of and issues in computational publishing for monograph publishers (COPIM).
Write a blogpost on 'What is a Computational Book' (COPIM).
Proof-of-concept demo:
Execute part one 'Scope' in the OBP and ADA publishing infrastructures and produce a demonstration of the workflows and example publication outputs. The contents would be technical proof-of-concept examples for demonstration purposes.
The proof-of-concept would be run as a public-facing open demonstration for the purpose of community engagement.
Write blogpost on the role of the publisher in Computational Publishing (COPIM)
Demonstration mockups:
Bring on board two partner projects, the Bartlett and NFDI4Culture from architecture to create demonstration mockups with real publication content.
One mockup would be for contemporary architecture – the Bartlett
One mockup for historical architecture – NFDI4Culture
Write blogpost reflecting on experiences of authors/communities around Computational Publishing (COPIM)
Either organise a stand alone workshop or as part of an NFDI4Culture or COPIM bigger workshop
April/May 2022
Finalise work plan
June/July 2022
Finalise scope and use cases
September 2022
Produce Proof-of-concept publication
Oct / Nov / Dec 2022
Demonstrations; Organise or participate in workshop
Single Source Publishing Community (SSPC)
Project outline: Work plan
Blogposts: Announce, and; others
Public engagement using mockup building: Proof-of-concept; Demonstrations
Preprint and paper: Wikiversity
Workshop (participation and planning) NFDI4Culure – TIB or NFDI context Winter 2022
Paper: The Journal of Open Source Software, or Journal of Open Source Education