Category Archives: Knowledge Systems

Datapkg 0.7 Released

A major new release (v0.7) of datapkg is out! Release: Docs: There’s a quick getting started section below (also see the docs). About the release This release brings major new functionality to datapkg especially in regard to its integration with CKAN. datapkg now supports uploading as well as downloading and can now be […]

Also posted in Code, Open Data, Open Knowledge Foundation, Python, Shuttleworth Fellow | 1 Response

Versioning / Revisioning for Data, Databases and Domain Models: Copy-on-Write and Diffs

There are several ways to implement revisioning (versioning) of domain model and Databases and data generally): Copy on write – so one has a ‘full’ copy of the model/DB at each version. Diffs: store diffs between versions (plus, usually, a full version of the model at a given point in time e.g. store HEAD) In […]

Also posted in Code, Open Knowledge Foundation, Python, RDF, Software | 1 Response

Author “Significance” From Catalogue Data

Continues the series of post related to analyzing catalogue data, here are some stats on author “significance” as measured by the number of book entries (‘items’) for that author in the Cambridge University Library catalogue from 1400-1960 (there being 1m+ such entries). I’ve termed this measure “significance” (with intentional quotes) as it co-mingles a variety […]

Also posted in Culture and Society, Data Digging, Economics, EUPD, Own Work | 2 Responses

Exploring Patterns of Knowledge Production

I’m posting up some work-in-progress entitled Exploring Patterns of Knowledge Production (link to full pdf) that follows up to my earlier post of a year and a bit ago. Below I’ve excerpted the introduction plus list of motivational questions. Comments (and critique) very welcome! Exploring Patterns of Knowledge Production Paper ‘Alpha’ (pdf) Introduction In what […]

Also posted in Academic, Economics, Innovation and Intellectual Property, Own Work, Papers, Work In Progress | Leave a comment

Exploring Patterns of Knowledge Production

A definition: the term ‘knowledge’ is here used broadly to signify all forms of information production including those involved in technological innovation, cultural creativity and academic advance. Largely as a result of better ICT we now have available some very substantial datasets regarding both the extent and structure of knowledge production across different jurisdictions and […]

Also posted in Academic, Economics, Innovation and Intellectual Property, Own Work, Papers | 1 Response

Overlord: D-Day and the Battle for Normandy 1944 by Max Hastings

7.5/10. Finished a few weeks ago this is another (rather earlier) example of Hastings’ skill in writing penetrating and engaging military history, as well as his willingness to be critical of existing ‘sacred cows’. Among other things Hastings: Argues that the famous Mulberrys were probably a waste of time and resources. Shows how the Air […]

Also posted in Books, History | 1 Response

Path-Dependent vs. Ergodic Systems

Consider a metal arm fixed by a pin. If it is hung vertically then the arm, no matter where it starts, will always end up in the same position. However, if you fix the arm (perfectly) horizontally it will stay forever in its initial position. The first case is ergodic: we converge independent of the […]

Also posted in Economics, History, Musings | Leave a comment

Versioned Domain Models

I’ve been thinking about how to have a versioned domain model similar to the way we have versioned filesystems (e.g. subversion) for over two years. Over the last few months whatever bits of free time I’ve had have gone into developing a prototype built on top of sqlobject and I’ve now got a rough and […]

Also posted in Python, Software | Leave a comment

The Robustness Principle

2.10. Robustness Principle TCP implementations will follow a general principle of robustness: be conservative in what you do, be liberal in what you accept from others. Source: rfc793: specification for TCP date: 1981 editor: Jon Postel url:

Also posted in Miscellaneous | Leave a comment

Thinking about Annotation

Annotation means the adding of comments/notes/etc to an underlying resource. For the present I’ll focus on the situation where the underlying resource is textual (as opposed to being an image, or a piece of film or some data). Various things to consider when implementing an annotation/comment system: Addressing and atomisation: Are annotations specific to particular […]

Also posted in Software | Leave a comment