Found in a review by Gary Will’s of Taylor Branch’s At Canaan’s Edge: America in the King Years, 1965-1968 in the NYRB (2006-04-06, p. 20):
It is amazing how Branch can marshal so much material along so many tracks, moving it ahead stage by stage in coordination with King’s actions. The I saw Branch [...]
We believe that we invent symbols. The truth is that they invent us. Gene. M. Wolfe, The Book of the New Sun.
1. Uncodified knowledge cannot be transferred except by f2f interaction (apprenticeship etc)
2. But knowledge codification is very time and space consuming (and much still remains implicit)
3. As the amount of codified knowledge grows it becomes harder to find what you want
Hypothesis: Value of Information in Databank = Value of Information if it could be Accessed Perfectly x Ease of Finding Any Particular Item
Plausible to assume Ease of Finding Information = h(Amount) where h’ < 0. * Let Amount = n * In standard Computer Science if we could sort items in some manner (by which we could also search). h(N) = log(n) (and sorting costs are n log(n) – bubble sort) [[TODO: check this]]
We are moving towards a world in which all information is software and all software is information
Plan
- We process information linearly. This is a fundamental fact. (Aside: example of polyphonic music and the Glenn Gould radio program). Symbol processing in home sapiens is serial and cannot manage either parallel or non-linear presentation. Particularly textual symbol processing. This is not only related to the methods by which humans obtain sensory input but derives from the very structure or high level information processing in the brain. This is manifested very clearly in language.
- thus even where information is presented non-linearly, or more commonly in parallel, we still create our own linear thread as we progress through it. A concrete example is given by the internet or by encylcopedias. Though both examples present a web of information rather than an explicit linear narrative the human mind cannot branch multiply in any literal sense. Thus as I progress through a website or an encylcopedia though I may branch I then leave the original line of investigation – perhaps to return later.
- Given this fact that we can only read along one dimension at once we see the great challenge or all analytical writing, namely to present in single-dimensional linear form, that which is always multidimensional and non-linear.
- Thus we are presented with a dilemma. Much knowledge and information is multi-faceted, approachable from many different angles simultaneously, yet if it is to be understood and processed by humans it must be presented serially, that is to say linearly along a single path. Now I do not suggest that we can overcome these inherent limitations but I do suggest that we can approach knowledge storage and categorization in such a way as to impose the minimal limits on the possible methods of presentation.
The Metaphor
We can imagine the building blocks, the factlets, as pearls, little pearls of knowledge. We can then imagine the creation of an expository line, or narrative if we allow ourselves to abuse terminology, as the stringing of these pearls onto the thread – the thread of narrative – which when complete provides a ‘necklace’ of exposition (NB: though we should avoid seeing any cyclical structure in analogy with the circular necklace as it is more usual for a exposition to resemble an interval with a beginning and end and a direction of progression).
Other Items
The multiple classification problem. Analogies and examples:
Introduction
I propose the following hierarchy: data — information — knowledge. Where items in one category are refined and filtered in the process of going to the next.
Short Quotes
Information is not knowledge. Knowledge is not wisdom. Wisdom is not truth. Truth is not beauty. Beauty is not love. Love is not music. Music is the best.
Source: Frank Zappa, Album: Joe’s Garage, Track: Packard Goose
<
blockquote>
Where is the wisdom we have lost in knowledge?
Where is the knowledge we have in information?
Is there a standard data format for taxonomies/classification systems. Should include a specification of text encoding (like LDIF but for taxonomies). If there is I would guess there will be open source implementations (and if not won’t be that hard to write one’s own).
Requirements:
-
Type of taxonmy:
- Enumerations (flat)
- Tree (single parent)
- Lattice (multiple parent)
- Identifiers. Support for at least 10 million possible elements in taxonomy. Optional: Identifiers should be portable across systems (i.e. you can plug different taxonomies together without recoding identifiers). This means probably want a GUID based id system). Required: basic int32 or int64 based identifiers.
Found So Far
- DELTA http://biodiversity.uno.edu/delta/. Seem to be primarily for standard tree taxonomies for animals and plants.
Written Myself
Two taxonomy systems with gui editors and serialization to xml. One in C# and the other in java. Major issue is non-stdness.
Categories
- *nix (9)
- Academic (73)
- Activity Updates (20)
- Books (55)
- Cinema (112)
- Code (15)
- Command Line (9)
- Copyright (23)
- Culture and Society (27)
- Data Digging (4)
- Economics (155)
- EUPD (10)
- External (3)
- Filesharing (11)
- Governance (14)
- Hacks (21)
- Happiness (16)
- Hardware (1)
- History (15)
- Innovation and Intellectual Property (66)
- Intellectual Myths (12)
- Javascript (3)
- Knowledge Systems (17)
- Miscellaneous (17)
- Musings (15)
- Notes (14)
- Open Bibliographic Data (2)
- Open Data (29)
- Open Knowledge Foundation (66)
- Openness (41)
- Own Work (59)
- Papers (4)
- People (1)
- Photos (1)
- Platforms (1)
- Poetry (1)
- Policy (18)
- PSI (8)
- Python (24)
- Quote (1)
- RDF (2)
- Shuttleworth Fellow (56)
- Software (47)
- Sysadmin (2)
- Talks (34)
- Transaction Costs (5)
- Work In Progress (2)


