Gladney Preserving Digital Information
2007
ISBN: 978-3-540-37887-7
Verlag: Springer
Format: PDF
Kopierschutz: 1 - PDF Watermark
E-Book, Englisch, 326 Seiten, eBook
ISBN: 978-3-540-37887-7
Verlag: Springer
Format: PDF
Kopierschutz: 1 - PDF Watermark
Cultural history enthusiasts have asserted the urgent need to protect digital information from imminent loss. This book describes methodology for long-term preservation of all kinds of digital documents. It justifies this methodology using 20 century theory of knowledge communication, and outlines the requirements and architecture for the software needed. The author emphasizes attention to the perspectives and the needs of end users.
Zielgruppe
Professional/practitioner
Autoren/Hrsg.
Weitere Infos & Material
Part I: Digital Preservation - Why Do We Need it ? 1) State of the Art - 2) Economic Trends and Social Issues Part II: Information Object Structure 3) Introduction to Knowledge Theory - 4) Lessons from Scientific Philosophy - 5) Truth and Authenticity - 6) Describing Information Structure Part III: Distributed Content Management 7) Digital Object Formats - 8) Archiving Practices - 9) Everyday Digital Content Management Part IV: Digital Object Architecture for the Long Term 10) Durable Bit-Strings and Catalogs - 11) Durable Evidence - 12) Durable Representation Part V: Peroration 11) Assessment of the Future
12 Durable Representation (p. 235-236)
We want unambiguous communication with future generations with whom dialog is impossible, without restricting what today’s authors can communicate. For this, we need language that we can confidently expect our descendants to understand easily. This challenge is the kind of language problem that has been central to computer science since it emerged as a discipline in the 1960s. Its core can be restated as, "ensure that an arbitrary computer program will execute correctly on a machine whose architecture is unknown when the program is saved."
The English logician A. M. Turing showed in 1937 (and various computing machine experts have put this into practice since then in various particular ways) that it is possible to develop code instruction systems for a computing machine which cause it to behave as if it were another, specified, computing machine. …
A code, which according to Turing's schema is supposed to make one machine behave as if it were another specific machine … must do the following things. It must contain, in terms that the machine will understand and (purposively obey), instructions … that will cause the machine to examine every order it gets and determine whether this order has the structure appropriate to an order of the second machine. It must then contain, in terms of the order system of the first machine, sufficient orders to make the machine cause the actions to be taken that the second machine would have taken under the influence of the order in question.
The important result of Turing's is that in this way the first machine can be caused to imitate the behavior of any other machine. von Neumann 1956, The Computer and the Brain, pp.70–71
Durable encoding, described in this chapter, represents difficult content types with the aid of programs written in virtual machine code - the code of a machine we call a UVC (Universal Virtual Computer). This Turing- Machine-equivalent virtual machine is simple compared to the designs of practical hardware. Its design can be specified completely, concisely, and unambiguously for future interpretation.
Objects to be preserved might consist of several source files, each represented as a bit-stream in a Fig. 32 digital object collection, with labeled links between parts of the complete package. Much of each TDO will be encoded using XML, relations, encryption algorithms, and identifiers. These are governed by relatively simple standards that are widely used - standards that we can be reasonably confident will be completely and correctly understood many years into the future. As described in §11.1, metadata can, and should, record the representation of each TDO component. The means for making each Fig. 32 content blob interpretable forever remains to be provided. What follows describes how this can be accomplished for a single content blob.
12.1 Representation Alternatives
We want information representation methods that can be embodied in tools whose use would be practical for information producers and consumers who do not have specialized skills or equipment.




