humankind creates 5 exabytes of stored data — in paper, the equivalent of 500,000 new Libraries of Congress each year. More than 90 percent of those 5 exabytes were stored on a hard disk.
Designing and Developing Digital Repositories Using Open Solutions A Strategy for Success
Given the increasing volume of digital content coming under the administration of libraries and information organizations, new solutions are called for to enable its management. Carl Grant considers some of these, along with strategies that are useful in the planning and development of digital collections in his white paper Delivering Digital Repositories with Open Solutions.
Grant cites an estimate that puts the current digital collections management issue in clear perspective: "According to the one estimate 'humankind creates 5 exabytes of stored data — in paper, the equivalent of 500,000 new Libraries of Congress each year. More than 90 percent of those 5 exabytes were stored on a hard disk.'"
The management of large scale repositories holding these materials involves meeting several criteria that Grant outlines:
1. Objects must be always retrievable (Access).
2. Cost and complexity as systems scale (Economic sustainability).
3. Finding data through sophisticated metadata handling (Discovery).
4. Data integrity must be assured (Trust).
5. Seamless scaling must be provided (Scale & extensibility).
According to Grant, in implementing repository technology, libraries need to address four areas of preservation and archiving that have been identified by Wheatley (2004) as essential, particularly in planning stages:
1. Data can be maintained without being lost, damaged or altered,
2. Data can be found and extracted for or by a user
3. Data can be interpreted and understood by the end user.
4. That objectives 1-3 can be achieved in perpetuity.
Meeting these criteria consistently can be threatened by such factors as media failure, hardware and software failure and operator error, though Grant notes that Sun StorageTek 5800 array incorporates technology to address both media and hardware failure possibilities.
Grant lists several steps that he recommends institutions follow in architecting a digital repository solution:
1. Understand the community of users; examine budgeting necessary to fulfill the users' needs; understand rights management issues; understand the metadata that will be utilized and develop an overall marketing and launch plan for when repository services are in place. Sun offers consultative services in this area, Grant adds.
2. Match the established functional requirements to the architectural requirements for hardware and software; select server, operating system, data center and mid-range or workgroup disk.
3. Consider the Sun StorageTek 5800 system when repository architecture calls for addressable storage solutions.
4. Download any open source solutions to be used, or work with vendors in the case of any proprietary solutions.
5. Configure the hardware/software using the built-in capabilities, start the repository and begin deploying digital repository services to the organization per the marketing/roll out plan developed earlier.
[...read more...]
Customized news reports about Sun Microsystems. Just the news you need, none of what you don't. 50,000+ Members. 20,000+ Articles Published since 1998.