Actions for Detecting data and schema changes in scientific documents [electronic resource].
Detecting data and schema changes in scientific documents [electronic resource].
- Published
- Washington, D.C : United States. Dept. of Energy. Office of the Assistant Secretary for Defense Programs, 1999.
Oak Ridge, Tenn. : Distributed by the Office of Scientific and Technical Information, U.S. Dept. of Energy. - Physical Description
- 1.3 Megabytes pages : digital, PDF file
- Additional Creators
- Lawrence Livermore National Laboratory, United States. Department of Energy. Office of the Assistant Secretary for Defense Programs, and United States. Department of Energy. Office of Scientific and Technical Information
Access Online
- Restrictions on Access
- Free-to-read Unrestricted online access
- Summary
- Data stored in a data warehouse must be kept consistent and up-to-date with the underlying information sources. By providing the capability to identify, categorize and detect changes in these sources, only the modified data needs to be transferred and entered into the warehouse. Another alternative, periodically reloading from scratch, is obviously inefficient. When the schema of an information source changes, all components that interact with, or make use of, data originating from that source must be updated to conform to the new schema. In this paper, the authors present an approach to detecting data and schema changes in scientific documents. Scientific data is of particular interest because it is normally stored as semi-structured documents, and it incurs frequent schema updates. They address the change detection problem by detecting data and schema changes between two versions of the same semi-structured document. This paper presents a graph representation of semi-structured documents and their schema before describing their approach to detecting changes while parsing the document. It also discusses how analysis of a collection of schema changes obtained from comparing several individual can be used to detect complex schema changes.
- Report Numbers
- E 1.99:ucrl-jc-134444
E 1.99: yn0100000
E 1.99: 97-erd-033
97-erd-033
yn0100000
ucrl-jc-134444 - Subject(s)
- Other Subject(s)
- Note
- Published through SciTech Connect.
06/08/1999.
"ucrl-jc-134444"
" yn0100000"
" 97-erd-033"
"YN0100000"
"97-ERD-033"
16th International Conference on Data Engineering, San Diego, CA (US), 02/28/2000--03/03/2000.
Musick, R; Critchlow, T; Adiwijaya, I. - Funding Information
- W-7405-ENG-48
LDRD
View MARC record | catkey: 14348046