Preparing Data for DLXS Indexing and Online Presentation
This document describes in general some of the issues involved in data preparation for all classes. This document does not address XPAT indexing, a description of which can be found in Indexing.
For specific information on data preparation for individual classes, check the following links:
- Data preparation environment: You may or may not have a machine, separate from your production machine (server), on which to prepare data. It is best to prepare data separate from any version of the collection that is in production and available to users. Unlike files in other directories in the DLXS file system (bin, obj, idx), everything in prep should be ultimately expendable in the production environment.
- Directories used in data preparation: We suggest a directory structure such as the following. Note: $DLXSROOT here refers to whatever path you have set up as the $DLXSROOT (see the $DLXSROOT environment variable).
- $DLXSROOT/prep/c/collid/ to contain
the data itself in its various versions as it is transformed from its original source structure into the structure needed for XPAT indexing and final delivery.
- $DLXSROOT/bin/c/collid/ to contain scripts and other files needed for processing the data (e.g., Perl scripts, Makefiles, XSLT stylesheets, etc.).
See documentation on directory conventions for more information on the overall DLXS directory structure.