Tentative Schedule
(each morning runs from 9 am to 12pm with a break,
each afternoon session from 1:30 pm to 4:30 pm with a break)
For a more detailed view of topics, see the
Course Outline
Day 1
Morning
- Introductions and course objectives
- Overview
- Process Overview
- Environment
- Document Classes
- Directory Structure
- Data Preparation and XPat Overview
- Text Class Components
- Image Class
- Environment details
- Directory Structure details
Afternoon
- Image Class
- Installation and Configuration
- Image Class Access Restrictions
- Discussion of Approaches to Batch Image Processing
- Image Processing Software Links
Day 2
Morning
- Data Preparation (Part 1): Encoding & Transformation
- Data Sources
- GUMS/TextClass
- In line markup (text munging vs. new XPat functionality)
- Unnumbered, nested, identical elements
- SGML Tools
- Transformation
- Normalization
- TermMapper & Fabricated regions
- Levels of Encoding
- Installation of DLXS TextClass Middleware & Content
- extract tar files (Hands On)
- edit configuration files (Hands On)
Afternoon
- Makefile
- Normalization (Hands On)
- XPat Search Engine
- History of the software
- Indexing
- SGML text indexing (Hands On)
- Region Indexing (Hands On)
- Query Language (Hands On)
- Fabricated regions (Hands On)
Day 3
Morning
- Related / Derivative Data
- TextClass
- CollDb
- Mapper
- Pageview Data Preparation (Hands On)
- Background
- pageview.dat files
- WordWheel Data Preparation (Hands On)
- History
- Wordwheel data creation
- Program Architecture
- text-idx
- Functional Requirements
- configuration files
Afternoon
- Program Architecture continued
- text-idx continued
- Objects used
- URL parameters
- text-idx walkthrough
Day 4
Morning
- Program Architecture continued
- text-idx continued
- walkthrough as needed
- User Interface issues
- TBD
- pageviewer-idx
- Background and overview
- PageView object
- Creation of pageview.dat file (Hands On)
- walkthrough
Afternoon
- ww-idx
- WW object, and others
- XPat indexed word data
- walkthrough
- Subclassing the TextClass
- examples
Day 5
Morning
- Q&A
- The Future