DIATHESIS
An Information System for documentation, management
and promotion of historical documents
| Description |
DIATHESIS is an information system for documentation, management
and promotion of historical documents that supports both digital
library functionality and archival management of the original
documents. It includes OCR-based page analysis and subject clipping,
subject-level metadata generation, semantic indexing and multifaceted
classification of subjects using built-in thesauri. The data
produced by the OCR processing of the scanned material are used
for the creation of a highly flexible annotation interface which
allows users to perform hybrid annotations upon the digitized
material assigning semantic properties to specific regions of
text that represent a subject. The goal of the documentation
process is the creation of a coherent semantic backbone that
can be easily enriched with semantic relations. It is not meant
to be a complete semantic structure that includes all the semantic
relationships and entities (Actors, Places) described in the
text. DIATHESIS consists of three lightweight, easily deployable
and highly configurable Web applications, namely the administration,
the documentation, and the querying applications, which allow
data import and monitoring, classification and indexing, and
search and presentation respectively. |