- ホーム
- > 洋書
- > 英文書
- > Computer / General
Full Description
Document Processing and Retrieval: TEXPROS focuses on the design and implementation of a personal, customizable office information and document processing system called TEXPROS (a TEXt PROcessing System). TEXPROS is a personal, intelligent office information and document processing system for text-oriented documents. This system supports the storage, classification, categorization, retrieval and reproduction of documents, as well as extracting, browsing, retrieving and synthesizing information from a variety of documents. When using TEXPROS in a multi-user or distributed environment, it requires specific protocols for extracting, storing, transmitting and exchanging information.
The authors have used a variety of techniques to implement TEXPROS, such as Object-Oriented Programming, Tcl/Tk, X-Windows, etc. The system can be used for many different purposes in many different applications, such as digital libraries, software documentation and information delivery.
Audience: Provides in-depth, state-of-the-art coverage of information processing and retrieval, and documentation for such professionals as database specialists, information systems and software developers, and information providers.
Contents
1 Introduction.- 1.1 Texpros: An Overall Organization.- 1.2 Organization of the Book.- 2 Data Model and Algebra for Office Document.- 2.1 Related Work.- 2.2 Formal Framework of the D_model.- 2.3 Formalism of the D_algebra.- 2.4 Discussion.- 2.5 Summary.- 3 Document Categorization.- 3.1 Data Model Concepts.- 3.2 The Reconstruction Problem.- 3.3 Agent-Based Filing Architecture.- 3.4 Summary.- 4 Document Classification and Information Extraction.- 4.1 Document Classification and Information Extraction Techniques.- 4.2 Document Structures.- 4.3 Organization of Document Classification and Information Extraction Components.- 4.4 Document Layout Analysis.- 4.5 Conceptual Analysis on Structured Part of Document.- 4.6 Content Analysis on Unstructured Part of Document.- 4.7 Summary.- 5 Knowledge-Based Document Classification.- 5.1 Architecture of Knowledge-Based Document Classification.- 5.2 Knowledge Acquisition Tool (KAT).- 5.3 Document Type Tree Inference Engine.- 5.4 Classification Handler (CH).- 5.5 Summary.- 6 Document Retrieval.- 6.1 Document Retrieval Techniques for TEXPROS.- 6.2 Current Research on Document Retrieval.- 6.3 Overall Architecture of Retrieval System.- 6.4 Summary.- 7 Query Transformation.- 7.1 System Catalog — The Representation of Domain Knowledge and Meta-data Knowledge.- 7.2 Query Transformation Mechanism.- 7.3 Summary.- 8 Browser.- 8.1 Object Network.- 8.2 Architecture of Browser.- 8.3 Browsing in TEXPROS.- 8.4 Topic Interpreter.- 8.5 Object Network Constructor.- 8.6 Examples.- 8.7 Summary.- 9 Generalizer.- 9.1 Introduction to Generalizer.- 9.2 Generalization and Substitution Concepts.- 9.3 Generalization Algorithm for Detecting Erroneous Presuppositions.- 9.4 Giving Cooperative Responses by Substitutions.- 9.5 Summary.- References.