eXtensible Text Framework (XTF)

The eXtensible Text Framework (XTF) is a flexible indexing and query tool that supports searching across collections of heterogeneous data and presents results in a highly configurable manner. The highlights of the XTF system are described in an online brochure [PDF] .

Downloads and Documentation


Technical Overview

The system is divided into four components:
  1. crossQuery: The front-end to the collection search system.
  2. dynaXML: Interface to individual documents.
  3. Text Engine: Used by crossQuery and dynaXML to perform text searches.
  4. Indexer: Full-text indexer based on Lucene.
The following diagrams give a general overview of how documents are indexed, stored, queried, retrieved, and displayed using XTF (somewhat outdated).

Who uses XTF?

The CDL uses XTF as a building block for new services and has used it to replace a number of systems previously used for text searching (i.e., DLXS, Greenstone, DynaWeb). As of 2008, CDL has deployed XTF in the following ways:

XTF is also extensively used outside the CDL:

Other institutions exploring XTF include: University of Sydney; OhioLink; University of Texas at Austin; University of Virginia; University of Denver; and University of Kansas Digital Initiatives.

Support

Implementers

While CDL does not directly support XTF implementers, we do make a good-faith effort to address the needs of the XTF community through the following resources on SourceForge:

Developers

SourceForge resources for XTF developers and others who are interested in contributing to the architecture