org.cdlib.xtf.textIndexer
Class MSWordIndexSource

Object
  extended by IndexSource
      extended by XMLIndexSource
          extended by MSWordIndexSource

public class MSWordIndexSource
extends XMLIndexSource

Transforms a Microsoft Word file to a single-record XML file.

Author:
Martin Haye

Field Summary
private  File msWordFile
          Source of MS Word document data
 
Constructor Summary
MSWordIndexSource(File msWordFile, String key, Templates[] preFilters, Templates displayStyle, StructuredStore lazyStore)
          Constructor -- initializes all the fields
 
Method Summary
protected  InputSource filterInput()
          Transform the MS Word file to XML data
 
Methods inherited from class XMLIndexSource
displayStyle, key, nextRecord, normalize, path, preFilters, removeDoctypeDecl, totalSize
 
Methods inherited from class Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

msWordFile

private File msWordFile
Source of MS Word document data

Constructor Detail

MSWordIndexSource

public MSWordIndexSource(File msWordFile,
                         String key,
                         Templates[] preFilters,
                         Templates displayStyle,
                         StructuredStore lazyStore)
Constructor -- initializes all the fields

Method Detail

filterInput

protected InputSource filterInput()
                           throws IOException
Transform the MS Word file to XML data

Overrides:
filterInput in class XMLIndexSource
Throws:
IOException