org.cdlib.xtf.textIndexer
Class PDFIndexSource

Object
  extended by IndexSource
      extended by XMLIndexSource
          extended by PDFIndexSource

public class PDFIndexSource
extends XMLIndexSource

Transforms a PDF file to a single-record XML file.

Author:
Martin Haye

Field Summary
private  File pdfFile
          Source of PDF data
 
Constructor Summary
PDFIndexSource(File pdfFile, String key, Templates[] preFilters, Templates displayStyle, StructuredStore lazyStore)
          Constructor -- initializes all the fields
 
Method Summary
protected  InputSource filterInput()
          Transform the PDF file to XML data
 
Methods inherited from class XMLIndexSource
displayStyle, key, nextRecord, normalize, path, preFilters, removeDoctypeDecl, totalSize
 
Methods inherited from class Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

pdfFile

private File pdfFile
Source of PDF data

Constructor Detail

PDFIndexSource

public PDFIndexSource(File pdfFile,
                      String key,
                      Templates[] preFilters,
                      Templates displayStyle,
                      StructuredStore lazyStore)
Constructor -- initializes all the fields

Method Detail

filterInput

protected InputSource filterInput()
                           throws IOException
Transform the PDF file to XML data

Overrides:
filterInput in class XMLIndexSource
Throws:
IOException