DLESE Tools
v1.2

org.dlese.dpc.index.reader
Class FileIndexingServiceDocReader

java.lang.Object
  extended byorg.dlese.dpc.index.reader.DocReader
      extended byorg.dlese.dpc.index.reader.FileIndexingServiceDocReader
All Implemented Interfaces:
Serializable
Direct Known Subclasses:
ErrorDocReader, SimpleFileIndexingServiceDocReader, XMLDocReader

public abstract class FileIndexingServiceDocReader
extends DocReader
implements Serializable

An abstract class that provides getter methods to read data from a Lucene Document that was created by a FileIndexingServiceWriter. The getter methods can then be accessed from (Struts) beans that need the data. This class may be extended for each Document type that might be returned in a search. After implemeting a new FileIndexingServiceDocReader, a new switch to access it should be added to class ResultDoc.

Author:
John Weatherley
See Also:
Serialized Form

Field Summary
 
Fields inherited from class org.dlese.dpc.index.reader.DocReader
conf, doc, score
 
Constructor Summary
protected FileIndexingServiceDocReader()
          Constructor that initializes an empty DocReader.
protected FileIndexingServiceDocReader(Document doc)
          Constructor that may be used programatically to wrap a reader around a Lucene Document that was created by a DocWriter.
protected FileIndexingServiceDocReader(Document doc, float score, ResultDocConfig conf)
          Constructor that is used by ResultDoc at search time to create a new instance of a DocReader.
 
Method Summary
 boolean fileExists()
          Determine whether the file associated with this Document exists.
protected static String getDateStamp()
          Return a string for the current time and date, sutiable for display in log files and output to standout:
 String getDeleted()
          Determine whether the status of this Document is deleted, indicated by a return value of "true".
 String getDocDir()
          Gets the absolute path of the directory that contained the File used to index the Document.
 String getDocsource()
          Gets the absolute path of the file that was used to index the Document.
 String getDocsourceEncoded()
          Gets the absolute path of the file that was used to index the Document, encoded.
 String getDoctype()
          Gets doctype associated with the Document, for example 'dlese_ims,' 'adn,' or 'html'.
 String getFileExists()
          Determine whether the file associated with this Document exists, indicated by a return value of "true".
 String getFileName()
          Gets the name of the File that was used to index the Document.
 String getFullContent()
          Gets the full content of the file that was used to index the Document.
 long getLastModified()
          Gets the File modification time of the File used to index the Document.
 String getLastModifiedString()
          Gets a String representataion of the File modification time of the File used to index the Document.
 File getSourceFile()
          Gets the File that was used to index the Document.
 boolean isDeleted()
          Determine whether the status of this Document is deleted.
protected static void prtln(String s)
          Output a line of text to standard out, with datestamp, if debug is set to true.
protected static void prtlnErr(String s)
          Output a line of text to error out, with datestamp.
protected static void setDebug(boolean db)
          Sets the debug attribute.
 
Methods inherited from class org.dlese.dpc.index.reader.DocReader
getDocument, getIndex, getQuery, getReaderType, getScore, setDoc
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

FileIndexingServiceDocReader

protected FileIndexingServiceDocReader(Document doc,
                                       float score,
                                       ResultDocConfig conf)
Constructor that is used by ResultDoc at search time to create a new instance of a DocReader.

Parameters:
doc - The Lucene Document that is read.
score - The rank of the Document in a set of results.

FileIndexingServiceDocReader

protected FileIndexingServiceDocReader(Document doc)
Constructor that may be used programatically to wrap a reader around a Lucene Document that was created by a DocWriter.

Parameters:
doc - A Lucene Document.
See Also:
DocWriter

FileIndexingServiceDocReader

protected FileIndexingServiceDocReader()
Constructor that initializes an empty DocReader.

Method Detail

getFullContent

public final String getFullContent()
Gets the full content of the file that was used to index the Document. This includes all XML or HTML tags, etc.

Returns:
The full content as text, or empty string if unable to process.

getDoctype

public String getDoctype()
Gets doctype associated with the Document, for example 'dlese_ims,' 'adn,' or 'html'. Note that to support wildcard searching, the doctype is indexed with a leading '0' appened to the beginning. This method strips the leading zero prior to returning.

Returns:
The doctype value.

getDeleted

public String getDeleted()
Determine whether the status of this Document is deleted, indicated by a return value of "true". This does not necessarily mean the file has been deleted.

Returns:
The String "true" if the status is deleted, else "false".

isDeleted

public boolean isDeleted()
Determine whether the status of this Document is deleted. This does not necessarily mean the file has been deleted.

Returns:
True if the status is deleted.

getFileExists

public String getFileExists()
Determine whether the file associated with this Document exists, indicated by a return value of "true".

Returns:
The String "true" if the file exists, else "false".

fileExists

public boolean fileExists()
Determine whether the file associated with this Document exists.

Returns:
True if the file exists, else false.

getLastModifiedString

public String getLastModifiedString()
Gets a String representataion of the File modification time of the File used to index the Document. Note that while this represents the File modification time, this date stamp does not get updated until the File is re-indexed by the indexer.

Returns:
The File modification time.

getLastModified

public long getLastModified()
Gets the File modification time of the File used to index the Document. Note that while this represents the File modification time, this date stamp does not get updated until the File is re-indexed by the indexer.

Returns:
The File modification time.

getSourceFile

public File getSourceFile()
Gets the File that was used to index the Document.

Returns:
The source File.

getFileName

public String getFileName()
Gets the name of the File that was used to index the Document.

Returns:
The source File name.

getDocsource

public String getDocsource()
Gets the absolute path of the file that was used to index the Document.

Returns:
The absolute path the the underlying file.

getDocsourceEncoded

public String getDocsourceEncoded()
Gets the absolute path of the file that was used to index the Document, encoded.

Returns:
The absolute path the the underlying file.

getDocDir

public String getDocDir()
Gets the absolute path of the directory that contained the File used to index the Document.

Returns:
The docDir value.

getDateStamp

protected static final String getDateStamp()
Return a string for the current time and date, sutiable for display in log files and output to standout:

Returns:
The dateStamp value

setDebug

protected static final void setDebug(boolean db)
Sets the debug attribute.

Parameters:
db - The new debug value

prtlnErr

protected static final void prtlnErr(String s)
Output a line of text to error out, with datestamp.

Parameters:
s - The text that will be output to error out.

prtln

protected static final void prtln(String s)
Output a line of text to standard out, with datestamp, if debug is set to true.

Parameters:
s - The String that will be output.

DLESE Tools
v1.2