BmpProcessor |
Used to create crawl summary information
for BMP and ICO files |
CompressedProcessor |
Used to create crawl summary information
for a gz compressed file whose uncompressed form has
a processor we index. |
DocProcessor |
Used to create crawl summary information
for binary DOC files |
DocxProcessor |
Used to create crawl summary information
for DOCX files |
EpubProcessor |
Used to create crawl summary information
for XML files (those served as application/epub+zip) |
GifProcessor |
Used to create crawl summary information
for GIF files |
GitXmlProcessor |
Parent class common to all processors used to create crawl summary
information that involves basically text data |
GopherProcessor |
Used to create crawl summary information
for gopher protocol pages |
HtmlProcessor |
Used to create crawl summary information
for HTML files |
ImageProcessor |
Base abstract class common to all processors used to create crawl summary
information from images |
JavaProcessor |
Parent class common to all processors used to create crawl summary
information that involves basically text data |
JpgProcessor |
Used to create crawl summary information
for JPEG files |
PageProcessor |
Base class common to all processors of web page data |
PngProcessor |
Used to create crawl summary information
for PNG files |
PptProcessor |
Used to create crawl summary information
for PPT files |
PptxProcessor |
Used to create crawl summary information
for PPTX files |
PythonProcessor |
Parent class common to all processors used to create crawl summary
information that involves basically text data |
RobotProcessor |
Processor class used to extract information from robots.txt files |
RssProcessor |
Used to create crawl summary information
for RSS or Atom files |
RtfProcessor |
Used to create crawl summary information
for RTF files |
SitemapProcessor |
Used to create crawl summary information
for sitemap files |
SvgProcessor |
Used to create crawl summary information
for SVG files. This class is a little bit
weird in that it generates thumbs like the
image processor classes, but when it gives
up on the data it falls back to text
processor handling. |
TextProcessor |
Parent class common to all processors used to create crawl summary
information that involves basically text data |
XlsxProcessor |
Used to create crawl summary information
for xlsx files |
XmlProcessor |
Used to create crawl summary information
for XML files (those served as text/xml) |