\seekquarry\yioop\libraryprocessors

Classes

BmpProcessor Used to create crawl summary information for BMP and ICO files
CompressedProcessor Used to create crawl summary information for a gz compressed file whose uncompressed form has a processor we index.
DocProcessor Used to create crawl summary information for binary DOC files
DocxProcessor Used to create crawl summary information for DOCX files
EpubProcessor Used to create crawl summary information for XML files (those served as application/epub+zip)
GifProcessor Used to create crawl summary information for GIF files
GitXmlProcessor Parent class common to all processors used to create crawl summary information that involves basically text data
GopherProcessor Used to create crawl summary information for gopher protocol pages
HtmlProcessor Used to create crawl summary information for HTML files
ImageProcessor Base abstract class common to all processors used to create crawl summary information from images
JavaProcessor Parent class common to all processors used to create crawl summary information that involves basically text data
JpgProcessor Used to create crawl summary information for JPEG files
PageProcessor Base class common to all processors of web page data
PngProcessor Used to create crawl summary information for PNG files
PptProcessor Used to create crawl summary information for PPT files
PptxProcessor Used to create crawl summary information for PPTX files
PythonProcessor Parent class common to all processors used to create crawl summary information that involves basically text data
RobotProcessor Processor class used to extract information from robots.txt files
RssProcessor Used to create crawl summary information for RSS or Atom files
RtfProcessor Used to create crawl summary information for RTF files
SitemapProcessor Used to create crawl summary information for sitemap files
SvgProcessor Used to create crawl summary information for SVG files. This class is a little bit weird in that it generates thumbs like the image processor classes, but when it gives up on the data it falls back to text processor handling.
TextProcessor Parent class common to all processors used to create crawl summary information that involves basically text data
XlsxProcessor Used to create crawl summary information for xlsx files
XmlProcessor Used to create crawl summary information for XML files (those served as text/xml)