Attachment Indexing
How Indexing Works
Zextras Powerstore has a new External Content Extractor to index attachment contents. This way the resources do not have to dedicate time reading the attachments.
The external content extractor works together with Zimbra’s default engine. The main Zimbra indexing process analyzes the contents of an item, splitting it into several parts based on the MIME parts of the object. Next, Zimbra handles the indexing of known contents — plaintext — and passes the datastream on to the Zextras Powerstore handlers for all other content.
The indexing engine includes an indexing cache that speeds up the indexing process of any content that has already been analyzed. Datastreams over 10Kb are cached by default, and the cache hold 10000 entries, while smaller datastreams are not cached as the cache benefits only apply to large datastreams.
Indexed Formats
Web
Extension |
Parser |
Content-type |
---|---|---|
|
|
application/x-asp |
|
|
application/xhtml+xml |
|
|
application/xhtml+xml, text/html |
|
|
application/xhtml+xml |
|
|
application/xhtml+xml |
Documents
Extension |
Parser |
Content-type |
|
---|---|---|---|
|
|
application/rtf |
|
|
|
application/pdf |
|
|
|
application/x-mspublisher |
|
|
|
application/vnd.ms-excel |
|
|
|
application/vnd.ms-excel |
|
|
|
application/vnd.ms-excel |
|
|
|
application/vnd.ms-powerpoint |
|
|
|
application/vnd.ms-powerpoint |
|
|
|
application/vnd.ms-project |
|
|
|
application/msword |
|
|
|
application/msword |
|
|
|
application/vnd.ms-outlook |
|
|
|
application/vnd.visio |
|
|
|
application/vnd.visio |
|
|
|
application/vnd.visio |
|
|
|
application/vnd.visio |
|
|
|
application/vnd.ms-excel.sheet.macroenabled.12 |
|
|
|
application/vnd.ms-powerpoint.presentation.macroenabled.12 |
|
|
|
application/vnd.openxmlformats-officedocument.spreadsheetml.template |
|
|
|
application/vnd.openxmlformats-officedocument.wordprocessingml.document |
|
|
|
application/vnd.openxmlformats-officedocument.presentationml.template |
|
|
|
application/vnd.openxmlformats-officedocument.spreadsheetml.sheet |
|
|
|
application/vnd.openxmlformats-officedocument.presentationml.presentation |
|
|
|
application/vnd.ms-excel.addin.macroenabled.12 |
|
|
|
application/vnd.ms-word.document.macroenabled.12 |
|
|
|
application/vnd.ms-excel.template.macroenabled.12 |
|
|
|
application/vnd.openxmlformats-officedocument.wordprocessingml.template |
|
|
|
application/vnd.ms-powerpoint.slideshow.macroenabled.12 |
|
|
|
application/vnd.ms-powerpoint.addin.macroenabled.12 |
|
|
|
application/vnd.ms-word.template.macroenabled.12 |
|
|
|
application/vnd.openxmlformats-officedocument.presentationml.slideshow |
|
|
|
application/vnd.oasis.opendocument.text |
|
|
|
application/vnd.oasis.opendocument.spreadsheet |
|
|
|
application/vnd.oasis.opendocument.presentation |
|
|
|
application/vnd.oasis.opendocument.graphics |
|
|
|
application/vnd.oasis.opendocument.chart |
|
|
|
application/vnd.oasis.opendocument.formula |
|
|
|
application/vnd.oasis.opendocument.image |
|
|
|
application/vnd.oasis.opendocument.text-master |
|
|
|
application/vnd.oasis.opendocument.text-template |
|
|
|
application/vnd.oasis.opendocument.spreadsheet-template |
|
|
|
application/vnd.oasis.opendocument.presentation-template |
|
|
|
application/vnd.oasis.opendocument.graphics-template |
|
|
|
application/vnd.oasis.opendocument.chart-template |
|
|
|
application/vnd.oasis.opendocument.formula-template |
|
|
|
application/vnd.oasis.opendocument.image-template |
|
|
|
application/vnd.oasis.opendocument.text-web |
|
|
|
application/vnd.sun.xml.writer |
Packages and Archives
Extension |
Parser |
Content-type |
---|---|---|
|
|
application/x-compress |
|
|
application/x-bzip |
|
|
application/x-bzip2 |
|
|
application/x-bzip2 |
|
|
application/gzip |
|
|
application/x-gzip |
|
|
application/x-gzip |
|
|
application/x-xz |
|
|
application/x-tar |
|
|
application/java-archive |
|
|
application/x-7z-compressed |
|
|
application/x-cpio |
|
|
application/zip |
|
|
application/x-rar-compressed |
|
|
text/plain |
Parser Controls
Parsers can be turned on or off by changing the related value to
true
or false
via the zxsuite config
CLI command.
Attribute |
Parsers |
---|---|
pdfParsingEnabled |
PDFParser |
odfParsingEnabled |
OpenDocumentParser |
archivesParsingEnabled |
CompressorParser, PackageParser, RarParser |
microsoftParsingEnabled |
OfficeParser, OOXMLParser, OldExcelParser |
rtfParsingEnabled |
RTFParser |
e.g. to disable PDF parsing run:
zxsuite config server set server.example.com attribute pdfParsingEnabled value false
By default, all parsers are active.