ZeXtras Powerstore: Attachment Indexing
From ZeXtras Suite Wiki
Language: | English • español • português |
---|
![]() |
Warning! This wiki is outdated and not maintained anymore, please refer to the new Zextras Documentation Hub at https://docs.zextras.com |
![]() |
This documentation is outdated. |
Please refer to https://docs.zextras.com |
How indexing works
Starting from version 2.4.0, an Indexing Engine has been added to Zextras Powerstore to index attachment contents.
It works together with Zimbra's own engine: the main Zimbra indexing process analyzes the content of an item, splitting it into several parts based on the MIME parts of the object, then handles the indexing of "known" contents - plaintext - and passes the datastream on to the Powerstore's handlers for all other content.
It includes an indexing cache, that speeds up the indexing process of any content that has already been analyzed. Datastreams over 10Kb are cached by default and the cache hold 10000 entries, while smaller datastreams are not cached as the cache benefits only apply to large datastreams.
This feature is currently only available on Zimbra Open Source Edition.
Indexed formats
Web
Extension | Parser | Content-type |
---|---|---|
"asp" | "HtmlParser" | //application/x-asp |
"htm" | "HtmlParser" | //application/xhtml+xml |
"html" | "HtmlParser" | text/html |
"shtml" | "HtmlParser" | //application/xhtml+xml |
"xhtml" | "HtmlParser" | //application/xhtml+xml |
Documents
Extension | Parser | Content-type |
---|---|---|
"rtf" | "RTFParser" | //application/rtf |
"pdf" | "PDFParser" | //application/pdf |
"pub" | "OfficeParser" | //application/x-mspublisher |
"xls" | "OfficeParser" | //application/vnd.ms-excel |
"xlt" | "OfficeParser" | //application/vnd.ms-excel |
"xlw" | "OfficeParser" | //application/vnd.ms-excel |
"ppt" | "OfficeParser" | //application/vnd.ms-powerpoint |
"pps" | "OfficeParser" | //application/vnd.ms-powerpoint |
"mpp" | "OfficeParser" | //application/vnd.ms-project |
"doc" | "OfficeParser" | //application/msword |
"dot" | "OfficeParser" | //application/msword |
"msg" | "OfficeParser" | //application/vnd.ms-outlook |
"vsd" | "OfficeParser" | //application/vnd.visio |
"vst" | "OfficeParser" | //application/vnd.visio |
"vss" | "OfficeParser" | //application/vnd.visio |
"vsw" | "OfficeParser" | //application/vnd.visio |
"xlsm" | "OOXMLParser" | //application/vnd.ms-excel.sheet.macroenabled.12 |
"pptm" | "OOXMLParser" | //application/vnd.ms-powerpoint.presentation.macroenabled.12 |
"xltx" | "OOXMLParser" | //application/vnd.openxmlformats-officedocument.spreadsheetml.template |
"docx" | "OOXMLParser" | //application/vnd.openxmlformats-officedocument.wordprocessingml.document |
"potx" | "OOXMLParser" | //application/vnd.openxmlformats-officedocument.presentationml.template |
"xlsx" | "OOXMLParser" | //application/vnd.openxmlformats-officedocument.spreadsheetml.sheet |
"pptx" | "OOXMLParser" | //application/vnd.openxmlformats-officedocument.presentationml.presentation |
"xlam" | "OOXMLParser" | //application/vnd.ms-excel.addin.macroenabled.12 |
"docm" | "OOXMLParser" | //application/vnd.ms-word.document.macroenabled.12 |
"xltm" | "OOXMLParser" | //application/vnd.ms-excel.template.macroenabled.12 |
"dotx" | "OOXMLParser" | //application/vnd.openxmlformats-officedocument.wordprocessingml.template |
"ppsm" | "OOXMLParser" | //application/vnd.ms-powerpoint.slideshow.macroenabled.12 |
"ppam" | "OOXMLParser" | //application/vnd.ms-powerpoint.addin.macroenabled.12 |
"dotm" | "OOXMLParser" | //application/vnd.ms-word.template.macroenabled.12 |
"ppsx" | "OOXMLParser" | //application/vnd.openxmlformats-officedocument.presentationml.slideshow |
"odt" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.text |
"ods" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.spreadsheet |
"odp" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.presentation |
"odg" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.graphics |
"odc" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.chart |
"odf" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.formula |
"odi" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.image |
"odm" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.text-master |
"ott" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.text-template |
"ots" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.spreadsheet-template |
"otp" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.presentation-template |
"otg" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.graphics-template |
"otc" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.chart-template |
"otf" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.formula-template |
"oti" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.image-template |
"oth" | "OpenDocumentParser" | //application/vnd.oasis.opendocument.text-web |
"sxw" | "OpenDocumentParser" | //application/vnd.sun.xml.writer |
Packages and Archives
Extension | Parser | Content-Type |
---|---|---|
"z" | "CompressorParser" | //application/x-compress |
"bz" | "CompressorParser" | //application/x-bzip |
"boz" | "CompressorParser" | //application/x-bzip2 |
"bz2" | "CompressorParser" | //application/x-bzip2 |
"gz" | "CompressorParser" | //application/gzip |
"gz" | "CompressorParser" | application/x-gzip |
"gzip" | "CompressorParser" | //application/x-gzip |
"xz" | "CompressorParser" | //application/x-xz |
"tar" | "PackageParser" | //application/x-tar |
"jar" | "PackageParser" | //application/java-archive |
"7z" | "PackageParser" | //application/x-7z-compressed |
"cpio" | "PackageParser" | //application/x-cpio |
"zip" | "PackageParser" | //application/zip |
"rar" | "RarParser" | //application/x-rar-compressed |
"txt" | "TXTParser" | //text/plain |
Parser Controls
Parsers can be turned on or off by changing the related value to "true" or "false" via the `zxsuite config` Zextras CLI command.
Attribute | Parsers |
---|---|
pdfParsingEnabled | PDFParser |
odfParsingEnabled | OpenDocumentParser |
archivesParsingEnabled | CompressorParser, PackageParser, RarParser |
microsoftParsingEnabled | OfficeParser, OOXMLParser, OldExcelParser |
rtfParsingEnabled | RTFParser |
e.g. to disable PDF parsing: zxsuite config server set `zmhostname` attribute pdfParsingEnabled value false
By default, all parsers are active.