ZeXtras Powerstore: Attachment Indexing

From ZeXtras Suite Wiki

Jump to: navigation, search
Language: English  • español • português
ZxPowerStore logo box.png
Available since version: 1.3.0
Latest Version: 2.12.2
Released on: January 2nd, 2019
Compatibility List
Admin Guide
FAQ
Troubleshooting
ZspPowerstore logo box.png

How indexing works

Starting from version 2.4.0, an Indexing Engine has been added to Zextras Powerstore to index attachment contents.

It works together with Zimbra's own engine: the main Zimbra indexing process analyzes the content of an item, splitting it into several parts based on the MIME parts of the object, then handles the indexing of "known" contents - plaintext - and passes the datastream on to the Powerstore's handlers for all other content.

It includes an indexing cache, that speeds up the indexing process of any content that has already been analyzed. Datastreams over 10Kb are cached by default and the cache hold 10000 entries, while smaller datastreams are not cached as the cache benefits only apply to large datastreams.

This feature is currently only available on Zimbra Open Source Edition.

Indexed formats

Web

Extension Parser Content-type
"asp" "HtmlParser" //application/x-asp
"htm" "HtmlParser" //application/xhtml+xml
"html" "HtmlParser" text/html
"shtml" "HtmlParser" //application/xhtml+xml
"xhtml" "HtmlParser" //application/xhtml+xml

Documents

Extension Parser Content-type
"rtf" "RTFParser" //application/rtf
"pdf" "PDFParser" //application/pdf
"pub" "OfficeParser" //application/x-mspublisher
"xls" "OfficeParser" //application/vnd.ms-excel
"xlt" "OfficeParser" //application/vnd.ms-excel
"xlw" "OfficeParser" //application/vnd.ms-excel
"ppt" "OfficeParser" //application/vnd.ms-powerpoint
"pps" "OfficeParser" //application/vnd.ms-powerpoint
"mpp" "OfficeParser" //application/vnd.ms-project
"doc" "OfficeParser" //application/msword
"dot" "OfficeParser" //application/msword
"msg" "OfficeParser" //application/vnd.ms-outlook
"vsd" "OfficeParser" //application/vnd.visio
"vst" "OfficeParser" //application/vnd.visio
"vss" "OfficeParser" //application/vnd.visio
"vsw" "OfficeParser" //application/vnd.visio
"xlsm" "OOXMLParser" //application/vnd.ms-excel.sheet.macroenabled.12
"pptm" "OOXMLParser" //application/vnd.ms-powerpoint.presentation.macroenabled.12
"xltx" "OOXMLParser" //application/vnd.openxmlformats-officedocument.spreadsheetml.template
"docx" "OOXMLParser" //application/vnd.openxmlformats-officedocument.wordprocessingml.document
"potx" "OOXMLParser" //application/vnd.openxmlformats-officedocument.presentationml.template
"xlsx" "OOXMLParser" //application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
"pptx" "OOXMLParser" //application/vnd.openxmlformats-officedocument.presentationml.presentation
"xlam" "OOXMLParser" //application/vnd.ms-excel.addin.macroenabled.12
"docm" "OOXMLParser" //application/vnd.ms-word.document.macroenabled.12
"xltm" "OOXMLParser" //application/vnd.ms-excel.template.macroenabled.12
"dotx" "OOXMLParser" //application/vnd.openxmlformats-officedocument.wordprocessingml.template
"ppsm" "OOXMLParser" //application/vnd.ms-powerpoint.slideshow.macroenabled.12
"ppam" "OOXMLParser" //application/vnd.ms-powerpoint.addin.macroenabled.12
"dotm" "OOXMLParser" //application/vnd.ms-word.template.macroenabled.12
"ppsx" "OOXMLParser" //application/vnd.openxmlformats-officedocument.presentationml.slideshow
"odt" "OpenDocumentParser" //application/vnd.oasis.opendocument.text
"ods" "OpenDocumentParser" //application/vnd.oasis.opendocument.spreadsheet
"odp" "OpenDocumentParser" //application/vnd.oasis.opendocument.presentation
"odg" "OpenDocumentParser" //application/vnd.oasis.opendocument.graphics
"odc" "OpenDocumentParser" //application/vnd.oasis.opendocument.chart
"odf" "OpenDocumentParser" //application/vnd.oasis.opendocument.formula
"odi" "OpenDocumentParser" //application/vnd.oasis.opendocument.image
"odm" "OpenDocumentParser" //application/vnd.oasis.opendocument.text-master
"ott" "OpenDocumentParser" //application/vnd.oasis.opendocument.text-template
"ots" "OpenDocumentParser" //application/vnd.oasis.opendocument.spreadsheet-template
"otp" "OpenDocumentParser" //application/vnd.oasis.opendocument.presentation-template
"otg" "OpenDocumentParser" //application/vnd.oasis.opendocument.graphics-template
"otc" "OpenDocumentParser" //application/vnd.oasis.opendocument.chart-template
"otf" "OpenDocumentParser" //application/vnd.oasis.opendocument.formula-template
"oti" "OpenDocumentParser" //application/vnd.oasis.opendocument.image-template
"oth" "OpenDocumentParser" //application/vnd.oasis.opendocument.text-web
"sxw" "OpenDocumentParser" //application/vnd.sun.xml.writer

Packages and Archives

Extension Parser Content-Type
"z" "CompressorParser" //application/x-compress
"bz" "CompressorParser" //application/x-bzip
"boz" "CompressorParser" //application/x-bzip2
"bz2" "CompressorParser" //application/x-bzip2
"gz" "CompressorParser" //application/gzip
"gz" "CompressorParser" application/x-gzip
"gzip" "CompressorParser" //application/x-gzip
"xz" "CompressorParser" //application/x-xz
"tar" "PackageParser" //application/x-tar
"jar" "PackageParser" //application/java-archive
"7z" "PackageParser" //application/x-7z-compressed
"cpio" "PackageParser" //application/x-cpio
"zip" "PackageParser" //application/zip
"rar" "RarParser" //application/x-rar-compressed
"txt" "TXTParser" //text/plain

Parser Controls

Parsers can be turned on or off by changing the related value to "true" or "false" via the `zxsuite config` Zextras CLI command.

Attribute Parsers
pdfParsingEnabled PDFParser
odfParsingEnabled OpenDocumentParser
archivesParsingEnabled CompressorParser, PackageParser, RarParser
microsoftParsingEnabled OfficeParser, OOXMLParser, OldExcelParser
rtfParsingEnabled RTFParser
    e.g. to disable PDF parsing: 
    zxsuite config server set `zmhostname` attribute pdfParsingEnabled value false

By default, all parsers are active.

Personal tools