Difference between revisions of "Search Indexes"

[unchecked revision][unchecked revision]
Line 71: Line 71:
 
The search index was created with an older version of MailStore Server and has an internal structure different from indexes which were built using the current version. No action is required. However, these indexes can be upgraded by rebuilding them. This significantly increases indexing and search performance.
 
The search index was created with an older version of MailStore Server and has an internal structure different from indexes which were built using the current version. No action is required. However, these indexes can be upgraded by rebuilding them. This significantly increases indexing and search performance.
  
[[de:Suchindizez]]
+
[[de:Suchindizes]]

Revision as of 17:01, 1 June 2011

Administration of the Full-Text Search

MailStore Server offers users an extremely fast full-text search. All emails a user has read-access to are searched, in most cases in only fractions of a second. To ensure this remarkable speed, MailStore Server sets up so-called search indexes during archiving. They work in a way similar to the indexes often found in the back of books: looking up something in an index gets results significantly faster than searching each single page.

MailStore Server maintains one index file each

  • per file group and
  • per user.

Normally, the creation, maintenance and usage of search indexes is completely transparent meaning that neither administrators nor MailStore users need to know of their existence or their internal workings. In some cases, however, some maintenance may become necessary. For example:

  • because of power outage,
  • because of unexpected termination of the MailStore Server service
  • because of missing network connectivity (only with storage on an NAS) or
  • because of changes to the index configuration (see below).

In these cases it may occur that archive, index and settings are no longer synchronous and that individual indexes must be rebuilt. Generally, a corresponding error message will be displayed.

Typical tasks regarding indexes are described in the following sections.

Setting Up Indexing for the Contents of File Attachments

In the standard configuration, MailStore Server includes the file names of file attachments in the search indexes but not their contents. To enable MailStore Server to search the contents of file attachments, it has to be configured accordingly. Please proceed as follows:

  • Start MailStore Client and log on as administrator (admin).
  • Click on Administrative Tools.
  • Open the page Search Indexes.
  • Click on Options.
  • Separated by space, enter the file types (file extensions) whose contents MailStore Server is to include in the search indexes.
  • Click on OK to save the settings.
  • The new settings apply to all emails that are archived as of now. To apply the settings to already archived emails, rebuild the appropriate search indexes as described below.
Tech index 02.png

MailStore Server can index all file types for which a so-called IFilter driver is installed. When archiving, the drivers must be installed on the MailStore Client machine, when rebuilding indexes they must be installed on MailStore Server. Typically, IFilters exist at least for all applications which are installed on the respective machines.

If, for example, Microsoft Office 2007 is installed, the corresponding IFilter drivers for Mirosoft Office documents are installed as well. IFilter drivers can also be installed independently from their applications. To learn more, please enter IFilter into any search engine.

For reasons of stability and performance, MailStore Server processes the following file types directly, regardless of the IFilter drivers that are installed:

  • Text files (TXT),
  • HTML files (HTM and HTML)
  • PDF files (PDF).

Rebuilding Search Indexes

If new indexing settings are to be applied to all existing emails or a message appears indicating that search indexes need to be rebuilt, please proceed as follows:

  • Start MailStore Client and log on as administrator (admin).
  • Click on Administrative Tools.
  • Open the page Search Indexes.
  • Check all search indexes to be rebuilt. Indexes with the status Please Rebuild are already checked for your convenience.
  • Click on Rebuild Search Indexes.
Tech index 01.png

Frequently Asked Questions Regarding Search Indexes

Why are the contents of file attachments not indexed in the standard configuration?

Indexing the contents of file attachments slows down the archiving process and increases the size of the search indexes significantly. Because of this, the contents of file attachments are not indexed in the standard configuration.

Everything was set up correctly but I still cannot find some text within the archived file attachments. What could be the reason?

First, this may be because, due to performance and stability, MailStore Server only indexes the first 10,000 Words (not characters!) of file attachments. In most cases, keywords are contained within the first 10,000 Words. If an email has multiple file attachments, at most 10,000 words are indexed overall. Of course, the file attachments themselves are archived in their entirety and can be searched in full with the appropriate application.

Another reason could be that MailStore Server was unable to read the corresponding file because no IFilter driver was available or because the file was DRM protected (copy protection), for example.

In the Status column, OK (Compatibility mode) is displayed. What should be done?

The search index was created with an older version of MailStore Server and has an internal structure different from indexes which were built using the current version. No action is required. However, these indexes can be upgraded by rebuilding them. This significantly increases indexing and search performance.