Search Index

Last published at: August 5th, 2024
Delete

The Search Index is available in the Advanced and Premium editions.

Delete

When creating a search index, Duplicate Check creates a key (hash) for every record in your object. The key is created based on fields that are defined in your scenario. Duplicate Check uses this key to compare records. It is advised to create a search index when using custom fields in your scenario, and/or when working with our fuzzy matching methods.  Having a search index improves results and scoring of duplicate records.

Duplicate Check uses the Search Index to find potential duplicate records in Salesforce. This Search Index is a string of values and numbers which is created based on the scenario that you use.

When searching with the Search Index, potential duplicates are returned. By default, about 15 records are returned. As a next step, the scenario calculates matching percentages for those 15 records. Every pair with matching percentages that are equal to, or higher than the defined threshold level, are defined as duplicate records and returned in the UI.

To use a search index for an Object, first enable and configure the search index for that Object in the *Object* Settings in DC Setup. Then create the Search Index for the Object on tab Index Batch. Don't forget to re-create the search index every now and then, to keep it up-to-date.

Configuring the Search Index

To set up the Search Index for an Object:

  1. Go to DC Setup.
  2. On the left hand side, under Object Setup, select the Object you want to create a Search Index for.
  3. On the *Object* Settings tab, enable the Search Index option.

Then, configure the Search Index in the advanced *Object* Settings:

  1. At top right, enable Show Advanced Settings.
  2. Scroll down to the Index section.

Search Index with Wild Card If enabled, Duplicate Check index search will return partial matches. 
Index Creation for single records

Determines when the Index creation will start. You can let DC decide when by choosing:

"DC Determines the best option" - Duplicate Check decides when the index is updated.
"Directly. Index is directly available" - The index is updated right away.
"In the background. There is a delay in Index creation" - Index is updated in an apex job, started whenever Salesforce allows it to start.

Number of records returned in index search Decides how many records Duplicate Check returns in the first step of the search procedure. Do not change this, unless you get instructions from the Plauti Support team.
Index Queue - Batch Chunk Size Determines the chunk size of the index creation job. Do not change this, unless you get instructions from the Plauti Support team.
Index location    The location where the Search Index will be stored. 

Search Index Location

You can choose between two options for storing the indexes. 

Index Object

The default setting is storing the index in the Index Object. When you create a Search Index, the indexes are stored in a custom object called "Index Object".

Each record that is indexed takes 2kb of storage space.

Within the Object itself

Another option is to store the index in the Object itself. This will store the index in a field in your records, which takes no additional storage space. As a result of updating the records, the last modified date will change. 

Delete

Incremental indexing

Incremental indexing is only available when the search index is stored in the Index Object, not when the search index is stored within the object itself.

Creating the Search Index

To create or re-create the Search Index for an object:

  1. Go to DC Setup.
  2. On the left hand side, under Object Setup, select the Object you want to create a Search Index for.
  3. On the Index Batch tab, click the Start button.
  4. Select one of the Create Search Index options.

Note: after selecting it may take some time before the Search Index is created.

For large indexes you have the option to run the search index creation on DC Local instead of within Salesforce. In that case:

  1. At Start, choose Create Search Index (DC Local)
  2. Open DC Local.
  3. Start the index job.

Alternatively, you can also run the search index creation on Plauti Cloud‍, saving you the hassle of running and maintaining an instance of DC Local. 

  1. At Start, choose Create Search Index (Plauti Cloud).
  2. Log into Plauti Cloud.
  3. Go to Duplicate Check > Jobs.
  4. Start the Create Search Index job.

Re-creating the Search Index

It is recommended to recreate an object's search index every once in a while, for example after you made a change to a scenario.

Recreating the entire search index is done simply by selecting Create Search Index again. This is recommended for instance after you created a new scenario, made changes to an existing scenario (e.g. a different matching method, new frequent words, or a new field), added or changed an object filter, or changed the index location.

To keep existing search indexes up-to-date, you can create an incremental search index. This option only recreates the search index for those records that were updated after the last full search index creation was started. Incremental search index creation is quicker than a full index re-creation.
The "Create Incremental Search Index" option will appear after a full search index was created, and only when the index is stored on the Index Object (see the "Search Index Location" section above).

To recreate the search index:

  • Go to DC Setup.
  • On the left hand side, under Object Setup, select the Object you want to recreate a Search Index for.
  • On the Index Batch tab, click the Start button.
  • Select one of the Create Search Index or Create Incremental Search Index options.
Delete

A search index for all your Objects

It is recommended to create an index for every Object that you work with. Especially when using any custom fields in your scenario, a search index is needed to take them into account when finding duplicates. 

Click here to watch a short video of how to enable and create the Search Index.

Delete

Please note, that if you choose to store the Search Index on the object itself, the "last modified" date of your records will change after creating or updating the search index. This might trigger workflows you have configured.

Delete

When making a change to your scenario, it's always a good idea to re-create your search index - making sure the search index in all records is up to date.