Deduping Settings
Important:This feature is currently supported only for customers who are using SQL Server as their DBMS for Data Platform.
On the Deduping tab, you can supply product-specific override settings for Technopedia deduplication rules. You can also export the current deduplication rules from this page.
• | To set custom deduplication rules for particular software assets, upload a CSV that uses the Dedupe Setting flags described below. |
• | To download a copy of the current custom deduplication rules (if any), click Export. |
For information about the deduping flags, how they alter default deduping behavior, and how to structure the CSV file to be uploaded, see Understanding Custom Dedupe Rule Flags, below.
Setting Custom Dedupe Rules
On the Deduping tab, you can upload a CSV file that lists one or more software products by their product IDs and sets deduping flag values to override the default deduping behavior. (These instructions presume you have already created a CSV file with dedupe override settings to upload. See Understanding Custom Dedupe Rule Flags, below, for an explanation of how the deduping flags alter deduping behavior and how to structure the CSV file.)
To set custom dedupe rules, do the following steps:
1. | Click Upload CSV. The Upload dialog box opens. |
2. | In the Upload dialog box, locate the CSV file with your deduping rule override settings and click Upload. |
The CSV file you upload overrides the default dedupe settings for the products included in it. If you had previously uploaded a CSV file with custom dedupe setting, the most recent upload overrides any previously uploaded custom dedupe setting.
Exporting Custom Dedupe Rules
If you have already uploaded custom dedupe settings in the past, you can download a copy of the existing override settings as a CSV file from the Deduping tab.
To export the current custom dedupe rules (if any), do the following steps:
• | Click Export. |
The most recently uploaded dedupe rule override settings are downloaded as a CSV file.
Understanding Custom Dedupe Rule Flags
To successfully override default deduping rules, it is important to understand how the dedupe rule flags alter deduping behavior as well as how to structure the CSV file that determines your dedupe rule override settings.
Dedupe Rule Override Flags
COEXIST_EDITION permits multiple editions of the same software product. Typically, default dedupe rules would roll up multiple editions of the same software product on a given device to the most robust/fully featured edition.
COEXIST_VERSION permits multiple versions of the same software product. Typically, default dedupe rules would roll up multiple versions of the same software product on a given device to the most recent version.
DEDUPE_ADVANCED considers the installation directory to permit multiple installs of the same software product on a single device to be treated as unique. Typically, default dedupe rules would roll up multiple installs of a given software product on a single device to one preferred install. In addition to the DEDUPE_ADVANCED flag, a dedicated configuration property, RegexList, is included in Norm.Configuration.config. This configuration property accepts a regular expression value. Only installation paths that match this regular expression will be considered for the DEDUPE_ADVANCED override.
ENABLED sets whether or not dedupe rule override flags are enabled for the software product. This allows you to retain dedupe override settings in the CSV file while preventing the override settings from operating for software product rows in which ENABLED=0.
CSV File Structure for Dedupe Rule Override Settings
The CSV file you upload to override deduping rules must include a heading row as shown below plus one row for each software product to which you want to apply dedupe rule override flags.
Heading Row: CAT_SW_PRODUCT_ID,COEXIST_EDITION,COEXIST_VERSION,DEDUPE_ADVANCED,ENABLED
Software Product Row: Software Product ID (CAT_SW_PRODUCT_ID),[0/1],[0/1],[0/1],[0/1]
For each software product, add a row after the heading row that includes the Software Product ID (CAT_SW_PRODUCT_ID) and a value of 0 or 1 in each subsequent named column. A value of 1 enables the particular dedupe rule override flag for the current column. A value of 0 disables that flag.