Could Exlibris regularly update the list of common titles used by the Primo duplication detection vector?
Primo uses the duplicate detection vector to determine whether or not records will be deduped and merged in Primo.
One of the match criteria is full title. If two records have the same title and the title is not in the list of common titles, the records are assigned 600 points. (800 points or higher means the records will be dedupe/merged.)
The list of common titles is provided here: https://knowledge.exlibrisgroup.com/Primo/ProductDocumentation/TechnicalGuide/180FilesUsedbytheDedupAlgorithm#wwconnectheader
It includes common titles such as annual report, bulletin, transactions... Unfortunately it does not include other common titles such as news, poems, plays... (See attached Excel file.)
My request is for Exlibris to maintain and update the list of common titles to include additional, frequently-occuring titles.
I performed an analysis in our repository in Analytics of the most common one or two-word titles. (See attached png file.)
We have six titles in Alma that are being deduped/merged into one Primo record. The bib records' OCLC numbers are 984755026, 994298106, 984751244, 993601092, 984751722, 994301130. They are newsletters published by various academic departments at our university.
I'm guessing this is because they all have the same 245 $a title, News and other similarities such as 710 corporate name, California State University, Northridge... and place of publication, Northridge.
Unfortunately, this can cause confusing displays of physical inventory. There are six holdings records in the "get it" section but it's difficult to determine which holdings record belongs to which academic department.
Therefore, it would be best if these 6 records were not deduped/merged and displayed as separate records in Primo.
Laura Akerman commented
What about the Client Common Titles.txt file? Could you use that? (https://knowledge.exlibrisgroup.com/Primo/Product_Documentation/Technical_Guide/180Files_Used_by_the_Dedup_Algorithm#ClientCommonTitles.txt all the way at the bottom)
All the way at the bottom.
Unfortunately, apparently it's accessed by the Serials matching profile only -- there's no corresponding file for monographs.