Make hyphenated compound searchable as closed compound
Hyphenated compounds are indexed for search as open compounds in Primo when "remove hyphens" option is enabled.
In German language it is also very common to concatenate them as closed compounds, thus creating a single word without spaces or hyphens. This variant should be indexed for search as well.
Example:
"Passagen-Werk" (hyphenated compound)
• at the moment also finds "Passagen Werk" (open compound) when "remove hyphens" option is enabled
• at the moment does not find "Passagenwerk" (closed compound) but should be able to find this!!
Thank you for this idea. We have improved the search with hyphens for Supported hyphenated terms.. it is released with Primo May 2021 release and Primo VE July release.
-
Jack commented
Is the terms list accessible by us or is this an internal Ex Libris list? If the latter, you haven't solved this yet. And I agree with others that a terms list solution is not a great solution.
-
Lars Iselid commented
The ExL support tells me it's not an improvement of the remove hyphens feature Nili talks about for VE (and maybe not for BO either?), just an implementation of a terms list in Primo VE (BO?):
"...right now we only have the option for closed list of terms in Primo VE.
Additionally, there is an option to suggest new terms for the current synonyms file, and we would consider adding them." Primo BO has a remove hyphens-feature but not VE. -
Manu Schwendener commented
> a list solution is not satisfactory
I completely agree!
Maybe voting here will help move this forward: https://ideas.exlibrisgroup.com/forums/308176-primo/suggestions/44442042-enable-retrieval-with-without-compound-hyphens-in
-
Kenny Oliver Eckart commented
We have been waiting a long time for an improvement in hyphenated searches. Now I have read the following article in the current release notes for Primo VE:
Improve Search Queries with HyphensJuly 2021 - NERS #4457
Local repository searches that include a search term with a hyphen will now return additional results by including the term's compound word in the search. For example, searches for the term chat-room, will also include results for chat room and chatroom. Previously, the system only added results for chat room. The supported terms are based on the same terms used for CDI results (see Supported Compound Words).In this list there are only English search terms. What about an improvement for the German language? Currently, I can't find anything for "Vier-Länder-Vergleich", for example. You only get hits when you search for "Vier Länder Vergleich".
Even if a list for German search terms is planned, a list solution is not satisfactory because new terms constantly appear that are then missing from the list. How often should these lists be updated?
-
Laszlo commented
This was working in Aleph. The index had tripple posted terms - as separate words - as hyphenated words - and as one word joined up without the hyphen(s).
-
Manu Schwendener commented
Will be done as part of the NERS improvements, but only via a list of terms.
(Info via Primo Working Group Webinar at Igelu 2020)
-
Manu Schwendener commented
NERS voting 2nd round open until 31.8.2020
4457 – Fix indexing of words with hyphens (e.g. hand-book, etc.)
-
Manu Schwendener commented
+1
-
Laurence Lockton commented
Very common in English too, e.g. co-operation, ultra-violet, micro-organisms, multi-disciplinary, hand-book, real-time, x-ray. I have no evidence for this, but I guess it's more common that the alternative spellings of hyphenated compounds are closed compounds than open compounds, so it would be more useful if "remove hyphens" did just that, rather than replace with a space. But even better if it did both, so it works for blue-green algae, mothers-in-law, etc.
-
Annette Seethaler commented
Since quite some long time all possible catalogues ISBNs can be searched with or without hyphens - a relief for all the users. Why go backwards?
-
Martin Gränicher commented
This could also improve ISBN search in a catalog where a majority of records contain ISBNs without hyphens: As it is now, users pasting hyphenated ISBNs e.g. from Amazon come up empty, even if the work is in the catalog.