From 4f6b54c65f9ac5b0760bc11eb51747857c1d5589 Mon Sep 17 00:00:00 2001 From: Adam Dickmeiss Date: Wed, 14 Sep 2011 12:40:33 +0200 Subject: [PATCH] Document new setting icu_chain for service/server --- doc/pazpar2_conf.xml | 98 +++++++++++++++++++++++++++++++++++++------------- 1 file changed, 74 insertions(+), 24 deletions(-) diff --git a/doc/pazpar2_conf.xml b/doc/pazpar2_conf.xml index 8ae9b97..9605df3 100644 --- a/doc/pazpar2_conf.xml +++ b/doc/pazpar2_conf.xml @@ -124,15 +124,25 @@ - + - relevance / sort / mergekey / facet + icu_chain Specifies character set normalization for relevancy / sorting / mergekey and facets - for the server. These definitions serves as default for services that don't have these given. For the meaning - of these settings refer to the "relevance" element inside service. + of these settings refer to the + "icu_chain" element inside service. + + + + + + relevance / sort / mergekey / facet + + + Obsolete. Use element icu_chain instead. @@ -346,15 +356,24 @@ - + - relevance + icu_chain - Specifies ICU tokenization and transformation rules - for tokens that are used in Pazpar2's relevance ranking. - The 'id' attribute is currently not used, and the 'locale' - attribute must be set to one of the locale strings + Specifies a named ICU rule set. The icu_chain element must include + attribute 'id' which specifies the identifier (name) for the ICU + rule set. + Pazpar2 uses the particular rule sets for particular purposes. + Rule set 'relevance' is used to normalize + terms for relevance ranking. Rule set 'sort' is used to + normalize terms for sorting. Rule set 'mergekey' is used to + normalize terms for making a mergekey and, finally, 'facet' + is used to normalize facet terms (AKA termlists). + + + The icu_chain element must also include a 'locale' + attribute which must be set to one of the locale strings defined in ICU. The child elements listed below can be in any order, except the 'index' element which logically belongs to the end of the list. The stated tokenization, @@ -411,12 +430,33 @@ + relevance + + + Specifies the ICU rule set used for relevance ranking. + The child element of 'relevance' must be 'icu_chain' and the + 'id' attribute of the icu_chain is ignored. This + definition is obsolete and should be replaced by the equivalent + construct: + + <icu_chain id="relevance" locale="en">..<icu_chain> + + + + + + sort - Specifies ICU tokenization and transformation rules - for tokens that are used in Pazpar2's sorting. The contents - is similar to that of relevance. + Specifies the ICU rule set used for sorting. + The child element of 'sort' must be 'icu_chain' and the + 'id' attribute of the icu_chain is ignored. This + definition is obsolete and should be replaced by the equivalent + construct: + + <icu_chain id="sort" locale="en">..<icu_chain> + @@ -426,8 +466,14 @@ Specifies ICU tokenization and transformation rules - for tokens that are used in Pazpar2's mergekey. The contents - is similar to that of relevance. + for tokens that are used in Pazpar2's mergekey. + The child element of 'mergekey' must be 'icu_chain' and the + 'id' attribute of the icu_chain is ignored. This + definition is obsolete and should be replaced by the equivalent + construct: + + <icu_chain id="mergekey" locale="en">..<icu_chain> + @@ -437,8 +483,14 @@ Specifies ICU tokenization and transformation rules - for tokens that are used in Pazpar2's facets. The contents - is similar to that of relevance. + for tokens that are used in Pazpar2's facets. + The child element of 'facet' must be 'icu_chain' and the + 'id' attribute of the icu_chain is ignored. This + definition is obsolete and should be replaced by the equivalent + construct: + + <icu_chain id="facet" locale="en">..<icu_chain> + @@ -506,14 +558,12 @@ merge="longest" rank="2"/> - - - - - - - - + + + + + + -- 1.7.10.4