Reduces the importance of the
CmsSearchSimilarity.lengthNorm(String,int) factor
for the
CmsSearchField.FIELD_CONTENT field, while
keeping the Lucene default for all other fields.
This implementation was added since apparently the default length norm is heavily biased
for small documents. In the default, even if a term is found in 2 documents the same number of
times, the smaller document (containing less terms) will have a score easily 3x as high as
the longer document. Using this implementation the importance of the term number is reduced.
Inspired by Chuck Williams WikipediaSimilarity.
author: Alexander Kandzior version: $Revision: 1.9 $ since: 6.0.0 |