- All Implemented Interfaces:
Accountable
Query that treats multiple fields as a single stream and scores terms as if you had
indexed them as a single term in a single field.
The query works as follows:
- Given a list of fields and weights, it pretends there is a synthetic combined field where all terms have been indexed. It computes new term and collection statistics for this combined field.
- It uses a disjunction iterator and
IndexSearcher.getSimilarity()to score documents.
In order for a similarity to be compatible, Similarity.computeNorm(org.apache.lucene.index.FieldInvertState) must be additive:
the norm of the combined field is the sum of norms for each individual field. The norms must also
be encoded using SmallFloat.intToByte4(int). These requirements hold for all similarities that
compute norms the same way as SimilarityBase.computeNorm(org.apache.lucene.index.FieldInvertState), which includes BM25Similarity and DFRSimilarity. Per-field similarities are not supported.
The query also requires that either all fields or no fields have norms enabled. Having only some fields with norms enabled can result in errors.
The scoring is based on BM25F's simple formula described in:
http://www.staff.city.ac.uk/~sb317/papers/foundations_bm25_review.pdf. This query implements the
same approach but allows other similarities besides BM25Similarity.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classA builder forCombinedFieldQuery.private static class(package private) class(package private) static classprivate static class -
Field Summary
FieldsModifier and TypeFieldDescriptionprivate static final longprivate final TreeMap<String, CombinedFieldQuery.FieldAndWeight> private final Term[]private final longprivate final BytesRef[]Fields inherited from interface org.apache.lucene.util.Accountable
NULL_ACCOUNTABLE -
Constructor Summary
ConstructorsModifierConstructorDescriptionprivateCombinedFieldQuery(TreeMap<String, CombinedFieldQuery.FieldAndWeight> fieldAndWeights, BytesRef[] terms) -
Method Summary
Modifier and TypeMethodDescriptioncreateWeight(IndexSearcher searcher, ScoreMode scoreMode, float boost) Expert: Constructs an appropriate Weight implementation for this query.booleanOverride and implement query instance equivalence properly in a subclass.getTerms()inthashCode()Override and implement query hash code properly in a subclass.longReturn the memory usage of this object in bytes.rewrite(IndexSearcher indexSearcher) Expert: called to re-write queries into primitive queries.private BooleanQueryPrints a query to a string, withfieldassumed to be the default field and omitted.private voidvalidateConsistentNorms(IndexReader reader) voidvisit(QueryVisitor visitor) Recurse through the query tree, visiting any child queries.Methods inherited from class org.apache.lucene.search.Query
classHash, rewrite, sameClassAs, toStringMethods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, waitMethods inherited from interface org.apache.lucene.util.Accountable
getChildResources
-
Field Details
-
BASE_RAM_BYTES
private static final long BASE_RAM_BYTES -
fieldAndWeights
-
terms
-
fieldTerms
-
ramBytesUsed
private final long ramBytesUsed
-
-
Constructor Details
-
CombinedFieldQuery
private CombinedFieldQuery(TreeMap<String, CombinedFieldQuery.FieldAndWeight> fieldAndWeights, BytesRef[] terms)
-
-
Method Details
-
getTerms
-
toString
Description copied from class:QueryPrints a query to a string, withfieldassumed to be the default field and omitted. -
equals
Description copied from class:QueryOverride and implement query instance equivalence properly in a subclass. This is required so thatQueryCacheworks properly.Typically a query will be equal to another only if it's an instance of the same class and its document-filtering properties are identical to those of the other instance. Utility methods are provided for certain repetitive code.
-
hashCode
public int hashCode()Description copied from class:QueryOverride and implement query hash code properly in a subclass. This is required so thatQueryCacheworks properly. -
ramBytesUsed
public long ramBytesUsed()Description copied from interface:AccountableReturn the memory usage of this object in bytes. Negative values are illegal.- Specified by:
ramBytesUsedin interfaceAccountable
-
rewrite
Description copied from class:QueryExpert: called to re-write queries into primitive queries. For example, a PrefixQuery will be rewritten into a BooleanQuery that consists of TermQuerys.Callers are expected to call
rewritemultiple times if necessary, until the rewritten query is the same as the original query.The rewrite process may be able to make use of IndexSearcher's executor and be executed in parallel if the executor is provided.
However, if any of the intermediary queries do not satisfy the new API, parallel rewrite is not possible for any subsequent sub-queries. To take advantage of this API, the entire query tree must override this method.
- Overrides:
rewritein classQuery- Throws:
IOException- See Also:
-
visit
Description copied from class:QueryRecurse through the query tree, visiting any child queries. -
rewriteToBoolean
-
createWeight
public Weight createWeight(IndexSearcher searcher, ScoreMode scoreMode, float boost) throws IOException Description copied from class:QueryExpert: Constructs an appropriate Weight implementation for this query.Only implemented by primitive queries, which re-write to themselves.
- Overrides:
createWeightin classQuery- Parameters:
scoreMode- How the produced scorers will be consumed.boost- The boost that is propagated by the parent queries.- Throws:
IOException
-
validateConsistentNorms
-