Neural sparse query

Introduced 2.11

Use the neural_sparse query for vector field search in neural sparse search.

You can run the query in the following ways:

Provide sparse vector embeddings for matching. For more information, see Neural sparse search using raw vectors:

"neural_sparse": {
  "<vector_field>": {
    "query_tokens": {
      "<token>": <weight>,
      ...
    }
  }
}

Provide text to tokenize and use for matching. To tokenize the text, you can use the following components:

A built-in DL model analyzer:

"neural_sparse": {
  "<vector_field>": {
    "query_text": "<input text>",
    "analyzer": "bert-uncased"
  }
}

A tokenizer model:

"neural_sparse": {
  "<vector_field>": {
    "query_text": "<input text>",
    "model_id": "<model ID>"
  }
}

For more information, see Generating sparse vector embeddings automatically.

Request body fields

The top-level vector_field specifies the vector field against which to run a search query. You must specify either query_text or query_tokens to define the input. The following fields can be used to configure the query

Field	Data type	Required/Optional	Description
`query_text`	String	Optional	The query text to convert into sparse vector embeddings. Either `query_text` or `query_tokens` must be specified.
`analyzer`	String	Optional	Used with `query_text`. Specifies a built-in DL model analyzer for tokenizing query text. Valid values are `bert-uncased` and `mbert-uncased`. Default is `bert-uncased`. If neither `model_id` nor `analyzer` are specified, the default analyzer (`bert-uncased`) is used to tokenize the text. Cannot be specified at the same time as `model_id`. For more information, see DL model analyzers.
`model_id`	String	Optional	Used with `query_text`. The ID of the sparse encoding model (for bi-encoder mode) or tokenizer (for doc-only mode) used to generate vector embeddings from the query text. The model/tokenizer must be deployed in OpenSearch before it can be used in neural sparse search. For more information, see Using custom models within OpenSearch and Generating sparse vector embeddings automatically. For information about setting a default model ID in a neural sparse query, see `neural_query_enricher`. Cannot be specified at the same time as `analyzer`.
`query_tokens`	Map of token (string) to weight (float)	Optional	A raw sparse vector in the form of tokens and their weights. Used as an alternative to `query_text` for direct vector input. Either `query_text` or `query_tokens` must be specified.
`max_token_score`	Float	Optional	(Deprecated) This parameter has been deprecated since OpenSearch 2.12. It is maintained only for backward compatibility and no longer affects functionality. The parameter can still be provided in requests, but its value has no impact. Previously used as the theoretical upper bound of the score for all tokens in the vocabulary.

Examples

To run a search using text tokenized by an analyzer, specify an analyzer in the request. The analyzer must be compatible with the model that you used for text analysis at ingestion time:

GET my-nlp-index/_search
{
  "query": {
    "neural_sparse": {
      "passage_embedding": {
        "query_text": "Hi world",
        "analyzer": "bert-uncased"
      }
    }
  }
}

For more information, see DL model analyzers.

If you don’t specify an analyzer, the default bert-uncased analyzer is used:

GET my-nlp-index/_search
{
  "query": {
    "neural_sparse": {
      "passage_embedding": {
        "query_text": "Hi world"
      }
    }
  }
}

To search using text tokenized by a tokenizer model, provide the model ID in the request:

GET my-nlp-index/_search
{
  "query": {
    "neural_sparse": {
      "passage_embedding": {
        "query_text": "Hi world",
        "model_id": "aP2Q8ooBpBj3wT4HVS8a"
      }
    }
  }
}

To search using a sparse vector, provide the sparse vector in the query_tokens parameter:

GET my-nlp-index/_search
{
  "query": {
    "neural_sparse": {
      "passage_embedding": {
        "query_tokens": {
          "hi" : 4.338913,
          "planets" : 2.7755864,
          "planet" : 5.0969057,
          "mars" : 1.7405145,
          "earth" : 2.6087382,
          "hello" : 3.3210192
        }
      }
    }
  }
}

Next steps

For more information about neural sparse search, see Neural sparse search.

Request body fields
Next steps

WAS THIS PAGE HELPFUL?

✔ Yes ✖ No

Tell us why

350 characters left

Have a question? Ask us on the OpenSearch forum.

Want to contribute? Edit this page or create an issue.

Neural sparse query

Request body fields

Examples

Next steps

OpenSearch Links

Get Involved

Resources

Contact Us