You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/reference.asciidoc
+40-7Lines changed: 40 additions & 7 deletions
Original file line number
Diff line number
Diff line change
@@ -1734,7 +1734,7 @@ client.search({ ... })
1734
1734
** *`profile` (Optional, boolean)*: Set to `true` to return detailed timing information about the execution of individual components in a search request. NOTE: This is a debugging tool and adds significant overhead to search execution.
** *`rescore` (Optional, { window_size, query, learning_to_rank } | { window_size, query, learning_to_rank }[])*: Can be used to improve precision by reordering just the top (for example 100 - 500) documents returned by the `query` and `post_filter` phases.
1737
-
** *`retriever` (Optional, { standard, knn, rrf, text_similarity_reranker, rule })*: A retriever is a specification to describe top documents returned from a search. A retriever replaces other elements of the search API that also return top documents such as `query` and `knn`.
1737
+
** *`retriever` (Optional, { standard, knn, rrf, text_similarity_reranker, rule, rescorer, linear, pinned })*: A retriever is a specification to describe top documents returned from a search. A retriever replaces other elements of the search API that also return top documents such as `query` and `knn`.
1738
1738
** *`script_fields` (Optional, Record<string, { script, ignore_failure }>)*: Retrieve a script evaluation (based on different fields) for each hit.
1739
1739
** *`search_after` (Optional, number | number | string | boolean | null | User-defined value[])*: Used to retrieve the next page of hits using a set of sort values from the previous page.
1740
1740
** *`size` (Optional, number)*: The number of hits to return, which must not be negative. By default, you cannot page through more than 10,000 hits using the `from` and `size` parameters. To page through more hits, use the `search_after` property.
@@ -7231,9 +7231,45 @@ Changes dynamic index settings in real time.
7231
7231
For data streams, index setting changes are applied to all backing indices by default.
7232
7232
7233
7233
To revert a setting to the default value, use a null value.
7234
-
The list of per-index settings that can be updated dynamically on live indices can be found in index module documentation.
7234
+
The list of per-index settings that can be updated dynamically on live indices can be found in index settings documentation.
7235
7235
To preserve existing settings from being updated, set the `preserve_existing` parameter to `true`.
7236
7236
7237
+
There are multiple valid ways to represent index settings in the request body. You can specify only the setting, for example:
7238
+
7239
+
----
7240
+
{
7241
+
"number_of_replicas": 1
7242
+
}
7243
+
----
7244
+
7245
+
Or you can use an `index` setting object:
7246
+
----
7247
+
{
7248
+
"index": {
7249
+
"number_of_replicas": 1
7250
+
}
7251
+
}
7252
+
----
7253
+
7254
+
Or you can use dot annotation:
7255
+
----
7256
+
{
7257
+
"index.number_of_replicas": 1
7258
+
}
7259
+
----
7260
+
7261
+
Or you can embed any of the aforementioned options in a `settings` object. For example:
7262
+
7263
+
----
7264
+
{
7265
+
"settings": {
7266
+
"index": {
7267
+
"number_of_replicas": 1
7268
+
}
7269
+
}
7270
+
}
7271
+
----
7272
+
7237
7273
NOTE: You can only define new analyzers on closed indices.
7238
7274
To add an analyzer, you must close the index, define the analyzer, and reopen the index.
7239
7275
You cannot close the write index of a data stream.
The chat completion inference API enables real-time responses for chat completion tasks by delivering answers incrementally, reducing response times during computation.
8052
+
The chat completion inference API enables real-time responses for chat completion tasks by delivering answers incrementally, reducing response times during computation.
8017
8053
It only works with the `chat_completion` task type for `openai` and `elastic` inference services.
8018
8054
8019
-
IMPORTANT: The inference APIs enable you to use certain services, such as built-in machine learning models (ELSER, E5), models uploaded through Eland, Cohere, OpenAI, Azure, Google AI Studio, Google Vertex AI, Anthropic, Watsonx.ai, or Hugging Face.
8020
-
For built-in models and models uploaded through Eland, the inference APIs offer an alternative way to use and manage trained models. However, if you do not plan to use the inference APIs to use these models or if you want to use non-NLP models, use the machine learning trained model APIs.
8021
-
8022
8055
NOTE: The `chat_completion` task type is only available within the _stream API and only supports streaming.
8023
8056
The Chat completion inference API and the Stream inference API differ in their response structure and capabilities.
8024
8057
The Chat completion inference API provides more comprehensive customization options through more fields and function calling support.
@@ -8421,7 +8454,7 @@ client.inference.putGooglevertexai({ task_type, googlevertexai_inference_id, ser
8421
8454
==== Arguments
8422
8455
8423
8456
* *Request (object):*
8424
-
** *`task_type` (Enum("rerank" | "text_embedding"))*: The type of the inference task that the model will perform.
8457
+
** *`task_type` (Enum("rerank" | "text_embedding" | "completion" | "chat_completion"))*: The type of the inference task that the model will perform.
8425
8458
** *`googlevertexai_inference_id` (string)*: The unique identifier of the inference endpoint.
8426
8459
** *`service` (Enum("googlevertexai"))*: The type of service supported for the specified task type. In this case, `googlevertexai`.
8427
8460
** *`service_settings` ({ location, model_id, project_id, rate_limit, service_account_json })*: Settings used to install the inference model. These settings are specific to the `googlevertexai` service.
Copy file name to clipboardExpand all lines: src/api/api/indices.ts
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -1854,7 +1854,7 @@ export default class Indices {
1854
1854
}
1855
1855
1856
1856
/**
1857
-
* Update index settings. Changes dynamic index settings in real time. For data streams, index setting changes are applied to all backing indices by default. To revert a setting to the default value, use a null value. The list of per-index settings that can be updated dynamically on live indices can be found in index module documentation. To preserve existing settings from being updated, set the `preserve_existing` parameter to `true`. NOTE: You can only define new analyzers on closed indices. To add an analyzer, you must close the index, define the analyzer, and reopen the index. You cannot close the write index of a data stream. To update the analyzer for a data stream's write index and future backing indices, update the analyzer in the index template used by the stream. Then roll over the data stream to apply the new analyzer to the stream's write index and future backing indices. This affects searches and any new data added to the stream after the rollover. However, it does not affect the data stream's backing indices or their existing data. To change the analyzer for existing backing indices, you must create a new data stream and reindex your data into it.
1857
+
* Update index settings. Changes dynamic index settings in real time. For data streams, index setting changes are applied to all backing indices by default. To revert a setting to the default value, use a null value. The list of per-index settings that can be updated dynamically on live indices can be found in index settings documentation. To preserve existing settings from being updated, set the `preserve_existing` parameter to `true`. There are multiple valid ways to represent index settings in the request body. You can specify only the setting, for example: ``` { "number_of_replicas": 1 } ``` Or you can use an `index` setting object: ``` { "index": { "number_of_replicas": 1 }} ``` Or you can use dot annotation: ``` { "index.number_of_replicas": 1 } ``` Or you can embed any of the aforementioned options in a `settings` object. For example: ``` { "settings": { "index": { "number_of_replicas": 1 }}} ``` NOTE: You can only define new analyzers on closed indices. To add an analyzer, you must close the index, define the analyzer, and reopen the index. You cannot close the write index of a data stream. To update the analyzer for a data stream's write index and future backing indices, update the analyzer in the index template used by the stream. Then roll over the data stream to apply the new analyzer to the stream's write index and future backing indices. This affects searches and any new data added to the stream after the rollover. However, it does not affect the data stream's backing indices or their existing data. To change the analyzer for existing backing indices, you must create a new data stream and reindex your data into it.
1858
1858
* @see {@link https://www.elastic.co/guide/en/elasticsearch/reference/8.19/indices-update-settings.html | Elasticsearch API documentation}
Copy file name to clipboardExpand all lines: src/api/api/inference.ts
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -45,7 +45,7 @@ export default class Inference {
45
45
}
46
46
47
47
/**
48
-
* Perform chat completion inference The chat completion inference API enables real-time responses for chat completion tasks by delivering answers incrementally, reducing response times during computation. It only works with the `chat_completion` task type for `openai` and `elastic` inference services. IMPORTANT: The inference APIs enable you to use certain services, such as built-in machine learning models (ELSER, E5), models uploaded through Eland, Cohere, OpenAI, Azure, Google AI Studio, Google Vertex AI, Anthropic, Watsonx.ai, or Hugging Face. For built-in models and models uploaded through Eland, the inference APIs offer an alternative way to use and manage trained models. However, if you do not plan to use the inference APIs to use these models or if you want to use non-NLP models, use the machine learning trained model APIs. NOTE: The `chat_completion` task type is only available within the _stream API and only supports streaming. The Chat completion inference API and the Stream inference API differ in their response structure and capabilities. The Chat completion inference API provides more comprehensive customization options through more fields and function calling support. If you use the `openai` service or the `elastic` service, use the Chat completion inference API.
48
+
* Perform chat completion inference The chat completion inference API enables real-time responses for chat completion tasks by delivering answers incrementally, reducing response times during computation. It only works with the `chat_completion` task type for `openai` and `elastic` inference services. NOTE: The `chat_completion` task type is only available within the _stream API and only supports streaming. The Chat completion inference API and the Stream inference API differ in their response structure and capabilities. The Chat completion inference API provides more comprehensive customization options through more fields and function calling support. If you use the `openai` service or the `elastic` service, use the Chat completion inference API.
49
49
* @see {@link https://www.elastic.co/guide/en/elasticsearch/reference/8.19/chat-completion-inference-api.html | Elasticsearch API documentation}
0 commit comments