ES|QL multivalue functions

Serverless Stack

ES|QL supports these multivalue functions:

MV_APPEND
MV_AVG
MV_CONCAT
MV_CONTAINS Stack Preview Serverless Preview
MV_COUNT
MV_DEDUPE
MV_FIRST
MV_LAST
MV_MAX
MV_MEDIAN
MV_MEDIAN_ABSOLUTE_DEVIATION
MV_MIN
MV_PERCENTILE
MV_PSERIES_WEIGHTED_SUM
MV_SORT
MV_SLICE
MV_SUM
MV_ZIP

`MV_APPEND`

Syntax

Parameters

field1
field2

Description

Concatenates values of two multi-value fields.

Supported types

field1	field2	result
boolean	boolean	boolean
cartesian_point	cartesian_point	cartesian_point
cartesian_shape	cartesian_shape	cartesian_shape
date	date	date
date_nanos	date_nanos	date_nanos
double	double	double
geo_point	geo_point	geo_point
geo_shape	geo_shape	geo_shape
geohash	geohash	geohash
geohex	geohex	geohex
geotile	geotile	geotile
integer	integer	integer
ip	ip	ip
keyword	keyword	keyword
keyword	text	keyword
long	long	long
text	keyword	keyword
text	text	keyword
unsigned_long	unsigned_long	unsigned_long
version	version	version

Example

		FROM employees
| WHERE emp_no == 10039 OR emp_no == 10040
| SORT emp_no
| EVAL dates = MV_APPEND(birth_date, hire_date)
| KEEP emp_no, birth_date, hire_date, dates
		
	

emp_no:integer	birth_date:date	hire_date:date	dates:date
10039	1959-10-01T00:00:00Z	1988-01-19T00:00:00Z	[1959-10-01T00:00:00Z, 1988-01-19T00:00:00Z]
10040	null	1993-02-14T00:00:00Z	null

`MV_AVG`

Syntax

Parameters

number: Multivalue expression.

Description

Converts a multivalued field into a single valued field containing the average of all of the values.

Supported types

number	result
double	double
integer	double
long	double
unsigned_long	double

Example

		ROW a=[3, 5, 1, 6]
| EVAL avg_a = MV_AVG(a)

a:integer	avg_a:double
[3, 5, 1, 6]	3.75

`MV_CONCAT`

Syntax

Parameters

string: Multivalue expression.
delim: Delimiter.

Description

Converts a multivalued string expression into a single valued column containing the concatenation of all values separated by a delimiter.

Supported types

string	delim	result
keyword	keyword	keyword
keyword	text	keyword
text	keyword	keyword
text	text	keyword

Examples

		ROW a=["foo", "zoo", "bar"]
| EVAL j = MV_CONCAT(a, ", ")

a:keyword	j:keyword
["foo", "zoo", "bar"]	"foo, zoo, bar"

To concat non-string columns, call TO_STRING first:

		ROW a=[10, 9, 8]
| EVAL j = MV_CONCAT(TO_STRING(a), ", ")

a:integer	j:keyword
[10, 9, 8]	"10, 9, 8"

`MV_CONTAINS`

Stack Preview 9.2.0

Syntax

Parameters

superset: Multivalue expression.
subset: Multivalue expression.

Description

Checks if all values yielded by the second multivalue expression are present in the values yielded by the first multivalue expression. Returns a boolean. Null values are treated as an empty set.

Supported types

superset	subset	result
boolean	boolean	boolean
cartesian_point	cartesian_point	boolean
cartesian_shape	cartesian_shape	boolean
date	date	boolean
date_nanos	date_nanos	boolean
double	double	boolean
geo_point	geo_point	boolean
geo_shape	geo_shape	boolean
geohash	geohash	boolean
geohex	geohex	boolean
geotile	geotile	boolean
integer	integer	boolean
ip	ip	boolean
keyword	keyword	boolean
keyword	text	boolean
long	long	boolean
text	keyword	boolean
text	text	boolean
unsigned_long	unsigned_long	boolean
version	version	boolean

Examples

		ROW set = ["a", "b", "c"], element = "a"
| EVAL set_contains_element = mv_contains(set, element)

set:keyword	element:keyword	set_contains_element:boolean
[a, b, c]	a	true

		ROW setA = ["a","c"], setB = ["a", "b", "c"]
| EVAL a_subset_of_b = mv_contains(setB, setA)
| EVAL b_subset_of_a = mv_contains(setA, setB)
		
	

setA:keyword	setB:keyword	a_subset_of_b:boolean	b_subset_of_a:boolean
[a, c]	[a, b, c]	true	false

		FROM airports
| WHERE mv_contains(type, ["major","military"]) AND scalerank == 9
| KEEP scalerank, name, country
		
	

scalerank:integer	name:text	country:keyword
9	Chandigarh Int'l	India

`MV_COUNT`

Syntax

Parameters

field: Multivalue expression.

Description

Converts a multivalued expression into a single valued column containing a count of the number of values.

Supported types

field	result
boolean	integer
cartesian_point	integer
cartesian_shape	integer
date	integer
date_nanos	integer
double	integer
geo_point	integer
geo_shape	integer
geohash	integer
geohex	integer
geotile	integer
integer	integer
ip	integer
keyword	integer
long	integer
text	integer
unsigned_long	integer
version	integer

Example

		ROW a=["foo", "zoo", "bar"]
| EVAL count_a = MV_COUNT(a)

a:keyword	count_a:integer
["foo", "zoo", "bar"]	3

`MV_DEDUPE`

Syntax

Parameters

field: Multivalue expression.

Description

Remove duplicate values from a multivalued field.

Note

MV_DEDUPE may, but won’t always, sort the values in the column.

Supported types

field	result
boolean	boolean
cartesian_point	cartesian_point
cartesian_shape	cartesian_shape
date	date
date_nanos	date_nanos
double	double
geo_point	geo_point
geo_shape	geo_shape
geohash	geohash
geohex	geohex
geotile	geotile
integer	integer
ip	ip
keyword	keyword
long	long
text	keyword
unsigned_long	unsigned_long
version	version

Example

		ROW a=["foo", "foo", "bar", "foo"]
| EVAL dedupe_a = MV_DEDUPE(a)

a:keyword	dedupe_a:keyword
["foo", "foo", "bar", "foo"]	["foo", "bar"]

`MV_FIRST`

Syntax

Parameters

field: Multivalue expression.

Description

Converts a multivalued expression into a single valued column containing the first value. This is most useful when reading from a function that emits multivalued columns in a known order like SPLIT.

The order that multivalued fields are read from underlying storage is not guaranteed. It is frequently ascending, but don’t rely on that. If you need the minimum value use MV_MIN instead of MV_FIRST. MV_MIN has optimizations for sorted values so there isn’t a performance benefit to MV_FIRST.

Supported types

field	result
boolean	boolean
cartesian_point	cartesian_point
cartesian_shape	cartesian_shape
date	date
date_nanos	date_nanos
double	double
geo_point	geo_point
geo_shape	geo_shape
geohash	geohash
geohex	geohex
geotile	geotile
integer	integer
ip	ip
keyword	keyword
long	long
text	keyword
unsigned_long	unsigned_long
version	version

Example

		ROW a="foo;bar;baz"
| EVAL first_a = MV_FIRST(SPLIT(a, ";"))

a:keyword	first_a:keyword
foo;bar;baz	"foo"

`MV_LAST`

Syntax

Parameters

field: Multivalue expression.

Description

Converts a multivalue expression into a single valued column containing the last value. This is most useful when reading from a function that emits multivalued columns in a known order like SPLIT.

The order that multivalued fields are read from underlying storage is not guaranteed. It is frequently ascending, but don’t rely on that. If you need the maximum value use MV_MAX instead of MV_LAST. MV_MAX has optimizations for sorted values so there isn’t a performance benefit to MV_LAST.

Supported types

field	result
boolean	boolean
cartesian_point	cartesian_point
cartesian_shape	cartesian_shape
date	date
date_nanos	date_nanos
double	double
geo_point	geo_point
geo_shape	geo_shape
geohash	geohash
geohex	geohex
geotile	geotile
integer	integer
ip	ip
keyword	keyword
long	long
text	keyword
unsigned_long	unsigned_long
version	version

Example

		ROW a="foo;bar;baz"
| EVAL last_a = MV_LAST(SPLIT(a, ";"))

a:keyword	last_a:keyword
foo;bar;baz	"baz"

`MV_MAX`

Syntax

Parameters

field: Multivalue expression.

Description

Converts a multivalued expression into a single valued column containing the maximum value.

Supported types

field	result
boolean	boolean
date	date
date_nanos	date_nanos
double	double
integer	integer
ip	ip
keyword	keyword
long	long
text	keyword
unsigned_long	unsigned_long
version	version

Examples

		ROW a=[3, 5, 1]
| EVAL max_a = MV_MAX(a)

a:integer	max_a:integer
[3, 5, 1]	5

It can be used by any column type, including keyword columns. In that case it picks the last string, comparing their utf-8 representation byte by byte:

		ROW a=["foo", "zoo", "bar"]
| EVAL max_a = MV_MAX(a)

a:keyword	max_a:keyword
["foo", "zoo", "bar"]	"zoo"

`MV_MEDIAN`

Syntax

Parameters

number: Multivalue expression.

Description

Converts a multivalued field into a single valued field containing the median value.

Supported types

number	result
double	double
integer	integer
long	long
unsigned_long	unsigned_long

Examples

		ROW a=[3, 5, 1]
| EVAL median_a = MV_MEDIAN(a)

a:integer	median_a:integer
[3, 5, 1]	3

If the row has an even number of values for a column, the result will be the average of the middle two entries. If the column is not floating point, the average rounds down:

		ROW a=[3, 7, 1, 6]
| EVAL median_a = MV_MEDIAN(a)

a:integer	median_a:integer
[3, 7, 1, 6]	4

`MV_MEDIAN_ABSOLUTE_DEVIATION`

Syntax

Parameters

number: Multivalue expression.

Description

Converts a multivalued field into a single valued field containing the median absolute deviation. It is calculated as the median of each data point’s deviation from the median of the entire sample. That is, for a random variable X, the median absolute deviation is median(|median(X) - X|).

Note

If the field has an even number of values, the medians will be calculated as the average of the middle two values. If the value is not a floating point number, the averages are rounded towards 0.

Supported types

number	result
double	double
integer	integer
long	long
unsigned_long	unsigned_long

Example

		ROW values = [0, 2, 5, 6]
| EVAL median_absolute_deviation = MV_MEDIAN_ABSOLUTE_DEVIATION(values), median = MV_MEDIAN(values)

values:integer	median_absolute_deviation:integer	median:integer
[0, 2, 5, 6]	2	3

`MV_MIN`

Syntax

Parameters

field: Multivalue expression.

Description

Converts a multivalued expression into a single valued column containing the minimum value.

Supported types

field	result
boolean	boolean
date	date
date_nanos	date_nanos
double	double
integer	integer
ip	ip
keyword	keyword
long	long
text	keyword
unsigned_long	unsigned_long
version	version

Examples

		ROW a=[2, 1]
| EVAL min_a = MV_MIN(a)

a:integer	min_a:integer
[2, 1]	1

It can be used by any column type, including keyword columns. In that case, it picks the first string, comparing their utf-8 representation byte by byte:

		ROW a=["foo", "bar"]
| EVAL min_a = MV_MIN(a)

a:keyword	min_a:keyword
["foo", "bar"]	"bar"

`MV_PERCENTILE`

Syntax

Parameters

number: Multivalue expression.
percentile: The percentile to calculate. Must be a number between 0 and 100. Numbers out of range will return a null instead.

Description

Converts a multivalued field into a single valued field containing the value at which a certain percentage of observed values occur.

Supported types

number	percentile	result
double	double	double
double	integer	double
double	long	double
integer	double	integer
integer	integer	integer
integer	long	integer
long	double	long
long	integer	long
long	long	long

Example

		ROW values = [5, 5, 10, 12, 5000]
| EVAL p50 = MV_PERCENTILE(values, 50), median = MV_MEDIAN(values)

values:integer	p50:integer	median:integer
[5, 5, 10, 12, 5000]	10	10

`MV_PSERIES_WEIGHTED_SUM`

Syntax

Parameters

number: Multivalue expression.
p: It is a constant number that represents the p parameter in the P-Series. It impacts every element’s contribution to the weighted sum.

Description

Converts a multivalued expression into a single-valued column by multiplying every element on the input list by its corresponding term in P-Series and computing the sum.

Supported types

number	p	result
double	double	double

Example

		ROW a = [70.0, 45.0, 21.0, 21.0, 21.0]
| EVAL sum = MV_PSERIES_WEIGHTED_SUM(a, 1.5)
| KEEP sum
		
	

sum:double
94.45465156212452

`MV_SLICE`

Syntax

Parameters

field: Multivalue expression. If null, the function returns null.
start: Start position. If null, the function returns null. The start argument can be negative. An index of -1 is used to specify the last value in the list.
end: End position(included). Optional; if omitted, the position at start is returned. The end argument can be negative. An index of -1 is used to specify the last value in the list.

Description

Returns a subset of the multivalued field using the start and end index values. This is most useful when reading from a function that emits multivalued columns in a known order like SPLIT or MV_SORT.

The order that multivalued fields are read from underlying storage is not guaranteed. It is frequently ascending, but don’t rely on that.

Supported types

field	start	end	result
boolean	integer	integer	boolean
cartesian_point	integer	integer	cartesian_point
cartesian_shape	integer	integer	cartesian_shape
date	integer	integer	date
date_nanos	integer	integer	date_nanos
double	integer	integer	double
geo_point	integer	integer	geo_point
geo_shape	integer	integer	geo_shape
geohash	integer	integer	geohash
geohex	integer	integer	geohex
geotile	integer	integer	geotile
integer	integer	integer	integer
ip	integer	integer	ip
keyword	integer	integer	keyword
long	integer	integer	long
text	integer	integer	keyword
unsigned_long	integer	integer	unsigned_long
version	integer	integer	version

Examples

		row a = [1, 2, 2, 3]
| eval a1 = mv_slice(a, 1), a2 = mv_slice(a, 2, 3)

a:integer	a1:integer	a2:integer
[1, 2, 2, 3]	2	[2, 3]

		row a = [1, 2, 2, 3]
| eval a1 = mv_slice(a, -2), a2 = mv_slice(a, -3, -1)

a:integer	a1:integer	a2:integer
[1, 2, 2, 3]	2	[2, 2, 3]

`MV_SORT`

Syntax

Parameters

field: Multivalue expression. If null, the function returns null.
order: Sort order. The valid options are ASC and DESC, the default is ASC.

Description

Sorts a multivalued field in lexicographical order.

Supported types

field	order	result
boolean	keyword	boolean
date	keyword	date
date_nanos	keyword	date_nanos
double	keyword	double
integer	keyword	integer
ip	keyword	ip
keyword	keyword	keyword
long	keyword	long
text	keyword	keyword
version	keyword	version

Example

		ROW a = [4, 2, -3, 2]
| EVAL sa = mv_sort(a), sd = mv_sort(a, "DESC")

a:integer	sa:integer	sd:integer
[4, 2, -3, 2]	[-3, 2, 2, 4]	[4, 2, 2, -3]

`MV_SUM`

Syntax

Parameters

number: Multivalue expression.

Description

Converts a multivalued field into a single valued field containing the sum of all of the values.

Supported types

number	result
double	double
integer	integer
long	long
unsigned_long	unsigned_long

Example

		ROW a=[3, 5, 6]
| EVAL sum_a = MV_SUM(a)

a:integer	sum_a:integer
[3, 5, 6]	14

`MV_ZIP`

Syntax

Parameters

string1: Multivalue expression.
string2: Multivalue expression.
delim: Delimiter. Optional; if omitted, , is used as a default delimiter.

Description

Combines the values from two multivalued fields with a delimiter that joins them together.

Supported types

string1	string2	delim	result
keyword	keyword	keyword	keyword
keyword	keyword	text	keyword
keyword	keyword		keyword
keyword	text	keyword	keyword
keyword	text	text	keyword
keyword	text		keyword
text	keyword	keyword	keyword
text	keyword	text	keyword
text	keyword		keyword
text	text	keyword	keyword
text	text	text	keyword
text	text		keyword

Example

		ROW a = ["x", "y", "z"], b = ["1", "2"]
| EVAL c = mv_zip(a, b, "-")
| KEEP a, b, c
		
	

a:keyword	b:keyword	c:keyword
[x, y, z]	[1 ,2]	[x-1, y-2, z]