contextby

Syntax

contextby(func, funcArgs, groupingCol, [sortingCol], [semanticFilter=1])

or

funcArg func:X groupingCol

or

func:X(funcArgs, groupingCol, [sortingCol], [semanticFilter=1])

Arguments

func is a function. For the second use case, func can only have one parameter (funcArg).

funcArgs are the parameters of func. It is a tuple if there are more than 1 parameter of func.

groupingCol is the grouping variable(s). It can be one or multiple vectors.

sortingCol is an optional argument for within group sorting before applying func.

semanticFilter (optional) is a positive integer indicating which columns to include in calculations when funcArgs is a table. The possible values are:

  • 0 - All columns
  • 1 (default) - Columns of FLOATING, INTEGRAL, and DECIMAL categories, excluding the COMPRESSED data type.
  • 2 - Columns of TEMPORAL category.
  • 3 - Columns of LITERAL category, excluding the BLOB data type.
  • 4 - Columns of FLOATING, INTEGRAL, DECIMAL, and TEMPORAL categories, excluding the COMPRESSED data type.

The vectors in groupingCol, sortingCol and each of the function argument in funcArgs all have the same size.

Details

Calculate func(funcArgs) for each groupingCol group. The result is a vector of the same size as each of the input arguments other than func. If func is an aggregate function, all elements within the same group have identical result. We can use sortingCol to sort the within-group data before the calculation.

Note: The keyword defg must be used to declare an aggregate function.

Examples

sym=`IBM`IBM`IBM`MS`MS`MS
price=172.12 170.32 175.25 26.46 31.45 29.43
qty=5800 700 9000 6300 2100 5300
trade_date=2013.05.08 2013.05.06 2013.05.07 2013.05.08 2013.05.06 2013.05.07;
contextby(avg, price, sym);
// output: [172.563,172.563,172.563,29.113,29.113,29.113]

price avg :X sym;
// output: [172.563,172.563,172.563,29.113,29.113,29.113]

price at price>contextby(avg, price,sym);
// output: [175.25,31.45,29.43]

price at price>price avg :X sym;
// output: [175.25,31.45,29.43]

sym at price>contextby(avg, price,sym);
// output: ["IBM","MS","MS"]

// calculate volume-weighted average price (vwap)
contextby(wavg, [price, qty], sym);
// output: [173.856,173.856,173.856,28.374,28.374,28.374]

contextby(ratios, price, sym, trade_date) - 1;
// output: [-0.01786,,0.028946,-0.100917,,-0.064229]

Multiple vectors for groupingCol :

sym=`IBM`IBM`IBM`IBM`IBM`IBM`MS`MS`MS`MS`MS`MS
date=2020.12.01 + 0 0 0 1 1 1 0 0 0 1 1 1
qty=5800 700 9000 1000 3500 3900 6300 2100 5300 7800 1200 4300
contextby(cumsum, qty, [sym,date]);
// output: [5800,6500,15500,1000,4500,8400,6300,8400,13700,7800,9000,13300]

We can embed a contextby template into a SQL query:

t1=table(trade_date,sym,qty,price);
t1;
trade_date sym qty price
2013.05.08 IBM 5800 172.12
2013.05.06 IBM 700 170.32
2013.05.07 IBM 9000 175.25
2013.05.08 MS 6300 26.46
2013.05.06 MS 2100 31.45
2013.05.07 MS 5300 29.43
// select orders with a price higher than the average price of the group
select trade_date, sym, qty, price from t1 where price > contextby(avg, price,sym);
trade_date sym qty price
2013.05.07 IBM 9000 175.25
2013.05.06 MS 2100 31.45
2013.05.07 MS 5300 29.43
// add 1 day to all trade dates in t1
contextby(temporalAdd{,1d}, t1, t1.sym,,2)

trade_date

sym

qty

price

2013.05.09 IBM 5,800 172.12
2013.05.07 IBM 700 170.32
2013.05.08 IBM 9,000 175.25
2013.05.09 MS 6,300 26.46
2013.05.07 MS 2,100 31.45
2013.05.08 MS 5,300 29.43