Druid Default Distinct Approximation Algorithm

405 views Asked by At

Is there a way to modify the default HLL approximation algorithm with ThetaSketch in Druid? So that while querying for count distinct, druid by default uses ThetaSketch instead of HLL.

1

There are 1 answers

0
Peter Marshall On

I believe you need to be explicit at query time, e.g. using APPROX_COUNT_DISTINCT_DS_THETA versus APPROX_COUNT_DISTINCT_DS_HLL

https://druid.apache.org/docs/latest/querying/sql.html#aggregation-functions