Window function ignore nulls not working in Databricks

Question

Window function ignore nulls not working in Databricks

366 views Asked by VarYaz At 12 October 2023 at 15:12

I am new to Databricks and was required to implement the snowflake code in Databricks.

The snowflake table, code and output look like below:

table:

id	col1	hn
ee1	null	1
ee1	null	2
ee1	test	3
ee1	test	4
ee1	test2	5

Query used:

SELECT ID, FIRST_VALUE(col1) ignore nulls OVER (PARTITION BY ID ORDER BY hn) AS first_value, LAST_VALUE(col1) ignore nulls OVER (PARTITION BY ID ORDER BY hn) AS last_value FROM table

Output:

id	first_value	last_value
ee1	test	test2
ee1	test	test2
ee1	test	test2
ee1	test	test2
ee1	test	test2

When I tried the same query in Databricks using Spark SQL, ignore nulls did not work properly.

Can anyone provide the equivalent query for this in Databricks?

Original Q&A

There are 1 answers

**Lukasz Szozda** · Accepted Answer · 2023-10-12T15:20:24+00:00

The key point is the window frame specification:

SELECT ID, 
  FIRST_VALUE(col1) ignore nulls OVER (PARTITION BY ID ORDER BY hn) AS first_value, 
  LAST_VALUE(col1) ignore nulls OVER (PARTITION BY ID ORDER BY hn 
            ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING) AS last_value 
FROM table;

If not defined explicitly the default is RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW

TechQA.

Window function ignore nulls not working in Databricks

There are 1 answers

Related Questions in PYSPARK

Related Questions in DATABRICKS

Related Questions in SPARK-WINDOW-FUNCTION

Popular Questions

Trending Questions