List Question
20 TechQA 2024-02-11T04:07:10.160000Inconsistent results in Spark-shell and Spark-submit
48 views
Asked by Janani
Avoid Broadcast Nested Loop Join in Pyspark when the joining condition has a OR clause
109 views
Asked by Ayan Biswas
pyspark - How to split the string inside an array column and make it into json?
201 views
Asked by fresh
Missing methods in PySpark 2.4's pyspark.sql.functions but still works in local environment
225 views
Asked by Simon Mau
Extension of compressed parquet file in Spark
376 views
Asked by Marwan02
Pyspark split the file while writing with specific limit
1k views
Asked by Vikas T
In pyspark 2.4, how to handle columns with the same name resulting of a self join?
278 views
Asked by Itération 122442
Hive beeline and spark load count doesn't match for hive tables
938 views
Asked by VimalK
Specific Spark write operation gradually increase with time in streaming applicaiton
125 views
Asked by ponthu
How to set Spark timeout ( application Killing itself )
1.5k views
Asked by Ilya Brodezki
Convert Spark2.2's UDAF to 3.0 Aggregator
249 views
Asked by Girish Rawat
Can we set up both Spark2.4 and Spark3.0 in single system?
765 views
Asked by HimanshuSPaul
Spark2.4 Unable to overwrite table from same table
2.9k views
Asked by Ratan
spark not downloading hive_metastore jars
884 views
Asked by Arran Duff
Reading HDFS small size partitions?
409 views
Asked by developforacause
Change spark _temporary directory path to avoid deletion of parquets
1.7k views
Asked by moez skanjii
Output Spark application name in driver log
1.4k views
Asked by Valentina
UDFs with Dictionaries on Spark 2.4
710 views
Asked by morfara
Error on Spark 2.4.4 metrics properties in BinaryClassificationMetrics
261 views
Asked by Joe Taras