Unable to write dataframe as xslx format in spark scala

123 views Asked by At
df
 .coalesce(1)
 .write
 .format("com.crealytics.spark.excel")
 .option("useHeader", "true")
 .option("header", "true")
 .mode(SaveMode.Append)
 .save(s"s3://$bucket/$etlFolderPrefix/a.xlsx")

ERROR [main] glue.ProcessLauncher (Logging.scala:logError(94)): Exception in User Class java.lang.NoSuchMethodError: org.apache.commons.io.IOUtils.byteArray(I)[B at org.apache.commons.io.output.AbstractByteArrayOutputStream.needNewBuffer(AbstractByteArrayOutputStream.java:104) at org.apache.commons.io.output.UnsynchronizedByteArrayOutputStream.(UnsynchronizedByteArrayOutputStream.java:51) at org.apache.commons.io.output.UnsynchronizedByteArrayOutputStream.(UnsynchronizedByteArrayOutputStream.java:38) at shadeio.poi.xssf.usermodel.XSSFWorkbook.newPackage(XSSFWorkbook.java:528) at shadeio.poi.xssf.usermodel.XSSFWorkbook.(XSSFWorkbook.java:245) at shadeio.poi.xssf.usermodel.XSSFWorkbook.(XSSFWorkbook.java:241) at shadeio.poi.xssf.usermodel.XSSFWorkbook.(XSSFWorkbook.java:229) at com.crealytics.spark.excel.ExcelFileSaver.save(ExcelFileSaver.scala:45) at com.crealytics.spark.excel.DefaultSource.createRelation Exception in User Class: java.lang.NoClassDefFoundError : org/apache/commons/io/output/UnsynchronizedByteArrayOutputStream

Tried with different versions, currently using spark version 2.12

0

There are 0 answers