Copy Files from AWS S3 to HDFS (Hadoop Distributed File System)

618 views Asked by Swathi At 16 September 2019 at 05:49

I'm trying to copy AVRO files from AWS S3 bucket to HDFS using the following Scala code:

val avroDF  = sparkSession.read.format("com.databricks.spark.avro").load("s3a://"+s3Location+"/")
avroDF.write.format("com.databricks.spark.avro").mode(SaveMode.Append).save(filePath)

The files when being copied to HDFS, part files are getting saved like (part-0001.avro), how to save the file with the same file name as it exists in AWS S3 bucket?

Original Q&A

TechQA.

Copy Files from AWS S3 to HDFS (Hadoop Distributed File System)

There are 0 answers

Related Questions in AMAZON-WEB-SERVICES

Related Questions in AMAZON-S3

Related Questions in HDFS

Related Questions in AVRO

Related Questions in SPARK2

Popular Questions

Popular Tags

Trending Questions