Java Spark how to save a JavaPairRDD<HashSet<String>, HashMap<String, Double>> to file?

Question

Java Spark how to save a JavaPairRDD<HashSet<String>, HashMap<String, Double>> to file?

582 views Asked by daydayup At 27 April 2018 at 04:47

I got this "JavaPairRDD<HashSet<String>, HashMap<String, Double>>" RDD after some complicated aggregations, want to save the result to file. I believe saveAsHadoopFile is a good API to do so, but am having trouble filling in the parameters for saveAsHadoopFile(path, keyClass, valueClass, outputFormatClass, CompressionCodec). Can anyone help?

Original Q&A

There are 1 answers

**Devendra Singh** · Answer 1 · 2018-04-27T06:12:21+00:00

You can use the following function and later on parse it to the desired result.

rdd.saveAsTextFile ("hdfs:///complete_path_to_hdfs_file/");

but if you want to use saveAsHadoopFile API then following method can be used.

saveAsHadoopFile(complete_path_to_file, HashSet.class, HashMap.class, TextOutputFormat.class)

you can also use HadoopOutputFormat.class as the last parameter

For more information, you can refer to this link HadoopFile

TechQA.

Java Spark how to save a JavaPairRDD<HashSet<String>, HashMap<String, Double>> to file?

There are 1 answers

Related Questions in JAVA

Related Questions in APACHE-SPARK

Related Questions in HADOOP

Related Questions in JAVA-PAIR-RDD

Popular Questions

Trending Questions