How to save file in hadoop with python

Question

How to save file in hadoop with python

2.4k views Asked by Mulagala At 23 May 2014 at 11:55

I am trying to save file in Hadoop with python 2.7. I searched on the internet. I got some code to save a file in Hadoop but it is taking the entire folder while saving (total files in the folder are saving in Hadoop). But I need to save a specific file.

Here is the link to save a folder in Hadoop: http://www.hadoopy.com/en/latest/tutorial.html#putting-data-on-hdfs

Now what I need is save a particular file in Hadoop like abc.txt.

Here is my code:

import hadoopy
hdfs_path = 'hdfs://192.168.x.xxx:xxxx/video/py5'
def main():
   local_path = open('abc.txt').read()
   hadoopy.writetb(hdfs_path, local_path)


if __name__ == '__main__':
    main()

Here i am getting need more than one value to unpack

Any help would be appreciated.

Original Q&A

There are 2 answers

GodMan On 23 May 2014 at 12:00

http://www.hadoopy.com/en/latest/api.html?highlight=hadoopy.writetb#hadoopy.writetb

writedb requires second arg as kvs – Iterator of (key, value)

As per the link you have given, you have forgot to copy the function read_local_dir in your code.

**supakeen** · Accepted Answer · 2014-05-23T12:00:31+00:00

supakeen On 23 May 2014 at 12:00 BEST ANSWER

The hadoopy.writetb seems to expects an iterable of two-values as its second argument. Try:

hadoopy.writetb(hdfs_path, [("abc.txt", open("abc.txt").read())])

TechQA.

How to save file in hadoop with python

There are 2 answers

Related Questions in PYTHON

Related Questions in HADOOP

Related Questions in HADOOPY

Popular Questions

Popular Tags

Trending Questions