I need to store documents such as .pdf, .doc and .txt files to MaprDB. I saw one example in Hbase where it stores files in binary and is retrieved as files in Hue, but I not sure how it could be implemented. Any idea how can a document be stored in MaprDB?
Store documents (.pdf, .doc and .txt files) in MaprDB
280 views Asked by Amu At
1
There are 1 answers
Related Questions in JAVA
- Can i move items from a custom list to another list after a specific retention?
- weblogic Ws-security policy vs oasis policy
- Your implementation of PreferenceActivity is vulnerable to fragment injection
- Granting Lync Polcies Via AD Group Member using PowerShell
- Amazon AWS S3 IAM Policy based on namespace or tag
- Reset quota is not working as expected in apigee
- Pundit Policy Scope for Has Many Through Relationship
- invalid according to policy policy condition failed starts-with $content-type ""
- What does S3 Policy Version mean?
- Facebook Log Out required for Unity apps?
Related Questions in HBASE
- Can i move items from a custom list to another list after a specific retention?
- weblogic Ws-security policy vs oasis policy
- Your implementation of PreferenceActivity is vulnerable to fragment injection
- Granting Lync Polcies Via AD Group Member using PowerShell
- Amazon AWS S3 IAM Policy based on namespace or tag
- Reset quota is not working as expected in apigee
- Pundit Policy Scope for Has Many Through Relationship
- invalid according to policy policy condition failed starts-with $content-type ""
- What does S3 Policy Version mean?
- Facebook Log Out required for Unity apps?
Related Questions in HUE
- Can i move items from a custom list to another list after a specific retention?
- weblogic Ws-security policy vs oasis policy
- Your implementation of PreferenceActivity is vulnerable to fragment injection
- Granting Lync Polcies Via AD Group Member using PowerShell
- Amazon AWS S3 IAM Policy based on namespace or tag
- Reset quota is not working as expected in apigee
- Pundit Policy Scope for Has Many Through Relationship
- invalid according to policy policy condition failed starts-with $content-type ""
- What does S3 Policy Version mean?
- Facebook Log Out required for Unity apps?
Related Questions in MAPR
- Can i move items from a custom list to another list after a specific retention?
- weblogic Ws-security policy vs oasis policy
- Your implementation of PreferenceActivity is vulnerable to fragment injection
- Granting Lync Polcies Via AD Group Member using PowerShell
- Amazon AWS S3 IAM Policy based on namespace or tag
- Reset quota is not working as expected in apigee
- Pundit Policy Scope for Has Many Through Relationship
- invalid according to policy policy condition failed starts-with $content-type ""
- What does S3 Policy Version mean?
- Facebook Log Out required for Unity apps?
Related Questions in NOSQL
- Can i move items from a custom list to another list after a specific retention?
- weblogic Ws-security policy vs oasis policy
- Your implementation of PreferenceActivity is vulnerable to fragment injection
- Granting Lync Polcies Via AD Group Member using PowerShell
- Amazon AWS S3 IAM Policy based on namespace or tag
- Reset quota is not working as expected in apigee
- Pundit Policy Scope for Has Many Through Relationship
- invalid according to policy policy condition failed starts-with $content-type ""
- What does S3 Policy Version mean?
- Facebook Log Out required for Unity apps?
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Popular Tags
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
First thing is , Im not aware about Maprdb as Im using Cloudera. But I have experience in hbase storing many types of objects in hbase as byte array like below mentioned.
Most primitive way of storing in hbase or any other db is byte array. see my answer
You can do that in below way using Apache commons lang API. probably this is best option, which will be applicable to all objects including image/audio/video etc..
please test this method with one of object type of any of your files.
SerializationUtils.serialize
will return bytes. which you can insert.Note :jar of apache commons lang always available in hadoop cluster.(not external dependency)
another example :
For any reason if you don't want to use
SerializationUtils
class provided by Apache commons lang, then you can see below pdf serialize and deserialize example for your better understanding but its lengthy code if you useSerializationUtils
the code will be reduced.Above you are getting byte array you can prepare put request to upload to database i.e Hbase or any other database
Once you persisted, you can get the same using hbase get or
scan
youget
your pdf bytes and use the below code to again make same file i.e someFile.pdf in this case.EDIT : Since you asked HBASE examples I'm adding this.. in the below method
yourcolumnasBytearray
is your doc file for instance pdf.. converted to byte array (usingSerializationUtils.serialize
) in above examples...