I'm trying to plan a SolrCloud implementation, and given current index sizes from testing, my estimated physical index size for 1 billion documents is roughly 20 terabytes. So far, I've been unable to find a cloud host that can support a single volume of this size. I was hoping somebody could provide some guidance with regard to managing an index this large. Is a 20TB index absurd? Is there something I'm missing with regard to SolrCloud architecture? Most of the guidelines I've seen indicate that the entire index, regardless of shard count, should be replicated on every machine to guarantee redundancy, so every node would require a 20TB storage device. If there's anyone out there who can shed some light, I would greatly appreciate it.
Related Questions in SOLR
- Android NFCA issues
- Access protection for NFC Tags
- Android makeReadOnly() throws IOException
- NFC Bluetooth handover - WITHOUT user confirmation
- Using external ACR1222L to get NDEF Messages w/ nfctools OR nxpnfclib
- node-hid not showing all USB devices
- How to convert a Chinese string to english for windows phone store apps[Javascript]
- How can I prevent (overwriting or format or cloning anything else) my NFC TAG?
- Android HCE issue
- onNewIntent() being called in onResume() multiple times
Related Questions in BIGDATA
- Android NFCA issues
- Access protection for NFC Tags
- Android makeReadOnly() throws IOException
- NFC Bluetooth handover - WITHOUT user confirmation
- Using external ACR1222L to get NDEF Messages w/ nfctools OR nxpnfclib
- node-hid not showing all USB devices
- How to convert a Chinese string to english for windows phone store apps[Javascript]
- How can I prevent (overwriting or format or cloning anything else) my NFC TAG?
- Android HCE issue
- onNewIntent() being called in onResume() multiple times
Related Questions in SOLRCLOUD
- Android NFCA issues
- Access protection for NFC Tags
- Android makeReadOnly() throws IOException
- NFC Bluetooth handover - WITHOUT user confirmation
- Using external ACR1222L to get NDEF Messages w/ nfctools OR nxpnfclib
- node-hid not showing all USB devices
- How to convert a Chinese string to english for windows phone store apps[Javascript]
- How can I prevent (overwriting or format or cloning anything else) my NFC TAG?
- Android HCE issue
- onNewIntent() being called in onResume() multiple times
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Popular Tags
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Not sure where you read such guidelines?
It is totally normal to keep only a portion of the index in each shard (each shard having one master and a number of replicas).
You would need to study how to shard your index, using built in routing based on a hash or provide your own.
Edit: so if I understand correctly, you are assuming that every node in the cluster must have either a master or a replica of EVERY shard, correct? If so, the answer is no. In order to provide resilience, you need to have master/replicas of every shard somewhere in the cluster, but you can have a node N that does not contain anything from shard S, as long as S has a master and a replica (at least) in other nodes.