Good evening,
I am trying to analyse the forementioned data(edgelist or pajek format). First thought was R-project with igraph package. But memory limitations(6GB) wont do the trick. Will a 128GB PC be able to handle the data? Are there any alternatives that don't require whole graph in RAM?
Thanks in advance.
P.S: I have found several programs but I would like to hear some pro(yeah, that's you) opinions on the matter.
If you only want degree distributions, you likely don't need a graph package at all. I recommend the bigtablulate package so that
foreachCheck out their website for more details. To give a quick example of this approach, let's first create an example with an edgelist involving 1 million edges among 1 million nodes.
I next concatenate this file 10 times to make the example a bit bigger.
Next we load the
bigtabulatepackage and read in the text file with our edgelist. The commandread.big.matrix()creates a file-backed object in R.We can compute the outdegrees by using
bigtable()on the first column.Quick sanity check to make sure table is working as expected:
To get indegree, just do
bigtable(x,2).