I am going over the DC3 algorithm, the linear time algorithm for construction of suffix arrays. I am unable to understand a technique in the paper which can be found here.
I am unable to understand how the renaming, mentioned on page 6 of the paper, is done. How is the renaming done as per Step 1. The relevant section of code from appendix is:
for (int i = 0; i < n02; i++)
{
if (T[SA12[i]] != c0 || T[SA12[i]+1] != c1 || T[SA12[i]+2] != c2)
{
name++; c0 = T[SA12[i]]; c1 = T[SA12[i]+1]; c2 = T[SA12[i]+2];
}
if (SA12[i] % 3 == 1)
{
R[SA12[i]/3] = name;
} // write to R1
else
{
R[SA12[i]/3 + n0] = name;
} // write to R2
}
Please help me understand this portion. (This code is from page 20 of the pdf)
After radix sort, adjacent element in SA12[] may equal, so there is a if in the loop, as for R1 and R2, give an example:
original array is [y a b b a d a b b a d o], n = 12, index range is [0,11] R1 = [1,4,7,10] R2=[2,5,8,11], "if (SA12[i] % 3 == 1) " indicates SA12[i] belong to R1, else belong to R2, R is concentration of R1 and R2.
hope this helps.