Initial Matrix contains 1
for known links (0
for unknown and missed entries) in a link prediction system. Outputs of Matrix Factorization are predicted values for missed entries to calculate AUC
:
- Hide
20%
of known links (set20%
of entries with1
to0
in the matrix ) - Sort output of Factorization and discard indexes used for train (
80%
of1
) - Set
N
to number of hided values - Get
N
top predicted values and check if they are hided values (class label set to1
) or not (class label set to0
) - Compute
AUC
usingN
top predictions
I know perfcurve
in matlab computes AUC
, but I need to be sure about the above process to provide labelled data for perfcurve
.
Any comment is really appreciated.