Initial Matrix contains 1 for known links (0 for unknown and missed entries) in a link prediction system. Outputs of Matrix Factorization are predicted values for missed entries to calculate AUC:
- Hide
20%of known links (set20%of entries with1to0in the matrix ) - Sort output of Factorization and discard indexes used for train (
80%of1) - Set
Nto number of hided values - Get
Ntop predicted values and check if they are hided values (class label set to1) or not (class label set to0) - Compute
AUCusingNtop predictions
I know perfcurve in matlab computes AUC, but I need to be sure about the above process to provide labelled data for perfcurve.
Any comment is really appreciated.