Using kNN with weighted dataset

Question

Using kNN with weighted dataset

55 views Asked by MC Jong At 30 October 2023 at 13:03

I have a dataset df:

	category	var 1	var 32	weighting	country
1	blue	1.0	54.2	3.0	US
2	pink	0.0	101.0	1.0	other
3	blue	1.0	49.9	3.0	US
4	green	1.0	72.2	9.0	US

I'm using the kNN classifier (on the country variable) but need it to take into account the current dataset weights I have included. After looking at the sklearn pack I can see the KNeighborsClassifier() does have a weight argument, can I set this argument 'weight = df.weighting'? or do I have to go about this another way?

Original Q&A

There are 1 answers

**Anna Andreeva Rogotulka** · Answer 1 · 2023-11-01T08:29:21+00:00

Anna Andreeva Rogotulka On 01 November 2023 at 08:29

you can explode samples by weight, for example, or you can think about creating custom weighted distance function

for weight, x_sample, y_sample in zip(sample_weights, X, y):
    weighted_X.extend([x_sample] * int(weight))
    weighted_y.extend([y_sample] * int(weight))

TechQA.

Using kNN with weighted dataset

There are 1 answers

Related Questions in PYTHON

Related Questions in PANDAS

Related Questions in SKLEARN-PANDAS

Related Questions in WEIGHTING

Popular Questions

Popular Tags

Trending Questions