Invalid classes inferred from unique values of `y`. Expected: [0 1 2 3 4 5], got [1 2 3 4 5 6]

Question

Invalid classes inferred from unique values of `y`. Expected: [0 1 2 3 4 5], got [1 2 3 4 5 6]

50.2k views Asked by ohoh At 25 April 2022 at 08:32

I've trained dataset using XGB Classifier, but I got this error in local. It worked on Colab and also my friends don't have any problem with same code. I don't know what that error means...

Invalid classes inferred from unique values of y. Expected: [0 1 2 3 4 5], got [1 2 3 4 5 6]

this is my code, but I guess it's not the reason.

start_time = time.time()
xgb = XGBClassifier(n_estimators = 400, learning_rate = 0.1, max_depth = 3)
xgb.fit(X_train.values, y_train)
print('Fit time : ', time.time() - start_time)

Original Q&A

There are 9 answers

**Hessah** · Answer 1 · 2023-10-17T19:27:25+00:00

Hessah On 17 October 2023 at 19:27

it happens because the version of ur xgboost , so :

try this :

y_train_xgb = y_train.map({"1": 0, "2": 1, "3": 2}

**Yassin El Jakani** · Answer 2 · 2022-05-02T09:35:15+00:00

Yassin El Jakani On 02 May 2022 at 09:35

The erros comes with the new version of xgboost, Uninstall current Xgboost and install xgboost 0.90

pip uninstall xgboost 

pip install xgboost==0.90

**Jefferson Santos** · Answer 3 · 2022-05-05T19:23:53+00:00

That happens because the class column has to start from 0 (as required since version 1.3.2). An easy way to solve that is using LabelEncoder from sklearn.preprocssing library.

Solution (works for version 1.6):

from sklearn.preprocessing import LabelEncoder
le = LabelEncoder()
y_train = le.fit_transform(y_train)

And then you try/run your code again:

start_time = time.time()
xgb = XGBClassifier(n_estimators = 400, learning_rate = 0.1, max_depth = 3)
xgb.fit(X_train.values, y_train)
print('Fit time : ', time.time() - start_time)

**Javier Moreno** · Answer 4 · 2022-06-08T16:35:13+00:00

Javier Moreno On 08 June 2022 at 16:35

Try to adding stratify to the train_test_split code:

X_train, X_test, y_train, y_test = train_test_split(data, labels, test_size=test_size, stratify = labels)

**norrey nordine** · Answer 5 · 2022-06-25T19:16:01+00:00

norrey nordine On 25 June 2022 at 19:16

Use python version 3.7 as used in colab

**Craig Rodrigues** · Answer 6 · 2024-03-26T03:46:38+00:00

I verified in the source code of xgboost that LabelEncoder() was deprecated in version 1.3 with this PR:

https://github.com/dmlc/xgboost/pull/6269/files

And then LabelEncoder() was removed in version 1.6.0 with this PR: https://github.com/dmlc/xgboost/pull/7357

which was then merged here: https://github.com/dmlc/xgboost/commit/3c4aa9b2ead21d11ef1589059db2ea50208c55ea

The approach mentioned by @jefferson-santos to explicitly use LabelEncoder() is correct, and worked for me.

**SOMDEB SAR** · Answer 7 · 2022-10-16T05:44:27+00:00

It's because the y_train must be encoded in a newer update XGBoost model before training it, i.e., you must use some categorical transformation like label encoders:

from sklearn.preprocessing import LabelEncoder
le = LabelEncoder()
y_train = le.fit_transform(y_train)

Then apply it to XGBoost model for training:

from xgboost import XGBClassifier
classifier = XGBClassifier()
classifier.fit(X = X_train,y =  y_train)

After training to find out its confusion matrix you must inverse transform the predicted y values, as shown:

from sklearn.metrics import confusion_matrix, accuracy_score
y_pred = classifier.predict(X_test)
y_pred = le.inverse_transform(y_pred)
cm = confusion_matrix(y_test, y_pred)
print(cm)
accuracy_score(y_test, y_pred)

**Jatin Kishore Patel** · Answer 8 · 2022-10-20T17:54:09+00:00

Downgrading to 1.5.0 worked for me

Also got this warning message during execution

UserWarning: The use of label encoder in XGBClassifier is deprecated and will be removed in a future release.

using the Label encoder in 1.6 returns this error for me:

MultiClassEvaluation: label must be in [0, num_class), num_class=6 but found 6 in label

**aps_s** · Answer 9 · 2022-09-15T23:57:40+00:00

aps_s On 15 September 2022 at 23:57

If it helps, i just rolled back to version 1.2.1

TechQA.

Invalid classes inferred from unique values of `y`. Expected: [0 1 2 3 4 5], got [1 2 3 4 5 6]

There are 9 answers

Related Questions in PYTHON

Related Questions in MACHINE-LEARNING

Related Questions in CLASSIFICATION

Related Questions in XGBOOST

Related Questions in XGBCLASSIFIER

Popular Questions

Popular Tags

Trending Questions