Python logit regression matrix shape error "ValueError: endog and exog matrices are different sizes"

Question

Python logit regression matrix shape error "ValueError: endog and exog matrices are different sizes"

202 views Asked by David Olson At 10 March 2022 at 20:00

Basic setup: I'm trying to run a logit regression in python on the probability of founding a business (founder variable) the exogenous variables are year, age, edu_cat (education category), and sex.

The X matrix is (4, 650), and the y matrix(1, 650). All of the variables within the x matrix have 650 non-NaN observations.

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import statsmodels.api as sm
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import classification_report, confusion_matrix

x=np.array ([ df_all['Year'], df_all['Age'], df_all['Edu_cat'], df_all['sex']])
y= np.array([df_all['founder']])

logit_model = sm.Logit(y, x)
result = logit_model.fit()
print(result)

So I'm tracking that the shape is good, but python is telling me otherwise. Am I missing something basic?

Original Q&A

There are 1 answers

**David Olson** · Answer 1 · 2022-03-11T20:55:25+00:00

David Olson On 11 March 2022 at 20:55

I believe the issue is with the Y array, being [650,1], when it should be [650,], which it defaults to. Additionally I needed to make the x array [650,4] through a transpose.

TechQA.

Python logit regression matrix shape error "ValueError: endog and exog matrices are different sizes"

There are 1 answers

Related Questions in PYTHON

Related Questions in MATRIX

Related Questions in SCIKIT-LEARN

Related Questions in REGRESSION

Related Questions in LOGITS

Popular Questions

Popular Tags

Trending Questions