Why does a series or dataframe column of integers containing NaN values have "float64" as data type?

Question

Why does a series or dataframe column of integers containing NaN values have "float64" as data type?

26 views Asked by Gin Al At 16 October 2023 at 11:29

Either by using pd.read_csv or by defining a series of integers, if it contains a NaN value, then the data type of that series or column becomes "float64", including the respective ".0" at the end of each numeric value.

The data type of a column read from a CSV file is one of the characteristics I use for my analysis. When the data in a column is either integer or NaN values, once the table is loaded with pandas.read_csv, the dtype function returns the data type of that column as "float64", while its values are integers.

Original Q&A

There are 1 answers

**wotb** · Answer 1 · 2023-10-16T11:52:51+00:00

wotb On 16 October 2023 at 11:52

Pure integers cannot be NaN. What you want is the nullable int type.

In code this might look something like:

df=pd.read_csv("file.csv",dtype={"col1":str,"col_with_nan":Int64})

Note the capital "I" in Int64.

TechQA.

Why does a series or dataframe column of integers containing NaN values have "float64" as data type?

There are 1 answers

Related Questions in PANDAS

Related Questions in CSV

Related Questions in NAN

Related Questions in DTYPE

Popular Questions

Popular Tags

Trending Questions