I have three columns in my dataset that I'm trying to save as a STATA dta file. These are the last three lines I run after I clean the data.
macro1=macro1.rename(columns={'index':'year', 'Price Index, PCE':'pce','Unemployment Rate':'urate'})
macro1.convert_objects(convert_numeric=True).dtypes
macro1[['year','pce', 'urate]].to_stata('file path\file name.dta', write_index=False)
these are the data types of these variables
year float64
pce float64
urate float64
dtype: object
The problem is, when i try to convert these columns to .dta
I get an error message
--------------------------------------------------------------------------- TypeError Traceback (most recent call last) <ipython-input-69-a2069ee823e7> in <module>()
36 macro1=macro1.rename(columns={'index':'year', 'Price Index, PCE':'pce','Unemployment Rate':'urate'})
37 macro1.convert_objects(convert_numeric=True).dtypes
---> 38 macro1[['pce']].to_stata('file path\file name.dta', write_index=False)
39 #macro1
C:\Users\chungk\AppData\Local\Continuum\Anaconda\lib\site-packages\pandas\core\frame.pyc in to_stata(self, fname, convert_dates, write_index, encoding, byteorder, time_stamp, data_label) 1262 time_stamp=time_stamp, data_label=data_label, 1263 write_index=write_index)
-> 1264 writer.write_file() 1265 1266 @Appender(fmt.docstring_to_string, indents=1)
C:\Users\chungk\AppData\Local\Continuum\Anaconda\lib\site-packages\pandas\io\stata.pyc in write_file(self) 1245 self._write(_pad_bytes("", 5)) 1246 if self._convert_dates is None:
-> 1247 self._write_data_nodates() 1248 else: 1249 self._write_data_dates()
C:\Users\chungk\AppData\Local\Continuum\Anaconda\lib\site-packages\pandas\io\stata.pyc in _write_data_nodates(self) 1327 if var is None or var == np.nan: 1328 var =
_pad_bytes('', typ)
-> 1329 if len(var) < typ: 1330 var = _pad_bytes(var, typ) 1331 if compat.PY3:
TypeError: object of type 'float' has no len()
the problem is with both urate
and pce
because when I try saving only year
, it works.
I'm not sure where the problem lies. Any help would be much appreciated.
convert_objects
does not convert the dtypes inplace so you needed to assign the operation:see the docs