I'm trying to write a list of data bytes to a CSV file. Since it's a list of byte strings, I used the below code:
with open(r"E:\Avinash\Python\extracting-drug-data\out.csv", "wb") as w:
writer = csv.writer(w)
writer.writerows(bytes(datas, 'UTF-8'))
But it results in the following error:
TypeError: encoding or errors without a string argument
datas
is a list of byte strings.
print(datas)
yields
[b'DB08873', b' MOLSDFPDBSMILESInChIView Structure \xc3\x97Structure for DB08873 (Boceprevir) Close', b'394730-60-0', b'LHHCSNFAOIFYRV-DOVBMPENSA-N', b'Organic acids and derivatives ', b'Food increases exposure of boceprevir by up to 65% relative to fasting state. However, type of food and time of meal does not affect bioavailability of boceprevir and thus can be taken without regards to food. \r\nTmax = 2 hours;\r\nTime to steady state, three times a day dosing = 1 day;\r\nCmax]
I want the above list to be printed as first row in a CSV file with the decoding of Unicode chars. That is, \xc3\x97
should be converted to it's corresponding character.
It seems your
datas
is already in bytes format, so to turn it into UTF-8 strings, you have to usestr
, notbytes
! Also, you have to convert each element fromdatas
individually, not the entire list at once. Finally, if you want to adddatas
as one row toout.csv
, you have to usewriterow
, whereaswriterows
would write all the rows at once, and accordinly would expect a list of lists.Depending on your OS, you might also have to specify the
encoding
when opening the file. Otherwise it will use the OS' default encoding, which might be something entirely different.This seems to do what you want. The result is a CSV file with one row1 of data in UTF-8 format, and the
\xc3\x97
is decoded to×
.1) Note that the last item in
datas
contains some line breaks, and thus will be split onto several lines. This is probably not what you want. Or is this a glitch in yourdatas
list?