I am a second year EE student. I just started learning python for my project.
I intend to parse a csv file with a format like
3520005,"Toronto (Ont.)",C ,F,2503281,2481494,F,F,0.9,1040597,979330,630.1763,3972.4,1
2466023,"Montréal (Que.)",V ,F,1620693,1583590,T,F,2.3,787060,743204,365.1303,4438.7,2
5915022,"Vancouver (B.C.)",CY ,F,578041,545671,F,F,5.9,273804,253212,114.7133,5039.0,8
3519038,"Richmond Hill (Ont.)",T ,F,162704,132030,F,F,23.2,53028,51000,100.8917,1612.7,28
into a text file like the following
Toronto 2503281 Montreal 1620693 Vancouver 578041
I am extracting the 1st and 5th column and save it into a text file.
This is what i have so far.
import csv
file = open('raw.csv')
reader = csv.reader(file)
f = open('NicelyDone.text','w')
for line in reader:
f.write("%s %s"%line[1],%line[5])
This is not working for me, I was able to extract the data from the csv file as line[1],line[5]. (I am able to print it out) But I dont know how to write it to a .text file in the format i wanted.
Also, I have to process the first column eg, "Toronto (Ont.)" into "Toronto". I am familiar with the function find(), I assume that i could extract Toronto out of Toronto(Ont.) using "(" as the stopping character, but based on my research , I have no idea how to use it and ask it to return me the string(Toronto).
Here is my question:
- What is the data format for
line[1]
?- If it is string how come
f.write()
does not work? - If it is not string, how do i convert it to a string?
- If it is string how come
- How do i extract the word
Toronto
out ofToronto(Ont)
into a string form usingfind()
or other methods.
My thinking is that I could add those 2 string together like c = a+ ' ' + b
, that would give me the format i wanted.
So i can use f.write()
to write into a file :)
Sorry if my questions sounds too easy or stupid.
Thanks ahead
Zhen
I don't recall
csv
that well, so I don't know if it's a string or not. What error are you getting? In any case, assuming it is a string, your line should be:In other words, you need a set of parentheses. Also, you should have a trailing space in your string.
A somewhat hackish but concise way to do this is:
line[1].split("(")[0]
This will create a list that splits on the
(
symbol, and then you extract the first element.