Continuing from my previous question I see that to get the error code of a process I spawned via Popen in python I have to call either wait() or communicate() (which can be used to access the Popen stdout and stderr attributes):
app7z = '/path/to/7z.exe'
command = [app7z, 'a', dstFile.temp, "-y", "-r", os.path.join(src.Dir, '*')]
process = Popen(command, stdout=PIPE, startupinfo=startupinfo)
out = process.stdout
regCompressMatch = re.compile('Compressing\s+(.+)').match
regErrMatch = re.compile('Error: (.*)').match
errorLine = []
for line in out:
if len(errorLine) or regErrMatch(line):
errorLine.append(line)
if regCompressMatch(line):
# update a progress bar
result = process.wait() # HERE
if result: # in the hopes that 7z returns 0 for correct execution
dstFile.temp.remove()
raise StateError(_("%s: Compression failed:\n%s") % (dstFile.s,
"\n".join(errorLine)))
However the docs warn that wait()
may deadlock (when stdout=PIPE, which is the case here) while communicate()
might overflow. So:
- what is the proper thing to use here ? Note that I do use the output
how exactly should I use communicate ? Would it be:
process = Popen(command, stdout=PIPE, startupinfo=startupinfo) out = process.communicate()[0] # same as before... result = process.returncode if result: # ...
not sure about blocking and the memory errors
- Any better/more pythonic way of handling the problem ? I do not think that
subprocess.CalledProcessError
or thesubprocess.check_call/check_output
apply in my case - or do they ?
DISCLAIMER: I did not write the code, I am the current maintainer, hence question 3.
Related:
- Python popen command. Wait until the command is finished
- Check a command's return code when subprocess raises a CalledProcessError exception
- wait process until all subprocess finish?
I am on windows if this makes a difference - python 2.7.8
There should be one-- and preferably only one --obvious way to do it
stdout=PIPE
andwait()
together iff you read from the pipe..communicate()
does the reading and callswait()
for you.communicate()
that accumulates all output in memory.To start subprocess, read its output line by line and to wait for it to exit:
This code does not deadlock due to a finite OS pipe buffer. Also, the code supports commands with unlimited output (if an individual line fits in memory).
iter()
is used to read a line as soon as the subprocess' stdout buffer is flushed, to workaround the read-ahead bug in Python 2. You could use a simplefor line in process.stdout
if you don't need to read lines as soon as they are written without waiting for the buffer to fill or the child process to end. See Python: read streaming input from subprocess.communicate().If you know that the command output can fit in memory in all cases then you could get the output all at once:
It raises
CalledProcessError
if the command returns with a non-zero exit status. Internally,check_output()
usesPopen()
and.communicate()
subprocess.Popen()
is the main API that works in many many cases. There are convenience functions/methods such asPopen.communicate()
,check_output()
,check_call()
for common use-cases.There are multiple methods, functions because there are multiple different use-cases.