3. Launching and Handling Child Processes

Understanding Regular Expressions
Hands-On: Simlutaneous-String-Matching and Extraction
Checking operating system at run-time
Exercise
The subprocess module
Specifying Variable Number of (Positional) Arguments in a Function Call
Hand-On: Get Directory Listing and Print the Output
Exercise: Find Process ID(s) Based On Process Name Regex Pattern
- Tips and Inputs

Understanding Regular Expressions

Regular Expressions provide you with a vast grammar for string searching, matching and extraction.

For most common scenarios, knowing the following part of grammar is sufficient:

[chars] - Allowed characters
\w - [a-zA-Z0-9_]    \W  Opposite
\d - [0-9]           \D  Opposite
\s - space           \S  Opposite
.  - anything

Ocurrences
{min, max} - min to max times
*  0 or more times
+  1 or more times
?  0 or 1 time

For extraction:
Mark a group using parenthesis: ()

Hands-On: Simlutaneous String Matching and Extraction

Python's built-in re module helps you in doing all stuff related to regular expressions.

For the purpose of this exercise, we will extract parts of a string based on pattern matching.

re.match(pattern, target_string)

Tip As regex patterns often contain \x nature of constructs and \ is used to mark escape sequences in strings e.g. \n, we need to handle this. One option is to escape the backslash \\. This is what you do in most languages. Python gives a much cleaner way with its raw-string type. Prefix a string with the letter r to make it a raw-string: r"sample-string".

Let's implement the function extract_email_parts in data_utils.py file to extract the email id and domain when an email address is provided.

We'll call the function in ex13.py and validate output.

Checking operating system at run-time

At times, you need to determine the OS name at run time to take OS-specific actions.

Using platform module's system() call, you can check the current operating system name.

import platform
platform.system()

Exercise

Implement is_windows_os function in os_utils.py.
Call the function in ex14.py to validate output.

The `subprocess` module

Python has a built-in subprocess module using which you can launch child processes.

from subprocess import Popen, PIPE

Let's say there is a command proc_name -a -b 2 that you want to launch. Also, you want to capture the output of pipes (STDOUT/STDERR) as bytes.

On Mac/Linux, you can launch it as follows:

proc = Popen(['proc_name`, '-a', '-b', '2'], stdout=PIPE, stderr=PIPE)

On Windows, you can launch it as follows (/C is provided so that the cmd process exists once command has completed its execution.):

proc = Popen(['cmd', '/C', 'proc_name`, '-a', '-b', '2'], stdout=PIPE, stderr=PIPE)

To capture the output of pipes, you can use the communicate method:

stdout, stderr = proc.communicate()

Specifying Variable Number of (Positional) Arguments in a Function Call

You can specify variable number of args for a function using the following syntax:

def some_function(a, b, *args):
    function body
    In the body args is available as a tuple.

Hand-On: Get Directory listing and Print the Output

On Mac/Linux, use:

ls -l

On Windows, use:

dir

Let's implement the execute_command function in process_utils.py.

We'll call the function in ex15.py and validate output.

Exercise: Find Process ID(s) Based On Process Name Regex Pattern

Implement the find_process_ids function in process_utils.py file. It takes the process name pattern, runs the appropriate command as per a platform and then extracts the process ids.
Call the function in ex16.py and validate output.

Tips and Inputs

You can use re.findall(pattern, target_string) to find all matches.
top -l 1 command (one iteration) can be used on Mac to find the running processes. The first column contains the process ids and the second contains the process names.
top -b -n 1 command (batch mode, 1 iteration) can be used on Linux. The first column contains the process ids and the last column contains the process name.
tasklist is the command that you can use on Windows. The first column is the process name and the second column is the process id.
communicate() call returns output as byte-string. Before you can do text-operations/regex matching on the output, you need to decode it: stdout.decode('utf-8')

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Section3.md

Section3.md

3. Launching and Handling Child Processes

Understanding Regular Expressions

Hands-On: Simlutaneous String Matching and Extraction

Checking operating system at run-time

Exercise

The `subprocess` module

Specifying Variable Number of (Positional) Arguments in a Function Call

Hand-On: Get Directory listing and Print the Output

Exercise: Find Process ID(s) Based On Process Name Regex Pattern

Tips and Inputs

Files

Section3.md

Latest commit

History

Section3.md

File metadata and controls

3. Launching and Handling Child Processes

Understanding Regular Expressions

Hands-On: Simlutaneous String Matching and Extraction

Checking operating system at run-time

Exercise

The subprocess module

Specifying Variable Number of (Positional) Arguments in a Function Call

Hand-On: Get Directory listing and Print the Output

Exercise: Find Process ID(s) Based On Process Name Regex Pattern

Tips and Inputs

The `subprocess` module