Automation of distributed parameter studies using Itasca software: Cruncher Automatic Mode

This system uses Amazon Web Services (AWS) to coordinate running large parameter studies with Itasca Software. The individual cases of the parameter study are run on local or remote computers, the AWS cloud is used to coordinate the cases and gather the data in a central location.

On the Itasca software side, a single line of Python is used to attach an instance of FLAC3D, 3DEC, or PFC to the automatic network. Once on the network the Itasca software checks the cloud for a case to run, downloads the data file and parameters, runs the case, and uploads the results to the cloud -- all automatically. A web page shows the progress of the parameter study and shows and errors that have occurred.

This repository contains the required source code and a worked example.

Overview

An AWS Cloud Formation template is used to set up the cloud resources. An AWS S3 bucket and an AWS SQS queue are created to manage the parameter study cases. The Itasca software contains an embedded Python interpreter which acts as a client, running parameter study cases and uploading the results. Many computers can join a parameter study and the cases are processed in a specific order.

requirements

This guide assumes you have the following:

FLAC3D, 3DEC, or PFC version 7 or later
An AWS account
Python 3.6 or later
The Python pyDOE library
The Python boto3 library installed and configured with sufficient permissions to:
- Launch Cloud Formation stacks
- Work with an SQS queue
- Work with S3 buckets
The awscli package installed and configured.
Access to the AWS web console for debugging and checking the individual resources.

Instructions

Launching the Cloud Formation stack

Launch the Cloud Formation stack with this command.

python launch.py my-study-name

This creates the SQS queue, the S3 bucket and gets the permissions set up. Replace my-study-name with a name for your parameter study.

A manual step needs to be done here: Enable ACL on the s3 bucket that was created via the AWS web portal.

Next, run this command to configure the website and support files:

python configure.py my-study-name.json

Upon success this command with give the web site for the parameter study and the S3 bucket name, it will look something like this (your URL will be different):

Website: http://my-study-name-databucket-17c4yhty0nr5a.s3-website.us-east-2.amazonaws.com

S3 Bucket: my-study-name-databucket-17c4yhty0nr5a

Creating a parametric datafile

In this step you create a Python program to run the cases in your parameter study.

The example provided here is based on the Prandtl's wedge example from the FLAC3D manual. In the FLAC3D example, cohesion is constant, in this example the cohesion of the top 5 layers of zones is varied individually and the force displacement curve is measured.

See the example prandtls_wedge.py. For parameters you want to vary wrap the variables like this:

cohesion_array = np.array({{cohesion_array}})

Any results you want to save should be added to the result dictionary at the end of the file.

Defining parameter study cases and adding them to the queue

The file create_cases.py defines the range of values that the unknowns should take. In this example, a hierarchical latin hypercube is used to vary the cohesion of 5 layers from 0.5e5 Pa to 5e5 Pa.

python create_cases.py

Attaching the Itasca software to the network

Open the website given in the configure step. The top of this web site contains a one-line Python program. Copy this line.
Open FLAC3D, 3DEC, or PFC3D
Open the IPython console via Menu->Python->IPython console
Paste the one line program into the IPython console window and press return

Monitoring the progress

View the web site, it gives a summary of the number of jobs remaining and any errors that have occurred.

Processing the results

The results are all in the S3 bucket given by the configuration step in the "data/" subfolder. The results are in JSON format and contain all the inputs and outputs.

A command like this (the s3 address will be different in your case) copies all the parameter study results to your local computer. Make a folder called results first.

aws s3 sync s3://my-queue-test-databucket-m19wz8wl341z/data/my_cases/my_folder/ ./results --exclude "*" --include "done*"

Running this script: python process_results.py

reads all the cases into a feature array X and a target array Y, these arrays can be used in machine learning model training.

Cleaning up

Delete all the files in the s3 bucket.
Run the command python clean_up.py my-study-name to delete the cloud resources.

Limitations

Messages that are in the queue for more than 14 days get deleted automatically.
Not all errors are caught, some timeout network errors are not handled correctly and the client (the Itasca software) leaves the network.
If there is an error in the outer wrapper it can be difficult to debug.

Security

A publicly readable AWS key set is created to allow the necessary operation. Permissions are restricted to only the needed operations but a malicious person could interfere with the system. The data file that runs the cases is publicly readable, but the results are not publicly readable. A more restrictive security model in which the client computer needs to know the keys could be implemented.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
explore_tool_data		explore_tool_data
.gitignore		.gitignore
README.md		README.md
X.txt		X.txt
Y.txt		Y.txt
aws_backend.py		aws_backend.py
bootstrap.py		bootstrap.py
cam_stack.yaml		cam_stack.yaml
check.py		check.py
clean_up.py		clean_up.py
configure.py		configure.py
create_cases.py		create_cases.py
error_plot.py		error_plot.py
fix_errors.py		fix_errors.py
fresh_cam.sh		fresh_cam.sh
index.html		index.html
launch.py		launch.py
learning_curve.py		learning_curve.py
make_explore_tool.py		make_explore_tool.py
organize_cases_by_cube.py		organize_cases_by_cube.py
prandtls_wedge.py		prandtls_wedge.py
process_results.py		process_results.py
put_cases_in_queue.py		put_cases_in_queue.py
script.js		script.js
stack.py		stack.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automation of distributed parameter studies using Itasca software: Cruncher Automatic Mode

Overview

requirements

Instructions

Launching the Cloud Formation stack

Creating a parametric datafile

Defining parameter study cases and adding them to the queue

Attaching the Itasca software to the network

Monitoring the progress

Processing the results

Cleaning up

Limitations

Security

About

Releases

Packages

Contributors 2

Languages

itascaconsulting/cam

Folders and files

Latest commit

History

Repository files navigation

Automation of distributed parameter studies using Itasca software: Cruncher Automatic Mode

Overview

requirements

Instructions

Launching the Cloud Formation stack

Creating a parametric datafile

Defining parameter study cases and adding them to the queue

Attaching the Itasca software to the network

Monitoring the progress

Processing the results

Cleaning up

Limitations

Security

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages