Commands¶

Warning

The DoE-Suite can easily start many instances in a remote cloud. If there is an error in the execution, or the suite finishes before all jobs are complete, then these remote resources are not terminated and can generate high costs. Always check that resources are terminated. We also provide the following command to ensure that the previously started instances are terminated:

make clean

The interface of the DoE-Suite is defined in a Makefile. In the following, we focus on the most frequently used commands.

Show all commands¶

make help

Show Output

$ make help
Euler Progress Update
  make peek suite=<SUITE> id=<ID>     - peek at a file on euler
            [file=<FILE>]                 select file (default: stdout.log)
            [exp=<EXP>]                   select experiment (default: first)
            [run=<RUN>]                   select run (default: 0)
Running Experiments
  make run suite=<SUITE> id=new                       - run the experiments in the suite
  make run suite=<SUITE> id=<ID>                      - continue with the experiments in the suite with <ID> (often id=last)
  make run suite=<SUITE> id=<ID> cloud=<CLOUD>        - run suite on non-default cloud ([aws], euler)
  make run suite=<SUITE> id=<ID> expfilter=<REGEX>    - run only subset of experiments in suite where name matches the <REGEX> (suite must be valid)
  make run-keep suite=<SUITE> id=new                  - does not terminate instances at the end, otherwise works the same as run target
Clean
  make clean                                          - terminate running cloud instances belonging to the project and local cleanup
  make clean-result                                   - delete all inclomplete results in doe-suite-results
Running ETL Locally
  make etl suite=<SUITE> id=<ID>                      - run the etl pipeline of the suite (locally) to process results (often id=last)
  make etl-design suite=<SUITE> id=<ID>               - same as `make etl ...` but uses the pipeline from the suite design instead of results
  make etl-all                                        - run etl pipelines of all results
  make etl-super config=<CONFIG> out=<PATH>           - run the super etl to combine results of multiple suites  (for <CONFIG> e.g., demo_plots)
  make etl-super ... pipelines="<P1> <P2>"            - run only a subset of pipelines in the super etl
Clean ETL
  make etl-clean suite=<SUITE> id=<ID>                - delete etl results from specific suite (can be regenerated with make etl ...)
  make etl-clean-all                                  - delete etl results from all suites (can be regenerated with make etl-all)
Gather Information
  make info                                           - list available suite designs
  make status suite=<SUITE> id=<ID>                   - show the status of a specific suite run (often id=last)
Design of Experiment Suites
  make design suite=<SUITE>                           - list all the run commands defined by the suite
  make design-validate suite=<SUITE>                  - validate suite design and show with default values
Setting up a Suite
  make new                                            - initialize doe-suite-config from a template
Running Tests
  make test                                           - running all suites (seq) and comparing results to expected (on aws)
  make euler-test cloud=euler                         - running all single instance suites on euler and compare results to expected
  make etl-test-all                                   - re-run all etl pipelines and compare results to current state (useful after update of etl step)

Running an Experiment Suite¶

Here we focus on the commands that are used to start and continue an experiment suite. For more information on the experiment suite design, see Suite Design and on the execution, see Running Experiments.

Start a new experiment suite¶

make run suite=example01-minimal id=new

Continue with the last experiment suite¶

make run suite=example01-minimal id=last

Continue the experiment suite with a specific ID¶

make run suite=example01-minimal id=<ID>

Start an experiment suite on non-default cloud¶

make run suite=example01-minimal id=new cloud=euler

Start suite with the explicit choice run-keep: Keep instances running after suite is complete¶

make run-keep suite=example01-minimal id=new

Warning

If you use run-keep, be sure to check that instances are terminated when you are done.

Cleaning up Cloud¶

By default, after an experiment suite is complete, all experiment resources created on the cloud are terminated.

However, if something goes wrong, i.e. an error occurs, the suite times out, or the suite is stopped manually, the created resources on the cloud remain running.

Further, creating resources on a cloud and setting up the environment takes a considerable amount of time. So, for debugging and short experiments, it can make sense not to terminate the instances. If you use run experiments with run-keep, be sure to check that instances are terminated when you are done.

Terminate all remote resources, e.g., terminate all EC2 instances, and local cleanup, e.g., pycache¶

make clean

Tip

Double check on the cloud that all resources are terminated, and setup budget alerts.

ETL Results Processing¶

The ETL pipeline is used to process the results of an experiment suite. The results processing runs on your local machine and is triggered automatically when the new results are available locally, i.e., an experiment job is complete.

However, often it is also useful to trigger a run of the ETL pipeline manually, e.g., for styling a plot.

Manually trigger a run of the ETL results pipeline (runs locally)¶

# can replace `id=last` with actual id, e.g., `id=1655831553`
make etl suite=example01-minimal id=last

Super ETL pipelines can be used to process the results of multiple experiment suites together.

Run Super-ETL results pipeline¶

 # can set `out` for example to a figures folder of a paper
make etl-super config=demo_plots out=.

Status and Info¶

Get information about available suites and experiments¶

make info

Get progress information about the last suite run¶

# w/o suite filter (all suites)
make status id=last

# w/ suite filter
make status suite=example01-minimal id=last

Developing Suite Designs¶

Tip

Ensure that the environment variable DOES_PROJECT_DIR points to the project directory.

Configure Project: Initialize doe-suite-config from a template¶

make new

List all commands that a suite design defines (+ Visualize ETL pipelines)¶

make design suite=example01-minimal

Validate a design and show the design with default values assigned¶

make design-validate suite=example01-minimal

Developing ETL pipeline by using the pipeline from the design¶

# can replace `id=last` with actual id, e.g., `id=1655831553`
make etl-design suite=example01-minimal id=last

# The same as: `make etl suite=example01-minimal id=last`
#   but uses the etl pipeline defined in `doe-suite-config/designs`
#   compared to the etl pipeline in `doe-suite-results/example01-single_<ID>/suite_design.yml`

Custom Commands¶

Project-specific commands (targets) can be defined in doe-suite-config/Makefile. These commands are integrated into the main doe-suite/Makefile and can be executed like any other target using make <command>.

For example, in demo_project, a custom command make peek is defined to peek at results when running in the Euler cloud:

doe-suite-config/Makefile¶

run?=0
rep?=0
file?=stdout.log
exp?=$$(ls -d */ | head -n 1)


help-custom:
	@echo 'Euler Progress Update'
	@echo '  make peek suite=<SUITE> id=<ID>     - peek at a file on euler'
	@echo '            [file=<FILE>]                 select file (default: stdout.log)'
	@echo '            [exp=<EXP>]                   select experiment (default: first)'
	@echo '            [run=<RUN>]                   select run (default: 0)'

# Example: for the euler cloud peek at results.
peek:
	@ssh euler.ethz.ch 'cd ~/doe-suite-results/$(suite)_$(suite_id) && cd $(exp) && less run_$(run)/rep_$(rep)/results/$(file)'

DoE-Suite

Navigation

Related Topics