Multi-Processing Executor v23.0.1 2022-02-02¶
Multi-Processing Executor v23.0.0 2021-12-10¶
New Features¶
Several improvements in
pipetaskexecution options: - New option--skip-existing-inwhich takes collection names(s), if outputdatasets already exist in those collections corresponding quanta is skipped.
- A
--skip-existingoption is now equivalent to appending output run collection to the--skip-existing-inlist. - An
--extend-runoption implicitly enables--skip-existingoption. - A
--prune-replaced=unstoreoption only removes regular output datasets; InitOutputs, task configs, and package versions are not removed. (DM-27492)
- A
GraphViz dot files generated by pipetask now include more information (
RUNcollection for datasets, dimensions for tasks, data IDs for quanta). (DM-28111)pipetask qgraphcan now generate a standalone “execution butler” which is a SQLite registry with all the expected outputs pre-filled in registry. Using this registry allowpipetask runto execute without touching the main registry whilst still writing file artifacts to the standard location. It is not yet configured to allow a completely detached processing using a local datastore but this can be changed manually after creation to use a chained datastore. (DM-28649)- Log messages issued during quantum execution are now collected and stored in butler as
tasklabel_logdataset types. - New command line options for logging have been added to
pipetask. These include--log-fileto write log messages to a file and--no-log-ttyto disable log output to the terminal. (DM-30977)
- Log messages issued during quantum execution are now collected and stored in butler as
- Add the output run to the log record.
- Add
--log-labeloption topipetaskcommand to allow extra information to be injected into the log record. (DM-31884)
Bug Fixes¶
Multi-Processing Executor v22.0 2021-04-01¶
New Features¶
pipetask runcan now execute a subset of a graph. This allows a single graph file to be created with an entire workflow and then only part of it to be executed. This is very important for large scale workflow execution. [DM-27667]
Performance Enhancement¶
- Multi-processing execution performance has been significantly improved for large graphs. [DM-28418]
Other¶
- Ignore
--inputinstead of rejecting it if it hasn’t changed. [DM-28101] - The graph file format has been changed from a pickle file to a form that can efficiently be accessed from an object store. This new format has a
.qgraphfile extension. [DM-27784] - A full URI can now be used to specify the location of the quantum graph. [DM-27682]