lsst-ctrl-mpexec v24.0.0 (2022-08-26)¶
New Features¶
- Added
--dataset-query-constraintoption topipetask qgraphcommand (and thus downstream commands) that allows a user to control howQuantumGraphcreation is constrained by dataset existence. (DM-31769) - Builds using
setuptoolsnow calculate versions from the Git repository, including the use of alpha releases for those associated with weekly tags. (DM-32408) - Added
--summaryoption topipetask runcommand, it produces JSON report for execution status of the whole process and individual quanta. (DM-33481) - Added
pipetaskCLI commandspurgeandcleanup. (DM-33634) - Removed dependency on the
obs_baseandafwpackages. Now only depends onpipe_baseanddaf_butler(along withpex_configandutils). (DM-34105) - Replaced the unused
--do-raiseoption with--pdb, which drops the user into the debugger (pdbby default, but--pdb=ipdbalso works if you haveipdbinstalled) on an exception. (DM-34215)
Bug Fixes¶
Other Changes and Additions¶
- Added
lsst.ctrl.mpexec.SimplePipelineExecutor, a minimal high-level Python interface forPipelineTaskexecution intended primarily for unit testing. (DM-31966)
lsst-ctrl-mpexec v23.0.1 (2022-02-02)¶
lsst-ctrl-mpexec v23.0.0 (2021-12-10)¶
New Features¶
Several improvements in
pipetaskexecution options:- New option
--skip-existing-inwhich takes collection names(s), if output datasets already exist in those collections corresponding quanta is skipped. - A
--skip-existingoption is now equivalent to appending output run collection to the--skip-existing-inlist. - An
--extend-runoption implicitly enables--skip-existingoption. - A
--prune-replaced=unstoreoption only removes regular output datasets; InitOutputs, task configs, and package versions are not removed. (DM-27492)
- New option
GraphViz dot files generated by pipetask now include more information (
RUNcollection for datasets, dimensions for tasks, data IDs for quanta). (DM-28111)pipetask qgraphcan now generate a standalone “execution butler” which is a SQLite registry with all the expected outputs pre-filled in registry. Using this registry allowpipetask runto execute without touching the main registry whilst still writing file artifacts to the standard location. It is not yet configured to allow a completely detached processing using a local datastore but this can be changed manually after creation to use a chained datastore. (DM-28649)- Log messages issued during quantum execution are now collected and stored in butler as
tasklabel_logdataset types. - New command line options for logging have been added to
pipetask. These include--log-fileto write log messages to a file and--no-log-ttyto disable log output to the terminal. (DM-30977)
- Log messages issued during quantum execution are now collected and stored in butler as
- Add the output run to the log record.
- Add
--log-labeloption topipetaskcommand to allow extra information to be injected into the log record. (DM-31884)
Bug Fixes¶
lsst-ctrl-mpexec v22.0 (2021-04-01)¶
New Features¶
pipetask runcan now execute a subset of a graph. This allows a single graph file to be created with an entire workflow and then only part of it to be executed. This is very important for large scale workflow execution. [DM-27667]
Performance Enhancement¶
- Multi-processing execution performance has been significantly improved for large graphs. [DM-28418]
Other¶
- Ignore
--inputinstead of rejecting it if it hasn’t changed. [DM-28101] - The graph file format has been changed from a pickle file to a form that can efficiently be accessed from an object store. This new format has a
.qgraphfile extension. [DM-27784] - A full URI can now be used to specify the location of the quantum graph. [DM-27682]