lsst-ctrl-mpexec v25.0.0 (2023-02-28)¶
New Features¶
Added support for transferring files into execution butler. (DM-35494)
Added documentation on how to use
--showand--config.A pipeline will now never execute if
--showoption is used withpipetask run.The
--configoption can now accept list configuration values (with or without square brackets), for example--config task:listItem=a,bor--config "task:listItem=[a,b]".The
--config-fileoption can now take comma-separated file names for multiple config files. (DM-35917)
added additional quanta information to be displayed by the logger, showing number of quanta per task (DM-36145)
If
pipetaskis run with multiple processes and if a butler datastore cache is configured, all subprocesses will now share the same cache. For large numbers of simultaneous processes it may be necessary to significantly increase the number of datasets in the cache to make the cache usable. This can be done by using the$DAF_BUTLER_CACHE_EXPIRATION_MODEenvironment variable.Previously each subprocess would get its own cache and if
forkstart method was used these cache directories would not be cleaned up. (DM-36412)Always disable implicit threading (e.g. in OpenBLAS) by default in
pipetask run, even when not using-j.The new
--enable-implicit-threadingcan be used to turn it back on. (DM-36831)
API Changes¶
SimplePipelineExecutorfactory methods addbindparameter for bind values to use with the user expression. (DM-36487)
lsst-ctrl-mpexec v24.0.0 (2022-08-26)¶
New Features¶
Added
--dataset-query-constraintoption topipetask qgraphcommand (and thus downstream commands) that allows a user to control howQuantumGraphcreation is constrained by dataset existence. (DM-31769)Builds using
setuptoolsnow calculate versions from the Git repository, including the use of alpha releases for those associated with weekly tags. (DM-32408)Added
--summaryoption topipetask runcommand, it produces JSON report for execution status of the whole process and individual quanta. (DM-33481)Added
pipetaskCLI commandspurgeandcleanup. (DM-33634)Removed dependency on the
obs_baseandafwpackages. Now only depends onpipe_baseanddaf_butler(along withpex_configandutils). (DM-34105)Replaced the unused
--do-raiseoption with--pdb, which drops the user into the debugger (pdbby default, but--pdb=ipdbalso works if you haveipdbinstalled) on an exception. (DM-34215)
Bug Fixes¶
Other Changes and Additions¶
Added
lsst.ctrl.mpexec.SimplePipelineExecutor, a minimal high-level Python interface forPipelineTaskexecution intended primarily for unit testing. (DM-31966)
lsst-ctrl-mpexec v23.0.1 (2022-02-02)¶
Miscellaneous Changes of Minor Interest¶
Allow
pipetask runexecution to continue in single-process mode after failure of one or more tasks. Previously execution stopped on an exception from any task. (DM-33339)
lsst-ctrl-mpexec v23.0.0 (2021-12-10)¶
New Features¶
Several improvements in
pipetaskexecution options:New option
--skip-existing-inwhich takes collection names(s), if output datasets already exist in those collections corresponding quanta is skipped.A
--skip-existingoption is now equivalent to appending output run collection to the--skip-existing-inlist.An
--extend-runoption implicitly enables--skip-existingoption.A
--prune-replaced=unstoreoption only removes regular output datasets; InitOutputs, task configs, and package versions are not removed. (DM-27492)
GraphViz dot files generated by pipetask now include more information (
RUNcollection for datasets, dimensions for tasks, data IDs for quanta). (DM-28111)pipetask qgraphcan now generate a standalone “execution butler” which is a SQLite registry with all the expected outputs pre-filled in registry. Using this registry allowpipetask runto execute without touching the main registry whilst still writing file artifacts to the standard location. It is not yet configured to allow a completely detached processing using a local datastore but this can be changed manually after creation to use a chained datastore. (DM-28649)Log messages issued during quantum execution are now collected and stored in butler as
tasklabel_logdataset types.New command line options for logging have been added to
pipetask. These include--log-fileto write log messages to a file and--no-log-ttyto disable log output to the terminal. (DM-30977)
Add the output run to the log record.
Add
--log-labeloption topipetaskcommand to allow extra information to be injected into the log record. (DM-31884)
Bug Fixes¶
Miscellaneous Changes of Minor Interest¶
Add some of the pipetask command line options to QuantumGraph metadata (DM-30702)
lsst-ctrl-mpexec v22.0 (2021-04-01)¶
New Features¶
pipetask runcan now execute a subset of a graph. This allows a single graph file to be created with an entire workflow and then only part of it to be executed. This is very important for large scale workflow execution. [DM-27667]
Performance Enhancement¶
Multi-processing execution performance has been significantly improved for large graphs. [DM-28418]
Other¶
Ignore
--inputinstead of rejecting it if it hasn’t changed. [DM-28101]The graph file format has been changed from a pickle file to a form that can efficiently be accessed from an object store. This new format has a
.qgraphfile extension. [DM-27784]A full URI can now be used to specify the location of the quantum graph. [DM-27682]