Pipeline¶
- class lsst.pipe.base.Pipeline(description: str)¶
Bases:
objectA
Pipelineis a representation of a series of tasks to run, and the configuration for those tasks.- Parameters:
- description
str A description of that this pipeline does.
- description
Attributes Summary
Returns a
MappingProxyTypewhere the keys are the labels of labeled subsets in thePipelineand the values are the set of task labels contained within that subset.Methods Summary
addConfigFile(label, filename)Add overrides from a specified file.
addConfigOverride(label, key, value)Apply single config override.
addConfigPython(label, pythonString)Add Overrides by running a snippet of python code against a config.
addInstrument(instrument)Add an instrument to the pipeline, or replace an instrument that is already defined.
addLabelToSubset(subset, label)Add a task label from the specified subset.
addLabeledSubset(label, description, taskLabels)Add a new labeled subset to the
Pipeline.addTask(task, label)Add a new task to the pipeline, or replace a task that is already associated with the supplied label.
findSubsetsWithLabel(label)Find any subsets which may contain the specified label.
fromFile(filename)Load a pipeline defined in a pipeline yaml file.
fromIR(deserialized_pipeline)Create a pipeline from an already created
PipelineIRobject.fromPipeline(pipeline)Create a new pipeline by copying an already existing
Pipeline.fromString(pipeline_string)Create a pipeline from string formatted as a pipeline document.
from_uri(uri)Load a pipeline defined in a pipeline yaml file at a location specified by a URI.
Get the instrument from the pipeline.
get_data_id(universe)Return a data ID with all dimension constraints embedded in the pipeline.
mergePipeline(pipeline)Merge another in-memory
Pipelineobject into this one.removeLabelFromSubset(subset, label)Remove a task label from the specified subset.
removeLabeledSubset(label)Remove a labeled subset from the
Pipeline.removeTask(label)Remove a task from the pipeline.
subsetFromLabels(labelSpecifier[, subsetCtrl])Subset a pipeline to contain only labels specified in
labelSpecifier.Return a generator of
TaskDefs which can be used to create quantum graphs.to_graph([registry])Construct a pipeline graph from this pipeline.
write_to_uri(uri)Write the pipeline to a file or directory.
Attributes Documentation
- subsets¶
Returns a
MappingProxyTypewhere the keys are the labels of labeled subsets in thePipelineand the values are the set of task labels contained within that subset.
Methods Documentation
- addConfigPython(label: str, pythonString: str) None¶
Add Overrides by running a snippet of python code against a config.
- addInstrument(instrument: Instrument | str) None¶
Add an instrument to the pipeline, or replace an instrument that is already defined.
- Parameters:
- instrument
Instrumentorstr Either a derived class object of a
lsst.daf.butler.instrumentor a string corresponding to a fully qualifiedlsst.daf.butler.instrumentname.
- instrument
- addLabeledSubset(label: str, description: str, taskLabels: set[str]) None¶
Add a new labeled subset to the
Pipeline.- Parameters:
- Raises:
- addTask(task: type[lsst.pipe.base.pipelineTask.PipelineTask] | str, label: str) None¶
Add a new task to the pipeline, or replace a task that is already associated with the supplied label.
- Parameters:
- task
PipelineTaskorstr Either a derived class object of a
PipelineTaskor a string corresponding to a fully qualifiedPipelineTaskname.- label
str A label that is used to identify the
PipelineTaskbeing added.
- task
- findSubsetsWithLabel(label: str) set[str]¶
Find any subsets which may contain the specified label.
This function returns the name of subsets which return the specified label. May return an empty set if there are no subsets, or no subsets containing the specified label.
- classmethod fromFile(filename: str) Pipeline¶
Load a pipeline defined in a pipeline yaml file.
- Parameters:
- filename
str A path that points to a pipeline defined in yaml format. This filename may also supply additional labels to be used in subsetting the loaded Pipeline. These labels are separated from the path by a
#, and may be specified as a comma separated list, or a range denoted as beginning..end. Beginning or end may be empty, in which case the range will be a half open interval. Unlike python iteration bounds, end bounds are INCLUDED. Note that range based selection is not well defined for pipelines that are not linear in nature, and correct behavior is not guaranteed, or may vary from run to run.
- filename
- Returns:
- pipeline:
Pipeline The pipeline loaded from specified location with appropriate (if any) subsetting.
- pipeline:
Notes
This method attempts to prune any contracts that contain labels which are not in the declared subset of labels. This pruning is done using a string based matching due to the nature of contracts and may prune more than it should.
- classmethod fromIR(deserialized_pipeline: PipelineIR) Pipeline¶
Create a pipeline from an already created
PipelineIRobject.- Parameters:
- deserialized_pipeline
PipelineIR An already created pipeline intermediate representation object.
- deserialized_pipeline
- Returns:
- pipeline:
Pipeline The new pipeline.
- pipeline:
- classmethod fromPipeline(pipeline: Pipeline) Pipeline¶
Create a new pipeline by copying an already existing
Pipeline.
- classmethod fromString(pipeline_string: str) Pipeline¶
Create a pipeline from string formatted as a pipeline document.
- classmethod from_uri(uri: str | ParseResult | ResourcePath | Path) Pipeline¶
Load a pipeline defined in a pipeline yaml file at a location specified by a URI.
- Parameters:
- uriconvertible to
ResourcePath If a string is supplied this should be a URI path that points to a pipeline defined in yaml format, either as a direct path to the yaml file, or as a directory containing a
pipeline.yamlfile the form used bywrite_to_uriwithexpand=True). This uri may also supply additional labels to be used in subsetting the loadedPipeline. These labels are separated from the path by a#, and may be specified as a comma separated list, or a range denoted as beginning..end. Beginning or end may be empty, in which case the range will be a half open interval. Unlike python iteration bounds, end bounds are INCLUDED. Note that range based selection is not well defined for pipelines that are not linear in nature, and correct behavior is not guaranteed, or may vary from run to run. The same specifiers can be used with aResourcePathobject, by being the sole contents in the fragments attribute.
- uriconvertible to
- Returns:
- pipeline
Pipeline The pipeline loaded from specified location with appropriate (if any) subsetting.
- pipeline
Notes
This method attempts to prune any contracts that contain labels which are not in the declared subset of labels. This pruning is done using a string based matching due to the nature of contracts and may prune more than it should.
- getInstrument() str | None¶
Get the instrument from the pipeline.
- Returns:
- instrument
str, or None The fully qualified name of a
lsst.obs.base.Instrumentsubclass, name, or None if the pipeline does not have an instrument.
- instrument
- get_data_id(universe: DimensionUniverse) DataCoordinate¶
Return a data ID with all dimension constraints embedded in the pipeline.
- Parameters:
- universe
lsst.daf.butler.DimensionUniverse Object that defines all dimensions.
- universe
- Returns:
- data_id
lsst.daf.butler.DataCoordinate Data ID with all dimension constraints embedded in the pipeline.
- data_id
- mergePipeline(pipeline: Pipeline) None¶
Merge another in-memory
Pipelineobject into this one.This merges another pipeline into this object, as if it were declared in the import block of the yaml definition of this pipeline. This modifies this pipeline in place.
- removeTask(label: str) None¶
Remove a task from the pipeline.
- Parameters:
- label
str The label used to identify the task that is to be removed.
- label
- Raises:
- KeyError
If no task with that label exists in the pipeline.
- subsetFromLabels(labelSpecifier: LabelSpecifier, subsetCtrl: PipelineSubsetCtrl = PipelineSubsetCtrl.DROP) Pipeline¶
Subset a pipeline to contain only labels specified in
labelSpecifier.- Parameters:
- labelSpecifier
labelSpecifier Object containing labels that describes how to subset a pipeline.
- subsetCtrl
PipelineSubsetCtrl Control object which decides how subsets with missing labels are handled. Setting to
PipelineSubsetCtrl.DROP(the default) will cause any subsets that have labels which are not in the set of all task labels to be dropped. Setting toPipelineSubsetCtrl.EDITwill cause the subset to instead be edited to remove the nonexistent label.
- labelSpecifier
- Returns:
- pipeline
Pipeline A new pipeline object that is a subset of the old pipeline.
- pipeline
- Raises:
- ValueError
Raised if there is an issue with specified labels
Notes
This method attempts to prune any contracts that contain labels which are not in the declared subset of labels. This pruning is done using a string based matching due to the nature of contracts and may prune more than it should.
- toExpandedPipeline() Generator[TaskDef, None, None]¶
Return a generator of
TaskDefs which can be used to create quantum graphs.- Returns:
- generatorgenerator of
TaskDef The generator returned will be the sorted iterator of tasks which are to be used in constructing a quantum graph.
- generatorgenerator of
- Raises:
- NotImplementedError
If a dataId is supplied in a config block. This is in place for future use.
- to_graph(registry: Registry | None = None) PipelineGraph¶
Construct a pipeline graph from this pipeline.
Constructing a graph applies all configuration overrides, freezes all configuration, checks all contracts, and checks for dataset type consistency between tasks (as much as possible without access to a data repository). It cannot be reversed.
- Parameters:
- registry
lsst.daf.butler.Registry, optional Data repository client. If provided, the graph’s dataset types and dimensions will be resolved (see
PipelineGraph.resolve).
- registry
- Returns:
- graph
pipeline_graph.PipelineGraph Representation of the pipeline as a graph.
- graph
- write_to_uri(uri: str | ParseResult | ResourcePath | Path) None¶
Write the pipeline to a file or directory.
- Parameters:
- uriconvertible to
ResourcePath URI to write to; may have any scheme with
ResourcePathwrite support or no scheme for a local file/directory. Should have a.yamlextension.
- uriconvertible to