lsst.pipe.base

The lsst.pipe.base module provides base classes for the task framework. Tasks package the algorithmic units of the LSST Science Pipelines. You can create, configure, and run tasks with their Python APIs. Some tasks, called command-line tasks, are also packaged into data processing pipelines that you can run from the command line.

Using lsst.pipe.base

Overview

Using command-line tasks

Developing tasks and command-line tasks

Contributing

lsst.pipe.base is developed at https://github.com/lsst/pipe_base. You can find Jira issues for this module under the pipe_base component.

Python API reference

lsst.pipe.base Package

Functions

iterConnections(connections, connectionType)

Creates an iterator over the selected connections type which yields all the defined connections of that type.

logInfo(obj, prefix[, logLevel])

Log timer information to obj.metadata and obj.log.

timeMethod(func)

Decorator to measure duration of a task method.

Classes

ArgumentParser(name[, usage])

Argument parser for command-line tasks that is based on argparse.ArgumentParser.

ButlerInitializedTaskRunner(TaskClass, parsedCmd)

A TaskRunner for CmdLineTasks that require a butler keyword argument to be passed to their constructor.

ButlerQuantumContext(butler, quantum)

A Butler-like class specialized for a single quantum

CmdLineTask([config, name, parentTask, log])

Base class for command-line tasks: tasks that may be executed from the command-line.

ConfigDatasetType(name)

Dataset type specified by a config parameter.

ConfigFileAction(option_strings, dest[, …])

argparse action to load config overrides from one or more files.

ConfigValueAction(option_strings, dest[, …])

argparse action callback to override config parameters using name=value pairs from the command-line.

DataIdContainer([level])

Container for data IDs and associated data references.

DatasetArgument([name, help, default])

Dataset type specified by a command-line argument.

DeferredDatasetRef(datasetRef)

Class which denotes that a datasetRef should be treated as deferred when interacting with the butler

GraphBuilder(registry[, skipExisting])

GraphBuilder class is responsible for building task execution graph from a Pipeline.

IncompatibleGraphError

Exception class to indicate that a lookup by NodeId is impossible due to incompatibilities

InputOnlyArgumentParser(name[, usage])

ArgumentParser for command-line tasks that don’t write any output.

InputQuantizedConnection(**kwargs)

LabelSpecifier(labels, begin, end)

A structure to specify a subset of labels to load

LegacyTaskRunner(TaskClass, parsedCmd[, …])

A TaskRunner for CmdLineTasks which calls the Task‘s run method on a dataRef rather than the runDataRef method.

NodeId(number, buildId)

This represents an unique identifier of a node within an individual construction of a QuantumGraph.

OutputQuantizedConnection(**kwargs)

Pipeline(description)

A Pipeline is a representation of a series of tasks to run, and the configuration for those tasks.

PipelineDatasetTypes(initInputs, …)

An immutable struct that classifies the dataset types used in a Pipeline.

PipelineTask(*[, config, log, initInputs])

Base class for all pipeline tasks.

PipelineTaskConfig(*args, **kw)

Configuration class for PipelineTask

PipelineTaskConnections(*, config)

PipelineTaskConnections is a class used to declare desired IO when a PipelineTask is run by an activator

QuantumGraph(quanta, …)

QuantumGraph is a directed acyclic graph of QuantumNode objects

QuantumNode(quantum, taskDef, nodeId)

This class represents a node in the quantum graph.

ResourceConfig(*args, **kw)

Configuration for resource requirements.

Struct(**keyArgs)

A container to which you can add fields as attributes.

Task([config, name, parentTask, log])

Base class for data processing tasks.

TaskDatasetTypes(initInputs, initOutputs, …)

An immutable struct that extracts and classifies the dataset types used by a PipelineTask

TaskDef(taskName, config[, taskClass, label])

TaskDef is a collection of information about task needed by Pipeline.

TaskError

Use to report errors for which a traceback is not useful.

TaskFactory()

Abstract base class for task factory.

TaskRunner(TaskClass, parsedCmd[, …])

Run a command-line task, using multiprocessing if requested.

Class Inheritance Diagram

Inheritance diagram of lsst.pipe.base.argumentParser.ArgumentParser, lsst.pipe.base.cmdLineTask.ButlerInitializedTaskRunner, lsst.pipe.base.butlerQuantumContext.ButlerQuantumContext, lsst.pipe.base.cmdLineTask.CmdLineTask, lsst.pipe.base.argumentParser.ConfigDatasetType, lsst.pipe.base.argumentParser.ConfigFileAction, lsst.pipe.base.argumentParser.ConfigValueAction, lsst.pipe.base.argumentParser.DataIdContainer, lsst.pipe.base.argumentParser.DatasetArgument, lsst.pipe.base.connections.DeferredDatasetRef, lsst.pipe.base.graphBuilder.GraphBuilder, lsst.pipe.base.graph.graph.IncompatibleGraphError, lsst.pipe.base.argumentParser.InputOnlyArgumentParser, lsst.pipe.base.connections.InputQuantizedConnection, lsst.pipe.base.pipeline.LabelSpecifier, lsst.pipe.base.cmdLineTask.LegacyTaskRunner, lsst.pipe.base.graph.quantumNode.NodeId, lsst.pipe.base.connections.OutputQuantizedConnection, lsst.pipe.base.pipeline.Pipeline, lsst.pipe.base.pipeline.PipelineDatasetTypes, lsst.pipe.base.pipelineTask.PipelineTask, lsst.pipe.base.config.PipelineTaskConfig, lsst.pipe.base.connections.PipelineTaskConnections, lsst.pipe.base.graph.graph.QuantumGraph, lsst.pipe.base.graph.quantumNode.QuantumNode, lsst.pipe.base.config.ResourceConfig, lsst.pipe.base.struct.Struct, lsst.pipe.base.task.Task, lsst.pipe.base.pipeline.TaskDatasetTypes, lsst.pipe.base.pipeline.TaskDef, lsst.pipe.base.task.TaskError, lsst.pipe.base.taskFactory.TaskFactory, lsst.pipe.base.cmdLineTask.TaskRunner

lsst.pipe.base.testUtils Module

Functions

assertValidInitOutput(task)

Test that a constructed task conforms to its own init-connections.

assertValidOutput(task, result)

Test that the output of a call to run conforms to its own connections.

getInitInputs(butler, config)

Return the initInputs object that would have been passed to a PipelineTask constructor.

makeQuantum(task, butler, dataId, ioDataIds)

Create a Quantum for a particular data ID(s).

runTestQuantum(task, butler, quantum[, mockRun])

Run a PipelineTask on a Quantum.

lsst.pipe.base.connectionTypes Module

Classes

InitInput(name, storageClass, doc, multiple)

InitOutput(name, storageClass, doc, multiple)

Input(name, storageClass, doc, multiple, …)

PrerequisiteInput(name, storageClass, doc, …)

Class used for declaring PipelineTask prerequisite connections

Output(name, storageClass, doc, multiple, …)

BaseConnection(name, storageClass, doc, multiple)

Base class used for declaring PipelineTask connections

Class Inheritance Diagram

Inheritance diagram of lsst.pipe.base.connectionTypes.InitInput, lsst.pipe.base.connectionTypes.InitOutput, lsst.pipe.base.connectionTypes.Input, lsst.pipe.base.connectionTypes.PrerequisiteInput, lsst.pipe.base.connectionTypes.Output, lsst.pipe.base.connectionTypes.BaseConnection

lsst.pipe.base.pipelineIR Module

Classes

ConfigIR(python, dataId, file, rest)

Intermediate representation of configurations read from a pipeline yaml file.

ContractError

An exception that is raised when a pipeline contract is not satisfied

ContractIR(contract, msg)

Intermediate representation of contracts read from a pipeline yaml file.

ImportIR(location, include, exclude, …)

An intermediate representation of imported pipelines

PipelineIR(loaded_yaml)

Intermediate representation of a pipeline definition

TaskIR(label, klass, config)

Intermediate representation of tasks read from a pipeline yaml file.

LabeledSubset(label, subset, description)

Intermediate representation of named subset of task labels read from a pipeline yaml file.

Class Inheritance Diagram

Inheritance diagram of lsst.pipe.base.pipelineIR.ConfigIR, lsst.pipe.base.pipelineIR.ContractError, lsst.pipe.base.pipelineIR.ContractIR, lsst.pipe.base.pipelineIR.ImportIR, lsst.pipe.base.pipelineIR.PipelineIR, lsst.pipe.base.pipelineIR.TaskIR, lsst.pipe.base.pipelineIR.LabeledSubset