Kueue Configuration API
Resource Types
AdmissionFairSharing
Appears in:
| Field | Description |
|---|---|
usageHalfLifeTime [Required]k8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
usageHalfLifeTime indicates the time after which the current usage will decay by a half If set to 0, usage will be reset to 0 immediately. |
usageSamplingInterval [Required]k8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
usageSamplingInterval indicates how often Kueue updates consumedResources in FairSharingStatus Defaults to 5min. |
resourceWeights [Required]map[ResourceName]float64
|
resourceWeights assigns weights to resources which then are used to calculate LocalQueue's resource usage and order Workloads. Defaults to 1. |
ClientConnection
Appears in:
| Field | Description |
|---|---|
qps [Required]float32
|
QPS controls the number of queries per second allowed for K8S api server connection. Setting this to a negative value will disable client-side ratelimiting. |
burst [Required]int32
|
Burst allows extra queries to accumulate when a client is exceeding its rate. |
ClusterQueueVisibility
Appears in:
| Field | Description |
|---|---|
maxCount [Required]int32
|
MaxCount indicates the maximal number of pending workloads exposed in the cluster queue status. When the value is set to 0, then ClusterQueue visibility updates are disabled. The maximal value is 4000. Defaults to 10. |
ControllerConfigurationSpec
Appears in:
ControllerConfigurationSpec defines the global configuration for controllers registered with the manager.
| Field | Description |
|---|---|
groupKindConcurrencymap[string]int
|
GroupKindConcurrency is a map from a Kind to the number of concurrent reconciliation allowed for that controller. When a controller is registered within this manager using the builder utilities, users have to specify the type the controller reconciles in the For(...) call. If the object's kind passed matches one of the keys in this map, the concurrency for that controller is set to the number specified. The key is expected to be consistent in form with GroupKind.String(),
e.g. ReplicaSet in apps group (regardless of version) would be |
cacheSyncTimeouttime.Duration
|
CacheSyncTimeout refers to the time limit set to wait for syncing caches. Defaults to 2 minutes if not set. |
ControllerHealth
Appears in:
ControllerHealth defines the health configs.
| Field | Description |
|---|---|
healthProbeBindAddressstring
|
HealthProbeBindAddress is the TCP address that the controller should bind to for serving health probes It can be set to "0" or "" to disable serving the health probe. |
readinessEndpointNamestring
|
ReadinessEndpointName, defaults to "readyz" |
livenessEndpointNamestring
|
LivenessEndpointName, defaults to "healthz" |
ControllerManager
Appears in:
| Field | Description |
|---|---|
webhookControllerWebhook
|
Webhook contains the controllers webhook configuration |
leaderElectionk8s.io/component-base/config/v1alpha1.LeaderElectionConfiguration
|
LeaderElection is the LeaderElection config to be used when configuring the manager.Manager leader election |
metricsControllerMetrics
|
Metrics contains the controller metrics configuration |
healthControllerHealth
|
Health contains the controller health configuration |
pprofBindAddressstring
|
PprofBindAddress is the TCP address that the controller should bind to for serving pprof. It can be set to "" or "0" to disable the pprof serving. Since pprof may contain sensitive information, make sure to protect it before exposing it to public. |
controllerControllerConfigurationSpec
|
Controller contains global configuration options for controllers registered within this manager. |
ControllerMetrics
Appears in:
ControllerMetrics defines the metrics configs.
| Field | Description |
|---|---|
bindAddressstring
|
BindAddress is the TCP address that the controller should bind to for serving prometheus metrics. It can be set to "0" to disable the metrics serving. |
enableClusterQueueResourcesbool
|
EnableClusterQueueResources, if true the cluster queue resource usage and quotas metrics will be reported. |
ControllerWebhook
Appears in:
ControllerWebhook defines the webhook server for the controller.
| Field | Description |
|---|---|
portint
|
Port is the port that the webhook server serves at. It is used to set webhook.Server.Port. |
hoststring
|
Host is the hostname that the webhook server binds to. It is used to set webhook.Server.Host. |
certDirstring
|
CertDir is the directory that contains the server key and certificate. if not set, webhook server would look up the server key and certificate in {TempDir}/k8s-webhook-server/serving-certs. The server key and certificate must be named tls.key and tls.crt, respectively. |
DeviceClassMapping
Appears in:
DeviceClassMapping holds device class to logical resource mappings for Dynamic Resource Allocation support.
| Field | Description |
|---|---|
name [Required]k8s.io/api/core/v1.ResourceName
|
Name is referenced in ClusterQueue.nominalQuota and Workload status. Must be a valid fully qualified name consisting of an optional DNS subdomain prefix followed by a slash and a DNS label, or just a DNS label. DNS labels consist of lower-case alphanumeric characters or hyphens, and must start and end with an alphanumeric character. DNS subdomain prefixes follow the same rules as DNS labels but can contain periods. The total length must not exceed 253 characters. |
deviceClassNames [Required][]k8s.io/api/core/v1.ResourceName
|
DeviceClassNames enumerates the DeviceClasses represented by this resource name. Each device class name must be a valid qualified name consisting of an optional DNS subdomain prefix followed by a slash and a DNS label, or just a DNS label. DNS labels consist of lower-case alphanumeric characters or hyphens, and must start and end with an alphanumeric character. DNS subdomain prefixes follow the same rules as DNS labels but can contain periods. The total length of each name must not exceed 253 characters. |
FairSharing
Appears in:
| Field | Description |
|---|---|
enable [Required]bool
|
enable indicates whether to enable Fair Sharing for all cohorts. Defaults to false. |
preemptionStrategies [Required][]PreemptionStrategy
|
preemptionStrategies indicates which constraints should a preemption satisfy. The preemption algorithm will only use the next strategy in the list if the incoming workload (preemptor) doesn't fit after using the previous strategies. Possible values are:
|
Integrations
Appears in:
| Field | Description |
|---|---|
frameworks [Required][]string
|
List of framework names to be enabled. Possible options:
|
externalFrameworks [Required][]string
|
List of GroupVersionKinds that are managed for Kueue by external controllers;
the expected format is |
podOptions [Required]PodIntegrationOptions
|
PodOptions defines kueue controller behaviour for pod objects Deprecated: This field will be removed on v1beta2, use ManagedJobsNamespaceSelector (https://kueue.sigs.k8s.io/docs/tasks/run/plain_pods/) instead. |
labelKeysToCopy [Required][]string
|
labelKeysToCopy is a list of label keys that should be copied from the job into the workload object. It is not required for the job to have all the labels from this list. If a job does not have some label with the given key from this list, the constructed workload object will be created without this label. In the case of creating a workload from a composable job (pod group), if multiple objects have labels with some key from the list, the values of these labels must match or otherwise the workload creation would fail. The labels are copied only during the workload creation and are not updated even if the labels of the underlying job are changed. |
InternalCertManagement
Appears in:
| Field | Description |
|---|---|
enable [Required]bool
|
Enable controls the use of internal cert management for the webhook,
metrics and visibility endpoints.
When enabled Kueue is using libraries to generate and
self-sign the certificates.
When disabled, you need to provide the certificates for
the webhooks, metrics and visibility through a third party certificate
This secret is mounted to the kueue controller manager pod. The mount
path for webhooks is /tmp/k8s-webhook-server/serving-certs, for
metrics endpoint the expected path is |
webhookServiceName [Required]string
|
WebhookServiceName is the name of the Service used as part of the DNSName. Defaults to kueue-webhook-service. |
webhookSecretName [Required]string
|
WebhookSecretName is the name of the Secret used to store CA and server certs. Defaults to kueue-webhook-server-cert. |
MultiKueue
Appears in:
| Field | Description |
|---|---|
gcIntervalk8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
GCInterval defines the time interval between two consecutive garbage collection runs. Defaults to 1min. If 0, the garbage collection is disabled. |
originstring
|
Origin defines a label value used to track the creator of workloads in the worker clusters. This is used by multikueue in components like its garbage collector to identify remote objects that ware created by this multikueue manager cluster and delete them if their local counterpart no longer exists. |
workerLostTimeoutk8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
WorkerLostTimeout defines the time a local workload's multikueue admission check state is kept Ready if the connection with its reserving worker cluster is lost. Defaults to 15 minutes. |
dispatcherNamestring
|
DispatcherName defines the dispatcher responsible for selecting worker clusters to handle the workload.
|
externalFrameworks[]MultiKueueExternalFramework
|
ExternalFrameworks defines a list of external frameworks that should be supported by the generic MultiKueue adapter. Each entry defines how to handle a specific GroupVersionKind (GVK) for MultiKueue operations. |
MultiKueueExternalFramework
Appears in:
MultiKueueExternalFramework defines a framework that is not built-in.
| Field | Description |
|---|---|
name [Required]string
|
Name is the GVK of the resource that are
managed by external controllers
the expected format is |
ObjectRetentionPolicies
Appears in:
ObjectRetentionPolicies holds retention settings for different object types.
| Field | Description |
|---|---|
workloadsWorkloadRetentionPolicy
|
Workloads configures retention for Workloads. A nil value disables automatic deletion of Workloads. |
PodIntegrationOptions
Appears in:
| Field | Description |
|---|---|
namespaceSelector [Required]k8s.io/apimachinery/pkg/apis/meta/v1.LabelSelector
|
NamespaceSelector can be used to omit some namespaces from pod reconciliation |
podSelector [Required]k8s.io/apimachinery/pkg/apis/meta/v1.LabelSelector
|
PodSelector can be used to choose what pods to reconcile |
PreemptionStrategy
(Alias of string)
Appears in:
QueueVisibility
Appears in:
| Field | Description |
|---|---|
clusterQueues [Required]ClusterQueueVisibility
|
ClusterQueues is configuration to expose the information about the top pending workloads in the cluster queue. |
updateIntervalSeconds [Required]int32
|
UpdateIntervalSeconds specifies the time interval for updates to the structure of the top pending workloads in the queues. The minimum value is 1. Defaults to 5. |
RequeuingStrategy
Appears in:
| Field | Description |
|---|---|
timestampRequeuingTimestamp
|
Timestamp defines the timestamp used for re-queuing a Workload that was evicted due to Pod readiness. The possible values are:
|
backoffLimitCountint32
|
BackoffLimitCount defines the maximum number of re-queuing retries.
Once the number is reached, the workload is deactivated ( Every backoff duration is about "b*2^(n-1)+Rand" where:
Defaults to null. |
backoffBaseSecondsint32
|
BackoffBaseSeconds defines the base for the exponential backoff for re-queuing an evicted workload. Defaults to 60. |
backoffMaxSecondsint32
|
BackoffMaxSeconds defines the maximum backoff time to re-queue an evicted workload. Defaults to 3600. |
RequeuingTimestamp
(Alias of string)
Appears in:
ResourceTransformation
Appears in:
| Field | Description |
|---|---|
input [Required]k8s.io/api/core/v1.ResourceName
|
Input is the name of the input resource. |
strategy [Required]ResourceTransformationStrategy
|
Strategy specifies if the input resource should be replaced or retained. Defaults to Retain |
outputs [Required]k8s.io/api/core/v1.ResourceList
|
Outputs specifies the output resources and quantities per unit of input resource.
An empty Outputs combined with a |
ResourceTransformationStrategy
(Alias of string)
Appears in:
Resources
Appears in:
| Field | Description |
|---|---|
excludeResourcePrefixes [Required][]string
|
ExcludedResourcePrefixes defines which resources should be ignored by Kueue |
transformations [Required][]ResourceTransformation
|
Transformations defines how to transform PodSpec resources into Workload resource requests. This is intended to be a map with Input as the key (enforced by validation code) |
deviceClassMappings[]DeviceClassMapping
|
DeviceClassMappings defines mappings from device classes to logical resources for Dynamic Resource Allocation support. |
WaitForPodsReady
Appears in:
WaitForPodsReady defines configuration for the Wait For Pods Ready feature, which is used to ensure that all Pods are ready within the specified time.
| Field | Description |
|---|---|
enable [Required]bool
|
Enable indicates whether to enable wait for pods ready feature. Defaults to false. |
timeoutk8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
Timeout defines the time for an admitted workload to reach the PodsReady=true condition. When the timeout is exceeded, the workload evicted and requeued in the same cluster queue. Defaults to 5min. |
blockAdmission [Required]bool
|
BlockAdmission when true, cluster queue will block admissions for all
subsequent jobs until the jobs reach the PodsReady=true condition.
This setting is only honored when |
requeuingStrategyRequeuingStrategy
|
RequeuingStrategy defines the strategy for requeuing a Workload. |
recoveryTimeoutk8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
RecoveryTimeout defines an opt-in timeout, measured since the last transition to the PodsReady=false condition after a Workload is Admitted and running. Such a transition may happen when a Pod failed and the replacement Pod is awaited to be scheduled. After exceeding the timeout the corresponding job gets suspended again and requeued after the backoff delay. The timeout is enforced only if waitForPodsReady.enable=true. If not set, there is no timeout. |
WorkloadRetentionPolicy
Appears in:
WorkloadRetentionPolicy defines the policies for when Workloads should be deleted.
| Field | Description |
|---|---|
afterFinishedk8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
AfterFinished is the duration to wait after a Workload finishes before deleting it. A duration of 0 will delete immediately. A nil value disables automatic deletion. Represented using metav1.Duration (e.g. "10m", "1h30m"). |
afterDeactivatedByKueuek8s.io/apimachinery/pkg/apis/meta/v1.Duration
|
AfterDeactivatedByKueue is the duration to wait after any Kueue-managed Workload (such as a Job, JobSet, or other custom workload types) has been marked as deactivated by Kueue before automatically deleting it. Deletion of deactivated workloads may cascade to objects not created by Kueue, since deleting the parent Workload owner (e.g. JobSet) can trigger garbage-collection of dependent resources. A duration of 0 will delete immediately. A nil value disables automatic deletion. Represented using metav1.Duration (e.g. "10m", "1h30m"). |
Feedback
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.