Configuring the requests per second (RPS) target

This setting specifies a target for requests-per-second per replica of an application.

  • Global key: requests-per-second-target-default
  • Per-revision annotation key: autoscaling.knative.dev/target (your revision must also be configured to use the rps metric annotation)
  • Possible values: An integer.
  • Default: "200"

Example:

apiVersion: serving.knative.dev/v1
kind: Service
metadata:
  name: helloworld-go
  namespace: default
spec:
  template:
    metadata:
      annotations:
        autoscaling.knative.dev/target: "150"
        autoscaling.knative.dev/metric: "rps"
    spec:
      containers:
        - image: gcr.io/knative-samples/helloworld-go
apiVersion: v1
kind: ConfigMap
metadata:
 name: config-autoscaler
 namespace: knative-serving
data:
 requests-per-second-target-default: "150"
apiVersion: operator.knative.dev/v1alpha1
kind: KnativeServing
metadata:
  name: knative-serving
spec:
  config:
    autoscaler:
      requests-per-second-target-default: "150"