diff --git a/charts/deepgram-self-hosted/CHANGELOG.md b/charts/deepgram-self-hosted/CHANGELOG.md index 2862e78..7a20cbe 100644 --- a/charts/deepgram-self-hosted/CHANGELOG.md +++ b/charts/deepgram-self-hosted/CHANGELOG.md @@ -6,6 +6,10 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/), ## [Unreleased] +### Added + +- Allow specifying custom annotations for deployments. + ## [0.2.3] - 2024-07-15 ### Added diff --git a/charts/deepgram-self-hosted/README.md b/charts/deepgram-self-hosted/README.md index fcf5f78..5ae1768 100644 --- a/charts/deepgram-self-hosted/README.md +++ b/charts/deepgram-self-hosted/README.md @@ -177,6 +177,7 @@ If you encounter issues while deploying or using Deepgram, consider the followin | Key | Type | Default | Description | |-----|------|---------|-------------| +| api.additionalAnnotations | object | `nil` | Additional annotations to add to the API deployment | | api.additionalLabels | object | `{}` | Additional labels to add to API resources | | api.affinity | object | `{}` | [Affinity and anti-affinity](https://kubernetes.io/docs/concepts/scheduling-eviction/assign-pod-node/#affinity-and-anti-affinity) to apply for API pods. | | api.driverPool | object | `` | driverPool configures the backend pool of speech engines (generically referred to as "drivers" here). The API will load-balance among drivers in the standard pool; if one standard driver fails, the next one will be tried. | @@ -217,6 +218,7 @@ If you encounter issues while deploying or using Deepgram, consider the followin | cluster-autoscaler.enabled | bool | `false` | Set to `true` to enable node autoscaling with AWS EKS. Note needed for GKE, as autoscaling is enabled by a [cli option on cluster creation](https://cloud.google.com/kubernetes-engine/docs/how-to/cluster-autoscaler#creating_a_cluster_with_autoscaling). | | cluster-autoscaler.rbac.serviceAccount.annotations."eks.amazonaws.com/role-arn" | string | `nil` | Replace with the AWS Role ARN configured for the Cluster Autoscaler. See the [Deepgram AWS EKS guide](https://developers.deepgram.com/docs/aws-k8s#creating-a-cluster) or [Cluster Autoscaler AWS documentation](https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/cloudprovider/aws/README.md#permissions) for details. | | cluster-autoscaler.rbac.serviceAccount.name | string | `"cluster-autoscaler-sa"` | Name of the IAM Service Account with the [necessary permissions](https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/cloudprovider/aws/README.md#permissions) | +| engine.additionalAnnotations | object | `nil` | Additional annotations to add to the Engine deployment | | engine.additionalLabels | object | `{}` | Additional labels to add to Engine resources | | engine.affinity | object | `{}` | [Affinity and anti-affinity](https://kubernetes.io/docs/concepts/scheduling-eviction/assign-pod-node/#affinity-and-anti-affinity) to apply for Engine pods. | | engine.chunking | object | `` | chunking defines the size of audio chunks to process in seconds. Adjusting these values will affect both inference performance and accuracy of results. Please contact your Deepgram Account Representative if you want to adjust any of these values. | @@ -281,6 +283,7 @@ If you encounter issues while deploying or using Deepgram, consider the followin | kube-prometheus-stack | object | `` | Passthrough values for [Prometheus k8s stack Helm chart](https://github.com/prometheus-community/helm-charts/tree/main/charts/kube-prometheus-stack). Prometheus (and its adapter) should be configured when scaling.auto is enabled. You may choose to use the installation/configuration bundled in this Helm chart, or you may configure an existing Prometheus installation in your cluster to expose the needed values. See source Helm chart for explanation of available values. Default values provided in this chart are used to provide pod autoscaling for Deepgram pods. | | kube-prometheus-stack.includeDependency | bool | `nil` | Normally, this chart will be installed if `scaling.auto.enabled` is true. However, if you wish to manage the Prometheus adapter in your cluster on your own and not as part of the Deepgram Helm chart, you can force it to not be installed by setting this to `false`. | | licenseProxy | object | `` | Configuration options for the optional [Deepgram License Proxy](https://developers.deepgram.com/docs/license-proxy). | +| licenseProxy.additionalAnnotations | object | `nil` | Additional annotations to add to the LicenseProxy deployment | | licenseProxy.additionalLabels | object | `{}` | Additional labels to add to License Proxy resources | | licenseProxy.affinity | object | `{}` | [Affinity and anti-affinity](https://kubernetes.io/docs/concepts/scheduling-eviction/assign-pod-node/#affinity-and-anti-affinity) to apply for License Proxy pods. | | licenseProxy.deploySecondReplica | bool | `false` | If the License Proxy is deployed, one replica should be sufficient to support many API/Engine pods. Highly available environments may wish to deploy a second replica to ensure uptime, which can be toggled with this option. | diff --git a/charts/deepgram-self-hosted/templates/api/api.deployment.yaml b/charts/deepgram-self-hosted/templates/api/api.deployment.yaml index e535579..a1edcf6 100644 --- a/charts/deepgram-self-hosted/templates/api/api.deployment.yaml +++ b/charts/deepgram-self-hosted/templates/api/api.deployment.yaml @@ -8,6 +8,10 @@ metadata: {{- range $key, $val := .Values.engine.additionalLabels }} {{ $key }}: {{ $val | quote }} {{- end}} + {{- with .Values.api.additionalAnnotations }} + annotations: + {{- toYaml . | nindent 4 }} + {{- end }} spec: selector: matchLabels: diff --git a/charts/deepgram-self-hosted/templates/engine/engine.deployment.yaml b/charts/deepgram-self-hosted/templates/engine/engine.deployment.yaml index 18eafe8..894c36d 100644 --- a/charts/deepgram-self-hosted/templates/engine/engine.deployment.yaml +++ b/charts/deepgram-self-hosted/templates/engine/engine.deployment.yaml @@ -8,6 +8,10 @@ metadata: {{- range $key, $val := .Values.engine.additionalLabels }} {{ $key }}: {{ $val | quote }} {{- end}} + {{- with .Values.engine.additionalAnnotations }} + annotations: + {{- toYaml . | nindent 4 }} + {{- end }} spec: selector: matchLabels: diff --git a/charts/deepgram-self-hosted/templates/license-proxy/license-proxy.deployment.yaml b/charts/deepgram-self-hosted/templates/license-proxy/license-proxy.deployment.yaml index e8a9dd9..0c14c17 100644 --- a/charts/deepgram-self-hosted/templates/license-proxy/license-proxy.deployment.yaml +++ b/charts/deepgram-self-hosted/templates/license-proxy/license-proxy.deployment.yaml @@ -8,7 +8,11 @@ metadata: app: deepgram-license-proxy {{- range $key, $val := .Values.licenseProxy.additionalLabels }} {{ $key }}: {{ $val | quote }} - {{- end}} + {{- end }} + {{- with .Values.licenseProxy.additionalAnnotations }} + annotations: + {{- toYaml . | nindent 4 }} + {{- end }} spec: selector: matchLabels: diff --git a/charts/deepgram-self-hosted/values.yaml b/charts/deepgram-self-hosted/values.yaml index 868e337..be170d7 100644 --- a/charts/deepgram-self-hosted/values.yaml +++ b/charts/deepgram-self-hosted/values.yaml @@ -117,6 +117,9 @@ api: # -- Additional labels to add to API resources additionalLabels: {} + # -- (object) Additional annotations to add to the API deployment + additionalAnnotations: + updateStrategy: rollingUpdate: # -- The maximum number of API pods, relative to the number of replicas, @@ -276,6 +279,9 @@ engine: # -- Additional labels to add to Engine resources additionalLabels: {} + # -- (object) Additional annotations to add to the Engine deployment + additionalAnnotations: + updateStrategy: rollingUpdate: # -- The maximum number of Engine pods, relative to the number of replicas, @@ -527,6 +533,9 @@ licenseProxy: # -- Additional labels to add to License Proxy resources additionalLabels: {} + # -- (object) Additional annotations to add to the LicenseProxy deployment + additionalAnnotations: + updateStrategy: # -- For the LicenseProxy, we only expose maxSurge and not maxUnavailable. # This is to avoid accidentally having all LicenseProxy nodes go offline during upgrades,