SaladTechnologies · mgorkii-nlplogix · May 20, 2026 · May 20, 2026 · May 20, 2026
@@ -4,13 +4,12 @@ sidebarTitle: 'Performance and Reliability'
 description: 'To develop high-performance, reliable applications on SaladCloud'
 ---
 
-_Last Updated: Oct 2, 2025_
+_Last Updated: May 20, 2026_
 
 SaladCloud consists of tens of thousands of globally distributed nodes, primarily high-performance desktop computers and
-servers running the SaladCloud agent. Each node is equipped with either consumer-grade or data center GPUs, along with
-varying CPU and memory configurations. Node distribution is uneven across regions and countries: consumer GPU nodes in
-the US and Canada account for 50–60% of the total, while nearly all data center GPU nodes are currently located in the
-US.
+servers running the SaladCloud agent. Each node is equipped with consumer-grade GPUs, along with varying CPU and memory
+configurations. Node distribution is uneven across regions and countries: nodes in the US and Canada account for 50–60%
+of the total.
 
 When these devices are idle, SaladCloud leverages them to run workloads by dynamically pulling and executing container
 images. Once a container group is stopped, the image and any associated runtime data are removed from the allocated
@@ -43,17 +42,14 @@ Key observations are:
 - The count of online instances then briefly dropped by one, indicating one instance was just reallocated.
 - By around `80 minutes`, nearly all 100 instances were online, with minor fluctuations afterward due to reallocations.
 
-**SaladCloud’s data center nodes are typically deployed near the Internet backbone and offer higher bandwidth and
-processing capacity, enabling faster startup.**
-
 ## Interruptions and Reallocations
 
 An instance may go offline after coming online for several reasons. In such cases, a new instance is allocated to
 continue processing:
 
-- **Voluntary Interruptions**: Node owners (individuals or data center providers) may temporarily reclaim their
-  resources for their own use, pausing sharing. However, high-priority workloads that run reliably over long periods
-  generate higher earnings, giving owners less incentive to interrupt.
+- **Voluntary Interruptions**: Node owners may temporarily reclaim their resources for their own use, pausing sharing.
+  However, high-priority workloads that run reliably over long periods generate higher earnings, giving owners less
+  incentive to interrupt.
 
 - **External Interruptions**: Factors such as power outages, network issues, or hardware failures can also take nodes
   offline.
@@ -77,9 +73,6 @@ Key observations are:
   fluctuations along the way. **This trend shows that as applications run stably for longer periods on nodes, the
   likelihood of interruption by node owners decreases.**
 
-**SaladCloud’s data center nodes are generally more stable when running workloads at high priority and are less likely
-to be interrupted by their owners.**
-
 ## Uptimes
 
 Additionally, the 2025 test measured the uptime distributions of instances over the same period, which are primarily
@@ -97,16 +90,13 @@ Key observations are:
 - The average uptime across all instance runs (interrupted and uninterrupted) was `60 hours`.
 - The average uptime of interrupted instance runs was `35 hours`.
 
-**High-priority applications on SaladCloud’s data center nodes generally run uninterrupted for extended periods, though
-this cannot be fully guaranteed.**
-
 ## Run-to-Request Ratio
 
 The instance run-to-request ratio measures the actual compute capacity available compared to what is requested. For
 example, if 100 instances are requested and 99 are running, the run-to-request ratio is 99%. When a node goes offline
 and a replacement is allocated, additional time is required to download and decompress the image before the new instance
 becomes operational. Because of variations in startup times and uptimes, a 100% run-to-request ratio cannot be
-consistently guaranteed on SaladCloud’s consumer GPU nodes.
+consistently guaranteed on SaladCloud’s GPU nodes.
 
 Large image sizes can increase startup times, which in turn lowers the run-to-request ratio. To mitigate this, it is
 often necessary to provision additional instances (5~10%) beyond the initial plan, particularly for real-time inference
@@ -125,10 +115,6 @@ Key observations are:
   fully shut down. During this overlap, as new nodes are allocated and start running, the number of active instances can
   temporarily exceed the original request.
 
-**SaladCloud’s data center nodes provide 8 GPUs per node. While these nodes are generally more stable and less likely to
-be interrupted, any node going offline removes all 8 GPUs from the resource pool, so you should consider this impact
-when deploying workloads.**
-
 ## Processing Performance
 
 Nodes with the same consumer GPU type can exhibit different performance due to factors such as system configuration
@@ -151,8 +137,6 @@ initial check and real-time performance monitoring to select suitable nodes and
 for application execution. For more details, please refer to
 [this guide](/container-engine/tutorials/performance/high-performance-apps#build-high-performance-applications).
 
-**Based on our tests, SaladCloud’s data center nodes can always deliver stable and consistent performance.**
-
 ## Network Performance
 
 Salad nodes with consumer GPUs often exhibit asymmetric bandwidth, as many operate on residential networks with high
@@ -164,16 +148,13 @@ and strong overall performance.
 
 <img src="/container-engine/images/sp4.png" />
 
-**Most SaladCloud’s data center nodes offer symmetric bandwidth, delivering several gigabytes per second in both
-directions.**
-
 Round-trip time (RTT) is primarily determined by the geographical distance and underlying network latency between two
 endpoints, and it plays a critical role in data transfer throughput. Since Salad nodes are globally distributed, nodes
 with identical network speeds in different regions can exhibit varying throughput to a specific endpoint, such as a
 cloud storage bucket in a particular location. Transfer tools and algorithms also matter—using chunked and parallel data
 transfers can better utilize the available end-to-end bandwidth.
 
-If your applications require higher throughput with lower latency, it is recommended to perform initial checks and apply
-custom filters to select nodes that meet your specific network requirements and adopt advanced tools and algorithms .
-Please check [this guide](/container-engine/tutorials/performance/high-performance-storage-solutions) for more
-information.
+If your applications require higher throughput with lower latency, perform startup checks from inside the container and
+request reallocation only when a node does not meet a real workload requirement. SaladCloud does not provide a container
+group setting or node filter for minimum network bandwidth. Please check
+[this guide](/container-engine/tutorials/performance/network-bandwidth-checks) for more information.
@@ -3,14 +3,18 @@ title: 'Using IMDS to reallocate a replica'
 sidebarTitle: 'Reallocate Replica'
 ---
 
-_Last Updated: June 3, 2025_
+_Last Updated: May 20, 2026_
 
 Using the IMDS, you can reallocate a replica from within the running container. This is often done in combination with
 [performance monitoring](/container-engine/tutorials/performance/performance-monitoring) to allow applications to
 self-monitor and reject under-performing nodes. You can find code samples and the full API reference in the
 [IMDS API documentation](/reference/imds/reallocate). This endpoint requires you to provide a reason for reallocation,
 which we use to continuously improve our service.
 
+Use reallocation carefully. When a replica reallocates away from a node, that node is temporarily excluded from your
+allocation pool. If your application rejects nodes too aggressively, it can cycle through available capacity and fail to
+allocate until exclusions expire or more capacity becomes available.
+
 ## Example Usage
 
 ```bash

@@ -21,10 +21,9 @@ upload completed artifacts to cloud storage.
 
 We will be using [Google Cloud Pub/Sub](https://cloud.google.com/pubsub) as our job queue, and
 [Cloudflare R2](https://www.cloudflare.com/developer-platform/products/r2/), an S3-compatible object storage service, as
-our cloud storage. We prefer R2 to AWS S3 for many SaladCloud workloads, because R2 does not charge for egress data, and
-SaladCloud's distributed nodes are not in datacenters, and therefore may incur egress fees from other providers.
-Instrumenting your code to use S3-compatible storage will make it easier to switch storage providers in the future if
-you choose to do so.
+our cloud storage. We prefer R2 to AWS S3 for many SaladCloud workloads, because R2 does not charge for egress data,
+which helps reduce costs when distributed workers fetch inputs and upload results. Instrumenting your code to use
+S3-compatible storage will make it easier to switch storage providers in the future if you choose to do so.
 
 For this guide, we will build an application that slowly calculates a sum for _n_ steps, sleeping for 30 seconds between
 steps to simulate work. We will set up a job queue and related resources, a storage bucket, a checkpoint saving system,
@@ -115,8 +114,8 @@ repository.
 ## Cloud Storage: R2
 
 R2 is a cloud storage service from Cloudflare that is compatible with the S3 API. It is a great choice for SaladCloud
-workloads because it does not charge egress fees, and SaladCloud's distributed nodes are mostly not in datacenters, and
-therefore may incur egress fees from other providers.
+workloads because it does not charge egress fees, which helps reduce costs when distributed workers fetch inputs and
+upload results.
 
 From the [R2 console](https://dash.cloudflare.com/), navigate to "R2 Object Storage", and click "Create Bucket".
 

@@ -20,10 +20,10 @@ upload completed artifacts to cloud storage.
 
 We will use [🐕 Kelpie](https://github.com/SaladTechnologies/kelpie) as our job queue and
 [Cloudflare R2](https://www.cloudflare.com/developer-platform/products/r2/), an S3-compatible object storage service, as
-our cloud storage. We prefer R2 to AWS S3 for many SaladCloud workloads, because R2 does not charge for egress data, and
-SaladCloud's distributed nodes are not in datacenters, and therefore may incur egress fees from other providers. Kelpie
-handles all interactions with the storage service, so your job will only need to write to the local file system, and
-Kelpie will take care of uploading the files to R2.
+our cloud storage. We prefer R2 to AWS S3 for many SaladCloud workloads, because R2 does not charge for egress data,
+which helps reduce costs when distributed workers fetch inputs and upload results. Kelpie handles all interactions with
+the storage service, so your job will only need to write to the local file system, and Kelpie will take care of
+uploading the files to R2.
 
 For this guide, we will build an application that slowly calculates a sum for _n_ steps, sleeping for 30 seconds between
 steps to simulate work. We will set up a storage bucket and a checkpoint saving system, and enable Kelpie's autoscaling
@@ -50,8 +50,8 @@ You can explore the full API with the [Swagger UI](https://kelpie.saladexamples.
 ## Cloud Storage: R2
 
 R2 is a cloud storage service from Cloudflare that is compatible with the S3 API. It is a great choice for SaladCloud
-workloads because it does not charge egress fees, and SaladCloud's distributed nodes are mostly not in datacenters, and
-therefore may incur egress fees from other providers.
+workloads because it does not charge egress fees, which helps reduce costs when distributed workers fetch inputs and
+upload results.
 
 From the [R2 console](https://dash.cloudflare.com/), navigate to "R2 Object Storage", and click "Create Bucket".
 

@@ -22,9 +22,8 @@ upload completed artifacts to cloud storage.
 We will be using [RabbitMQ](https://www.rabbitmq.com/) hosted on [CloudAMQP](https://www.cloudamqp.com/) as our job
 queue, and [Cloudflare R2](https://www.cloudflare.com/developer-platform/products/r2/), an S3-compatible object storage
 service, as our cloud storage. We prefer R2 to AWS S3 for many SaladCloud workloads, because R2 does not charge for
-egress data, and SaladCloud's distributed nodes are not in datacenters, and therefore may incur egress fees from other
-providers. Instrumenting your code to use S3-compatible storage will make it easier to switch storage providers in the
-future if you choose to do so.
+egress data, which helps reduce costs when distributed workers fetch inputs and upload results. Instrumenting your code
+to use S3-compatible storage will make it easier to switch storage providers in the future if you choose to do so.
 
 For this guide, we will build an application that slowly calculates a sum for _n_ steps, sleeping for 30 seconds between
 steps to simulate work. We will set up a job queue and related resources, a storage bucket, a checkpoint saving system,
@@ -154,8 +153,8 @@ before being sent to the deadletter exchange.
 ## Cloud Storage: R2
 
 R2 is a cloud storage service from Cloudflare that is compatible with the S3 API. It is a great choice for SaladCloud
-workloads because it does not charge egress fees, and SaladCloud's distributed nodes are mostly not in datacenters, and
-therefore may incur egress fees from other providers.
+workloads because it does not charge egress fees, which helps reduce costs when distributed workers fetch inputs and
+upload results.
 
 From the [R2 console](https://dash.cloudflare.com/), navigate to "R2 Object Storage", and click "Create Bucket".
 

@@ -21,10 +21,9 @@ upload completed artifacts to cloud storage.
 
 We will be using [Amazon SQS](https://aws.amazon.com/sqs/) as our job queue, and
 [Cloudflare R2](https://www.cloudflare.com/developer-platform/products/r2/), an S3-compatible object storage service, as
-our cloud storage. We prefer R2 to AWS S3 for many SaladCloud workloads, because R2 does not charge for egress data, and
-SaladCloud's distributed nodes are not in datacenters, and therefore may incur egress fees from other providers.
-Instrumenting your code to use S3-compatible storage will make it easier to switch storage providers in the future if
-you choose to do so.
+our cloud storage. We prefer R2 to AWS S3 for many SaladCloud workloads, because R2 does not charge for egress data,
+which helps reduce costs when distributed workers fetch inputs and upload results. Instrumenting your code to use
+S3-compatible storage will make it easier to switch storage providers in the future if you choose to do so.
 
 For this guide, we will build an application that slowly calculates a sum for _n_ steps, sleeping for 30 seconds between
 steps to simulate work. We will set up a job queue and related resources, a storage bucket, a checkpoint saving system,
@@ -192,8 +191,8 @@ For our application to use these queues, we will need the Queue URL, available o
 ## Cloud Storage: R2
 
 R2 is a cloud storage service from Cloudflare that is compatible with the S3 API. It is a great choice for SaladCloud
-workloads because it does not charge egress fees, and SaladCloud's distributed nodes are mostly not in datacenters, and
-therefore may incur egress fees from other providers.
+workloads because it does not charge egress fees, which helps reduce costs when distributed workers fetch inputs and
+upload results.
 
 From the [R2 console](https://dash.cloudflare.com/), navigate to "R2 Object Storage", and click "Create Bucket".
 

@@ -90,7 +90,7 @@ RunPod offers four primary products. Each has a clear equivalent or migration st
 
 - Single-node GPU pods
 - Manual environment setup (via SSH or scripts)
-- Consumer grade and datacenter GPUs
+- Broad GPU catalog with multiple deployment models
 
 **SaladCloud GPU Containers:**