You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: _includes/about-soda.md
+1-1
Original file line number
Diff line number
Diff line change
@@ -14,7 +14,7 @@ Soda works by taking the data quality checks that you prepare and using them to
14
14
15
15
To test your data quality, you choose a flavor of Soda (choose a deployment model) which enables you to configure connections with your data sources and define data quality checks, then run scans that execute your data quality checks.
16
16
17
-
***Connect to your data source.** <br />Connect Soda to a data source such as Snowflake, Amazon Athena, or Big Query by providing access details for your data source such as host, port, and data source login credentials.
17
+
***Connect to your data source.** <br />Connect Soda to a data source such as Snowflake, Amazon Athena, or BigQuery by providing access details for your data source such as host, port, and data source login credentials.
18
18
***Define checks to surface bad-quality data.** <br />Define data quality checks using Soda Checks Language (SodaCL), a domain-specific language for data quality testing. A Soda Check is a test that Soda performs when it scans a dataset in your data source.
19
19
***Run a scan to execute your data quality checks.** <br />During a scan, Soda does not ingest your data, it only scans it for quality metrics, then uses the metadata to prepare scan results<sup>1</sup>. After a scan, each check results in one of three default states:
20
20
* pass: the values in the dataset match or fall within the thresholds you specified
1. If you have not already done so, create a Soda Cloud account at <ahref="https://cloud.soda.io/signup?utm_source=docs"target="_blank"> cloud.soda.io</a>. If you already have a Soda account, log in.
2
+
2. By default, Soda prepares a Soda-hosted agent for all newly-created accounts. However, if you are an Admin in an existing Soda Cloud account and wish to use a Soda-hosted agent, navigate to **your avatar** > **Organization Settings**. In the **Organization** tab, click the checkbox to **Enable Soda-hosted Agent**.
3
+
3. Navigate to **your avatar** > **Data Sources**, then access the **Agents** tab. Notice your out-of-the-box Soda-hosted agent that is up and running.
The <strong>agreement</strong> feature is being deprecated and is only available upon request. Contact <ahref="mailto:[email protected]">Soda Support</a> to request access.
Introducing a secure, out-of-the-box Soda-hosted Agent to manage access to data sources from within your Soda Cloud account. Quickly configure connections to your data sources in the Soda Cloud user interface, then empower all your colleagues to explore datasets, access check results, customize collections, and create their own no-code checks for data quality.
9
+
10
+
Learn how to [Set up a Soda-hosted agent]({% link soda-agent/managed-agent.md %}).
Copy file name to clipboardexpand all lines: soda-agent/basics.md
+3-3
Original file line number
Diff line number
Diff line change
@@ -10,11 +10,11 @@ redirect_from: /soda-agent/
10
10
<!--Linked to UI, access Shlink-->
11
11
*Last modified on {% last_modified_at %}*
12
12
13
-
The **Soda Agent** is a tool that empowers Soda Cloud users to securely access data sources to scan for data quality. Create a Kubernetes cluster in a cloud services provider environment, then use Helm to deploy a Soda Agent in the cluster.
13
+
The **Soda Agent** is a tool that empowers Soda Cloud users to securely access data sources to scan for data quality. For a self-hosted agent, create a Kubernetes cluster in a cloud services provider environment, then use Helm to deploy a Soda Agent in the cluster.
14
14
15
-
This setup enables Soda Cloud users to securely connect to data sources (Snowflake, Amazon Athena, etc.) from within the Soda Cloud web application. Any user in your Soda Cloud account can add a new data source via the agent, then write their own no-code checks and agreements to check for data quality in the new data source.
15
+
This setup enables Soda Cloud users to securely connect to data sources (Snowflake, Amazon Athena, etc.) from within the Soda Cloud web application. Any user in your Soda Cloud account can add a new data source via the agent, then write their own no-code checks to check for data quality in the new data source.
16
16
17
-
What follows is an extremely abridged introduction to a few basic elements involved in the deployment and setup of a Soda Agent.
17
+
What follows is an extremely abridged introduction to a few basic elements involved in the deployment and setup of a self-hosted Soda Agent.
Copy file name to clipboardexpand all lines: soda-agent/deploy.md
+59-10
Original file line number
Diff line number
Diff line change
@@ -19,9 +19,9 @@ redirect_from:
19
19
<!--Linked to UI, access Shlink-->
20
20
*Last modified on {% last_modified_at %}*
21
21
22
-
The **Soda Agent** is a tool that empowers Soda Cloud users to securely access data sources to scan for data quality. Create a Kubernetes cluster, then use Helm to deploy a Soda Agent in the cluster.
22
+
The **Soda Agent** is a tool that empowers Soda Cloud users to securely access data sources to scan for data quality. Create a Kubernetes cluster, then use Helm to deploy a self-hosted Soda Agent in the cluster.
23
23
24
-
This setup enables Soda Cloud users to securely connect to data sources (BigQuery, Snowflake, etc.) from within the Soda Cloud web application. Any user in your Soda Cloud account can add a new data source via the agent, then write their own no-code checks and agreements to check for data quality in the new data source.
24
+
This setup enables Soda Cloud users to securely connect to data sources (BigQuery, Snowflake, etc.) from within the Soda Cloud web application. Any user in your Soda Cloud account can add a new data source via the agent, then write their own no-code checks and agreements to check for data quality in the new data source. Alternatively, if you use a BigQuery, MySQL, PostgreSQL, or Snowflake data source, you can use a secure, out-of-the-box [Soda-hosted agent]({% link soda-agent/managed-agent.md %}) made available for every Soda Cloud organization.
25
25
26
26
As a step in the **Get started roadmap**, this guide offers instructions to set up, install, and configure Soda in a [self-hosted agent deployment model]({% link soda/setup-guide.md %}#self-hosted-agent).
27
27
@@ -173,7 +173,7 @@ REVISION: 1
173
173
```shell
174
174
minikube kubectl -- describe pods
175
175
```
176
-
4. In your Soda Cloud account, navigate to **your avatar** > **Data Sources** > **Agents** tab. Refresh the page to verify that you see the agent you just created in the list of Agents. <br/><br/>Be aware that this may take several minutes to appear in your list of Soda Agents. Use the `describe pods` command in step 3 to check the status of the deployment. When `State: Running` and `Ready: True`, then you can refresh and see the agent in Soda Cloud.
176
+
4. In your Soda Cloud account, navigate to **your avatar** > **Data Sources** > **Agents** tab. Refresh the page to verify that you see the agent you just created in the list of Agents. <br/><br/>Be aware that this may take several minutes to appear in your list of Soda Agents. Use the `describe pods` command in step 3 to check the status of the deployment. When `State: Running` and `Ready: True`, then you can refresh and see the agent in Soda Cloud.
If you use private key authentication with a Soda Agent, refer to [Soda Agent extras]({% link soda-agent/secrets.md %}#use-a-values-file-to-store-private-key-authentication-values).
250
259
251
260
<br />
@@ -258,7 +267,7 @@ If you use private key authentication with a Soda Agent, refer to [Soda Agent ex
5. After you have started the agent and validated that it is running, log into your Soda Cloud account, then navigate to **your avatar** > **Data Sources** > **Agents** tab. Refresh the page to verify that you see the agent you just created in the list of Agents.
8. In your Soda Cloud account, navigate to **your avatar** > **Data Sources** > **Agents** tab. Refresh the page to verify that you see the agent you just created in the list of Agents.
@@ -1273,7 +1322,7 @@ In your Soda Cloud account, navigate to **your avatar** > **Data Sources**. Clic
1273
1322
| ----------------------- | ---------- |
1274
1323
| Data Source Label | Provide a unique identifier for the data source. Soda Cloud uses the label you provide to define the immutable name of the data source against which it runs the Default Scan.|
1275
1324
| Default Scan Schedule Label | Provide a name for the default scan schedule for this data sources. The scan schedule indicates which Soda Agent to use to execute the scan, and when. |
1276
-
| Default Scan Schedule Agent | Select the name of a Soda Agent that you have previously set up in your secure environment and connected to a specific data source. This identifies the Soda Agent to which Soda Cloud must connect in order to run its scan. |
1325
+
| Default Scan Schedule Agent | Select the name of a Soda Agent that you have previously set up in your secure environment. This identifies the Soda Agent to which Soda Cloud must connect in order to run its scan. |
1277
1326
| Schedule Definition | Provide the scan frequency details Soda Cloud uses to execute scans according to your needs. If you wish, you can define the schedule as a cron expression. |
1278
1327
| Starting At | Select the time of day to run the scan. The default value is midnight. |
1279
1328
| Time Zone | Select a timezone. The default value is UTC. |
@@ -1289,7 +1338,7 @@ To more securely provide sensitive values such as usernames and passwords, use e
1289
1338
1290
1339
Access the data source-specific connection configurations listed below to copy+paste the connection syntax into the editing panel, then adjust the values to correspond with your data source's details. Access connection configuration details in [Data source reference]({% link soda/connect-athena.md %}) section of Soda documentation.
1291
1340
1292
-
See also: [Use a file reference for a Big Query data source connection](#use-a-file-reference-for-a-big-query-data-source-connection)
1341
+
See also: [Use a file reference for a BigQuery data source connection](#use-a-file-reference-for-a-bigquery-data-source-connection)
1293
1342
1294
1343
<br />
1295
1344
@@ -1352,7 +1401,7 @@ automated monitoring:
1352
1401
1353
1402
<br />
1354
1403
1355
-
### Use a file reference for a Big Query data source connection
1404
+
### Use a file reference for a BigQuery data source connection
1356
1405
1357
1406
If you already store information about your data source in a JSON file in a secure location, you can configure your BigQuery data source connection details in Soda Cloud to refer to the JSON file for service account information. To do so, you must add two elements:
1358
1407
* `volumes` and `volumeMounts` parameters in the `values.yml` file that your Soda Agent helm chart uses
0 commit comments