Provides the ability to submit PySpark batch jobs to Azure Synapse Spark pools.
Submit PySpark batch jobs to Synapse clusters by using the context menu.
Requires Azure CLI. Install Azure CLI and login with az login
This extension contributes the following settings:
synapse-spark.subscriptionId
: Set the Azure Subscription that contains your resources.synapse-spark.workspace
: Set the Synapse Workspace for job submission.synapse-spark.resourceGroupName
: Set the Azure Resource GRoup that contains your Synapse Workspace.synapse-spark.adlsTempAccount
: Set the Azure Storage Account to use to upload the file to submit.synapse-spark.adlsTempContainer
: Set the Azure Storage Account Container to use for file uploads.synapse-spark.adlsTempPath
: Set the path in the Azure Storage Account Container to use for file uploads.synapse-spark.cluster
: Set the Spark pool to use for job submissions.synapse-spark.batchJobOptions
: Set the Spark Batch Job Options to use when submitting jobs.
synapse-spark.batchJobOptions
must be configured in your workspace settings manually. Example:
"synapse-spark": {
"batchJobOptions": {
"tags": null, //{}
"artifactId": null, //str
"className": "", //str
"arguments": null, //arr
"jars": null, //arr
"pythonFiles": [],
"files": null, //arr
"archives": null, //arr
"configuration": null, //{}
"driverMemory": "4g", //str
"driverCores": 2, //int
"executorMemory": "4g", //str
"executorCores": 2, //int
"executorCount": 2 //int
}
},
The other configuration settings can be configured individually using the command palette or all at once by selecting Configure All
from the command palette.
N/A
Initial release of Synapse Spark Tools
- Submit PySpark Batch jobs via the context menu