askill
databricks-2025

databricks-2025Safety 95Repository

This skill should be used when the user asks about "Databricks Job activity", "DatabricksJob in ADF", "ServiceNow V2 connector", "ADF managed identity", or needs guidance on Databricks serverless linked services, Snowflake connectors, or MFA enforcement.

20 stars
1.2k downloads
Updated 3/17/2026

Package Files

Loading files...
SKILL.md

Azure Data Factory Databricks Integration 2025

Databricks Job Activity (Recommended 2025)

CRITICAL UPDATE (2025): The Databricks Job activity is now the ONLY recommended method for orchestrating Databricks in ADF. Microsoft strongly recommends migrating from legacy Notebook, Python, and JAR activities.

Quick Reference

  • Activity type: DatabricksJob (NOT DatabricksSparkJob or DatabricksNotebook)
  • Parameter property: jobParameters (NOT parameters)
  • Linked service auth: Managed Identity ("authentication": "MSI") recommended
  • Cluster config: Do NOT specify cluster properties in linked service; the Databricks Job controls compute

Why Databricks Job Activity?

FeatureNotebook Activity (Legacy)Job Activity (2025)
ComputeMust configure cluster in linked serviceServerless by default
Workflow tasksSingle notebookMulti-task DAGs (notebook, Python, SQL, DLT)
RetryADF-level onlyJob-level + task-level
Repair runsNot supportedRerun failed tasks only
Git integrationLimitedFull Databricks Git support + DABs
LineageNoneBuilt-in data lineage
If/Else logicMust use ADF control flowNative If/Else task types

Benefits Summary

  1. Serverless Execution -- No cluster specification needed; automatic serverless compute with faster startup and lower costs
  2. Advanced Workflow Features -- Run As, Task Values, Conditional Execution, AI/BI Tasks, Repair Runs, Notifications, Queuing
  3. Centralized Job Management -- Jobs defined once in Databricks workspace; single source of truth with Git-backed versioning
  4. Cost Optimization -- Serverless compute (pay only for execution), job clusters (auto-terminating), spot instance support

For complete JSON examples of Job activity, linked service, and pipeline configurations, see references/databricks-job-examples.md.

Connectors and Enhancements (2025+)

ServiceNow V2 Connector (RECOMMENDED - V1 End of Support)

ServiceNow V1 connector is at End of Support. Migrate to V2 immediately.

FeatureV1V2
Linked service typeServiceNowServiceNowV2
Source typeServiceNowSourceServiceNowV2Source
Query builderCustomAligns with ServiceNow condition builder
PerformanceStandardEnhanced extraction
OData supportNoYes

Migration steps: Update linked service type to ServiceNowV2, update source type to ServiceNowV2Source, test queries in ServiceNow UI condition builder, adjust timeouts.

Enhanced PostgreSQL Connector

Improved performance with 2025 SSL enhancements: enableSsl: true, sslMode: "Require".

Enhanced Snowflake Connector

Improved performance with KeyPair authentication support and Key Vault secret integration.

Managed Identity for Azure Storage

New managed identity support for Azure Table Storage and Azure Files connectors (system-assigned and user-assigned).

Mapping Data Flows - Spark 3.3

Spark 3.3 now powers Mapping Data Flows with 30% faster processing, Adaptive Query Execution (AQE), dynamic partition pruning, improved caching, and better column statistics.

Azure DevOps Server 2022 Support

Git integration now supports on-premises Azure DevOps Server 2022 via the hostName property.

For complete JSON examples of all connectors, see references/connector-examples.md.

Managed Identity 2025 Best Practices

User-Assigned vs System-Assigned

ScenarioRecommendation
Single ADF, simple setupSystem-assigned
Multiple data factoriesUser-assigned (shared identity)
Complex multi-environmentUser-assigned
Granular permission controlUser-assigned
Identity lifecycle independenceUser-assigned

Use ADF's centralized Credentials feature to consolidate Microsoft Entra ID-based credentials across multiple linked services.

MFA Enforcement (Enforced Since October 2025)

Azure MFA is mandatory for all interactive user logins. Impact on ADF:

  • Managed identities are UNAFFECTED -- no MFA required for service accounts
  • Service principals with certificate auth are the recommended alternative to secrets
  • All interactive user logins require MFA

Principle of Least Privilege

ResourceSource RoleSink Role
Storage BlobStorage Blob Data ReaderStorage Blob Data Contributor
SQL Databasedb_datareaderdb_datareader + db_datawriter
Key VaultGet secrets onlyGet secrets only

For complete managed identity JSON examples, see references/connector-examples.md.

Best Practices (2025)

  1. Use Databricks Job Activity (MANDATORY) -- Stop using Notebook, Python, JAR activities. Define workflows in Databricks workspace with serverless compute.

  2. Managed Identity Authentication (MANDATORY) -- Use managed identities for ALL Azure resources. Leverage Credentials feature for consolidation. MFA-compliant since October 2025.

  3. Monitor Job Execution -- Track Databricks Job run IDs from ADF output, log parameters for auditability, set up alerts for failures, leverage built-in lineage.

  4. Optimize Spark 3.3 (Data Flows) -- Enable AQE, use 4-8 partitions per core, broadcast joins for small dimensions, dynamic partition pruning.

Resources

Progressive Disclosure References

  • Databricks Job Examples: references/databricks-job-examples.md - Complete JSON for Job activity, linked services, pipeline, and Databricks workspace job definition
  • Connector Examples: references/connector-examples.md - Complete JSON for ServiceNow V2, PostgreSQL, Snowflake, Azure Storage MI, Mapping Data Flows, and Azure DevOps Server

Install

Download ZIP
Requires askill CLI v1.0+

AI Quality Score

78/100Analyzed 3/22/2026

Comprehensive technical reference for Azure Data Factory Databricks integration in 2025. Well-structured with comparison tables, clear recommendations, and best practices. Strong on technical accuracy and completeness for reference use, but actionability is moderate due to missing step-by-step instructions and external file dependencies. Clear use case triggers defined. Low internal_only signal as it appears marketplace-ready.

95
88
85
82
65

Metadata

Licenseunknown
Version-
Updated3/17/2026
PublisherJosiahSiegel

Tags

ci-cddatabasegithub-actionssecuritytesting