This guide describes how to configure a secure connection from Abacus.AI to data hosted on the Azure Databricks platform using the Databricks JDBC driver and Azure Entra ID.
Before you begin, gather the following:
- Databricks personal access token
- In your Databricks workspace, go to User Settings and generate a personal access token.
- JDBC connection details for your Databricks cluster
- Navigate to Compute, select your cluster, open Advanced options, and then the JDBC tab to view connection info.
- Note: Do not include
https://
in the Database Server URL/IP. Use only the server hostname or address.
- Azure Entra ID application (Client ID and Client Secret)
- Register an app in Azure Entra ID and obtain its Client ID and Client Secret.
- Microsoft quickstart: https://learn.microsoft.com/en-us/entra/identity-platform/quickstart-register-app
- Open the Abacus.AI Connected Services Dashboard: https://abacus.ai/app/profile/connected_services
- Click Add New Connector, choose ODBC/JDBC, and select Databricks JDBC driver from the Driver dropdown.
- Turn on the Import RBAC toggle.
- In Azure Entra ID, create/register an application if you have not already done so.
- Enter the app’s Client ID and Client Secret in the corresponding fields.
- Fill in all Databricks configuration details gathered above (server hostname/address, JDBC info, token, etc.) and click Save.
- Click Verify and wait for the connector to be validated. If an error appears, follow the message to troubleshoot and correct your configuration.
- From the Abacus.AI logo, go to the Projects page.
- Create a new project.
- Select ChatLLM – Custom LLM Chat.
- Enter a name for your project.
- Choose Skip to project dashboard.
- In the left toolbar, open Model and click Train Model (top right).
- For Structured data source, select External service from the dropdown.
- Choose the connector you created in the steps above and add the tables to use for testing.
- Click Train Model.
- Once training completes, open Models and select your model.
- Click Create a new deployment.
- Select Offline Batch + Realtime, then click Next.
- Enter an end‑user friendly deployment name and click Deploy.
- Return to your model (Model > your model name). When the deployment is Active, click the deployment name.
- Under Deployment, open the Prediction Dash.
- Click Go to Abacus.AI chat to test your bot.
- Enter your prompt/question. Each user will be asked to sign in once via Azure Entra ID. Log in using your Entra ID user credentials.
If you need to update this document, edit /home/ubuntu/Uploads/databricks_user_connector.md
and re‑publish as needed.