Skip to content

Databricks hackathon/databricks common errors#266

Open
matthewberobinson wants to merge 4 commits into
mainfrom
databricks-hackathon/databricks-common-errors
Open

Databricks hackathon/databricks common errors#266
matthewberobinson wants to merge 4 commits into
mainfrom
databricks-hackathon/databricks-common-errors

Conversation

@matthewberobinson

Copy link
Copy Markdown
Contributor

Overview of changes

Have added a Databricks common errors page with folders in the image folder.

Why are these changes being made?

To keep track of issues people have with Databricks for easy self-diagnosis.

Detailed description of changes

Have added a key issue drop down plus 5 dropdowns partitioning the issues into their most common categories:

  • Installing
  • Navigating
  • Connecting
  • Using
  • SQL migration

I've then sourced any questions from the team's channel and our own knowledge of issues and added them to the page with their respective solutions.

Issue ticket number/s and link

Checklist before requesting a review

  • I have checked the contributing guidelines
  • [] I have checked for and linked any relevant issues that this may resolve
  • I have checked that these changes build locally
  • I understand that if merged into main, these changes will be publicly available

@Lsnaathorst1 Lsnaathorst1 left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi,

Thanks for this - good to have so many errors on there already! I have just done a general review of the page itself so far, but will need to individually look at the errors against our OneNote etc as in the PR description when time :)

Comment thread ADA/databricks_common_questions.qmd Outdated
title: "Databricks Common Questions"
format: html
---

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it would be nice to have an intro paragraph to explain the page here, just stating that this page brings together, known issues and fixes for working with Databricks. Then maybe something on if you encounter an issue not covered, considering emailing our team so the guidance can continue to improve

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe also something to explain the organisation as you have in your PR:

_Have added a key issue drop down plus 5 dropdowns partitioning the issues into their most common categories:

Installing
Navigating
Connecting
Using
SQL migration_

Comment thread ADA/databricks_common_questions.qmd Outdated
------------------------------------------------------------------------

::: {.callout-important collapse = "true"}
Current Issues:

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a nice idea! Can we have something like
'No known current issues' in this box when that is the case?

Comment thread ADA/databricks_common_questions.qmd Outdated
@@ -0,0 +1,252 @@
---
title: "Databricks Common Questions"

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think for consistency the title (and title in the sidebar) should be in sentence case

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want it to be common questions, or common errors? Or maybe common issues?

Comment thread ADA/databricks_common_questions.qmd Outdated

When you download Simba Spark it doesn't necessarily download the data driver, to do this raise an I.T. ticket explaining that you have installed "Simba Spark" but the driver is missing, they will download it from the internet.

#### Spark

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are we missing level 3 headers? I can see you are using level 2 for each of the drop down box headings but then think only level 4 and 5 headers are used within the boxes?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have updated the H3 and H4 headers as part of my PR, so once merged in, they should look more different to H3. I think for accessibility reasons we shouldn't skip header levels so be good to get no levels missing and then once merged in it will look different, or you could also add the same code to the styles.css page to see how it looks

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Level 1 and 3 were skipped on purpose, curious as to how it impacts accessibility, have changed now so that no levels are skipped.

@Lsnaathorst1 Lsnaathorst1 left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hey Matt,

Thank you for this! Having a look at some of the common issues, I had a piece of general feedback for across all of the common issues on the first one, I think we could lay them out to have the first paragraph being the explanation of what the resolution is for each and then just tighten up some of the explanations and resolutions. I will add an example for a question too where it may be feels less obvious. Will leave this back with you and let me know when ready for a review! :)

Comment thread ADA/databricks_common_questions.qmd Outdated

#### No Simba Spark ODBC Driver

When you download Simba Spark it doesn't necessarily download the data driver, to do this raise an I.T. ticket explaining that you have installed "Simba Spark" but the driver is missing, they will download it from the internet.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it might be helpful to lay these out with the resolution clearly labelled. Maybe the first paragraph for each should be an explanation of the issue, plus when people might come across it, and then the resolution header with step by step instructions?

Also, does this one come from an issue someone raised on Databricks support on teams? If so, maybe we can add a bit more context as to what people may see. I can't see anywhere we are telling people to create a new data source as in that teams issue, but if we are then maybe we can link to that here.

No Simba Spark ODBC Driver

After installing Simba Spark, the ODBC driver by not be included by default. You may see installation errors and will not be able to add the 'Simba Spark ODBC Driver' when creating a new data source.

Resolution

Raise an I.T. ticket explaining that you have installed "Simba Spark" but the driver is missing so it can be installed.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added in issue and resolution for each.

I think some of these were an amalgamation of several teams issues. I'm not sure I follow what you are talking about with a new data source?

Comment thread ADA/databricks_common_questions.qmd Outdated
#### Temporary tables
Temporary tables do not work for Databricks runtimes before 18.1, Databricks [temporary table guidance](https://docs.databricks.com/gcp/en/tables/temporary-tables)

#### Tables vs Volumes

@Lsnaathorst1 Lsnaathorst1 Jun 15, 2026

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Example suggestion for how we could give a little more info, add in where people might come across this and lay out with the resolution header:

Tables vs Volumes

It is common to confuse volumes and tables when working in Databricks. If you try to refer to a volume in the same way you would a table, you may see SQL queries fail, or you may find you are unable to query files in a volume in the way you expect.

Resolution

Tables store structured data and can be queried directly through SQL. Volumes, however, are designed for file-based (unstuctured or semi-structured) storage and are not directly accessible via standard SQL queries.

To access data in a volume, you must use the appropriate file path rather than SQL table references. You can find guidance on the differences on the Databricks’ guidance: volumes and Databricks’ guidance: volume paths pages.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm struggling to see what more info to give

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants