A production cluster has 3 executor nodes and uses the same virtual machine type for the driver and executor.When evaluating the Ganglia Metrics for this cluster, which indicator would signal a bottleneck caused by code executing on the driver?
Where in the Spark UI can one diagnose a performance problem induced by not leveraging predicate push-down?
Review the following error traceback:Which statement describes the error being raised?
Which distribution does Databricks support for installing custom Python code packages?
Which Python variable contains a list of directories to be searched when trying to locate required modules?
The Databricks workspace administrator has configured interactive clusters for each of the data engineering groups. To control costs, clusters are set to terminate after 30 minutes of inactivity. Each user should be able to execute workloads against their assigned clusters at any time of the day.Assuming users have been added to a workspace but not granted any permissions, which of the following describes the minimal permissions a user would need to start and attach to an already configured cluster.