A data analysis team is working with the table_bronze SQL table as a source for one of its most complex projects. A stakeholder of the project notices that some of the downstream data is duplicative. The analysis team identifies table_bronze as the source of the duplication.Which of the following queries can be used to deduplicate the data from table_bronze and write it to a new table table_silver?
A business analyst has been asked to create a data entity/object called sales_by_employee. It should always stay up-to-date when new data are added to the sales table. The new entity should have the columns sales_person, which will be the name of the employee from the employees table, and sales, which will be all sales for that particular sales person. Both the sales table and the employees table have an employee_id column that is used to identify the sales person.Which of the following code blocks will accomplish this task?
A data analyst has been asked to use the below table sales_table to get the percentage rank of products within region by the sales:The result of the query should look like this:Which of the following queries will accomplish this task?
In which of the following situations should a data analyst use higher-order functions?
Consider the following two statements:Statement 1:Statement 2:Which of the following describes how the result sets will differ for each statement when they are run in Databricks SQL?
A data analyst has created a user-defined function using the following line of code:CREATE FUNCTION price(spend DOUBLE, units DOUBLE)RETURNS DOUBLE -RETURN spend / units;Which of the following code blocks can be used to apply this function to the customer_spend and customer_units columns of the table customer_summary to create column customer_price?