Which command can be used to write data into a Delta table while avoiding the writing of duplicate records?
A data analyst has created a Delta table sales that is used by the entire data analysis team. They want help from the data engineering team to implement a series of tests to ensure the data is clean. However, the data engineering team uses Python for its tests rather than SQL.Which command could the data engineering team use to access sales in PySpark?
A data engineer has created a new database using the following command:CREATE DATABASE IF NOT EXISTS customer360;In which location will the customer360 database be located?
A data engineer is attempting to drop a Spark SQL table my_table and runs the following command:DROP TABLE IF EXISTS my_table;After running this command, the engineer notices that the data files and metadata files have been deleted from the file system.What is the reason behind the deletion of all these files?
What is a benefit of creating an external table from Parquet rather than CSV when using a CREATE TABLE AS SELECT statement?
Which SQL keyword can be used to convert a table from a long format to a wide format?