DRAG DROP -You have data stored in thousands of CSV files in Azure Data Lake Storage Gen2. Each file has a header row followed by a properly formatted carriage return (/ r) and line feed (/n).You are implementing a pattern that batch loads the files daily into a dedicated SQL pool in Azure Synapse Analytics by using PolyBase.You need to skip the header row when you import the files into the data warehouse. Before building the loading pattern, you need to prepare the required database objects in Azure Synapse Analytics.Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.NOTE: Each correct selection is worth one pointSelect and Place:
HOTSPOT -You are building an Azure Synapse Analytics dedicated SQL pool that will contain a fact table for transactions from the first half of the year 2020.You need to ensure that the table meets the following requirements:β Minimizes the processing time to delete data that is older than 10 yearsβ Minimizes the I/O for queries that use year-to-date valuesHow should you complete the Transact-SQL statement? To answer, select the appropriate options in the answer area.NOTE: Each correct selection is worth one point.Hot Area:
You are performing exploratory analysis of the bus fare data in an Azure Data Lake Storage Gen2 account by using an Azure Synapse Analytics serverless SQL pool.You execute the Transact-SQL query shown in the following exhibit.What do the query results include?
DRAG DROP -You use PySpark in Azure Databricks to parse the following JSON input.You need to output the data in the following tabular format.How should you complete the PySpark code? To answer, drag the appropriate values to the correct targets. Each value may be used once, more than once, or not at all. You may need to drag the spit bar between panes or scroll to view content.NOTE: Each correct selection is worth one point.Select and Place:
HOTSPOT -You are designing an application that will store petabytes of medical imaging data.When the data is first created, the data will be accessed frequently during the first week. After one month, the data must be accessible within 30 seconds, but files will be accessed infrequently. After one year, the data will be accessed infrequently but must be accessible within five minutes.You need to select a storage strategy for the data. The solution must minimize costs.Which storage tier should you use for each time frame? To answer, select the appropriate options in the answer area.NOTE: Each correct selection is worth one point.Hot Area:
You have a table in an Azure Synapse Analytics dedicated SQL pool. The table was created by using the following Transact-SQL statement.You need to alter the table to meet the following requirements:β Ensure that users can identify the current manager of employees.β Support creating an employee reporting hierarchy for your entire company.β Provide fast lookup of the managers' attributes such as name and job title.Which column should you add to the table?