You are designing a solution for the ad hoc analysis of data in Azure Databricks notebooks. The data will be stored in Azure Blob storage.You need to ensure that Blob storage will support the recovery of the data if the data is overwritten accidentally.What should you recommend?
You are planning a solution that combines log data from multiple systems. The log data will be downloaded from an API and stored in a data store.You plan to keep a copy of the raw data as well as some transformed versions of the data. You expect that there will be at least 2 TB of log files. The data will be used by data scientists and applications.You need to recommend a solution to store the data in Azure. The solution must minimize costs.What storage solution should you recommend?
You are designing a serving layer for data. The design must meet the following requirements:✑ Authenticate users by using Azure Active Directory (Azure AD).✑ Serve as a hot path for data.✑ Support query scale out.✑ Support SQL queries.What should you include in the design?
You are designing a storage solution for streaming data that is processed by Azure Databricks. The solution must meet the following requirements:✑ The data schema must be fluid.✑ The source data must have a high throughput.✑ The data must be available in multiple Azure regions as quickly as possible.What should you include in the solution to meet the requirements?
You are designing a log storage solution that will use Azure Blob storage containers.CSV log files will be generated by a multi-tenant application. The log files will be generated for each customer at five-minute intervals. There will be more than5,000 customers. Typically, the customers will query data generated on the day the data was created.You need to recommend a naming convention for the virtual directories and files. The solution must minimize the time it takes for the customers to query the log files.What naming convention should you recommend?
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.You are designing an HDInsight/Hadoop cluster solution that uses Azure Data Lake Gen1 Storage.The solution requires POSIX permissions and enables diagnostics logging for auditing.You need to recommend solutions that optimize storage.Proposed Solution: Ensure that files stored are larger than 250MB.Does the solution meet the goal?