Create Next App

databricks CERTIFIED_MACHINE_LEARNING_ASSOCIATE

Exam contains 73 questions

Page 7 of 13

Question 37 🔥

A data scientist is developing a machine learning pipeline using AutoML on Databricks Machine Learning.Which of the following steps will the data scientist need to perform outside of their AutoML experiment?

Which database solution meets these requirements?

A. Model tuning

Highly voted

B. Model evaluation

Highly voted

C. Model deployment

Highly voted

D. Exploratory data analysis

Highly voted

Discussion of the question

Question 38 🔥

A data scientist is utilizing MLflow Autologging to automatically track their machine learning experiments. After completing a series of runs for the experiment experiment_id, the data scientist wants to identify the run_id of the run with the best root-mean-square error (RMSE).Which of the following lines of code can be used to identify the run_id of the run with the best RMSE in experiment_id?

Which database solution meets these requirements?

Highly voted

Discussion of the question

Question 39 🔥

A machine learning engineer has been notified that a new Staging version of a model registered to the MLflow Model Registry has passed all tests. As a result, the machine learning engineer wants to put this model into production by transitioning it to the Production stage in the Model Registry.From which of the following pages in Databricks Machine Learning can the machine learning engineer accomplish this task?

Which database solution meets these requirements?

A. The home page of the MLflow Model Registry

Highly voted

B. The experiment page in the Experiments observatory

Highly voted

C. The model version page in the MLflow Model Registry

Highly voted

D. The model page in the MLflow Model Registry

Highly voted

Discussion of the question

Question 40 🔥

A machine learning engineer has identified the best run from an MLflow Experiment. They have stored the run ID in the run_id variable and identified the logged model name as "model". They now want to register that model in the MLflow Model Registry with the name "best_model".Which lines of code can they use to register the model associated with run_id to the MLflow Model Registry?

Which database solution meets these requirements?

A. mlflow.register_model(run_id, "best_model")

Highly voted

B. mlflow.register_model(f"runs:/{run_id}/model”, "best_model”)

Highly voted

C. millow.register_model(f"runs:/{run_id)/model")

Highly voted

D. mlflow.register_model(f"runs:/{run_id}/best_model", "model")

Highly voted

Discussion of the question

Question 41 🔥

A new data scientist has started working on an existing machine learning project. The project is a scheduled Job that retrains every day. The project currently exists in a Repo in Databricks. The data scientist has been tasked with improving the feature engineering of the pipeline’s preprocessing stage. The data scientist wants to make necessary updates to the code that can be easily adopted into the project without changing what is being run each day.Which approach should the data scientist take to complete this task?

Which database solution meets these requirements?

A. They can create a new branch in Databricks, commit their changes, and push those changes to the Git provider.

Highly voted

B. They can clone the notebooks in the repository into a Databricks Workspace folder and make the necessary changes.

Highly voted

C. They can create a new Git repository, import it into Databricks, and copy and paste the existing code from the original repository before making changes.

Highly voted

D. They can clone the notebooks in the repository into a new Databricks Repo and make the necessary changes.

Highly voted

Discussion of the question

Question 42 🔥

A machine learning engineering team has a Job with three successive tasks. Each task runs a single notebook. The team has been alerted that the Job has failed in its latest run.Which of the following approaches can the team use to identify which task is the cause of the failure?

Which database solution meets these requirements?

A. Run each notebook interactively

Highly voted

B. Review the matrix view in the Job’s runs

Highly voted

C. Migrate the Job to a Delta Live Tables pipeline

Highly voted

D. Change each Task’s setting to use a dedicated cluster

Highly voted

Discussion of the question

Ready to Pass Your Certification Test

databricks CERTIFIED_MACHINE_LEARNING_ASSOCIATE

Exam contains 73 questions

Lorem ipsum dolor sit amet consectetur. Eget sed turpis aenean sit aenean. Integer at nam ullamcorper a.

Company

Product

Resources

Follow us