Easter Sale Special 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: exams65

Databricks Certified Machine Learning Professional Question and Answers

Databricks Certified Machine Learning Professional

Last Update May 15, 2024
Total Questions : 60

We are offering FREE Databricks-Machine-Learning-Professional Databricks exam questions. All you do is to just go and sign up. Give your details, prepare Databricks-Machine-Learning-Professional free exam questions and then go for complete pool of Databricks Certified Machine Learning Professional test questions that will help you more.

Databricks-Machine-Learning-Professional pdf

Databricks-Machine-Learning-Professional PDF

$35  $99.99
Databricks-Machine-Learning-Professional Engine

Databricks-Machine-Learning-Professional Testing Engine

$42  $119.99
Databricks-Machine-Learning-Professional PDF + Engine

Databricks-Machine-Learning-Professional PDF + Testing Engine

$56  $159.99
Questions 1

Which of the following is an advantage of using thepython_function(pyfunc)model flavor over the built-in library-specific model flavors?

Options:

A.  

python_function provides no benefits over the built-in library-specific model flavors

B.  

python_function can be used to deploy models in a parallelizable fashion

C.  

python_function can be used to deploy models without worrying about which library was used to create the model

D.  

python_function can be used to store models in an MLmodel file

E.  

python_function can be used to deploy models without worrying about whether they are deployed in batch, streaming, or real-time environments

Discussion 0
Questions 2

A machine learning engineer needs to select a deployment strategy for a new machine learning application. The feature values are not available until the time of delivery, and results are needed exceedingly fast for one record at a time.

Which of the following deployment strategies can be used to meet these requirements?

Options:

A.  

Edge/on-device

B.  

Streaming

C.  

None of these strategies will meet the requirements.

D.  

Batch

E.  

Real-time

Discussion 0
Questions 3

A data scientist has developed and logged a scikit-learn random forest model model, and then they ended their Spark session and terminated their cluster. After starting a new cluster, they want to review the feature_importances_ of the original model object.

Which of the following lines of code can be used to restore the model object so that feature_importances_ is available?

Options:

A.  

mlflow.load_model(model_uri)

B.  

client.list_artifacts(run_id)["feature-importances.csv"]

C.  

mlflow.sklearn.load_model(model_uri)

D.  

This can only be viewed in the MLflow Experiments UI

E.  

client.pyfunc.load_model(model_uri)

Discussion 0
Questions 4

A data scientist would like to enable MLflow Autologging for all machine learning libraries used in a notebook. They want to ensure that MLflow Autologging is used no matter what version of the Databricks Runtime for Machine Learning is used to run the notebook and no matter what workspace-wide configurations are selected in the Admin Console.

Which of the following lines of code can they use to accomplish this task?

Options:

A.  

mlflow.sklearn.autolog()

B.  

mlflow.spark.autolog()

C.  

spark.conf.set(“autologging”, True)

D.  

It is not possible to automatically log MLflow runs.

E.  

mlflow.autolog()

Discussion 0
Questions 5

A machine learning engineer is attempting to create a webhook that will trigger a Databricks Jobjob_idwhen a model version for modelmodeltransitions into any MLflow Model Registry stage.

They have the following incomplete code block:

Which of the following lines of code can be used to fill in the blank so that the code block accomplishes the task?

Options:

A.  

"MODEL_VERSION_CREATED"

B.  

"MODEL_VERSION_TRANSITIONED_TO_PRODUCTION"

C.  

"MODEL_VERSION_TRANSITIONED_TO_STAGING"

D.  

"MODEL_VERSION_TRANSITIONED_STAGE"

E.  

"MODEL_VERSION_TRANSITIONED_TO_STAGING", "MODEL_VERSION_TRANSITIONED_TO_PRODUCTION"

Discussion 0
Questions 6

A machine learning engineer needs to deliver predictions of a machine learning model in real-time. However, the feature values needed for computing the predictions are available one week before the query time.

Which of the following is a benefit of using a batch serving deployment in this scenario rather than a real-time serving deployment where predictions are computed at query time?

Options:

A.  

Batch servinghas built-in capabilities in Databricks Machine Learning

B.  

There is no advantage to using batch serving deployments over real-time serving deployments

C.  

Computing predictions in real-time provides more up-to-date results

D.  

Testing is not possible in real-time serving deployments

E.  

Querying stored predictions can be faster than computing predictions in real-time

Discussion 0
Questions 7

A data scientist is using MLflow to track their machine learning experiment. As a part of each MLflow run, they are performing hyperparameter tuning. The data scientist would like to have one parent run for the tuning process with a child run for each unique combination of hyperparameter values.

They are using the following code block:

The code block is not nesting the runs in MLflow as they expected.

Which of the following changes does the data scientist need to make to the above code block so that it successfully nests the child runs under the parent run in MLflow?

Options:

A.  

Indent the child run blocks within the parent run block

B.  

Add the nested=True argument to the parent run

C.  

Remove the nested=True argument from the child runs

D.  

Provide the same name to the run name parameter for all three run blocks

E.  

Add the nested=True argument to the parent run and remove the nested=True arguments from the child runs

Discussion 0
Questions 8

A data scientist has developed a scikit-learn modelsklearn_modeland they want to log the model using MLflow.

They write the following incomplete code block:

Which of the following lines of code can be used to fill in the blank so the code block can successfully complete the task?

Options:

A.  

mlflow.spark.track_model(sklearn_model, "model")

B.  

mlflow.sklearn.log_model(sklearn_model, "model")

C.  

mlflow.spark.log_model(sklearn_model, "model")

D.  

mlflow.sklearn.load_model("model")

E.  

mlflow.sklearn.track_model(sklearn_model, "model")

Discussion 0
Questions 9

Which of the following is a simple, low-cost method of monitoring numeric feature drift?

Options:

A.  

Jensen-Shannon test

B.  

Summary statistics trends

C.  

Chi-squared test

D.  

None of these can be used to monitor feature drift

E.  

Kolmogorov-Smirnov (KS) test

Discussion 0
Questions 10

Which of the following describes label drift?

Options:

A.  

Label drift is when there is a change in the distribution of the predicted target given by the model

B.  

None of these describe label drift

C.  

Label drift is when there is a change in the distribution of an input variable

D.  

Label drift is when there is a change in the relationship between input variables and target variables

E.  

Label drift is when there is a change in the distribution of a target variable

Discussion 0
Questions 11

Which of the following lists all of the model stages are available in the MLflow Model Registry?

Options:

A.  

Development. Staging. Production

B.  

None. Staging. Production

C.  

Staging. Production. Archived

D.  

None. Staging. Production. Archived

E.  

Development. Staging. Production. Archived

Discussion 0
Questions 12

Which of the following describes the purpose of the context parameter in the predict method of Python models for MLflow?

Options:

A.  

The context parameter allows the user to specify which version of the registered MLflowModel should be used based on the given application's current scenario

B.  

The context parameter allows the user to document the performance of a model after it has been deployed

C.  

The context parameter allows the user to include relevant details of the business case to allow downstream users to understand the purpose of the model

D.  

The context parameter allows the user to provide the model with completely custom if-else logic for the given application's current scenario

E.  

The context parameter allows the user to provide the model access to objects like preprocessing models or custom configuration files

Discussion 0
Questions 13

Which of the following machine learning model deployment paradigms is the most common for machine learning projects?

Options:

A.  

On-device

B.  

Streaming

C.  

Real-time

D.  

Batch

E.  

None of these deployments

Discussion 0
Questions 14

A machine learning engineer has developed a model and registered it using the FeatureStoreClient fs. The model has model URI model_uri. The engineer now needs to perform batch inference on customer-level Spark DataFrame spark_df, but it is missing a few of the static features that were used when training the model. The customer_id column is the primary key of spark_df and the training set used when training and logging the model.

Which of the following code blocks can be used to compute predictions for spark_df when the missing feature values can be found in the Feature Store by searching for features by customer_id?

Options:

A.  

df = fs.get_missing_features(spark_df, model_uri)

fs.score_model(model_uri, df)

B.  

fs.score_model(model_uri, spark_df)

C.  

df = fs.get_missing_features(spark_df, model_uri)

fs.score_batch(model_uri, df)

df = fs.get_missing_features(spark_df)

D.  

fs.score_batch(model_uri, df)

E.  

fs.score_batch(model_uri, spark_df)

Discussion 0
Questions 15

A data scientist has developed a scikit-learn random forest model model, but they have not yet logged model with MLflow. They want to obtain the input schema and the output schema of the model so they can document what type of data is expected as input.

Which of the following MLflow operations can be used to perform this task?

Options:

A.  

mlflow.models.schema.infer_schema

B.  

mlflow.models.signature.infer_signature

C.  

mlflow.models.Model.get_input_schema

D.  

mlflow.models.Model.signature

E.  

There is no way to obtain the input schema and the output schema of an unlogged model.

Discussion 0
Questions 16

A data scientist set up a machine learning pipeline to automatically log a data visualization with each run. They now want to view the visualizations in Databricks.

Which of the following locations in Databricks will show these data visualizations?

Options:

A.  

The MLflow Model RegistryModel paqe

B.  

The Artifacts section of the MLflow Experiment page

C.  

Logged data visualizations cannot be viewed in Databricks

D.  

The Artifacts section of the MLflow Run page

E.  

The Figures section of the MLflow Run page

Discussion 0
Questions 17

A machine learning engineer has registered a sklearn model in the MLflow Model Registry using the sklearn model flavor with UI model_uri.

Which of the following operations can be used to load the model as an sklearn object for batch deployment?

Options:

A.  

mlflow.spark.load_model(model_uri)

B.  

mlflow.pyfunc.read_model(model_uri)

C.  

mlflow.sklearn.read_model(model_uri)

D.  

mlflow.pyfunc.load_model(model_uri)

E.  

mlflow.sklearn.load_model(model_uri)

Discussion 0
Questions 18

Which of the following describes the concept of MLflow Model flavors?

Options:

A.  

A convention that deployment tools can use to wrap preprocessing logic into a Model

B.  

A convention that MLflow Model Registry can use to version models

C.  

A convention that MLflow Experiments can use to organize their Runs by project

D.  

A convention that deployment tools can use to understand the model

E.  

A convention that MLflow Model Registrycan use to organize its Models by project

Discussion 0