Which of the following is a simple statistic to monitor for categorical feature drift?
Which of the following is a simple statistic to monitor for categorical feature drift?A . ModeB . None of theseC . Mode, number of unique values, and percentage of missing valuesD . Percentage of missing valuesE . Number of unique valuesView AnswerAnswer: C
Which of the following SQL commands can be used to accomplish this task?
After a data scientist noticed that a column was missing from a production feature set stored as a Delta table, the machine learning engineering team has been tasked with determining when the column was dropped from the feature set. Which of the following SQL commands can be used to accomplish...
Which of the following types of drift is present in the above scenario?
A data scientist has developed a model to predict ice cream sales using the expected temperature and expected number of hours of sun in the day. However, the expected temperature is dropping beneath the range of the input variable on which the model was trained. Which of the following types...
Which of the following should be completed as Step #3?
Run a statistical test to determine if there are changes over time Which of the following should be completed as Step #3?A . Obtain the observed values (actual) feature valuesB . Measure the latency of the prediction timeC . Retrain the modelD . None of these should be completed as...
Which of the following operations in Feature Store Client fs can be used to return a Spark DataFrame of a data set associated with a Feature Store table?
Which of the following operations in Feature Store Client fs can be used to return a Spark DataFrame of a data set associated with a Feature Store table?A . fs.create_tableB . fs.write_tableC . fs.get_tableD . There is no way to accomplish this task with fsE . fs.read_tableView AnswerAnswer: A
Which of the following lines of code can be used to obtain run-level results for exp_id in a Spark DataFrame?
A data scientist is utilizing MLflow to track their machine learning experiments. After completing a series of runs for the experiment with experiment ID exp_id, the data scientist wants to programmatically work with the experiment run data in a Spark DataFrame. They have an active MLflow Client client and an...
Which of the following machine learning model deployment paradigms is the most common for machine learning projects?
Which of the following machine learning model deployment paradigms is the most common for machine learning projects?A . On-deviceB . StreamingC . Real-timeD . BatchE . None of these deploymentsView AnswerAnswer: B
Which of the following is a reason for using Jensen-Shannon (JS) distance over a Kolmogorov-Smirnov (KS) test for numeric feature drift detection?
Which of the following is a reason for using Jensen-Shannon (JS) distance over a Kolmogorov-Smirnov (KS) test for numeric feature drift detection?A . All of these reasonsB . JS is not normalized or smoothedC . None of these reasonsD . JS is more robust when working with large datasetsE ....
Which of the following code blocks can they use to perform this task using the Feature Store Client fs?
A data scientist has computed updated feature values for all primary key values stored in the Feature Store table features. In addition, feature values for some new primary key values have also been computed. The updated feature values are stored in the DataFrame features_df. They want to replace all data...
Which of the following lines of code can be used to restore the model object so that feature_importances_ is available?
A data scientist has developed and logged a scikit-learn random forest model model, and then they ended their Spark session and terminated their cluster. After starting a new cluster, they want to review the feature_importances_ of the original model object. Which of the following lines of code can be used...