SAS Institute A00-406 SAS Viya Supervised Machine Learning Pipelines Online Training

SAS Institute A00-406 Online Training

Question #21

What is the primary purpose of model assessment in the context of data science and machine learning?

Question #22

What is the main advantage of ensemble learning methods, such as Random Forest, in a machine learning pipeline?

Question #23

When deploying a machine learning model, what is meant by "model latency"?

Question #24

What is the purpose of an ROC curve (Receiver Operating Characteristic) in model assessment?

Question #25

Which data source typically provides access to real-time financial market data?

Question #26

In model assessment, what does "cross-validation" aim to address?

Question #27

Which metric is commonly used to evaluate the performance of a regression model?

Question #28

What is "model reevaluation" in the model deployment phase?

A . The process of data preprocessing
B . The process of selecting features
C . The periodic assessment of a deployed model’s performance and potential retraining
D . The evaluation of data distribution

Question #29

What is overfitting in machine learning, and how can it be addressed in a pipeline?

A . Overfitting occurs when the model is too simple and underperforms.
B . Overfitting occurs when the model fits the training data too closely and may not generalize well. It can be addressed by regularization techniques.
C . Overfitting occurs when the model is too complex and overperforms.
D . Overfitting is not a concern in machine learning pipelines.

Question #30

What is a data lake?

A . A data storage solution designed for high-speed data retrieval
B . A centralized repository for storing all structured and unstructured data at any scale
C . A specialized database for time-series data
D . A backup system for relational databases