Which two packages can be used to customize the software configuration of a Jupyter notebook environment in Cloud Pak for Data?
Which two packages can be used to customize the software configuration of a Jupyter notebook environment in Cloud Pak for Data?A . vimB . pipC . sudoD . bashE . condaView AnswerAnswer: BE
Which statistical method reduces the number of attributes by lumping highly correlated attributes together?
Which statistical method reduces the number of attributes by lumping highly correlated attributes together?A . BinningB . Principal Component Analysis (PCA)C . Long Short Term Memory Network (LSTM)D . Synthetic Minority Over-sampling Technique (SMOTE)View AnswerAnswer: B
Which of the following is a critical first step in understanding a business problem for data science projects?
Which of the following is a critical first step in understanding a business problem for data science projects?A . Selecting the machine learning algorithmB . Defining the project scopeC . Choosing the visualization toolsD . Deploying the modelView AnswerAnswer: B
When helping businesses articulate and define problems, what is an essential first step?
When helping businesses articulate and define problems, what is an essential first step?A . Identifying potential data sourcesB . Defining key performance indicators (KPIs)C . Establishing a clear problem statementD . Selecting the analytical techniquesView AnswerAnswer: C
Cloud Pak for Data's integration with Spark allows users to:
Cloud Pak for Data's integration with Spark allows users to:A . Perform complex computations on small datasets onlyB . Leverage distributed computing for processing large datasets efficientlyC . Avoid using any form of data processing or analysisD . Use Spark exclusively for data visualization purposesView AnswerAnswer: B
In classification models, which of the following metrics is NOT directly derived from the confusion matrix?
In classification models, which of the following metrics is NOT directly derived from the confusion matrix?A . PrecisionB . RecallC . Mean Absolute Error (MAE)D . F1-scoreView AnswerAnswer: C
What is data leakage in the context of model training?
What is data leakage in the context of model training?A . When data from outside the training dataset is accidentally included in the training processB . A situation where the test data is not availableC . Leakage of sensitive information due to poor data handling practicesD . Loss of data...
What is the primary purpose of partitioning data into training and test sets?
What is the primary purpose of partitioning data into training and test sets?A . To ensure that the model gets exposed to all possible data scenarios during trainingB . To maximize the accuracy of the model by using all data for trainingC . To evaluate the model's performance on unseen...
Which metric is a good indicator of the performance of the virtual assistant?
A virtual assistant has been developed and deployed based on the Watson Assistant service. The assistant will support customers by answering FAQs (Frequent Answered Questions). Which metric is a good indicator of the performance of the virtual assistant?A . The Area Under the Curve (AUC)B . Measure escalated calls using...
The first step in performing exploratory data analysis (EDA) typically involves:
The first step in performing exploratory data analysis (EDA) typically involves:A . Choosing a color palette for data visualizationB . Determining the hypothesis for the analysisC . Connecting to as many data sources as possibleD . Selecting a random sample of data to analyzeView AnswerAnswer: B