What is a consideration when building decision trees?

What is a consideration when building decision trees?A . Cannot handle variables that affect the outcome in a discontinuous wayB . Short decision trees are likely subject to overfitC . Correlated variables can cause double-countingD . Tree structure is sensitive to small changes in the training dataView AnswerAnswer: D

September 21, 2024 No Comments READ MORE +

What is the output of the K-means clustering algorithm?

What is the output of the K-means clustering algorithm?A . Centroid positioning and entropy of each record in each clusterB . Center of each discovered cluster and mapping of each record to a clusterC . Two dimensional representation of the data and the clustersD . Intercept and coefficients for each...

September 21, 2024 No Comments READ MORE +

In a decision tree, what is an example of a pure node?

In a decision tree, what is an example of a pure node?A . 25 positives; 75 negativesB . 50 positives; 50 negativesC . 75 positives; 25 negativesD . 100 positives; 0 negativesView AnswerAnswer: D

September 21, 2024 No Comments READ MORE +

In which phase of the data analytics lifecycle do Data Scientists spend the most time in a project?

In which phase of the data analytics lifecycle do Data Scientists spend the most time in a project?A . DiscoveryB . Data PreparationC . Model BuildingD . Communicate ResultsView AnswerAnswer: B

September 21, 2024 No Comments READ MORE +

What is the next step you should take?

The Marketing department of your company wishes to track opinion on a new product that was recently introduced. Marketing would like to know how many positive and negative reviews are appearing over a given period and potentially retrieve each review for more in- depth insight. They have identified several popular...

September 20, 2024 No Comments READ MORE +

What does the Receiver Operating Characteristic (ROC) curve show?

What does the Receiver Operating Characteristic (ROC) curve show?A . Relationship between p-value and true positive rateB . Relationship between p-value and true negative rateC . Relationship between true positive rate and false positive rateD . Relationship between true positive rate and true negative rateView AnswerAnswer: C

September 20, 2024 No Comments READ MORE +

Which assumption makes the Naïve Bayesian classifier different from the general Bayesian model?

Which assumption makes the Naïve Bayesian classifier different from the general Bayesian model?A . Number of features cannot be greater than the number of recordsB . Features of a class are conditionally independent of one anotherC . All variables need to be numericD . Fewer features can be used with...

September 20, 2024 No Comments READ MORE +

Which function would best address this need?

You have created a scatterplot of two continuous variables for 2000 records. You want to add a line to the scatterplot to check linearity of the data. Which function would best address this need?A . abline()B . glm()C . hist()D . lm()View AnswerAnswer: A

September 20, 2024 No Comments READ MORE +

What is the output format from the Map function of MapReduce?

What is the output format from the Map function of MapReduce?A . Key-value pairsB . Binary representation of keys concatenated with structured dataC . Compressed indexD . Unique key record and separate records of all possible valuesView AnswerAnswer: A

September 20, 2024 No Comments READ MORE +

What is your next step?

You have been assigned to do a study of the daily revenue effect of a pricing model of online transactions. All the data currently available to you has been loaded into your analytics database; revenue data, pricing data, and online transaction data. You find that all the data comes in...

September 20, 2024 No Comments READ MORE +