What is a consideration when building decision trees?
What is a consideration when building decision trees?A . Cannot handle variables that affect the outcome in a discontinuous wayB . Short decision trees are likely subject to overfitC . Correlated variables can cause double-countingD . Tree structure is sensitive to small changes in the training dataView AnswerAnswer: D
What is the output of the K-means clustering algorithm?
What is the output of the K-means clustering algorithm?A . Centroid positioning and entropy of each record in each clusterB . Center of each discovered cluster and mapping of each record to a clusterC . Two dimensional representation of the data and the clustersD . Intercept and coefficients for each...
In a decision tree, what is an example of a pure node?
In a decision tree, what is an example of a pure node?A . 25 positives; 75 negativesB . 50 positives; 50 negativesC . 75 positives; 25 negativesD . 100 positives; 0 negativesView AnswerAnswer: D
In which phase of the data analytics lifecycle do Data Scientists spend the most time in a project?
In which phase of the data analytics lifecycle do Data Scientists spend the most time in a project?A . DiscoveryB . Data PreparationC . Model BuildingD . Communicate ResultsView AnswerAnswer: B
What is the next step you should take?
The Marketing department of your company wishes to track opinion on a new product that was recently introduced. Marketing would like to know how many positive and negative reviews are appearing over a given period and potentially retrieve each review for more in- depth insight. They have identified several popular...
What does the Receiver Operating Characteristic (ROC) curve show?
What does the Receiver Operating Characteristic (ROC) curve show?A . Relationship between p-value and true positive rateB . Relationship between p-value and true negative rateC . Relationship between true positive rate and false positive rateD . Relationship between true positive rate and true negative rateView AnswerAnswer: C
Which assumption makes the Naïve Bayesian classifier different from the general Bayesian model?
Which assumption makes the Naïve Bayesian classifier different from the general Bayesian model?A . Number of features cannot be greater than the number of recordsB . Features of a class are conditionally independent of one anotherC . All variables need to be numericD . Fewer features can be used with...
Which function would best address this need?
You have created a scatterplot of two continuous variables for 2000 records. You want to add a line to the scatterplot to check linearity of the data. Which function would best address this need?A . abline()B . glm()C . hist()D . lm()View AnswerAnswer: A
What is the output format from the Map function of MapReduce?
What is the output format from the Map function of MapReduce?A . Key-value pairsB . Binary representation of keys concatenated with structured dataC . Compressed indexD . Unique key record and separate records of all possible valuesView AnswerAnswer: A
What is your next step?
You have been assigned to do a study of the daily revenue effect of a pricing model of online transactions. All the data currently available to you has been loaded into your analytics database; revenue data, pricing data, and online transaction data. You find that all the data comes in...