Which tool/library would you use to produce these models with the least effort?
You have been assigned to run a logistic regression model for each of 100 countries, and all the data is currently stored in a PostgreSQL database. Which tool/library would you use to produce these models with the least effort?A . MADlibB . MahoutC . RStudioD . HBaseView AnswerAnswer: A
What is the most appropriate visual to reinforce this statement?
You have created a Logistic Regression model to predict customer churn for your company. The company’s Marketing department wants to use your model to identify at-risk customers and offer incentives to keep them from leaving. Using two different thresholds for the model provides the two confusion matrices shown in the...
What can you determine from the lift calculation?
You are using the Apriori algorithm to determine the likelihood that a person who owns a home has a good credit score. You have determined that the confidence for the rules used in the algorithm is > 75%. You calculate lift = 1.011 for the rule, "People with good credit...
Which activity is performed in the Operationalize phase of the Data Analytics Lifecycle?
Which activity is performed in the Operationalize phase of the Data Analytics Lifecycle?A . Define the process to maintain the modelB . Try different analytical techniquesC . Try different variablesD . Transform existing variablesView AnswerAnswer: A
Which technique could you use to address the three correlated variables?
During the data preparation phase, you notice a high correlation between average spend on video games, age of players, and number of science fiction shows watched. Which technique could you use to address the three correlated variables?A . Square the three variables to remove the correlationB . Combine the three...
Which technique should you use?
You need to run a hypothesis test across three normally distributed populations. Which technique should you use?A . Z-testB . Welch's t-testC . ANOVAD . Wilcoxon rank sum testView AnswerAnswer: C
In a t-test with unknown variance, what values are used to calculate the t-statistic?
In a t-test with unknown variance, what values are used to calculate the t-statistic?A . Sample mean, sample standard deviation, and sample sizeB . Mean, sample standard deviation, and population sizeC . Sample mean, standard deviation, and sample sizeD . Mean, standard deviation, and population sizeView AnswerAnswer: A
What is your next step?
You have been assigned to do a study of the daily revenue effect of a pricing model of online transactions. You have tested all the theoretical models in the previous model planning stage, and all tests have yielded statistically insignificant results. What is your next step?A . Report that the...
What is another component?
Trend, seasonal, and cyclical are components of a time series. What is another component?A . IrregularB . LinearC . QuadraticD . ExponentialView AnswerAnswer: A
What is the mandatory Clause that must be included when using Window functions?
What is the mandatory Clause that must be included when using Window functions?A . OVERB . RANKC . PARTITION BYD . RANK BYView AnswerAnswer: A