What should you do?
You have Google Cloud Dataflow streaming pipeline running with a Google Cloud Pub/Sub subscription as the source. You need to make an update to the code that will make the new Cloud Dataflow pipeline incompatible with the current version. You do not want to lose any data when making this...
Which software libraries are supported by Cloud Machine Learning Engine?
Which software libraries are supported by Cloud Machine Learning Engine?A . Theano and TensorFlowB . Theano and TorchC . TensorFlowD . TensorFlow and TorchView AnswerAnswer: C Explanation: Cloud ML Engine mainly does two things: Enables you to train machine learning models at scale by running TensorFlow training applications in the...
Which of these are examples of a value in a sparse vector? (Select 2 answers.)
Which of these are examples of a value in a sparse vector? (Select 2 answers.)A . [0, 5, 0, 0, 0, 0]B . [0, 0, 0, 1, 0, 0, 1]C . [0, 1]D . [1, 0, 0, 0, 0, 0, 0]View AnswerAnswer: C,D Explanation: Categorical features in linear models are...
Which two characteristic support this method?
You want to use a database of information about tissue samples to classify future tissue samples as either normal or mutated. You are evaluating an unsupervised anomaly detection method for classifying the tissue samples. Which two characteristic support this method? (Choose two.)A . There are very few occurrences of mutations...
What should you do?
You want to use Google Stackdriver Logging to monitor Google BigQuery usage. You need an instant notification to be sent to your monitoring tool when new data is appended to a certain table using an insert job, but you do not want to receive notifications for other tables. What should...
Which schema should you use?
MJTelco needs you to create a schema in Google Bigtable that will allow for the historical analysis of the last 2 years of records. Each record that comes in is sent every 15 minutes, and contains a unique identifier of the device and a data record. The most common query...
How should the row key be redesigned to improve Bigtable performance on queries that populate real-time dashboards?
Your company is streaming real-time sensor data from their factory floor into Bigtable and they have noticed extremely poor performance. How should the row key be redesigned to improve Bigtable performance on queries that populate real-time dashboards?A . Use a row key of the form <timestamp>.B . Use a row...
How should you design the pipeline?
You are designing a basket abandonment system for an ecommerce company. The system will send a message to a user based on these rules: - No interaction by the user on the site for 1 hour - Has added more than $30 worth of products to the basket - Has...
What is the most likely cause of these duplicate messages?
You work for a car manufacturer and have set up a data pipeline using Google Cloud Pub/Sub to capture anomalous sensor events. You are using a push subscription in Cloud Pub/Sub that calls a custom HTTPS endpoint that you have created to take action of these anomalous events as they...
Which of these is not a supported method of putting data into a partitioned table?
Which of these is not a supported method of putting data into a partitioned table?A . If you have existing data in a separate file for each day, then create a partitioned table and upload each file into the appropriate partition.B . Run a query to get the records for...