How should the row key be redesigned to improve Bigtable performance on queries that populate real-time dashboards?
Your company is streaming real-time sensor data from their factory floor into Bigtable and they have noticed extremely poor performance. How should the row key be redesigned to improve Bigtable performance on queries that populate real-time dashboards?A . Use a row key of the form <timestamp>.B . Use a row...
What should you do?
Your company has hired a new data scientist who wants to perform complicated analyses across very large datasets stored in Google Cloud Storage and in a Cassandra cluster on Google Compute Engine. The scientist primarily wants to create labelled data sets for machine learning projects, along with some visualization tasks....
What should they do?
Your company is running their first dynamic campaign, serving different offers by analyzing real-time data during the holiday season. The data scientists are collecting terabytes of data that rapidly grows every hour during their 30-day campaign. They are using Google Cloud Dataflow to preprocess the data and collect the feature...
Which methods can be used to reduce the number of rows processed by BigQuery?
Which methods can be used to reduce the number of rows processed by BigQuery?A . Splitting tables into multiple tables; putting data in partitionsB . Splitting tables into multiple tables; putting data in partitions; using the LIMIT clauseC . Putting data in partitions; using the LIMIT clauseD . Splitting tables...
What is the most likely cause of these duplicate messages?
You work for a car manufacturer and have set up a data pipeline using Google Cloud Pub/Sub to capture anomalous sensor events. You are using a push subscription in Cloud Pub/Sub that calls a custom HTTPS endpoint that you have created to take action of these anomalous events as they...
What method can you employ to address this?
Topic 1, Main Questions Set A Your company built a TensorFlow neural-network model with a large number of neurons and layers. The model fits well for the training data. However, when tested against new data, it performs poorly. What method can you employ to address this?A . ThreadingB . SerializationC...
How should you design the pipeline?
You are designing a basket abandonment system for an ecommerce company. The system will send a message to a user based on these rules: - No interaction by the user on the site for 1 hour - Has added more than $30 worth of products to the basket - Has...
What should you do?
Flowlogistic’s CEO wants to gain rapid insight into their customer base so his sales team can be better informed in the field. This team is not very technical, so they’ve purchased a visualization tool to simplify the creation of BigQuery reports. However, they’ve been overwhelmed by all the data in...
Which of the following is NOT true about Dataflow pipelines?
Which of the following is NOT true about Dataflow pipelines?A . Dataflow pipelines are tied to Dataflow, and cannot be run on any other runnerB . Dataflow pipelines can consume data from other Google Cloud servicesC . Dataflow pipelines can be programmed in JavaD . Dataflow pipelines use a unified...
Which approach should you take?
Flowlogistic is rolling out their real-time inventory tracking system. The tracking devices will all send package-tracking messages, which will now go to a single Google Cloud Pub/Sub topic instead of the Apache Kafka cluster. A subscriber application will then process the messages for real-time reporting and store them in Google...