How should you design the pipeline?

You are designing a basket abandonment system for an ecommerce company. The system will send a message to a user based on these rules: - No interaction by the user on the site for 1 hour - Has added more than $30 worth of products to the basket - Has...

September 24, 2021 No Comments READ MORE +

What is the most likely cause of these duplicate messages?

You work for a car manufacturer and have set up a data pipeline using Google Cloud Pub/Sub to capture anomalous sensor events. You are using a push subscription in Cloud Pub/Sub that calls a custom HTTPS endpoint that you have created to take action of these anomalous events as they...

September 23, 2021 No Comments READ MORE +

Which of these is not a supported method of putting data into a partitioned table?

Which of these is not a supported method of putting data into a partitioned table?A . If you have existing data in a separate file for each day, then create a partitioned table and upload each file into the appropriate partition.B . Run a query to get the records for...

September 23, 2021 No Comments READ MORE +

Why do you need to split a machine learning dataset into training data and test data?

Why do you need to split a machine learning dataset into training data and test data?A . So you can try two different sets of featuresB . To make sure your model is generalized for more than just the training dataC . To allow you to create unit tests in...

September 23, 2021 No Comments READ MORE +

Which Google Cloud Platform product should you use?

You are designing the database schema for a machine learning-based food ordering service that will predict what users want to eat. Here is some of the information you need to store: - The user profile: What the user likes and doesn’t like to eat - The user account information: Name,...

September 23, 2021 No Comments READ MORE +

Which three machine learning applications can you use?

Business owners at your company have given you a database of bank transactions. Each row contains the user ID, transaction type, transaction location, and transaction amount. They ask you to investigate what type of machine learning can be applied to the data. Which three machine learning applications can you use?...

September 23, 2021 No Comments READ MORE +

What should you do?

Your company is migrating their 30-node Apache Hadoop cluster to the cloud. They want to re-use Hadoop jobs they have already created and minimize the management of the cluster as much as possible. They also want to be able to persist data beyond the life of the cluster. What should...

September 23, 2021 No Comments READ MORE +

Which Cloud Dataflow pipeline configuration setting should you update?

Topic 3, MJTelco Case Study Company Overview MJTelco is a startup that plans to build networks in rapidly growing, underserved markets around the world. The company has patents for innovative optical communications hardware. Based on these patents, they can create many reliable, high-speed backbone links with inexpensive hardware. Company Background...

September 22, 2021 No Comments READ MORE +

Which of these sources can you not load data into BigQuery from?

Which of these sources can you not load data into BigQuery from?A . File uploadB . Google DriveC . Google Cloud StorageD . Google Cloud SQLView AnswerAnswer: D Explanation: You can load data into BigQuery from a file upload, Google Cloud Storage, Google Drive, or Google Cloud Bigtable. It is...

September 22, 2021 No Comments READ MORE +

Which of these statements about exporting data from BigQuery is false?

Which of these statements about exporting data from BigQuery is false?A . To export more than 1 GB of data, you need to put a wildcard in the destination filename.B . The only supported export destination is Google Cloud Storage.C . Data can only be exported in JSON or Avro...

September 22, 2021 No Comments READ MORE +