Which of the following commands can you use on the Dataflow monitoring console to stop the pipeline job?
You have a job that you want to cancel. It is a streaming pipeline, and you want to ensure that any data that is in-flight is processed and written to the output. Which of the following commands can you use on the Dataflow monitoring console to stop the pipeline job?A...
What should you do?
Your company’s on-premises Apache Hadoop servers are approaching end-of-life, and IT has decided to migrate the cluster to Google Cloud Dataproc. A like-for-like migration of the cluster would require 50 TB of Google Persistent Disk per node. The CIO is concerned about the cost of using that much block storage....
Which Cloud Dataflow / Beam feature should you use to aggregate data in an unbounded data source every hour based on the time when the data entered the pipeline?
Which Cloud Dataflow / Beam feature should you use to aggregate data in an unbounded data source every hour based on the time when the data entered the pipeline?A . An hourly watermarkB . An event time triggerC . The with Allowed Lateness methodD . A processing time triggerView AnswerAnswer:...
Which three machine learning applications can you use?
Business owners at your company have given you a database of bank transactions. Each row contains the user ID, transaction type, transaction location, and transaction amount. They ask you to investigate what type of machine learning can be applied to the data. Which three machine learning applications can you use?...
What could be the reason for that?
When running a pipeline that has a BigQuery source, on your local machine, you continue to get permission denied errors. What could be the reason for that?A . Your gcloud does not have access to the BigQuery resourcesB . BigQuery cannot be accessed from local machinesC . You are missing...
Which combination of Google Cloud Platform products should you recommend?
MJTelco is building a custom interface to share data. They have these requirements: They need to do aggregations over their petabyte-scale datasets. They need to scan specific time range rows with a very fast response time (milliseconds). Which combination of Google Cloud Platform products should you recommend?A . Cloud Datastore...
Does Dataflow process batch data pipelines or streaming data pipelines?
Does Dataflow process batch data pipelines or streaming data pipelines?A . Only Batch Data PipelinesB . Both Batch and Streaming Data PipelinesC . Only Streaming Data PipelinesD . None of the aboveView AnswerAnswer: B Explanation: Dataflow is a unified processing model, and can execute both streaming and batch data pipelines...
What should you do?
You work for a manufacturing plant that batches application log files together into a single log file once a day at 2:00 AM. You have written a Google Cloud Dataflow job to process that log file. You need to make sure the log file in processed once per day as...
Which three steps should you take?
Your company handles data processing for a number of different clients. Each client prefers to use their own suite of analytics tools, with some allowing direct query access via Google BigQuery. You need to secure the data so that clients cannot see each other’s data. You want to ensure appropriate...
How should you maintain users’ privacy?
You are working on a sensitive project involving private user data. You have set up a project on Google Cloud Platform to house your work internally. An external consultant is going to assist with coding a complex transformation in a Google Cloud Dataflow pipeline for your project. How should you...