What should you recommend they do?
Topic 4, Main Questions Set B Your company has recently grown rapidly and now ingesting data at a significantly higher rate than it was previously. You manage the daily batch MapReduce analytics jobs in Apache Hadoop. However, the recent increase in data has meant the batch jobs are falling behind....
Which table name will make the SQL statement work correctly?
Your company is using WHILECARD tables to query data across multiple tables with similar names. The SQL statement is currently failing with the following error: # Syntax error: Expected end of statement but got “-“ at [4:11] SELECT age FROM bigquery-public-data.noaa_gsod.gsod WHERE age != 99 AND_TABLE_SUFFIX = ‘1929’ ORDER BY...
Which combination of GCP products should you choose?
Flowlogistic’s management has determined that the current Apache Kafka servers cannot handle the data volume for their real-time inventory tracking system. You need to build a new system on Google Cloud Platform (GCP) that will feed the proprietary tracking software. The system must be able to ingest data from a...
What should you do?
You have spent a few days loading data from comma-separated values (CSV) files into the Google BigQuery table CLICK_STREAM. The column DT stores the epoch time of click events. For convenience, you chose a simple schema where every field is treated as the STRING type. Now, you want to compute...
Which three databases meet your requirements?
You are choosing a NoSQL database to handle telemetry data submitted from millions of Internet-of-Things (IoT) devices. The volume of data is growing at 100 TB per year, and each data entry has about 100 attributes. The data processing pipeline does not require atomicity, consistency, isolation, and durability (ACID). However,...
By default, which of the following windowing behavior does Dataflow apply to unbounded data sets?
By default, which of the following windowing behavior does Dataflow apply to unbounded data sets?A . Windows at every 100 MB of dataB . Single, Global WindowC . Windows at every 1 minuteD . Windows at every 10 minutesView AnswerAnswer: B Explanation: Dataflow's default windowing behavior is to assign all...
What should you do first?
Your startup has never implemented a formal security policy. Currently, everyone in the company has access to the datasets stored in Google BigQuery. Teams have freedom to use the service as they see fit, and they have not documented their use cases. You have been asked to secure the data...
What should you do?
You create an important report for your large team in Google Data Studio 360. The report uses Google BigQuery as its data source. You notice that visualizations are not showing data that is less than 1 hour old. What should you do?A . Disable caching by editing the report settings.B...
Which three approaches can you take?
Your company is in a highly regulated industry. One of your requirements is to ensure individual users have access only to the minimum amount of information required to do their jobs. You want to enforce this requirement with Google BigQuery. Which three approaches can you take? (Choose three.)A . Disable...
Which Java SDK class can you use to run your Dataflow programs locally?
Which Java SDK class can you use to run your Dataflow programs locally?A . LocalRunnerB . DirectPipelineRunnerC . MachineRunnerD . LocalPipelineRunnerView AnswerAnswer: B Explanation: DirectPipelineRunner allows you to execute operations in the pipeline directly, without any optimization. Useful for small local execution and tests Reference: https://cloud.google.com/dataflow/java-sdk/JavaDoc/com/google/cloud/dataflow/sdk/runners/DirectPipelineRunner