What should you do?
Flowlogistic’s CEO wants to gain rapid insight into their customer base so his sales team can be better informed in the field. This team is not very technical, so they’ve purchased a visualization tool to simplify the creation of BigQuery reports. However, they’ve been overwhelmed by all the data in...
How should you build this pipeline?
An external customer provides you with a daily dump of data from their database. The data flows into Google Cloud Storage GCS as comma-separated values (CSV) files. You want to analyze this data in Google BigQuery, but the data could have rows that are formatted incorrectly or corrupted. How should...
What should you do?
Your company is performing data preprocessing for a learning algorithm in Google Cloud Dataflow. Numerous data logs are being are being generated during this step, and the team wants to analyze them. Due to the dynamic nature of the campaign, the data is growing exponentially every hour. The data scientists...
Which of the following are feature engineering techniques? (Select 2 answers)
Which of the following are feature engineering techniques? (Select 2 answers)A . Hidden feature layersB . Feature prioritizationC . Crossed feature columnsD . Bucketization of a continuous featureView AnswerAnswer: CD Explanation: Selecting and crafting the right set of feature columns is key to learning an effective model. Bucketization is a...
How can you adjust your application design?
You need to store and analyze social media postings in Google BigQuery at a rate of 10,000 messages per minute in near real-time. Initially, design the application to use streaming inserts for individual postings. Your application also performs data aggregations right after the streaming inserts. You discover that the queries...
What should you do?
You create an important report for your large team in Google Data Studio 360. The report uses Google BigQuery as its data source. You notice that visualizations are not showing data that is less than 1 hour old. What should you do?A . Disable caching by editing the report settings.B...
Which Cloud Dataflow pipeline configuration setting should you update?
Topic 3, MJTelco Case Study Company Overview MJTelco is a startup that plans to build networks in rapidly growing, underserved markets around the world. The company has patents for innovative optical communications hardware. Based on these patents, they can create many reliable, high-speed backbone links with inexpensive hardware. Company Background...
What should you do?
Your company has hired a new data scientist who wants to perform complicated analyses across very large datasets stored in Google Cloud Storage and in a Cassandra cluster on Google Compute Engine. The scientist primarily wants to create labelled data sets for machine learning projects, along with some visualization tasks....
To run a TensorFlow training job on your own computer using Cloud Machine Learning Engine, what would your command start with?
To run a TensorFlow training job on your own computer using Cloud Machine Learning Engine, what would your command start with?A . gcloud ml-engine local trainB . gcloud ml-engine jobs submit trainingC . gcloud ml-engine jobs submit training localD . You can't run a TensorFlow program on your own computer...
The _________ for Cloud Bigtable makes it possible to use Cloud Bigtable in a Cloud Dataflow pipeline.
The _________ for Cloud Bigtable makes it possible to use Cloud Bigtable in a Cloud Dataflow pipeline.A . Cloud Dataflow connectorB . DataFlow SDKC . BiqQuery APID . BigQuery Data Transfer ServiceView AnswerAnswer: A Explanation: The Cloud Dataflow connector for Cloud Bigtable makes it possible to use Cloud Bigtable in...