Google Professional Data Engineer Google Certified Professional – Data Engineer Online Training
Google Professional Data Engineer Online Training
The questions for Professional Data Engineer were last updated at Nov 26,2024.
- Exam Code: Professional Data Engineer
- Exam Name: Google Certified Professional – Data Engineer
- Certification Provider: Google
- Latest update: Nov 26,2024
If a dataset contains rows with individual people and columns for year of birth, country, and income, how many of the columns are continuous and how many are categorical?
- A . 1 continuous and 2 categorical
- B . 3 categorical
- C . 3 continuous
- D . 2 continuous and 1 categorical
Which of the following are examples of hyperparameters? (Select 2 answers.)
- A . Number of hidden layers
- B . Number of nodes in each hidden layer
- C . Biases
- D . Weights
Which of the following are feature engineering techniques? (Select 2 answers)
- A . Hidden feature layers
- B . Feature prioritization
- C . Crossed feature columns
- D . Bucketization of a continuous feature
You want to use a BigQuery table as a data sink. In which writing mode(s) can you use BigQuery as a sink?
- A . Both batch and streaming
- B . BigQuery cannot be used as a sink
- C . Only batch
- D . Only streaming
You have a job that you want to cancel. It is a streaming pipeline, and you want to ensure that any data that is in-flight is processed and written to the output.
Which of the following commands can you use on the Dataflow monitoring console to stop the pipeline job?
- A . Cancel
- B . Drain
- C . Stop
- D . Finish
When running a pipeline that has a BigQuery source, on your local machine, you continue to get permission denied errors.
What could be the reason for that?
- A . Your gcloud does not have access to the BigQuery resources
- B . BigQuery cannot be accessed from local machines
- C . You are missing gcloud on your machine
- D . Pipelines cannot be run locally
What Dataflow concept determines when a Window’s contents should be output based on certain criteria being met?
- A . Sessions
- B . OutputCriteria
- C . Windows
- D . Triggers
Which of the following is NOT one of the three main types of triggers that Dataflow supports?
- A . Trigger based on element size in bytes
- B . Trigger that is a combination of other triggers
- C . Trigger based on element count
- D . Trigger based on time
Which Java SDK class can you use to run your Dataflow programs locally?
- A . LocalRunner
- B . DirectPipelineRunner
- C . MachineRunner
- D . LocalPipelineRunner
The Dataflow SDKs have been recently transitioned into which Apache service?
- A . Apache Spark
- B . Apache Hadoop
- C . Apache Kafka
- D . Apache Beam