Google Professional Data Engineer Google Certified Professional – Data Engineer Online Training
Google Professional Data Engineer Online Training
The questions for Professional Data Engineer were last updated at Nov 26,2024.
- Exam Code: Professional Data Engineer
- Exam Name: Google Certified Professional – Data Engineer
- Certification Provider: Google
- Latest update: Nov 26,2024
Which of the following statements about Legacy SQL and Standard SQL is not true?
- A . Standard SQL is the preferred query language for BigQuery.
- B . If you write a query in Legacy SQL, it might generate an error if you try to run it with Standard SQL.
- C . One difference between the two query languages is how you specify fully-qualified table names (i.e. table names that include their associated project name).
- D . You need to set a query language for each dataset and the default is Standard SQL.
How would you query specific partitions in a BigQuery table?
- A . Use the DAY column in the WHERE clause
- B . Use the EXTRACT(DAY) clause
- C . Use the __PARTITIONTIME pseudo-column in the WHERE clause
- D . Use DATE BETWEEN in the WHERE clause
Which SQL keyword can be used to reduce the number of columns processed by BigQuery?
- A . BETWEEN
- B . WHERE
- C . SELECT
- D . LIMIT
To give a user read permission for only the first three columns of a table, which access control method would you use?
- A . Primitive role
- B . Predefined role
- C . Authorized view
- D . It’s not possible to give access to only the first three columns of a table.
What are two methods that can be used to denormalize tables in BigQuery?
- A . 1) Split table into multiple tables; 2) Use a partitioned table
- B . 1) Join tables into one table; 2) Use nested repeated fields
- C . 1) Use a partitioned table; 2) Join tables into one table
- D . 1) Use nested repeated fields; 2) Use a partitioned table
Which of these is not a supported method of putting data into a partitioned table?
- A . If you have existing data in a separate file for each day, then create a partitioned table and upload each file into the appropriate partition.
- B . Run a query to get the records for a specific day from an existing table and for the destination table, specify a partitioned table ending with the day in the format "$YYYYMMDD".
- C . Create a partitioned table and stream new records to it every day.
- D . Use ORDER BY to put a table’s rows into chronological order and then change the table’s type to "Partitioned".
Which of these operations can you perform from the BigQuery Web UI?
- A . Upload a file in SQL format.
- B . Load data with nested and repeated fields.
- C . Upload a 20 MB file.
- D . Upload multiple files using a wildcard.
Which methods can be used to reduce the number of rows processed by BigQuery?
- A . Splitting tables into multiple tables; putting data in partitions
- B . Splitting tables into multiple tables; putting data in partitions; using the LIMIT clause
- C . Putting data in partitions; using the LIMIT clause
- D . Splitting tables into multiple tables; using the LIMIT clause
Why do you need to split a machine learning dataset into training data and test data?
- A . So you can try two different sets of features
- B . To make sure your model is generalized for more than just the training data
- C . To allow you to create unit tests in your code
- D . So you can use one dataset for a wide model and one for a deep model
Which of these numbers are adjusted by a neural network as it learns from a training dataset (select
2 answers)?
- A . Weights
- B . Biases
- C . Continuous features
- D . Input values