Google Professional Machine Learning Engineer Google Professional Machine Learning Engineer Online Training
Google Professional Machine Learning Engineer Online Training
The questions for Professional Machine Learning Engineer were last updated at Feb 19,2025.
- Exam Code: Professional Machine Learning Engineer
- Exam Name: Google Professional Machine Learning Engineer
- Certification Provider: Google
- Latest update: Feb 19,2025
You are creating a deep neural network classification model using a dataset with categorical input values. Certain columns have a cardinality greater than 10,000 unique values.
How should you encode these categorical values as input into the model?
- A . Convert each categorical value into an integer value.
- B . Convert the categorical string data to one-hot hash buckets.
- C . Map the categorical variables into a vector of boolean values.
- D . Convert each categorical value into a run-length encoded string.
You need to train a natural language model to perform text classification on product descriptions that contain millions of examples and 100,000 unique words. You want to preprocess the words individually so that they can be fed into a recurrent neural network.
What should you do?
- A . Create a hot-encoding of words, and feed the encodings into your model.
- B . Identify word embeddings from a pre-trained model, and use the embeddings in your model.
- C . Sort the words by frequency of occurrence, and use the frequencies as the encodings in your model.
- D . Assign a numerical value to each word from 1 to 100,000 and feed the values as inputs in your model.
Your data science team has requested a system that supports scheduled model retraining, Docker containers, and a service that supports autoscaling and monitoring for online prediction requests.
Which platform components should you choose for this system?
- A . Vertex AI Pipelines and App Engine
- B . Vertex AI Pipelines, Vertex AI Prediction, and Vertex AI Model Monitoring
- C . Cloud Composer, BigQuery ML, and Vertex AI Prediction
- D . Cloud Composer, Vertex AI Training with custom containers, and App Engine
You are profiling the performance of your TensorFlow model training time and notice a performance issue caused by inefficiencies in the input data pipeline for a single 5 terabyte CSV file dataset on Cloud Storage. You need to optimize the input pipeline performance.
Which action should you try first to increase the efficiency of your pipeline?
- A . Preprocess the input CSV file into a TFRecord file.
- B . Randomly select a 10 gigabyte subset of the data to train your model.
- C . Split into multiple CSV files and use a parallel interleave transformation.
- D . Set the reshuffle_each_iteration parameter to true in the tf.data.Dataset.shuffle method.
You are profiling the performance of your TensorFlow model training time and notice a performance issue caused by inefficiencies in the input data pipeline for a single 5 terabyte CSV file dataset on Cloud Storage. You need to optimize the input pipeline performance.
Which action should you try first to increase the efficiency of your pipeline?
- A . Preprocess the input CSV file into a TFRecord file.
- B . Randomly select a 10 gigabyte subset of the data to train your model.
- C . Split into multiple CSV files and use a parallel interleave transformation.
- D . Set the reshuffle_each_iteration parameter to true in the tf.data.Dataset.shuffle method.
You are profiling the performance of your TensorFlow model training time and notice a performance issue caused by inefficiencies in the input data pipeline for a single 5 terabyte CSV file dataset on Cloud Storage. You need to optimize the input pipeline performance.
Which action should you try first to increase the efficiency of your pipeline?
- A . Preprocess the input CSV file into a TFRecord file.
- B . Randomly select a 10 gigabyte subset of the data to train your model.
- C . Split into multiple CSV files and use a parallel interleave transformation.
- D . Set the reshuffle_each_iteration parameter to true in the tf.data.Dataset.shuffle method.
You are profiling the performance of your TensorFlow model training time and notice a performance issue caused by inefficiencies in the input data pipeline for a single 5 terabyte CSV file dataset on Cloud Storage. You need to optimize the input pipeline performance.
Which action should you try first to increase the efficiency of your pipeline?
- A . Preprocess the input CSV file into a TFRecord file.
- B . Randomly select a 10 gigabyte subset of the data to train your model.
- C . Split into multiple CSV files and use a parallel interleave transformation.
- D . Set the reshuffle_each_iteration parameter to true in the tf.data.Dataset.shuffle method.
You are profiling the performance of your TensorFlow model training time and notice a performance issue caused by inefficiencies in the input data pipeline for a single 5 terabyte CSV file dataset on Cloud Storage. You need to optimize the input pipeline performance.
Which action should you try first to increase the efficiency of your pipeline?
- A . Preprocess the input CSV file into a TFRecord file.
- B . Randomly select a 10 gigabyte subset of the data to train your model.
- C . Split into multiple CSV files and use a parallel interleave transformation.
- D . Set the reshuffle_each_iteration parameter to true in the tf.data.Dataset.shuffle method.
You are profiling the performance of your TensorFlow model training time and notice a performance issue caused by inefficiencies in the input data pipeline for a single 5 terabyte CSV file dataset on Cloud Storage. You need to optimize the input pipeline performance.
Which action should you try first to increase the efficiency of your pipeline?
- A . Preprocess the input CSV file into a TFRecord file.
- B . Randomly select a 10 gigabyte subset of the data to train your model.
- C . Split into multiple CSV files and use a parallel interleave transformation.
- D . Set the reshuffle_each_iteration parameter to true in the tf.data.Dataset.shuffle method.
You are profiling the performance of your TensorFlow model training time and notice a performance issue caused by inefficiencies in the input data pipeline for a single 5 terabyte CSV file dataset on Cloud Storage. You need to optimize the input pipeline performance.
Which action should you try first to increase the efficiency of your pipeline?
- A . Preprocess the input CSV file into a TFRecord file.
- B . Randomly select a 10 gigabyte subset of the data to train your model.
- C . Split into multiple CSV files and use a parallel interleave transformation.
- D . Set the reshuffle_each_iteration parameter to true in the tf.data.Dataset.shuffle method.