Which loss function should you use?

Your team needs to build a model that predicts whether images contain a driver's license, passport, or credit card. The data engineering team already built the pipeline and generated a dataset composed of 10,000 images with driver's licenses, 1,000 images with passports, and 1,000 images with credit cards. You now...

September 27, 2024 No Comments READ MORE +

How should you create a dataset following Google-recommended best practices?

You have been asked to develop an input pipeline for an ML training model that processes images from disparate sources at a low latency. You discover that your input data does not fit in memory. How should you create a dataset following Google-recommended best practices?A . Create a tf.data.Dataset.prefetch transformationB...

September 27, 2024 No Comments READ MORE +

What should you do?

You work for an advertising company and want to understand the effectiveness of your company's latest advertising campaign. You have streamed 500 MB of campaign data into BigQuery. You want to query the table, and then manipulate the results of that query with a pandas dataframe in an Al Platform...

September 27, 2024 No Comments READ MORE +

What should you do?

As the lead ML Engineer for your company, you are responsible for building ML models to digitize scanned customer forms. You have developed a TensorFlow model that converts the scanned images into text and stores them in Cloud Storage. You need to use your ML model on the aggregated data...

September 27, 2024 No Comments READ MORE +

What should you do?

You are working on a Neural Network-based project. The dataset provided to you has columns with different ranges. While preparing the data for model training, you discover that gradient optimization is having difficulty moving weights to a good solution. What should you do?A . Use feature construction to combine the...

September 26, 2024 No Comments READ MORE +

What should you do first?

You work on an operations team at an international company that manages a large fleet of on-premises servers located in few data centers around the world. Your team collects monitoring data from the servers, including CPU/memory consumption. When an incident occurs on a server, your team is responsible for fixing...

September 26, 2024 No Comments READ MORE +

Which learning strategy should you recommend to train and deploy this ML model?

You are an ML engineer at a bank that has a mobile application. Management has asked you to build an ML-based biometric authentication for the app that verifies a customer's identity based on their fingerprint. Fingerprints are considered highly sensitive personal information and cannot be downloaded and stored into the...

September 25, 2024 No Comments READ MORE +

How should you build the pipeline on Google Cloud while meeting the speed and processing requirements?

You want to rebuild your ML pipeline for structured data on Google Cloud. You are using PySpark to conduct data transformations at scale, but your pipelines are taking over 12 hours to run. To speed up development and pipeline run time, you want to use a serverless tool and SQL...

September 25, 2024 No Comments READ MORE +

What should you do?

You have been asked to build a model using a dataset that is stored in a medium-sized (~10 GB) BigQuery table. You need to quickly determine whether this data is suitable for model development. You want to create a one-time report that includes both informative visualizations of data distributions and...

September 25, 2024 No Comments READ MORE +

Which data transformation strategy would likely improve the performance of your classifier?

You work for a bank and are building a random forest model for fraud detection. You have a dataset that includes transactions, of which 1% are identified as fraudulent. Which data transformation strategy would likely improve the performance of your classifier?A . Write your data in TFRecords.B . Z-normalize all...

September 24, 2024 No Comments READ MORE +