Google Exam Professional Machine Learning Engineer Topic 6 Question 90 Discussion

Actual exam question for Google's Professional Machine Learning Engineer exam

Question #: 90
Topic #: 6

[All Professional Machine Learning Engineer Questions]

You work at a gaming startup that has several terabytes of structured data in Cloud Storage. This data includes gameplay time data, user metadata, and game metadat

a. You want to build a model that recommends new games to users that requires the least amount of coding. What should you do?

ALoad the data in BigQuery. Use BigQuery ML to train an Autoencoder model.

BLoad the data in BigQuery. Use BigQuery ML to train a matrix factorization model.

CRead data to a Vertex Al Workbench notebook. Use TensorFlow to train a two-tower model.

DRead data to a Vertex Al Workbench notebook. Use TensorFlow to train a matrix factorization model.

Show Suggested Answer

Suggested Answer: B

The best option to build a game recommendation model with the least amount of coding is to use BigQuery ML, which allows you to create and execute machine learning models using standard SQL queries. BigQuery ML supports several types of models, including matrix factorization, which is a common technique for collaborative filtering-based recommendation systems. Matrix factorization models learn latent factors for users and items from the observed ratings, and then use them to predict the ratings for new user-item pairs. BigQuery ML provides a built-in function calledML.RECOMMENDthat can generate recommendations for a given user based on a trained matrix factorization model. To use BigQuery ML, you need to load the data in BigQuery, which is a serverless, scalable, and cost-effective data warehouse. You can use thebqcommand-line tool, the BigQuery API, or the Cloud Console to load data from Cloud Storage to BigQuery. Alternatively, you can use federated queries to query data directly from Cloud Storage without loading it to BigQuery, but this may incur additional costs and performance overhead. Option A is incorrect because BigQuery ML does not support Autoencoder models, which are a type of neural network that can learn compressed representations of the input data. Autoencoder models are not suitable for recommendation systems, as they do not capture the interactions between users and items. Option C is incorrect because using TensorFlow to train a two-tower model requires more coding than using BigQuery ML. A two-tower model is a type of neural network that learns embeddings for users and items separately, and then combines them with a dot product or a cosine similarity to compute the rating. TensorFlow is a low-level framework that requires you to define the model architecture, the loss function, the optimizer, the training loop, and the evaluation metrics. Moreover, you need to read the data from Cloud Storage to a Vertex AI Workbench notebook, which is an instance of JupyterLab that runs on a Google Cloud virtual machine. This may involve additional steps such as authentication, authorization, and data preprocessing. Option D is incorrect because using TensorFlow to train a matrix factorization model also requires more coding than using BigQuery ML. Although TensorFlow provides some high-level APIs such as Keras and TensorFlow Recommenders that can simplify the model development, you still need to handle the data loading and the model training and evaluation yourself. Furthermore, you need to read the data from Cloud Storage to a Vertex AI Workbench notebook, which may incur additional complexity and costs.Reference:

BigQuery ML documentation

Using matrix factorization with BigQuery ML

Recommendations AI documentation

Loading data into BigQuery

Querying data in Cloud Storage from BigQuery

Vertex AI Workbench documentation

TensorFlow documentation

TensorFlow Recommenders documentation

by Josphine at Sep 18, 2024, 04:37 PM

Limited Time Offer

25%

Off

Get Premium Professional Machine Learning Engineer Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Amira

2 months ago

I'm not sure, but I think option B could also work well since matrix factorization models are commonly used for recommendation systems.

upvoted 0 times

...

3 months ago

I'm surprised option A isn't even considered. Autoencoder models are great for extracting hidden features from complex data. But I guess BigQuery ML limits the model choices.

upvoted 0 times

Margot

2 months ago

C) Read data to a Vertex AI Workbench notebook. Use TensorFlow to train a two-tower model.

upvoted 0 times

...

Gaston

2 months ago

B) Load the data in BigQuery. Use BigQuery ML to train a matrix factorization model.

upvoted 0 times

...

Quentin

2 months ago

A) Load the data in BigQuery. Use BigQuery ML to train an Autoencoder model.

upvoted 0 times

...

Kristine

3 months ago

I disagree, I believe option C is better as using TensorFlow to train a two-tower model can provide more accurate recommendations.

upvoted 0 times

...

3 months ago

I think option A is the best choice because BigQuery ML can train an Autoencoder model with minimal coding.

upvoted 0 times

...