Latest Google Professional-Machine-Learning-Engineer Exam questions and answers [Q65-Q82]

Rate this post

Latest Google Professional-Machine-Learning-Engineer Exam questions and answers

Test4Engine Professional-Machine-Learning-Engineer Exam Practice Test Questions (Updated 142 Questions)

How much Professional Machine Learning Engineer – Google Cost

The cost of the Professional Machine Learning Engineer – Google is $200. For more information related to exam price, please visit the official website Google Website as the cost of exams may be subjected to vary county-wise.

Understanding functional and technical aspects of Professional Machine Learning Engineer – Google Data Preparation and Processing

The following will be discussed in Google Professional-Machine-Learning-Engineer exam dumps:

  • Encoding structured data types
  • Data exploration (EDA)
  • Class imbalance
  • Feature selection
  • Feature crosses
  • Managing large samples (TFRecords)
  • Data ingestion
  • Data validation
  • Database migration
  • Evaluation of data quality and feasibility
  • Monitoring/changing deployed pipelines
  • Design data pipelines
  • Handling missing data
  • Ingestion of various file types (e.g. Csv, json, img, parquet or databases, Hadoop/Spark)
  • Streaming data (e.g. from IoT devices)
  • Visualization
  • Feature engineering
  • Handling outliers
  • Statistical fundamentals at scale
  • Data leakage and augmentation

 

Q65. A gaming company has launched an online game where people can start playing for free, but they need to pay if they choose to use certain features. The company needs to build an automated system to predict whether or not a new user will become a paid user within 1 year. The company has gathered a labeled dataset from 1 million users.
The training dataset consists of 1,000 positive samples (from users who ended up paying within 1 year) and
999,000 negative samples (from users who did not use any paid features). Each data sample consists of 200 features including user age, device, location, and play patterns.
Using this dataset for training, the Data Science team trained a random forest model that converged with over
99% accuracy on the training set. However, the prediction results on a test dataset were not satisfactory Which of the following approaches should the Data Science team take to mitigate this issue? (Choose two.)

 
 
 
 
 

Q66. You are an ML engineer responsible for designing and implementing training pipelines for ML models. You need to create an end-to-end training pipeline for a TensorFlow model. The TensorFlow model will be trained on several terabytes of structured dat a. You need the pipeline to include data quality checks before training and model quality checks after training but prior to deployment. You want to minimize development time and the need for infrastructure maintenance. How should you build and orchestrate your training pipeline?

 
 
 
 

Q67. You recently developed a deep learning model using Keras, and now you are experimenting with different training strategies. First, you trained the model using a single GPU, but the training process was too slow. Next, you distributed the training across 4 GPUs using tf.distribute.MirroredStrategy (with no other changes), but you did not observe a decrease in training time. What should you do?

 
 
 
 

Q68. A data scientist wants to use Amazon Forecast to build a forecasting model for inventory demand for a retail company. The company has provided a dataset of historic inventory demand for its products as a .csv file stored in an Amazon S3 bucket. The table below shows a sample of the dataset.

How should the data scientist transform the data?

 
 
 
 

Q69. You manage a team of data scientists who use a cloud-based backend system to submit training jobs. This system has become very difficult to administer, and you want to use a managed service instead. The data scientists you work with use many different frameworks, including Keras, PyTorch, theano, scikit-learn, and custom libraries. What should you do?

 
 
 
 

Q70. Your team is working on an NLP research project to predict political affiliation of authors based on articles they have written. You have a large training dataset that is structured like this:

A)

B)

C)

D)

 
 
 
 

Q71. An online reseller has a large, multi-column dataset with one column missing 30% of its data. A Machine Learning Specialist believes that certain columns in the dataset could be used to reconstruct the missing data.
Which reconstruction approach should the Specialist use to preserve the integrity of the dataset?

 
 
 
 

Q72. You are building a real-time prediction engine that streams files which may contain Personally Identifiable Information (Pll) to Google Cloud. You want to use the Cloud Data Loss Prevention (DLP) API to scan the files. How should you ensure that the Pll is not accessible by unauthorized individuals?

 
 
 
 
 

Q73. You are developing ML models with Al Platform for image segmentation on CT scans. You frequently update your model architectures based on the newest available research papers, and have to rerun training on the same dataset to benchmark their performance. You want to minimize computation costs and manual intervention while having version control for your code. What should you do?

 
 
 
 

Q74. You work for a global footwear retailer and need to predict when an item will be out of stock based on historical inventory dat a. Customer behavior is highly dynamic since footwear demand is influenced by many different factors. You want to serve models that are trained on all available data, but track your performance on specific subsets of data before pushing to production. What is the most streamlined and reliable way to perform this validation?

 
 
 
 

Q75. A Machine Learning Specialist working for an online fashion company wants to build a data ingestion solution for the company’s Amazon S3-based data lake.
The Specialist wants to create a set of ingestion mechanisms that will enable future capabilities comprised of:
* Real-time analytics
* Interactive analytics of historical data
* Clickstream analytics
* Product recommendations
Which services should the Specialist use?

 
 
 
 

Q76. You need to design a customized deep neural network in Keras that will predict customer purchases based on their purchase history. You want to explore model performance using multiple model architectures, store training data, and be able to compare the evaluation metrics in the same dashboard. What should you do?

 
 
 
 

Q77. A Machine Learning team runs its own training algorithm on Amazon SageMaker. The training algorithm requires external assets. The team needs to submit both its own algorithm code and algorithm-specific parameters to Amazon SageMaker.
What combination of services should the team use to build a custom algorithm in Amazon SageMaker?
(Choose two.)

 
 
 
 
 

Q78. A Machine Learning Specialist is developing a custom video recommendation model for an application. The dataset used to train this model is very large with millions of data points and is hosted in an Amazon S3 bucket.
The Specialist wants to avoid loading all of this data onto an Amazon SageMaker notebook instance because it would take hours to move and will exceed the attached 5 GB Amazon EBS volume on the notebook instance.
Which approach allows the Specialist to use all the data to train the model?

 
 
 
 

Q79. You work for a magazine publisher and have been tasked with predicting whether customers will cancel their annual subscription. In your exploratory data analysis, you find that 90% of individuals renew their subscription every year, and only 10% of individuals cancel their subscription. After training a NN Classifier, your model predicts those who cancel their subscription with 99% accuracy and predicts those who renew their subscription with 82% accuracy. How should you interpret these results?

 
 
 
 

Q80. You work for a large hotel chain and have been asked to assist the marketing team in gathering predictions for a targeted marketing strategy. You need to make predictions about user lifetime value (LTV) over the next 30 days so that marketing can be adjusted accordingly. The customer dataset is in BigQuery, and you are preparing the tabular data for training with AutoML Tables. This data has a time signal that is spread across multiple columns. How should you ensure that AutoML fits the best model to your data?

 
 
 
 

Q81. A logistics company needs a forecast model to predict next month’s inventory requirements for a single item in
10 warehouses. A machine learning specialist uses Amazon Forecast to develop a forecast model from 3 years of monthly data. There is no missing data. The specialist selects the DeepAR+ algorithm to train a predictor. The predictor means absolute percentage error (MAPE) is much larger than the MAPE produced by the current human forecasters.
Which changes to the CreatePredictor API call could improve the MAPE? (Choose two.)

 
 
 
 
 

Q82. During batch training of a neural network, you notice that there is an oscillation in the loss. How should you adjust your model to ensure that it converges?

 
 
 
 

Pass Your Google Exam with Professional-Machine-Learning-Engineer Exam Dumps: https://www.test4engine.com/Professional-Machine-Learning-Engineer_exam-latest-braindumps.html

         

Leave a Reply

Your email address will not be published. Required fields are marked *

Enter the text from the image below