AWS Machine Learning Certification - Quiz #1

The ‘AWS Machine Learning – Specialty’ certification is tough. It is designed to test 1-2 years of practical machine learning knowledge, plus AWS machine learning implementation specifics. It’s broad – you’ll also need AWS systems architect knowledge to get through this one.

AWS’s own Machine Learning certification path comes with some useful questions and a full test – https://aws.amazon.com/training/learning-paths/machine-learning/exam-preparation, but the course itself isn’t ideal.

The paid AWS practice exam has only a handful of questions. So, over the next few weeks I’m providing some original and free practice questions. These should make you think, and help you check for any knowledge gaps. Questions are aimed to be at around the same level, most are in the same format. However, in some cases they may require more specific knowledge than might be expected in the exam.

Enjoy, and good luck!

267

Created on May 25, 2020 By

admin

AWS Machine Learning – Quiz 1

1 / 10

QUESTION

A data scientist is working with a small dataset of 1000 rows and 6 features (labelled f1, f2 … f6)
Feature F2 is numerical and has about 80 entries with no value set. The data is going to be used to create a simple regression model.

How should the missing values be handled?

Drop rows with missing data

Use the mean for this feature for missing values

Use KNN to determine average values to replace missing value.

Use a deep learning model to impute the missing values

2 / 10

QUESTION

A data scientist wishes to create word embeddings for use in downstream modelling and has selected Blazing text for evaluation.
Which of the following are true?

(Choose One)

a) An enhanced version of fasttext can be used,set evaluation=True to create a model artifact file (eval.json) and use vector.txt and TSNE to visualise

b) A distributed version of word2vec can be utilised, set evaluation=True to create a model artifact file (eval.json) and use vector.txt and TSNE to visualise

c) word2vec can be used, ensure both train and validation channels are in place, query model performance directly via model.spearman

d) A distributed version of word2vec can be utilised, set evaluation=False to create the necessary eval.json and vector.txt files, use a JSON lines manifest file

3 / 10

QUESTION

A data scientist has built a multi-class burger classifier, with the aim to identify the corresponding meal from social media images.

Given the resulting multi-class confusion matrix from the first training run, what are the missing values x, y, z in the metrics table that follows?

		PREDICTED
		big mac meal	whopper meal	in & out double-double
ACTUAL	big mac meal	5	5	10
	whopper meal	5	10	5
	in & out double-double	10	5	5

	precision	recall	f1-score	support
big mac meal	x	0.25	0.25	20
whopper meal	0.5	y	0.5	20
in & out double-double	0.25	0.25	z	20

(Select One)

1/4, 1/4, 1/8

1/4, 1/2, 1/4

1/8, 1/4, 1/2

1/4,1/2,1/8

4 / 10

QUESTION

Which of the following must be set by the user (required hyperparameters), for SageMaker’s built-in algorithm, DeepAR Forecasting?

(Select Three)

epochs, prediction_length

context_length

learning_rate

time_freq

5 / 10

QUESTION

Which of these would provide the most manageable, scalable and secure means for an app to access inference from a SageMaker Endpoint?

(Choose One)

client <-> SageMaker Endpoint

client <-> AWS Lambda <-> SageMaker Endpoint

client <-> AWS API Gateway <-> AWS Lambda <-> SageMaker Endpoint

client <-> AWS Lambda <-> AWS API Gateway <-> SageMaker Endpoint

6 / 10

QUESTION

A data scientist is working on a machine learning model to improve relevance in search results. As a benchmark, the scientist is using SKLearn’s TF-IDF implementation.

However, at first glance, the td-idf values aren’t looking as expected. Which of the following best explains why the values might different from the scientists initial calculations? And, given the following corpus, what are the dimensions for the tf-idf matrix if only bigrams are selected.

Document1 : product managers know about machine learning
Document2: machine learning, a product owner essential

(Choose One)

TfidVectorizer applies non standard smooth_idf and applies l2 normalization by default. The resulting tf-idf matrix is 2,8.

TfidVectorizer uses a non standard idf. The resulting tf-idf matrix is 8,2

Tfid Vectorizer uses a standard idf, but non standard normalization. The resulting tf-idf matrix is 2,9

TfidTransformer should have been used. The resulting tf-idf matrix is 2,8

Sklearn’s TfidfVectorizer uses non default idf(t) for both the default smoothed and non default idf. L2 normalisation is also used. The code example below shows the default, which if you know idf is typically ln(number of docs/docs with this term), you’d be expecting some zeros.

There are 2 documents (sentences) and 8 possible bigrams (listed below) with ‘a’ excluded because it’s shorter than 2 character minimum. So the resulting tf-idf is 2,8

https://scikit-learn.org/stable/modules/feature_extraction.html#text-feature-extraction

Example Code –

from sklearn.feature_extraction.text import TfidfVectorizer
corpus = [“product managers know about machine learning”,
“machine learning, a product owner essential”]
vectorizer_A = TfidfVectorizer(ngram_range = (2,2))
vectorizer_B = TfidfVectorizer(ngram_range = (2,2),smooth_idf=False, norm=None)
matrix = vectorizer_A.fit_transform(corpus)
print(“**list of bigrams**”)
for i, feature in enumerate(vectorizer_A.get_feature_names()):
print(i, feature)
matrix_A = vectorizer_A.fit_transform(corpus)
matrix_B = vectorizer_B.fit_transform(corpus)
print(“**bigram tf-idf with defaults values**” )
print(matrix_A)
print(“**bigram tf-idf with no smoothing or normalisation**” )
print(matrix_B)

**list of bigrams**0 about machine1 know about2 learning product3 machine learning4 managers know5 owner essential6 product managers7 product owner**bigram tf-idf with defaults values**  (0, 3)	0.33517574332792605  (0, 0)	0.47107781233161794  (0, 1)	0.47107781233161794  (0, 4)	0.47107781233161794  (0, 6)	0.47107781233161794  (1, 5)	0.534046329052269  (1, 7)	0.534046329052269  (1, 2)	0.534046329052269  (1, 3)	0.37997836159100784**bigram tf-idf with no smoothing or normalisation**  (0, 3)	1.0  (0, 0)	1.6931471805599454  (0, 1)	1.6931471805599454  (0, 4)	1.6931471805599454  (0, 6)	1.6931471805599454  (1, 5)	1.6931471805599454  (1, 7)	1.6931471805599454  (1, 2)	1.6931471805599454  (1, 3)	1.0

7 / 10

QUESTION

A national tourist agency data team are using Blazing Text to classify social media posts to specific activities types and overall positive or negative classification. Which of the following are true?

a) For training, raw text must be converted to space separated tokenised text.For pipe mode, there is no need to use RecordIO. Both train and validation channels are supported
b) The algorithm supports only binary classification
c) Only CPU is supported
d) The algorithm extends the FastText classifier and is an unsupervised learning model.
e) Accuracy on the validation data is used as a proxy to the quality of the algorithm

(Choose One)

a, e

a, b, d,e

a, d, e

8 / 10

QUESTION

Which of the following must be set by the user (required hyper parameters), for SageMaker’s built-in algorithm, K-Nearest Neighbours?

(Select Three)

k and sample_size

feature_dim, predictor_type

sample_size, dimension_reduction_target

mini_batch_size, dimension_reduction_type

9 / 10

QUESTION

Which of the following SageMaker algorithms or AWS Services offer access to the benefits of transfer learning?

a) Image Classification Algorithm
b) Object Detection Algorithm
c)Semantic Segmentation Algorithm
d) Amazon Rekognition

(Choose One)

b & c only

all

b, c & d

d only

10 / 10

QUESTION

A data scientist is evaluating a machine learning model – which of the following are true?

(Select two)

ROC curves are used for regression evaluation

ROC plots True Positive Rate against False Positive Rate for different decision thresholds

An AUC of 0.5 or higher indicates a good classification model

Precision Recall Curves are better suited that ROC for evaluating classifiers where there is class imbalance

Your score is

The average score is 45%

#AWS certification AWS Machine Learning - Speciality Quiz

3 Responses

Sanket says:
March 2, 2021 at 4:37 pm
How can you use mean for imputation if the variable is categorical?
Reply
- Sanket says:
  March 2, 2021 at 4:59 pm
  For this question……
  A data scientist is working with a small dataset of 1000 rows and 6 features (labelled f1, f2 … f6)
  Feature F2 is categorical and has about 80 entries with no value set. The data is going to be used to create a simple regression model.
  How should the missing values be handled?
  Drop rows with missing data
  Use the mean for this feature for missing values
  Use KNN to determine average values to replace missing value.
  Use a deep learning to impute the missing values
  Reply
admin says:
March 3, 2021 at 4:39 pm
Hello Sanket,
Many thanks for taking the time to provide feedback. Yes, you’re right and I’ve corrected the question.
Reply

AWS Machine Learning Certification – Quiz #1

3 Responses

Leave a Reply Cancel reply