AWS Data Analytics Quiz - Redshift

Why not prepare for Amazon’s AWS Data Analytics – Specialty certification with these questions & test your knowledge of Amazon Redshift?

Find questions on the following topic(s) –

0 votes, 0 avg

Created on August 18, 2020 By

admin

AWS Certification

AWS Data Analytics – Quiz 3: Redshift

1 / 6

QUESTION

Which of the following are true regarding loading of data into Redshift from DynamoDB?

(Select Two)

A) You cannot use COPY to load data from a DynamoDB table to Redshift

B) The DynamoDB table must be created in the same AWS region as the Redshift Cluster

C) DynamoDB STRING, BINARY and SET data types are supported.

D) COPY maximises throughput by loading data from DynamoDB in parallel across the compute nodes in the Redshift cluster

2 / 6

QUESTION

The data analysis team wish to create new insights on data currently located in S3, Amazon Aurora and Redshift, to be published in a new Business Intelligence application. Which of the following is the most straight-forward solution?

A) Use ‘Federated Queries’ – Redshift will use it’s parallel processing capacity to run queries as needed and distribute part of the required computation into the remote databases.

B) Use ‘Redshift Spectrum’ & Glue – Queries will employ massive parallelism to execute against S3 and Aurora data sets

C) Use ‘DBLink’ – Adds ability to utilise PL/pgSQL user defined functions to Redshift Queries

D) Use DMS to sync data between S3, Aurora and Redshift and Quicksight to produce the dashboard.

3 / 6

QUESTION

Which of the following are best practice when loading data to Redshift?

A) Aggregate data into as few large files as possible prior to COPY for efficient transfer

B)Use a manifest file with COPY to enforce strong consistency

C) Use Multiple concurrent COPY commands to optimise parallelism

D) Load data in sort key order to avoid the need to run VACUUM following COPY

4 / 6

QUESTION

You have created two tables on Redshift which will be used to support data analysis and will be frequently accessed, holding a 1-2GB data set. Often, analytics queries will require joins on these tables. Which style would ensure the appropriate data distribution for these tables?

A) AUTO

B) EVEN

C) KEY

D) ALL

5 / 6

QUESTION

Which of the following are true regarding data merge (‘upsert’) operations and Amazon Redshift?

(Select Two)

A) If a subset of columns are to be updated, but most rows will not be included then i) create a staging table ii) UPDATE the target table explicitly listing columns to be updated

B) If the majority of rows are to be updated, use RMERGE to left join source data

C) If all of the target table columns are to be overwritten, i) use a staging table ii) use a single INSERT command

D) If the majority of rows are to be updated i) create a staging table ii) use UPSERT to populate the target table

6 / 6

QUESTION

At the end of the month, data analyst teams run end of month reporting and ad hoc analysis consisting long, complex queries, creating a spike in read usage. Queries are running slowly. Automatic Workload Management (WLM) and Short Queue Acceleration (SQA) are in place but have not fixed the problem.

Which of the following are the most cost effective and least disruptive means of scaling to meet demand?

(Select Two)

A) Use Elastic Resizing to add compute nodes at the end of the month, remove them at the start of the next month

B) Activate concurrency scaling

C) Use Classic Resize, using snapshot restore to keep the cluster available

D) Use Redshift Spectrum to query data in place on S3

Your score is

The average score is 25%

AWS Data Analytics Quiz – Redshift

No responses yet

Leave a Reply Cancel reply