Amazon Exam DAS-C01 Topic 2 Question 95 Discussion

Actual exam question for Amazon's DAS-C01 exam

Question #: 95
Topic #: 2

A company uses Amazon EC2 instances to receive files from external vendors throughout each day. At the end of each day, the EC2 instances combine the files into a single file, perform gzip compression, and upload the single file to an Amazon S3 bucket. The total size of all the files is approximately 100 GB each day.

When the files are uploaded to Amazon S3, an AWS Batch job runs a COPY command to load the files into an Amazon Redshift cluster.

Which solution will MOST accelerate the COPY process?

AUpload the individual files to Amazon S3. Run the COPY command as soon as the files become available.

BSplit the files so that the number of files is equal to a multiple of the number of slices in the Redshift cluster. Compress and upload the files to Amazon S3. Run the COPY command on the files.

CSplit the files so that each file uses 50% of the free storage on each compute node in the Redshift cluster. Compress and upload the files to Amazon S3. Run the COPY command on the files.

Dpply sharding by breaking up the files so that the DISTKEY columns with the same values go to the same file. Compress and upload the sharded files to Amazon S3. Run the COPY command on the files.

Show Suggested Answer

Suggested Answer: B

by James at Jun 18, 2024, 10:30 PM

Limited Time Offer

25%

Off

Get Premium DAS-C01 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Farrah

5 months ago

That makes sense. I agree with you.

upvoted 0 times

...

Phung

5 months ago

Wait, are we sure these are all the right answers? I thought there was supposed to be a 'none of the above' option for these tricky AWS questions.

upvoted 0 times

...

Argelia

5 months ago

Because splitting the files to match the number of slices in the Redshift cluster will optimize the COPY process.

upvoted 0 times

...

6 months ago

I was thinking the same thing. Compressing and uploading the files to S3 in a way that aligns with the Redshift architecture is a smart move.

upvoted 0 times

Darrin

5 months ago

B: D) Apply sharding by breaking up the files so that the DISTKEY columns with the same values go to the same file. Compress and upload the sharded files to Amazon S3. Run the COPY command on the files.

upvoted 0 times

...

Myra

5 months ago

A: B) Split the files so that the number of files is equal to a multiple of the number of slices in the Redshift cluster. Compress and upload the files to Amazon S3. Run the COPY command on the files.

upvoted 0 times

...

5 months ago

I agree, option B seems like the most efficient way to accelerate the COPY process.

upvoted 0 times

...