A data engineer is optimizing query performance in Amazon Athena notebooks that use Apache Spark...

Amazon Web Services Data-Engineer-Associate Full Course Access

Amazon Web Services Data-Engineer-Associate View All Questions

Amazon Web Services Data-Engineer-Associate Question Answer

A data engineer is optimizing query performance in Amazon Athena notebooks that use Apache Spark to analyze large datasets that are stored in Amazon S3. The data is partitioned. An AWS Glue crawler updates the partitions.

The data engineer wants to minimize the amount of data that is scanned to improve efficiency of Athena queries.

Which solution will meet these requirements?

Apply partition filters in the queries.

Increase the frequency of AWS Glue crawler invocations to update the data catalog more often.

Organize the data that is in Amazon S3 by using a nested directory structure.

Configure Spark to use in-memory caching for frequently accessed data.

Data-Engineer-Associate PDF/Engine

Printable Format
Value of Money
100% Pass Assurance
Verified Answers
Researched by Industry Experts
Based on Real Exams Scenarios
100% Real Questions

Get 65% Discount on All Products, Use Coupon: "ac4s65"

A data engineer uses Amazon Managed Workflows for Apache Airflow (Amazon MWAA) to run data...

A company uses Amazon Redshift as its data warehouse service.

Summer Sale Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: ac4s65

A data engineer is optimizing query performance in Amazon Athena notebooks that use Apache Spark...

The Answer Is:

Quick Links