Microsoft Exam DP-500 Topic 8 Question 19 Discussion

Actual exam question for Microsoft's DP-500 exam

Question #: 19
Topic #: 8

[All DP-500 Questions]

You are using a Python notebook in an Apache Spark pool in Azure Synapse Analytics.

You need to present the data distribution statistics from a DataFrame in a tabular view.

Which method should you invoke on the DataFrame?

Afreqlcems

Bcorr

Csummary

Drollup

Show Suggested Answer

Suggested Answer: B

pandas.DataFrame.corr computes pairwise correlation of columns, excluding NA/null values.

Incorrect:

* freqItems

pyspark.sql.DataFrame.freqItems

Finding frequent items for columns, possibly with false positives. Using the frequent element count algorithm described in https://doi.org/10.1145/762471.762473, proposed by Karp, Schenker, and Papadimitriou.'

* summary is used for index.

* There is no panda method for rollup. Rollup would not be correct anyway.

by Harrison at Mar 01, 2023, 10:07 PM

Limited Time Offer

25%

Off

Get Premium DP-500 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Currently there are no comments in this discussion, be the first to comment!