score:-1

I dont think there is any additional benefit besides developers have more control over data with high level API (Dataframe/ Dataset) than low level (RDD), and they dont need to worry about performance as it is well optimized/ managed by high level API by its own.

Reference - https://spark.apache.org/docs/3.0.0-preview/sql-data-sources-binaryFile.html

P.S. - I do think my answer does not qualify as a formal answer. I earlier wanted to add it as comment only but unable to do so because I am yet to earn privilege of commenting.. :)


Related Query

More Query from same tag