Case of Column Names in Spark DataFrames

You can use Scala testing frameworks ScalaTest (recommended) and Specs, or you can use frameworks/tools developed based on them for Spark specifically. Various discussions suggests that Spark Testing Base is a good one.

https://www.slideshare.net/SparkSummit/beyond-parallelize-and-collect-by-holden-karau

Spark Unit Testing¶

Spark vs Redshift

Oct 29, 2019

Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement! Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!

https://www.quora.com/Spark-vs-Redshift-Should-I-be-using-both-for-big-data-Which-is-better

Performance

https://dbseer.com/benchmark-comparison-spark-sql-redshift-cluster/

Redshift vs …

← Older Newer →

Ben Chuanlong Du's Blog

It is never too late to learn.

Case of Column Names in Spark DataFrames

Comments¶

Read/Write Parquet Files in Spark

Read/Write TSV in Spark

Read/Write CSV in Spark

Comments¶

Unit Testing for Spark

Static Analyzer¶

Spark Testing Frameworks/Tools¶

Spark Unit Testing¶

Spark vs Redshift

Performance