At work recently, a question came up about whether Spark or Tez is better. Here’s an interesting article with some interesting perspectives.

On paper, Spark and Tez have a lot in common: both possess in-memory capabilities, can run on top of Hadoop YARN and support all data types from any data sources. So, what’s the difference?

