Jose Quesada - A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and cons PyData Berlin 2016 The