Spark and Cassandra For Machine Learning: Data Pre-processing