4 week course starting June 1, free via edX
https://www.edx.org/course/introduction-big-data-apache-spark-uc-berkeleyx-cs100-1x
This course covers advanced undergraduate-level material. It requires a programming background and experience with Python (or the ability to learn it quickly). All exercises will use PySpark (part of Apache Spark), but previous experience with Spark or distributed computing is NOT required. Students should take this Python mini-quiz before the course and take this Python mini-course if they need to learn Python or refresh their Python knowledge.