Getting started with Spark

Date
Feb 22, 2018
Abstract:
 
Data analytics and machine learning have become mainstream in recent years. With the amount of data available, distributed computing has become a necessity. Apache Spark is one of the forerunners in the distributed computing domain. In this session, you’ll learn about the background and basic concepts of Apache Spark. You’ll also see how to build a reference implementation in an IDE. The minimal-slide session, designed to be interactive, is recommended for developers who want to start experimenting with Spark.