Data analytics and machine learning have become mainstream in recent years. With the amount of data available, distributed computing has become a necessity. Apache Spark is one of the forerunners in distributed computing domain. In this session, the audience will learn about the background and basic concepts of Apache Spark. The speaker will explain basic concepts to the audience in detail with a live demo and build up the use case piece by piece live. This session is designed to be interactive, and recommended for developers who want to start experimenting with Spark. There will be no slides in the session.
The main motivation of this session is to provide participants a handy explanation for basic Spark concepts.