Event

Webinar: What is Spark?

19 April 2016
Online, 15.00 - 16.00

Spark might be considered as a one-stop tool for big data processing, providing data manipulation facilities to slice and dice datasets as well as statistical functionality and visualisation capabilities to present your results.  This webinar is intended as an overview of the Spark system and what you can use it for.

This webinar will provide:

  • an overview of the Spark system and what it can be used for
  • how Spark can be used both as a standalone product and as a means to accessing large datasets on a Hadoop cluster
  • demonstrations of how Spark can be used to access and manipulate datasets in Hadoop and to present the results of analysis

This webinar is intended for researchers with no in-depth knowledge of programming with data. However, attendees are more likely to find this webinar of interest if they already have some experience of doing simple data manipulations and analyses (e.g. obtaining summary statistics and graphs in SPSS, Stata or R)

The webinar will consist of a 30 minute presentation followed by 20 minutes for questions.

Resources

DATA CATALOGUE