Analytics with Cassandra and Spark SQL
Date: 18 MARCH 2017, 9:00 – 14:30
Trainers: Felix Crisan, Valentina Crisan
Location: eSolutions Academy, Budişteanu Office Building, strada General Constantin Budişteanu Nr. 28C, etaj 1, Sector 1, Bucureşti.
Number of places:
15 no more places left
For those that learned about Apache Cassandra, you have realized so far that Cassandra it’s a storage and pre-aggregation layer, thus a computational layer should exist in order to complete the queries we would like to run on our data. In this workshop we will look at the analytics that can be done on top of Cassandra with Spark SQL, we will start with similar examples in CQL and Spark SQL and we will evolve into examples that can only be run with Spark SQL. The aim is to understand the capabilities that Spark SQL brings on top of a Cassandra environment and the analytics that can be done on top of these two connected solutions. The exercises will be run in Scala – meaning the examples will be provided in Scala.
Cassandra and Spark in a Data Architecture
Very Brief Cassandra and Spark intro (only key points will be listed)
– Cassandra main points: data partitioning, distribution, replication, consistency
– Cassandra indexing
– Spark basics: RDD’s, Dataframes, transformations, actions
Connecting Spark to the C* environment:
– reading, processing, converting and saving data
– count, group by key, joins
Spark SQL & examples
Working on real data – end to end example
The price for the workshop is 125 RON (including VAT).
There are no more seats left for this session. If you want to be announced if places become available , register here:
1. Complete registration form: