datatomix blog

data focused blog of Christian Johannsen

Schlagwort

spark

Spark vs. MapReduce Series – Part II – MapReduce

Yesterday I had the time to run my favourite example of MapReduce on my twitter data. And guess what, it´s WordCount 🙂 To be honest, this one is available in Hadoop, Spark and SolR and that´s the main reason for… Weiterlesen →

Spark is fine, but SparkR is…

Last week a potential customer asks for using Spark R on DataStax Enterprise and I had no clue if this could work. I decided to test the general possibility in my lab environment. After starting up my Cassandra and DSE… Weiterlesen →

Spark vs. MapReduce Series – Part I – Get some data

With this series I will try to show how to use Cassandra for storing data and how to use MapReduce or Spark to analyse data. First I have to store data in a Apache Cassandra database and my decision was… Weiterlesen →

© 2018 datatomix blog — Diese Website läuft mit WordPress

Theme erstellt von Anders NorénNach oben ↑