datatomix blog

data focused blog of Christian Johannsen

Schlagwort

apache cassandra

Spark vs. MapReduce Series – Part II – MapReduce

Yesterday I had the time to run my favourite example of MapReduce on my twitter data. And guess what, it´s WordCount 🙂 To be honest, this one is available in Hadoop, Spark and SolR and that´s the main reason for… Weiterlesen →

Spark is fine, but SparkR is…

Last week a potential customer asks for using Spark R on DataStax Enterprise and I had no clue if this could work. I decided to test the general possibility in my lab environment. After starting up my Cassandra and DSE… Weiterlesen →

Spark vs. MapReduce Series – Part I – Get some data

With this series I will try to show how to use Cassandra for storing data and how to use MapReduce or Spark to analyse data. First I have to store data in a Apache Cassandra database and my decision was… Weiterlesen →

Deploying Datastax Enterprise with Vagrant and bash

After configuring my new shiny MacBook I started to think about how to establish a Datastax Enterprise Environment on my Notebook. First I wanted to have it cost free and secondly automated, what took me to Vagrant and Virtualbox. I… Weiterlesen →

I can see clearly (data) now

This is my first post! As many of you know I was responsible for automation and integration projects at mightycare and VMware in the last years. After nearly 4 years at VMware and some really cool customers, projects and colleagues… Weiterlesen →

© 2018 datatomix blog — Diese Website läuft mit WordPress

Theme erstellt von Anders NorénNach oben ↑