Analytics · Big Data · General advice and suggestions

Hive: the fresh honey source for your (big data) dishes

When people are dealing with big data, they have to deal simultaneously with a NoSQL data store, such as Couchbase, Cassandra, ArangoDb etc. Just to refresh, A NoSQL (originally referring to “non SQL”, “non relational” or “not only SQL”) database provides a mechanism for storage and retrieval of data which is modeled in means other… Continue reading Hive: the fresh honey source for your (big data) dishes

General advice and suggestions

Scala-la-la-la in the morning ♪ ♪ ♬

Which freaking big data programming language should I use? That’s probably the first cry of every new data engineer, well sort-of. When it comes to picking a language for writing big data applications, developers have an embarrassment of riches at their disposal. Python and R have proven popular among data scientists, while Java has been… Continue reading Scala-la-la-la in the morning ♪ ♪ ♬

Frameworks · General advice and suggestions

Add a little spark* to your dishes

By now, you must have heard of big data. Who doesn’t really?  If you’ve been actively involved and reading about BIG DATA, and I really hope you are (there is no getting about it), you’ve probably been messing around with Map Reduce. Let’s just refresh our memory quickly: MapReduce is the heart of Hadoop®. It… Continue reading Add a little spark* to your dishes

Supervised Learning Algorithms

Why use (artificial neural) nets to store veggies?

Confused as to exactly what the activation function in a neural network does? Read this overview, and check out the handy cheat sheet at the end. Sorry if this is too trivial, but let me start at the “very beginning:” Linear regression.The goal of (ordinary least-squares) linear regression is to find the optimal weights that… Continue reading Why use (artificial neural) nets to store veggies?