Google spreadsheets are rapidly replacing excel for some types of data analysis. Here are some useful Google spreadsheet add ons for data analysis.
Hello everyone! In this post, I will show you how you can use rbokeh to build interactive graphs and maps in R.
In this second post in a series of posts about a content recommendation system for The Marketing Technologist (TMT) website we are going to elaborate on the concept of content-based recommendation systems. In the first post we described the benefits of recommendation systems and we roughly divided them in two different types of recommenders: content-based and collaborative filtering. The first post also described the prerequisites in order to set-up both types of recommenders. If you haven’t read this first post yet, it is recommended to do this first before you continue. In this article we take our first steps in content-based recommendation systems by describing a quantified approach to express the similarity of articles.
Welcome to the second chapter in a five-part series about machine learning. In this chapter, we will briefly introduce model performance concepts, and then focus on the following parts of the machine learning process: data selection, preprocessing, feature selection, model selection, and model tradeoff considerations.
Major conferences are often the occasion for key vendor announcements, and SAP didn’t disappoint. At the 2016 SAP Insider event on BI/Hana in Las Vegas, SAP announced the acquisition of independent mobile BI specialist Roambi’s solution portfolio and key assets. With this acquisition, SAP underlines its commitment not only to mobile and cloud but also to getting the right data into the hands of the right people at the right time. With this acquisition, SAP underlines its commitment not only to mobile and cloud but also to getting the right data into the hands of the right people at the right time.
RapidMiner is thrilled to be recognized as a Leader in the Gartner Magic Quadrant for Advanced Analytics Platforms for the third consecutive year. Download the Gartner report.
Never mind driverless cars! Big Data is already hard at work in every aspect of the automotive industry, including safety, design, marketing and more. We look at where Big Data is having an impact on the cars that we are driving.
During the discussion that followed the ggplot2 posts from David and I last week we started talking about tidy data and the man himself noted that matrices are often useful instead of ‘tidy data’ and I mentioned there might be other data that are usefully ‘non tidy’. Here I will be using tidy/non-tidy according to Hadley’s definition.
At the end of each month I pull together a collection of links to some of the most relevant, interesting or thought-provoking web content I’ve come across during the previous month. Here’s the latest collection from December 2015.