4 Crucial Big Data Profit Maximization Hacks for Business Owners

I attended a business conference earlier this month, where the topic of big data came up. I was surprised to see how many people are still sceptical about the value of big data in the world of business.
You probably know that big data is being used by business owners all over the world. The market for big data is growing faster than anybody ever predicted.

Read Full Story

What is the difference between Segmentation and Personalization?

What is the difference between segmentation and personalization? This is the question that came up during one of the webinar on personalization by Optimizely. This blog post is for those who have the same question.Basic definition of Segmentation is  – division into separate parts or sections.

Read Full Story

Railscamp

h1. Railscamp
p(meta). 22 July 2009
I just came back from “railscamp”:http://railscamps.com New England edition. It was one of the best programming events I have been to. It was a mix of a hackfest, tech talks, binge drinking, and LAN parties. One of the things that made it great was that there was no internet.

Read Full Story

Examining the arc of 100,000 stories: a tidy analysis

I recently came across a great natural language dataset from Mark Riedel: 112,000 plots of stories downloaded from English language Wikipedia. This includes books, movies, TV episodes, video games- anything that has a Plot section on a Wikipedia page.
This offers a great opportunity to analyze story structure quantitatively.

Read Full Story

Machine learning APIs: which performs best?

Amazon ML (Machine Learning) made a lot of noise when it came out last month. Shortly afterwards, someone posted a link to Google Prediction API on HackerNews and it quickly became one of the most popular’s posts. Google’s product is quite similar to Amazon’s but it’s actually much older since it was introduced in 2011.

Read Full Story

Fab failure

So, I was browsing exp.lore.com and came across these nifty little usb-sticks a couple of days ago. Huh, that’s a pretty decent just-in-time gift I thought – might be an idea to buy a couple of them for those occasions where you don’t really have time to buy a gift for someone. So I click the link, and end up on the fine site fab.com.

Read Full Story

The Social Media Analysis Business in 2014

One of the nicest compliments I’ve received over the years came from a company founder who read one of my reports and said I’d summarized his company’s work better than they did. It’s just one of the things I do—take a pile of information and figure out what it’s about. I summarize. So if you need to tease out the short version of something complicated, call me.

Read Full Story

MDM Matching – Are You Asking the Right Question?

An odd request came in last week when a prospective customer asked us about a benchmark on the percentage of duplicates we can find for them using MDM.
In this blog, I wanted to touch base on few key reasons why this is odd in many ways. I would also like to take this chance to explain what are the right questions you should be asking to your vendor when it comes to MDM matching.

Read Full Story

Book review: Radical Candor

This just came out, the book Radical Candor by Kim Scott. It’s a good read on managing and focused on people. I’d recommend it if you are a manager or help others manage people. I’d summarize it by saying it takes a teaching and mentoring approach to management, very much of the school that managers primarily exist to help the people on their team.

Read Full Story

Understanding The Role Of Data In Recruiting GDPR Experts

Two major trends in the big data landscape came to our attention that we wanted to address. We wanted to discuss data recruitment and what it means for the GDPR professionals out there. As you may know, all organizations – within the EU – that collect personal data must comply with the GDPR. The consequences for failing to meet GDPR standards are huge.

Read Full Story

Interpretational Challenges with Ideal Point Models

As a graduate student, I came to love working with the roll call voting data sets that have been compiled for the United States Congress by political scientists like Keith Poole and Howard Rosenthal. These datasets can be represented in simplified form as matrices in which the rows correspond to legislators and the columns correspond to bills that the legislators vote for or against.

Read Full Story

Interview of Jerome Berthier, Head of BI and Big Data at ELCA

Data Mining Research (DMR): Can you tell us who you are and how you came to the field of Data Science?
Jerome Berthier (JB): My name is Jerome Berthier, I am an engineer in Computer Science and I have an MBA in management. After 10 years working in different roles for an IT provider (developer, sales representative, managing director), I joined ELCA in 2012 to head the BI division.

Read Full Story

Machine Learning is not BS in Monitoring

Recently I came across provocatively titled “Machine Learning in Monitoring is BS” and decided to reply but the response came out longer than typical comment so I posted it separately.
Ambiguity of the data – true point. It’s impossible to build a single universal model that eats any data and alert when it’s wrong.

Read Full Story

Hello GDPR, It’s Been a Month. How Are You Doing?

It is about a month since General Data Protection Regulation(GDPR) came into effect across the European Union. It’s the most critical data privacy law thus far, an 88-page monster translated into 26 different languages. When we summarize those pages, GDPR on privacy requires companies to:
Clearly state how they’re collecting and storing data about EU citizens.

Read Full Story