Best of 2019: Betting on Data Eating the World

IDC predicts that 175 trillion gigabytes of new data will be created worldwide in 2025
[July 23, 2109]
Data is eating the world. All businesses, non-profits, and governments around the world are now in full digital transformation mode, figuring out what data can do to the quality of their decisions and the effectiveness of their actions.

Read Full Story

Crossmedia Names Former Cadreon SVP Its First Head of Integrated Media Sourcing

Newly Created Role Advances the Independent Media Agency’s Data-First Approach to Buying
In a move that underscores its commitment to channel neutral, data-driven buying, independent media agency Crossmedia Inc. (XM) has named Matt Bayer to the newly created position of head of integrated media sourcing.
Bayer will be the centralized leader across all the agency’s media activations.

Read Full Story

Influencing Consumer Choice in a Time of Amazon

Has Amazon Created a Culture of Shoppers on Autopilot?
How do you shop? I go to Amazon. There is a huge selection, prices are reasonable, there is a 5-star rating system, I can read reviews and I am a Prime member. I know that almost always, I will get what I order in two days or less, sometimes even within two hours, for no extra charge. Shopping is convenient, always-on, fast, and risk-free.

Read Full Story

Importing, Inspecting, and Scoring With MOJO Models Inside H2O

Machine-learning models created with H2O may be exported in two basic ways:
Binary format,
Model Object, Optimized (MOJO).
An H2O model can be saved in a binary format, which is tied to the very specific version of H2O it has been created with. There are multiple reasons for such a restriction. One of the important reasons is that model-building algorithms may evolve in time.

Read Full Story

How to update your scikit-learn code for 2018

In 2015, I created a 4-hour video series called Introduction to machine learning in Python with scikit-learn. In the years since, hundreds of thousands of students have watched these videos, and thousands continue to do so every month.
At the time of the recording, I was using Python 2.7 and scikit-learn 0.16.

Read Full Story

Whats new on arXiv – Complete List

• Not All Claims are Created Equal: Choosing the Right Approach to Assess Your Hypotheses• Constructing a Data Visualization Recommender System• Rethinking Self-Attention: An Interpretable Self-Attentive Encoder-Decoder Parser• A Re-evaluation of Knowledge Graph Completion Methods• Improving Node Classification by Co-training Node Pair Classification: A Novel Training Framework for Gener

Read Full Story

Role Of Predictive Analytics In The Shifting Email Threat Landscape

Predictive analytics is essential in modern email threat prevention. The IEEE created a report titled Identifying Email Threats Using Predictive Analytics, which shed a lot of light on this complicated issue.
How is Predictive Analytics Revamping Email Security?
Securing your email and other communication mediums from unauthorized access and threats is a priority more than ever.

Read Full Story

Neural Nets with Caffe Utilizing the GPU

Caffe is an open-source deep learning framework originally created by Yangqing Jia which allows you to leverage your GPU for training neural networks. As opposed to other deep learning frameworks like Theano or Torch you don’t have to program the algorithms yourself; instead you specify your network by means of configuration files.

Read Full Story

Creating a Shiny app with last.fm data

I’ve created a dashboard to display the current most popular artists and tracks on lastfm using the Shiny tool developed by RStudio. If you just want to play with the dashboard click here if you want to know more about how I did it please read on
Shiny is a tool developed by RStudio. Its great for creating visual interactive dashboards for your data.

Read Full Story

Accessing a million songs with Hive and Hadoop on AWS

The million song dataset was created a few years ago to help encourage research on algorithms for analysing music related data. There was also a Kaggle competition and a Hackathon using it a couple of years ago. It’s freely available through Amazon Web Services (AWS) as a public dataset and also in an S3 bucket. I use AWS at work but access it using a nice front end.

Read Full Story

Importing, Inspecting, and Scoring With MOJO Models Inside H2O – Open Source Leader in AI and ML

Machine-learning models created with H2O may be exported in two basic ways:
Binary format,
Model Object, Optimized (MOJO).
An H2O model can be saved in a binary format, which is tied to the very specific version of H2O it has been created with. There are multiple reasons for such a restriction. One of the important reasons is that model-building algorithms may evolve in time.

Read Full Story