Facebook launches new market research app after shutting down its controversial VPN service

Illustration by Alex Castro / The Verge
Facebook has a new market research app launching today called Viewpoints, just a few months after the company introduced an Android data collection app called Study designed to monitor what and for how long users are accessing other software on the Google-owned operating system.
Both apps have a controversial history.

Read Full Story

Identifying budding big data talent in your company

How to find promising candidates for upskilling within your organization.Big data is often called one of the most important skill sets in the 21st century, and it’s experiencing enormous demand in the job market. Hiring data scientists and other big data professionals is a major challenge for large enterprises, leading many to shift their efforts to training existing staff.

Read Full Story

Resampling Methods: Bootstrap vs jackknife

Resampling is a way to reuse data to generate new, hypothetical samples (called resamples) that are representative of an underlying population. It’s used when:
You don’t know the underlying distribution for the population,
Traditional formulas are difficult or impossible to apply,
As a substitute for traditional methods.
Two popular tools are the bootstrap and jackknife.

Read Full Story

How to update your scikit-learn code for 2018

In 2015, I created a 4-hour video series called Introduction to machine learning in Python with scikit-learn. In the years since, hundreds of thousands of students have watched these videos, and thousands continue to do so every month.
At the time of the recording, I was using Python 2.7 and scikit-learn 0.16.

Read Full Story

Best practices with pandas (video series)

At the PyCon 2018 conference, I presented a tutorial called “Using pandas for Better (and Worse) Data Science”. Through a series of exercises, I demonstrated best practices with pandas to help students become more fluent at using pandas to answer data science questions and avoid data science errors.
I split the tutorial into 10 videos.

Read Full Story

Truncated Bi-Level Optimization

In 2012, I wrote a paper that I probably should have called “truncated bi-level optimization”.  I vaguely remembered telling the reviewers I would release some code, so I’m finally getting around to it.
The idea of bilevel optimization is quite simple.  Imagine that you would like to minimize some function .  However, itself is defined through some optimization.

Read Full Story

Hyper Networks | 大トロ

In this post, I will talk about our recent paper called [1609.09106] HyperNetworks. I worked on this paper as a Google Brain Resident – a great research program where we can work on machine learning research for a whole year, with a salary and benefits! The Brain team is now accepting applications for the 2017 program: see g.co/brainresidency.

Read Full Story

What is an Interaction Effect?

Introduction
Yesterday, Uri Simonsohn published a blog post called “Interactions in Logit Regressions: Why Positive May Mean Negative”. I love Uri’s blog in general, but I was pretty confused by this latest post. I found it very hard to understand exactly what was being claimed in the absence of formal mathematical definitions of the terms being used.

Read Full Story