Progress

If you don’t like running through the history books, take today off … heck, take the next two weeks off and come back in early January.Now, if you noticed, I ended last week’s discussion about catalog circulation with discussions about five-year ad-investment simulations. This would be the part of the program where my comments are met with groans. But progress happens.

Read Full Story

Facebook takes down fake accounts with fake faces pushing politics

By Donie O’Sullivan | CNN
Artificially-generated faces of people who don’t exist are being used to front fake Facebook accounts in an attempt to trick users and game the company’s systems, the social media network said Friday. Experts who reviewed the accounts say it is the first time they have seen fake images like this being used at scale as part of a single social media campaign.

Read Full Story

How to Research like HubSpot’s Research Team

In college, research was always my favorite part of writing papers. I don’t know why, but finding data to back up my hypotheses always gives me a thrill.
But, even though I love research, it’s not exactly my forte — and I know I’m not alone.
Conducting market research is a necessity for both B2B or B2C companies.

Read Full Story

TechBytes with Robert McGovern, CEO at PreciseTarget

What inspired you to start PreciseTarget?
It all started with a question: why don’t all retailers personalize their e-commerce experience? The journey led me to understand that retailers have sparse data about their customers. For example, the largest non-Amazon apparel retailer averages only two items sold per customer, per year.

Read Full Story

Redash Dashboard Tutorial with Cube.js

We recently covered how to build an analytics dashboard using react, vue, or angular. But if you don’t want to code and host your own dashboard, you have some open-source alternatives. Today I’ll show you how to build an analytics dashboard using Cube.js Backend and Redash. You can find the resulting dashboard live here. Why Cube.js?You can always connect Redash directly to your database.

Read Full Story

Customer-Facing Data Solutions Need a Product Mindset

If most customer data initiatives are risky, have low adoption, and often don’t meet their goals, what steps can we take to improve success rates? What we’ve learned at Juice over 10 years and hundreds of implementations is that customer-facing data initiatives succeed when you deliver a product, not a project.

Read Full Story

Perceptual Straightening of Natural Videos

Video is an interesting domain for unsupervised, or self-supervised, representation learning. But we still don’t know what type of inductive biases will enable us to best exploit the information encoded in the temporal sequence of video frames. Slow Feature Analysis (SFA) and its more recent cousin Learning to Linearize (e.g. Goroshin et al.

Read Full Story

Teach the tidyverse to beginners

A few years ago, I wrote a post Don’t teach built-in plotting to beginners (teach ggplot2). I argued that ggplot2 was not an advanced approach meant for experts, but rather a suitable introduction to data visualization.
Many teachers suggest I’m overestimating their students: “No, see, my students are beginners…”.

Read Full Story

Recap of my first Kaggle Competition: Detecting Insults in Social Commentary [update 3]

Recently I entered my first kaggle competition – for those who don’t know it, it is a site running machine learning competitions. A data set and time frame is provided and the best submission gets a money prize, often something between 5000$ and 50000$.I found the approach quite interesting and could definitely use a new laptop, so I entered Detecting Insults in Social Commentary.

Read Full Story

I don’t want to learn your garbage query language

This is a bit of a rant but I really don’t like software that invents its own query language. There’s a trillion different ORMs out there. Another trillion databases with their own query language. Another trillion SaaS products where the only way to query is to learn some random query DSL they made up.
I just want my SQL back.

Read Full Story

Beware the bandwidth gap – speeding up optimization

Disks are slow and RAM is fast. Everyone knows that. But many optimization algorithms don’t take advantage of this. More to the point, disks currently stream at about 100-200 MB/s, solid state drives stream at over 500 MB/s with 1000x lower latency than disks, and main memory reigns supreme at about 10-100 GB/s bandwidth (depending on how many memory banks you have).

Read Full Story

Hackers beware: Bootstrap sampling may be harmful

Bootstrap sampling techniques are very appealing, as they don’t require knowing much about statistics and opaque formulas. Instead, all one needs to do is resample the given data many times, and calculate the desired statistics. Therefore, bootstrapping has been promoted as an easy way of modelling uncertainty to hackers who don’t have much statistical knowledge.

Read Full Story

Tech and tech idealism

It’s been almost 2 years since my last post! I don’t know if anyone is still reading this. If you are, thank you! Why haven’t I posted more? Partly it is the broad transition to microblogging, which everyone is using more than long form. But part also is that I have negative feelings about where tech has been going. I’m a tech idealist.

Read Full Story

Trending GitHub Repositories for October 2019

If you didn’t watch the video because you’re in a pubic setting and don’t have headphones, the video essentially says GitHub is an open source software development community for teams of and individual developers to work on projects. GitHub gives you a platform to copy projects, track changes, and so much more.

Read Full Story