4 Data Science Conferences to Attend in Asia

Previously, I’ve written posts about 2019 and 2020 data science conferences to attend. Researching for both of those posts have given me a fair amount of knowledge on conferences happening in different regions around the world. In this article, I’m going to cover 5 data science conferences in Asia that you should consider attending in 2020.

Read Full Story

5 Data Science Conferences to Attend in Asia

Previously, I’ve written posts about 2019 and 2020 data science conferences to attend. Researching for both of those posts have given me a fair amount of knowledge on conferences happening in different regions around the world. In this article, I’m going to cover 5 data science conferences in Asia that you should consider attending in 2020.

Read Full Story

CEO Series: Interview with Matt Warta, CEO of Gutcheck

I’ve known Matt Warta, CEO of Gutcheck for over ten years now, and was privileged to work with him as an advisor for a few months as the company was beginning their evolution from the “SurveyMonkey of Qualitative”. That was the hallmark of their launch to the tech-driven insights consultancy that they are today.

Read Full Story

Maiden voyage

Who
Me. I’m an associate professor of Statistics at Youngstown State University in Youngstown, Ohio, USA. I’ve been using R for about 7 years, Emacs about 3 years, git about 1 year, and Org-Mode for less than a year.
What
I want this blog to be about statistics, plain and sample. No frills, no tomfoolery, just bare-boned statistics from beginning to end.

Read Full Story

Double blind review at theory conferences: More thoughts.

I’ve had a number of discussions with people both before and after the report that Rasmus and I wrote on the double-blind experiment at ALENEX. And I think it’s helpful to lay out some of my thoughts on both the purpose of double blind review as I understand it, and the logistical challenges of implementing it.

Read Full Story

Leaving NYC for Nashville

For ten out of the last eleven years, I’ve lived in two places: New York City
and San Francisco. The last two years have been in NYC. After founding Ursa
Labs, a not-for-profit open source development group, I felt it was time to
make my home somewhere that isn’t either of those places. After some
contemplation and consulting many friends, I decided on Nashville,
Tennessee.

Read Full Story

Some Observations on Winsorization and Trimming

Over the last few months, I’ve had a lot of conversations with people about the use of winsorization to deal with heavy-tailed data that is positively skewed because of large outliers. After a conversation with my friend Chris Said this past week, it became clear to me that I needed to do some simulation studies to understand the design space of techniques for dealing with outliers.

Read Full Story

I co-founded a company! Meet Problem Forward Data Science

I have some exciting news about something I’ve been working on for the last year or so. I started a company! It’s called Problem Forward data science. I’m pumped about this new startup for a lot of reasons.
My co-founder is one of my families closest friends, Jamie McGovern, who has more than 2 decades of experience in the consulting world and who I’ve known for 15 years.

Read Full Story

Getting Started with Apache Spark and Neo4j Using Docker Compose

I’ve received a lot of interest in Neo4j Mazerunner since first announcing it a few months ago. People from around the world have reached out to me and are excited about the possibilities of using Apache Spark and Neo4j together. From authors who are writing new books about big data to PhD researchers who need it to solve the world’s most challenging problems.

Read Full Story

A Docker Image for Graph Analytics on Neo4j with Apache Spark GraphX

I’ve just released a useful new Docker image for graph analytics on a Neo4j graph database with Apache Spark GraphX. This image deploys a container with Apache Spark and uses GraphX to perform ETL graph analysis on subgraphs exported from Neo4j. This docker image is a great addition to Neo4j if you’re looking to do easy PageRank or community detection on your graph data.

Read Full Story

Business secrets from terrible people

I get bored reading management books very easily and lately I’ve been reading about a wide range of almost arbitrary topics. One of the lenses I tend to read through is to see different management styles in different environments.
It turns out that some truly f—ng horrific people have some smart management ideas. This is not maybe surprising.

Read Full Story