Data Science

The simple elegance of `.get_new_params()`

Reading through documentation for a programming function to find what arguments it likes can be tedious, especially since many of the possible options either don’t work together or aren’t documented. Something I have been including on most of my newer classes is a static method called get_new_params(method=’random’) which does pretty much exactly as it says, …

The simple elegance of `.get_new_params()` Read More »

Comparative runtimes of GPU vs CPU showing CPU competitive with GPU in most cases

PyTorch, Tensorflow, and MXNet on GPU in the same environment and GPU vs CPU performance

It has been impossible in the past to get all three of the largest neural network architectures running in the same Python environment in such a way that they don’t conflict and so that they will also train on GPU. The reason I want all the libraries running in the same environment is that AutoTS …

PyTorch, Tensorflow, and MXNet on GPU in the same environment and GPU vs CPU performance Read More »

All the Problems with the M6 Competition (and a few good things)

I was so excited to see the announcement of the M6 forecasting competition. For those who don’t know the M series of competitions (not to be confused with BMW cars) is a long running series of competitions for forecasting time series. It comes from the 1980’s, well predating the modern data science explosion. And this …

All the Problems with the M6 Competition (and a few good things) Read More »

Setting up and Optimizing Python for Data Science on Intel, AMD, and ARM (including Apple) Computers

Setting up a Python environment is often a pain. Even worse is that sometimes the environment, once it is finally built, is surprisingly slow because the underlying numeric libraries (BLAS, OneAPI, and so on) are improperly configured. This aims to be a fairly definitive guide to fixing those problems as of early 2022. How much …

Setting up and Optimizing Python for Data Science on Intel, AMD, and ARM (including Apple) Computers Read More »

Anaconda is Slow on Tiger Lake – How to Get MKL-accelerated Python working on Latest Intel CPUs

While doing some benchmarking of my AutoTS package I quickly discovered a troubling trend – my newest, fanciest, most expensive Intel i7 CPUs – an Intel 10700 and 1165G7, were performing vastly slower than their older counterparts. I wasn’t terribly surprised to see the 1165G7 having issues: Tiger Lake – a brand new architecture CPU …

Anaconda is Slow on Tiger Lake – How to Get MKL-accelerated Python working on Latest Intel CPUs Read More »