XLKitLearn

XLKitLearn was designed to teach machine learning to non-technical students without the distraction of code - it exposes the power of scikit-learn through Excel, and works on PC and Mac computers.

Are you looking to using XLKitLearn? If so, skip this page and go to xlkitlearn.com for installation instructions, and a link to a user manual. This page contains background, demos, and pedagogical notes.

I wrote XLKitLearn to give students access to the full power of scikit-learn using an Excel interface. Students use it to fit random forests, boosted trees, and carry out Latent Dirichlet Allocation on large datasets, all in Excel. It has changed the way I teach data science and analytics, in my Business Analytics 2 class.

First, the highlights:

Click on Enable Editing

Here are two short videos that demonstrate the basics of the predictive analytics and text analytics capabilities of XLKitLearn.

You might also be interested in this introductory video I use to introduce the add-in in my classes; it discusses the general mechanics of changing the add-in settings and running it - you might find it useful as an intro video if you decide to use the add-in for your classes.

Using the add-in

You are welcome to use the add-in for your own classes (note, however, that if the add-in is run on a computer with internet access, every run logs the email address of the user along with the add-in settings and any errors for debugging purposes, and to warn the user if they are using an old version of the add-in).

Please do reach out if you decide to use it - I'm happy to answer any questions, provide whatever support I can, and discuss potential future improvements.

The code for the add-in is here, and I of course welcome any contributions, suggestions, or comments.

Why design a brand new tool?

Before designing XLKitLearn, I did a broad search to see what other approaches existed to teach non-technical students data science. I found three approaches, but none met my needs exactly, hence my decision to create something new.

I have also found that even for technical students who know how to code, using a tool that allows them to focus on the data science without worrying about the syntax can be invaluable. XLKitLearn's code output can then be used to seamlessly transition to scikit-learn.