Classification is the problem of assigning a new observation to a specific category, given a set of features. It is used in in a multitude of applications such as spam filters, sentiment analysis, and image recognition.
During this HandsOn Data Science meetup, we will introduce you to the basic concepts of classification, some useful performance measures, as well as a possible workflow for applying it on a real dataset.
There will also be a brief introduction to decision trees and ensemble learning using trees. In many Machine Learning applications these are extensively used, often being state of the art.
After the presentation there will be a practical Data Science session where we will use Kaggle’s “Ghouls, Goblins, and Ghosts” dataset. There will be plenty of exercises of varying difficulty to choose from. So, make sure you bring your laptop and that you have installed any tool for performing Machine Learning, such as R or Python.
You can download and find information about the dataset here
17:30 - 17:45 Meet and greet
17:45 - 18:10 Presentation of Classification, Trees, and Forests
18:10 - 18:30 Hands on time
18:30 - 19:00 Meet and eat
19:00 - 20:00 More hands on
About the presenters
Olof Rännbäck Garpinger holds a PhD in Automatic Control from Lund University, and has been active in the Data Science field since 2016. Today he works with data science and machine learning at Knightec.
Jonas Dürango holds a Licentiate degree in Automatic Control from Lund University and is currently at Knightec, working with data science and machine learning.