By Jen Neng Ng

Photo by Jorge Fernández Salas on Unsplash

Machine learning is a tool to help data scientists perform prediction. In order to fully understand how to analyse a dataset in a given case scenario, inspecting the data distribution and drafting a comprehensive experiment plan is more toward delivering a better outcome.

As Kaggle has become the most favourite platform for a data scientist to learn from each other, many beginners will be stuck at the point on what to analyze and what to experiment with.

In this tutorial, I am going to assume you have some basic understanding of Machine learning from some…


By — Jen Neng Ng

Photo by Sebastian Herrmann on Unsplash

This story is continue from a Series of :

Part 1: Background Research

Part 2: Data Analysis

Part 3: Implementation

Part 4: Implementation (Continue)

Part 2: Data Analysis

This tutorial is using R language, the reason is less coding effort and quick plotting. The drawback may be syntax is not understandable (syntax too short) and less deep learning model support.

There are 39 columns in DataDriven.org. The main objective is to predict the ordinal variable “damage_grade”. This column presents the level of damage grade affected by the earthquake. …


By — Jen Neng Ng

Photo by Zoshua Colah on Unsplash

This story is continue from a Series of :

Part 1: Background Research

Part 2: Data Analysis

Part 3: Implementation

Part 4: Implementation (Continue)

Part 4: Implementation (Continue)

Experiment 7: No SMOTE

This experiment just needs to comment on these few lines of code:

Ng Jen Neng

MSc AI

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store