Implement a quality metric for recommendations with implicit preference based on the expected percentile ranking.
Map string IDs to unique contiguous numeric IDs that are required by certain machine learning libraries.
San Francisco Crime Classification using XGBoost.
Visualization of the concentration of parked bikes and empty docks around London for the London Cycle Hire scheme.
Greedy forward feature selection performing badly on the specific problem.
This is how to load the data to SQL Server for Amazon.com Employee Access Challenge on Kaggle.com.
Improving the SVM classification algorithm by running a grid search optimisation of the parameters.
First try on the Data Science London + Scikit-learn competition.
A qualitative study of the usage of the cycle hire scheme over a sunny week in London.