Rank-based Evaluator for Recommendations with Implicit Preference in PySpark

Implement a quality metric for recommendations with implicit preference based on the expected percentile ranking.

Map column values from strings to contiguous numbers in PySpark

Map string IDs to unique contiguous numeric IDs that are required by certain machine learning libraries.

Hyperparameter Grid Search across multiple models in scikit-learn

A helper class for running GridSearchCV accross multiple models in scikit-learn and comparing scores.

Flight routes graph database with Neo4j

Tutorial for populating and quering a Neo4j graph database with flight routes.

Data Science London Scikit-learn Kaggle competition part III

Greedy forward feature selection performing badly on the specific problem.

Data Science London Scikit-learn Kaggle competition part II

Improving the SVM classification algorithm by running a grid search optimisation of the parameters.

Page 1 of 1