Classification : Titanic (Ongoing): https://www.kaggle.com/c/titanicForest Cover Type Prediction (Late Submission): https://www.kaggle.com/c/forest-cover-type-predictionDon’t Overfit 2 (Late Submission): https://www.kaggle.com/c/dont-overfit-iiCareerCon 2019 (Late Submission)(Multiclass): https://www.kaggle.com/c/career-con-2019IEEE-CIS Fraud Detection (Late Submission): https://www.kaggle.com/c/ieee-fraud-detectionInstant Gratification (Late Submission): https://www.kaggle.com/c/instant-gratificationCategorical Feature Encoding Challenge (Late Submission): https://www.kaggle.com/c/cat-in-the-datUniversity of Liverpool – Ion Switching (Late Submission): https://www.kaggle.com/c/liverpool-ion-switching Binary Classification Tips and tricks Regression : House Prices (Ongoing): https://www.kaggle.com/c/house-prices-advanced-regression-techniquesBike Sharing Demand (Late Submission): https://www.kaggle.com/c/bike-sharing-demand/dataPredict Future Sales (Ongoing)(Time Series): https://www.kaggle.com/c/competitive-data-science-predict-future-salesTMDB Box Office Prediction (Late Submission): https://www.kaggle.com/c/tmdb-box-office-prediction ASHRAE – Great Energy Predictor III (Late Submission): https://www.kaggle.com/c/ashrae-energy-prediction/ Computer Vision Read more about Kaggle Competitions[…]
Methods in Biostatistics Class notes for Math 150 at Pomona College: Methods in Biostatistics. The notes are based primarily on the text (Kuiper and Sklar Practicing Statistics, 2013). You are responsible for reading your text. Your text is very good & readable, so you should use it. Your text is not, however, overly technical.
Reconstructing data from Kaplan-Meier curves Using the methodology given in Guyot et al. BMC Medical Research Methodology 2012, 12:9 (http://www.biomedcentral.com/1471-2288/12/9), we held a practical session to show how to do this in R. The algorithm in Guyot (2012) is included in the digitise() function in the survHE package. The first step is to extract the Read more about Reconstructing KM Curves[…]
R : Data.Table Tutorial (with 50 Examples) The data.table R package is considered as the fastest package for data manipulation. This tutorial includes various examples and practice questions to make you familiar with the package. Analysts generally call R programming not compatible with big datasets ( 10 GB) as it is not memory efficient and Read more about Listendata: Data.table[…]
JOINing data in R using data.table This tutorial is based on the following articles: For joining data.tables, the basics are: the ON or USING clause is defined by setting the keys on the tables with setkey() without anything else, TABLE_X[TABLE_Y] returns a right outer join; setting nomatch=0 it returns a inner join The source of Read more about Data.table Joins[…]
Data Wrangling Part 1: Basic to Advanced Ways to Select Columns I went through the entire dplyr documentation for a talk last week about pipes, which resulted in a few “aha!” moments. I discovered and re-discovered a few useful functions, which I wanted to collect in a few blog posts so I can share them Read more about Advanced Data Wrangling[…]
Learn to purrr Purrr is the tidyverse’s answer to apply functions for iteration. It’s one of those packages that you might have heard of, but seemed too complicated to sit down and learn. Starting with map functions, and taking you on a journey that will harness the power of the list, this post will have Read more about Learn to purrr[…]
Software and Programmer Efficiency Research Group Geometric objects (geoms) are the visual representations of (subsets of) observations. Click on any of the following images to see the quick reference of the corresponding geom. Coming up next: geom_bar, geom_boxplot We retrieved the list of supported parameters and their default values using , where geomname is the Read more about ggplot2 Quick Reference: geom[…]
Personalised Medicine – EDA with tidy R Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More. Flexible Data Ingestion.
R EDA for GStore + GLM + KERAS + XGB Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More. Flexible Data Ingestion.