6 Exercise Checklist
This checklist should help you keep track of your exercises. Remember that you have to hand in satisfactory solutions to at least two-thirds of the exercises. If you’re part of the beginner track, this refers to two-thirds of part A (EDA) only. If you’re part of the advanced track, you have to hand in at least two-thirds of both individual parts A and B. Hence, you cannot hand in 100 percent of the first part and only 50 percent of the second one. You’ll need more than 66% in each one for a certificate. After all, you’re not that advanced if you only did half of it, right?
Part 1: Exploratory Data Analysis (Beginner + Advanced Tracks)
Visualize available apartments
Load the
calendar
data set and get an overview of itTransform data in columns
price
,available
, anddate
Show the number of available Airbnb apartments in a simple line plot
Create a calendar plot to visualize the availability of Airbnb listings
Correlation between price and availability
Calculate the Correlation Coefficient between
available
andprice
Visualize Correlation
Distribution of Prices
Calculate Mean and Standard Deviation for each neighbourhood
Visually compare the price distribution for the, on average, most expensive neighbourhood with the least expensive one
Analysis of Listing Reviews
Compute the average number of reviews per apartment
Merge with the
listings
data set to extract the location of listingsPlot the location of 200 most reviewed apartments (e.g. with barplot)
Maps
Map the 200 most frequently reviewed apartments
Create a new map extending its functionality by adding a pop up window
Create a heatmap
BE CREATIVE
- Surprise us with one or more plots of your choice!
Part 2: Price Prediction Using Statistical Methods (motivated Beginner + Advanced Tracks)
Visualize feature correlations in a correlation matrix/heatmap
Regression
Simple regression model using one variable
Improve your model using more features
Improve your model
- Train new models using more advanced methods