File: Airbnb-Housing-SF_DataPrep_EDA.ipynb
Names: Corinne Medeiros
Date: 1/10/21
Usage: Program imports, cleans, and organizes data, then generates exploratory charts.

Loading Airbnb Listings Data

These data come from http://insideairbnb.com/get-the-data.html. To look at listings over time, I downloaded their archival data from each December. In total, I have 6 years of listings data from 2015 to 2020. Before loading it into this project, I merged these 6 separate datasets using Tableau Prep Builder into a combined csv file.

Tableau Prep Builder Flow:

dataflow.png

In order to find the number of listings per year, I'm going to group by the original file names and calculate the unique values within the 'id' column.

Loading Housing Data

Home Values

The original source of these data is https://www.zillow.com/research/data/. Since I'm only focusing on the city of San Francisco, I removed the rest of the rows in Excel and added a header column up front before loading it into this project.

Loading Housing Data

Rental Prices

The original source of these data is https://www.zillow.com/research/data/. Since I'm only focusing on the city of San Francisco, I removed the rest of the rows in Excel and added a header column up front before loading it into this project.

Data Visualization

After loading and cleaning, I have 4 dataframes to work with for visualizations:

sf_airbnb_df
total_listings_df
sf_housing_df_t
sf_rentals_df_t

Line Chart - Airbnb listings

Bar Chart - Airbnb listings

Line Chart - Home Values

Line Chart - Rental Prices

Now that I've explored and organized my data, I'll use Tableau to generate some nicer looking graphs.