Visual Analytics Project: Proposal

Proposal for the project work for this course.

Louelle TEO Fengmin , Jason TEY Shou Heng , WONG Kian Hoong (Andy)
04-11-2021

Background

Airbnb, Inc. is one of the most well-known vacation rental platforms in the world. With almost 3 million hosts listing over 7 million accommodations worldwide,

United States, as the origin of the app, has the largest number of listings at 660,000, with the company having an estimated US$33.8B economic impact on the economy in 2018. Australia - the focus of this project - saw the company having an economic impact of US$4.4B in the same period, and ranked as the top-10 destination for both inbound and outbound guests, as well as having its cities listed as some of the most popular cities for bookings. In Australia, Airbnb is taking up more than two-thirds of the short-term rental market share.

Prior to the Covid-19 pandemic, the company was valued at US$35B but dropped by 48.6% to around $18B, and the company is estimated to have lost as much as 54% of its revenue in 2020 due to the virus. However, Airbnb’s IPO in December 2020 had its valuation topping $100B, indicating optimism in its recovery post-COVID.

Airbnb is popular with holidaymakers for the wide choice of rental accommodations catering to different needs.Given the popularity of Airbnb to holidaymakers, people are also eager to list their accommodation for vacationers to hire.

This project provides an analytics platform for interested parties (especially non-data specialists) to conduct statistical analysis on the Australia Airbnb dataset using simple and user-friendly interactive dashboards that does not require programming knowledge.

Motivation

The dataset that has be scrapped on the Airbnb web and made publicly available by Inside Airbnb provides geospatial, textual (description of house, house rules, reviews etc.), and quantitative data (per-night price, average ratings, available facilities etc.) on each of the listings listed on the web.

The abundance of Airbnb data provides great opportunity to conduct a variety of data analyses to understand the residential short-lease rental market. Exploratory data analysis allows broad overview of the short-term residential rental market captured by Airbnb, cluster and geospatial analysis provides a deeper understanding into the different types of dwelling units available on the web, as well as the geographical distribution and patterns of the rental market. Text analysis is also plausible with the rich multitude of textual data made available from the descriptive nature of dataset, to peer into the linguistic association of different attributes of the rental market. Finally, regression analysis complements the other methods to dive deep into the various factors that could crack the code to a successful rental listing and vibrant residential market.

While there are studies and reports available on these various aspects of the short-lease rental market in the context of individual countries, a prominent short-coming is the lack of publicly and readily accessible data analytic tools for non-specialist to explore unmask the plethora of knowledge beneath. This project hence aims to provide an application for Airbnb hosts, policymakers, or common man and woman alike who are keen on exploring the interesting geospatial, textual, and statistical relationship of the various interacting elements affecting the Airbnb short-term residential rental market.

While this project focuses on one particular country - Australia - the analytical tools and methods are easily transferable to incorporate datasets from other region, country, or city, subject to the availability of dataset and processing capabilities of the hardware.

Project Objectives

Timeline of Project

Our project timeline will be as follows:

Timeline


Proposed Scope and Methodology

This project will focus on Australia as the location of interest.



The methodologies that will be used are:

Application Features

The application will consist of five main sections, one for each analysis approach. Exploratory Data Analysis


Cluster Analysis


Spatial Cluster Analysis


Text Analysis


Multilinear Regression Analysis

Software Tool

Tha interactive application, together with all documentation such as this project proposal and the final report will be written in R using RStudio.

R Packages

These are the R packages used:

General

Exploratory Data Analytics

Cluster Analysis

Text Analysis

Multilinear Regression

Team Members

References