Airbnb-Sentiment-Analysis

Summary of the analysis

This analysis used two datasets listings.csv and reviews.csv.

Prliminary data exploring done on listings.csv to understand general sentiment of the listings. Data cleaning: i.e removing empty cells and invalid data value Initial word cloud generated to gain an overview of the reviews. Sentiment analysis using VADER and produce negative and positive sentiment dataframe Generate word clouds of negative and positive sentiments. Bar plots of most frequent word for negative sentiment showed that most non-english reviews are marked as negative sentiments Data cleaning again: use of langdetect library to filter out english reviews Performed sentiment analysis on CLEANED data and generate a refined dataset Logsitic regression is used to train a model using the refined labeled dataset (Vectorised the review column and sentiment are labeled target for model training) Topic Modelling using LDA and NMF

Confusion matrix

Key insights

Most of the reviews are predominantly positive

Sentiment of NSW listings are very positive

Negative and positive sentiments words share similar words

Difficult for model to identify negative words due to scarcity of negative training data

Machine learning model requires fine tuning

Some of the results

Overall listing Ratings scaled to 0-100 Majority of the listings have high ratings

Common words used: Word cloud

Histo plot of the most frequent words

Positive sentiment word frequency

Negative sentiment word frequency

How to run

I recommend creating a conda env and running. Will be posting the requirements.txt for all the dependicies for this project soon.

Notes

The dataset exceeds the 100mb Github limit, so I will be putting up an external link to the original dataset used for this analysis

To do

Add requirements.txt
Add the link to the original dataset.
Include logistic regression results in a table

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
AirBnb_Analysis.ipynb		AirBnb_Analysis.ipynb
README.md		README.md
airbnb.jpeg		airbnb.jpeg
reviews_sentiment.png		reviews_sentiment.png
wordcloud_negative.png		wordcloud_negative.png
wordcloud_reviews.png		wordcloud_reviews.png
wordcloud_reviews_hd.png		wordcloud_reviews_hd.png
wordcloudq_positive.png		wordcloudq_positive.png
your_file_name.png		your_file_name.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Airbnb-Sentiment-Analysis

Summary of the analysis

Confusion matrix

Key insights

Some of the results

How to run

Notes

To do

About

Releases

Packages

Languages

MellowPhi/Airbnb-Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Airbnb-Sentiment-Analysis

Summary of the analysis

Confusion matrix

Key insights

Some of the results

How to run

Notes

To do

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages