This challenge is on the largest bike-sharing program in the United States. As a lead data analyst, we are expected to generate regular reports for city officials looking to publicize and improve the city program. Since 2013, the Citi Bike program has implemented a robust infrastructure for collecting data on the program's utilization. Each month, bike data is collected, organized, and made public on the Citi Bike DataLinks to City Bike Data.
However, while the data has been regularly updated, the team has yet to implement a dashboard or sophisticated reporting process. City officials have questions about the program, so our first task on the job is to build a set of data reports to provide the answers.
The data is downloaded from the City Bike website from Aug 2022 - July 2023 to answer the questions below. The size of all the data are more than 100 MB and can't be uploaded to the repo so, here are the names of the file.
JC-202208-citibike-tripdata.csv
JC-202307-citibike-tripdata.csv
JC-202306-citibike-tripdata.csv
JC-202305-citibike-tripdata.csv
JC-202304-citibike-tripdata.csv
JC-202303-citibike-tripdata.csv
JC-202302-citibike-tripdata.csv
JC-202301-citibike-tripdata.csv
JC-202212-citibike-tripdata.csv
JC-202211-citibike-tripdata.csv
JC-202210-citibike-tripdata.csv
JC-202209-citibike-tripdata.csv
I used to Python (jupyter notebook) to read all the csv files, concatenate them, and extract the starting time and ending time of the journey to use it in Tableau. The final merged filename is citi_bike_2022_2023.csv
- What are the top 10 stations in the city for starting a journey?
- What are the top 10 stations in the city for ending a journey?
- What are the bottom 10 stations in the city for starting a journey?
- What are the bottom 10 stations in the city for ending a journey?
- What are the peak months when bikes are used during a time period (August 2022 - July 2023)?
- What are the peak time during the day when bikers ride?
- Do bikers use annual membership to ride bikes?
- What is their preference over classic bike, electric bike and docked bike?
showing the top 10 starting stations and ending stations. The density map represent the heatmap of starting locations labeled with zip code. There are two hotspots areas which shows the starting prime locations in the 07030 and 07302 zip code.
https://public.tableau.com/app/profile/shipra1168/viz/citybike_16964763213050/Dashboard1?publish=yes
showing the bottom 10 starting stations and ending stations. The density map represent the heatmap of ending locations labeled with zip code. There are two prime hotspot locations for ending the journey in the 07030 and 07302 zip code which is similar to the above starting journey. These represent the frequent bikers location.
https://public.tableau.com/app/profile/shipra1168/viz/citybike_16964763213050/Dashboard2?publish=yes
represent the peak hours during the day. Throughout this time period (Aug 2022 - July 2023), August is the peak month for highest bike ride with starting time in the morning around 8 am and ending time in the evening around 6 pm. Bikers prefer classic bike ride way more than electric or docked bike.
https://public.tableau.com/app/profile/shipra1168/viz/citybike_16964763213050/Dashboard3?publish=yes
represent bike types, members and its popularity. Summers have more riders with August as the peak month popular with classic bike rides followed by electric bikes. There are more annual members riding the bike than casual riders.
https://public.tableau.com/app/profile/shipra1168/viz/citybike_16964763213050/Story1?publish=yes
-
Biker services are remarkably in demand during the summer months (August specially). Bikers like to ride classic bikes and use their membership.
-
The starting and ending locations are in the same neighborhood (based on zip code).
-
Main usage of the bike services are during the office hours morning 8 am and evening 6 pm which indicates that people like to use bikes to commute to offices.