SummaryStatstics
is a package made by Kiera Lee for STAT545B at The
University of British Columbia. SummaryStatstics
was made with the
devtools
package.
The goal of SummaryStatstics
is to help the user efficiently create
summary statistics for the data they are analyzing. With the use of
SummaryStastics
, the user can use the provided functions to get
necessary data.
This package contains the following functions
summary_statistics()
A functions that produces a tibble with the summary statistics columns mean, median, maximum value, minimum value, and standard deviation of a numerical variable, based on a grouping, both contained within a dataframe.
Please see ?summary_statsitics
for a more detailed explanation of the
function.
You can install the most up-to-date released version of
SummaryStatstics
from Github with:
devtools::install_github("stat545ubc-2021/FunctionsKieraLee")
Following installation, this package should be loaded into R or Rstudio with:
library(SummaryStatstics)
This is a basic example which shows you how to solve a common problem,
using a simple dataframe, fruit
:
suppressMessages(library(SummaryStatstics))
#if this is a very basic data set that you are working with
fruit <- data.frame(fruit_type = c("apple","apple","apple", "orange", "orange","orange","plum","plum","plum"),
number = c(10,7,45,3,45,67,89,0,12))
#using the function summary_statistics()
summary_statistics(fruit, number, fruit_type)
## # A tibble: 3 × 6
## fruit_type mean median standard_deviation max min
## <chr> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 apple 20.7 10 21.1 45 7
## 2 orange 38.3 45 32.5 67 3
## 3 plum 33.7 12 48.3 89 0