Abstract The aim of this project is to determine if there is a statistically significant difference in quality scores between red and white wines. Two datasets are used from the UCI machine learning repository: one listing qualities of red wine and the other listing qualities of white wine. R and RStudio were used to conduct … Continue reading Final project: Which wine has the greatest quality mean?
Author: Brittany Bryce
Time Series Analysis and Exponential Smoothing
Hello world! This week in advanced statistics we covered time series analysis and exponential smoothing. The problem for this week is as follows: The table below represents charges for a student credit card. a. Construct a time series plot using R. b. Employ Exponential Smoothing Model as outlined in Avril Voghlan's notes and report the statistical outcome. c. Provide … Continue reading Time Series Analysis and Exponential Smoothing
Logistic Regression
Hello world! This week in advanced statistics we covered logistic regression. We were assigned two questions for the week, and the first is as follows: 10.1 Set up an additive model for the ashina data (see Exercise 4.6), containing additive effects of subjects, period, and treatment. Compare the results with those obtained from t tests. Here is the R Code … Continue reading Logistic Regression
Tabular Data
Hello world! This week in advanced statistics we covered tabular data. For our first question, we were to convert a given data frame into a table. This is the R code I used for my answer: # Loads data.table package library("data.table") # Creates assignment_data dataframe assignment_data <- data.frame(Country = c("France","Spain","Germany","Spain","Germany", "France","Spain","France","Germany","France"), age = c(44,27,30,38,40,35,52,48,45,37), salary … Continue reading Tabular Data
Analysis of Variance (ANOVA)
Hello, world! This week in advanced statistics we covered Analysis of Variance (ANOVA). This week we were assigned two questions. Here is the first: A researcher is interested in the effects of drug against stress reaction. She gives a reaction time test to three different groups of subjects: one group that is under a great deal of … Continue reading Analysis of Variance (ANOVA)
Regression Analysis
Hello, world! This week in Advanced Statistics and Analysis we covered regression analysis. The first question for this week is as follows: 1.1 Define the relationship model between the predictor and the response variable:1.2 Calculate the coefficients. This is the data set we will be using for this question: x <- c(16, 17, 13, 18, 12, 14, 19, 11, … Continue reading Regression Analysis
Random Variables and Probability Distributions
Hello world! This week in advanced statistics we covered random variables and probability distributions. We were given three questions to answer, the first of which reads as follows: A. Consider a population consisting of the following values, which represent the number of ice cream purchases during the academic year for each of five housemates. 8, … Continue reading Random Variables and Probability Distributions
Hypothesis Testing and Correlation Analysis
This week in advanced statistics, we covered hypothesis testing and correlation analysis. We were assigned three questions to answer, which read as follows: The director of manufacturing at a cookie factory needs to determine whether a new machine is producing a particular type of cookie according to the manufacturer's specifications, which indicate that cookies should … Continue reading Hypothesis Testing and Correlation Analysis
Probability Theory
Hello again world! This week in Advanced Statistics and Analytics we covered probability theory. We've been assigned three different problems to complete this week. The first is as follows: BB1A1020A12040Table 1 A. Based on Table 1, what is the probability of: A1. Event A?A2. Event B?A3. Event A or B?A4. Event A and B? To … Continue reading Probability Theory
Descriptive Statistics
Hello world, I hope everyone's Labor Day weekend was nice and relaxing! This week in class we explored descriptive statistics. Our assignment was to calculate both central tendency and variation values for two data sets and analyze the difference between them. The two data sets were presented as follows: set1 <- c(10, 2, 3, 2, … Continue reading Descriptive Statistics