Task: Determine the most common genres of movies in the dataset. Then, for each genre, calculate descriptive statistics (mean, median, mode, range, variance, standard deviation) of the IMDB scores.Hint: Use Excel's COUNTIF function to count the number of movies for each genre. You might need to manipulate the 'genres' column to separate multiple genres for a single movie. Use Excel's functions like AVERAGE, MEDIAN, MODE, MAX, MIN, VAR, and STDEV to calculate descriptive statistics. Compare the statistics to understand the impact of genre on movie ratings.
Question
Task: Determine the most common genres of movies in the dataset. Then, for each genre, calculate descriptive statistics (mean, median, mode, range, variance, standard deviation) of the IMDB scores.Hint: Use Excel's COUNTIF function to count the number of movies for each genre. You might need to manipulate the 'genres' column to separate multiple genres for a single movie. Use Excel's functions like AVERAGE, MEDIAN, MODE, MAX, MIN, VAR, and STDEV to calculate descriptive statistics. Compare the statistics to understand the impact of genre on movie ratings.
Solution
Sure, here are the steps to follow:
- Open your Excel dataset.
- If the 'genres' column contains multiple genres for a single movie separated by a delimiter (like a comma or a slash), you'll need to separate these into individual genres. You can do this using Excel's 'Text to Columns' function found in the 'Data' tab.
- Once you have a column for each genre, you can use the COUNTIF function to determine the number of movies in each genre. The COUNTIF function is used like this: =COUNTIF(range, criteria). In this case, the range would be your genre column, and the criteria would be the genre you're counting.
- Repeat step 3 for each genre to find out the most common genres.
- For each genre, calculate the descriptive statistics of the IMDB scores. You can use the following Excel functions:
- Mean: =AVERAGE(range)
- Median: =MEDIAN(range)
- Mode: =MODE(range)
- Range: =MAX(range) - MIN(range)
- Variance: =VAR.P(range) or =VAR.S(range) depending on whether you want the population or sample variance.
- Standard Deviation: =STDEV.P(range) or =STDEV.S(range) depending on whether you want the population or sample standard deviation.
- Compare the statistics for each genre to understand the impact of genre on movie ratings. You might want to use a pivot table or conditional formatting to help visualize the data.
Remember to replace 'range' in the formulas with the actual range of cells containing the IMDB scores for the genre you're analyzing.
Similar Questions
Randy is attending film school and one of his projects involves researching the ratingsgiven to movies by critics and viewers. He has selected a random sample of movies andhas recorded the average critic score (X1) and the average viewer score (X2) for eachmovie. The scores were obtained from a website that collates movie reviews and ratingssubmitted by critics and viewers. Randy also recorded the genre (X3) of each movie. Thedata are stored in the file AssignmentData.RData in the data frame movies.df.(a) [3 marks] Create a scatter plot of average critic score against average viewer scorefor comedies, with average viewer score along the x-axis. Make sure to give yourplot a proper descriptive title and appropriate labels for the x and y axes. Describethe relationship between these two variables for comedies
Randy is attending film school and one of his projects involves researching the ratingsgiven to movies by critics and viewers. He has selected a random sample of movies andhas recorded the average critic score (X1) and the average viewer score (X2) for eachmovie. The scores were obtained from a website that collates movie reviews and ratingssubmitted by critics and viewers. Randy also recorded the genre (X3) of each movie. Thedata are stored in the file AssignmentData.RData in the data frame movies.df.(a) [3 marks] Create a scatter plot of average critic score against average viewer scorefor comedies, with average viewer score along the x-axis. Make sure to give yourplot a proper descriptive title and appropriate labels for the x and y axes. Describethe relationship between these two variables for comedies.(b) [2 marks] Determine whether the average critic scores and the average viewerscores are more highly correlated for comedies or for thrillers. Provide a clearjustification for your answer.(c) [3 marks] For thrillers, test whether the population proportion of movies that havean average viewer score greater than 3.27 is less than 0.5. Clearly state your hypothe-ses, making sure to define any parameters, and use a significance level of α = 3%.Do not use any R functions that are designed to perform hypothesis tests.(d) [4 marks] For dramas, test whether the population mean average viewer score isgreater than the population mean average critic score by more than 0.05. Clearlystate your hypotheses, making sure to define any parameters, and use a significancelevel of α = 3%. Do not use any R functions that are designed to perform hypothesistests
Randy is attending film school and one of his projects involves researching the ratingsgiven to movies by critics and viewers. He has selected a random sample of movies andhas recorded the average critic score (X1) and the average viewer score (X2) for eachmovie. The scores were obtained from a website that collates movie reviews and ratingssubmitted by critics and viewers. Randy also recorded the genre (X3) of each movie. Thedata are stored in the file AssignmentData.RData in the data frame movies.df.(a) [3 marks] Create a scatter plot of average critic score against average viewer scorefor comedies, with average viewer score along the x-axis. Make sure to give yourplot a proper descriptive title and appropriate labels for the x and y axes. Describethe relationship between these two variables for comedies.
Directions: The given graph shows how a particular movie was rated by users of an online movie review site. Each user could vote only once. Go through the given graph and answer the questions based on it.
Each of 8 students reported the number of movies they saw in the past year. Here is what they reported.12, 5, 20, 16, 20, 6, 10, 20Send data to calculatorFind the mean number of movies that the students saw.If necessary, round your answer to the nearest tenth.movies
Upgrade your grade with Knowee
Get personalized homework help. Review tough concepts in more detail, or go deeper into your topic by exploring other relevant questions.