Post a Doubt
Movie Genre Analysis: Analyze the distribution of movie genres and their impact on the IMDB score.
Task: Determine the most common genres of movies in the dataset. Then, for each genre, calculate descriptive statistics of the IMDB scores.
My question is should i separate multiple genres for a single movie and do the analysis or is it okay to just take the combined genres for a particular movie? I have done both approaches but confused which one is right process to analyse the data. I have attached 2 images with a basic understanding of both the scenarios :
Image 1 - first i selected all rows under genres column, converted text to columns, delimiter '|' is removed and 'COUNTIF' is used to count the number of times a particular genre is used.
Image 2 - genres is picked up as it is in the data grouped together and a pivot is used to find the count of genres (combined)
Request to please guide me through which is the correct approach
You have to separate multiple genres for a single movie and do the analysis.