IMDB Movie Analysis
By- Ravi Verma
Project Description:
“What factors influence the success of a movie on IMDB?"
Here, success can be defined by high IMDB ratings.
APPROACH:
Cleaning and Handling Data:
Find and remove 2697 blank cells.�Removed rows in which any necessary column data is missing.
127 duplicates were found in the movie_title column.
Tech Stack Used:
Using MS Excel for analysing the dataset:
Microsoft Excel is a versatile tool for data
analysis and visualization. It offers
functions for organizing and manipulating data, pivot tables for summarizing large datasets, and various chart types for visual representation. With features like data validation and What-If Analysis, users can ensure data accuracy and explore different scenarios. Excel's sharing capabilities enable collaboration, making it a go-to choice across industries for effective data-driven decision-making.
Insights:
Task A: Movie Genre Analysis
Task B: Movie Duration Analysis
Task C: Language Analysis
Task D: Director Analysis
Task E: Budget Analysis
Task A. Movie Genre Analysis: Analyze the distribution of movie genres and their impact on the IMDB score.
Task A: Descriptive Statistics of the IMDB scores
�Task B. Movie Duration Analysis: Analyze the distribution of movie durations and its impact on the IMDB score.� B.1: Distribution of Number of Movies w.r.t duration.
Task B. Movie Duration Analysis: � B.2: Distribution vs IMDB Score
Task B: Tables and Formulaes
Formula for distributing movie durations by class intervals:
=COUNTIF($A$2:$A$3715, "<60")
Formula for calculating average IMDB Score as per duration distribution:
=AVERAGEIFS($B$2:$B$3715,$A$2:$A$3715,">60",$A$2:$A$3715,"<90")
�Task C. Language Analysis: Situation: Examine the distribution of movies based on their language.
�Task D. Director Analysis: Influence of Directors on movie ratings.
Top 15 high IMDB scored directors
15 Least IMDB scored directors
Task E. Budget Analysis: Explore the relationship between movie budgets and their financial success.
Task E. Budget Analysis: Table representing Correlation coefficient, gross profit margin, movie title.
Links to the Presentation and Excel Sheet
Results:
THANK YOU!