Think Summer: Project 2 — 2023
Submission
Students need to submit the following file by 10:00PM EST through Gradescope inside Brightspace.
-
A Jupyter notebook (a
.ipynbfile).
We’ve provided you with a template notebook for you to use. Please carefully read this section to get started.
|
When you are finished with the project, please make sure to run every cell in the notebook prior to submitting. To do this click . Next, to export your notebook (your |
Questions
Question 1
How many crews include George Lucas in the role of director?
-
SQL queries used to solve this problem. (2 pts)
-
Output from running the SQL queries. (1 pt)
Question 2
How many shows have more than 10000 episodes? Hint: Use the episodes table, and GROUP BY the show_title_id and use the condition HAVING COUNT(*) > 10000 at the end of the query.
-
SQL queries used to solve this problem. (2 pts)
-
Output from running the SQL queries. (1 pt)
Question 3
What are the 5 most popular episodes of Friends? Please include the title of each episode. Please verify your answer by double-checking with IMDB.
Hint: Friends has show_title_id = tt0108778.
Another hint: When you join the episodes table and the ratings table, you might want to add the condition e.episode_title_id = r.title_id
Another hint: You might want to have ORDER BY r.rating DESC LIMIT 3 at the end of your query, so that you are ordering the results by the ratings, and putting them in descending order (with the biggest at the top).
-
SQL queries used to solve this problem. (2 pts)
-
Output from running the SQL queries. (1 pt)
Question 4
Identify the 6 movies that have rating 9 or higher and have 50000 or more votes.
-
SQL queries used to solve this problem. (2 pts)
-
Output from running the SQL queries. (1 pt)
Question 5
For how many movies has Sean Connery been on the crew?
-
SQL queries used to solve this problem. (2 pts)
-
Output from running the SQL queries. (1 pt)
Question 6
Revisiting the question about George Lucas: What are the titles of the movies in which George Lucas had the role of director?
-
SQL queries used to solve this problem. (2 pts)
-
Output from running the SQL queries. (1 pt)
Question 7
Revisiting the question about the shows that have more than 10000 episodes, please find the primary_title of these shows.
-
SQL queries used to solve this problem. (2 pts)
-
Output from running the SQL queries. (1 pt)
Question 8
Revisiting the question about how many movies has Sean Connery been on the crew: Which movie is his most popular movie?
-
SQL queries used to solve this problem. (2 pts)
-
Output from running the SQL queries. (1 pt)
Question 9
Again revisiting the previous question about the popularity of movies for which Sean Connery has been on the crew: Which movie is his most popular movie, if we limit our query to results with 1000 or more votes? What is the title of the most popular result?
-
SQL queries used to solve this problem. (2 pts)
-
Output from running the SQL queries. (1 pt)