Skip to content

course-dprep/team-project-spring-2025-team-1

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Title of your Project

Describe the purpose of this project

Motivation

The renewal of a TV series for a second season is a multifaceted decision shaped by several factors. Traditionally, networks and streaming platforms have relied on audience viewership, critical perception, and financial considerations to determine whether a show continues. However, with the expansion of streaming services and globalization of content, additional factors may now influence these decisions. One possible factor is the popularity of certain genres. Genres like crime dramas or reality TV have historically shown strong audience retention, making them more likely to be renewed (What’s Behind a Show Renewal, n.d.-b). Conversely, niche or experimental genres may encounter more difficulties in securing continued production. Recognizing which genres have a higher likelihood of renewal can assist producers and platforms in optimizing their content strategies. Another factor to consider is the language of the series. In the increasingly global entertainment industry, non-English series have become increasingly popular, with platforms like Netflix and Amazon Prime investing significantly in foreign-language content (What’s Behind a Show Renewal, n.d.-b). The question remains whether language influences renewal decisions, potentially due to factors such as regional demand, dubbing and subtitling costs, and international appearance (Analyzing TV Ratings Systems - Examining the Influence of Viewership Data on Show Renewals | Common Good Ventures, 2001). Finally, average ratings are frequently considered a key factor in renewal decisions. High ratings generally indicate strong audience engagement, suggesting that a show is well-received and likely to perform well in future seasons (Analyzing TV Ratings Systems - Examining the Influence of Viewership Data on Show Renewals | Common Good Ventures, 2001). However, the correlation between ratings and renewal is complex, as other factors – such as production costs, competition, and strategic objectives of the platform – can also influence the final decision. Based on these factors, the question arises to what extent genre popularity, the language of the title, and average ratings influence the likelihood of series renewal.

The findings of this study can contribute to both academic and industry discussions by shedding light on the key factors that influence TV series renewal decisions. By identifying patterns in these decisions, this research can assist content creators, streaming platforms, and production companies in making more strategic and data-driven choices regarding future productions. Understanding these trends can also help media executives allocate resources more effectively and develop content that aligns with audience preferences. Furthermore, the automated and reproducible workflow used in this study ensures that the research process remains transparent and accessible. This not only enhances the reliability of the findings but also makes the study a valuable resource for other students, researchers, and the broader scientific community. Future studies can build upon this framework to explore additional factors influencing the renewal of TV series, contributing to a deeper understanding of decision-making in the entertainment industry.

Data

The data for this study was obtained from IMDb via https://developer.imdb.com/non-commercial-datasets/. This website contains seven different datasets with information from IMDb. For this study, the following four datasets were used: 'title.akas', 'title.basics', 'title.episode' and 'title.ratings'. The following table gives an overview of all variables in the final dataset used for this study.

Variable name Variable description Scale
tconst Alphanumeric identifier of an episode Nominal
parentTconst Alphanumeric identifier of a TV series Nominal
primaryTitle The title used by the makers on promotional materials Nominal
originalTitle The original title in the original language Nominal
isAdult Wheter or not the serie is an adult show (0: non-adult, 1: adult) Boolean
startYear The release year of the title Ratio
endYear The final year a serie was produced Ratio
Genre1 The first genre of the series (some series have multiple genres) Nominal
Genre2 If available, the second genre of the series Nominal
Genre3 If applicable, the third genre of the series Nominal
title
averageRating The weigthed average of all individual user ratings Interval
numVotes The number of votes a series has received Ratio
Renewed Whether the series has at least a second season (0: not renewed, 1: renewed) Boolean
Genre1_encoded
Genre2_encoded
Genre3_encoded
  • What dataset(s) did you use? How was it obtained?
  • How many observations are there in the final dataset?
  • Include a table of variable description/operstionalisation.

Method

To explore influence of genre, language of the title, and average ratings on the likelihood of series renewal, we will conduct a logistic regression analysis. The renewal status will be the dependent variable, while genre, language of the title, and average rating will serve as the independent variables. According to Lee and Wang (2003), logistic regression is a useful method for analyzing binary variables because it models and predicts the probability of a specific outcome. This method is useful as it can handle both continuous and categorical predictors, making it versatile for various types of data.

For deployment, the results will be communicated through a PDF report, ensuring accessibility and clarity for potential users. The structured format will effectively present conclusions, making it easy to interpret key findings.

Preview of Findings

  • Describe the gist of your findings (save the details for the final paper!)
  • How are the findings/end product of the project deployed?
  • Explain the relevance of these findings/product.

Repository Overview

*Include a tree diagram that illustrates the repository structure

Dependencies

Explain any tools or packages that need to be installed to run this workflow.

Running Instructions

Provide step-by-step instructions that have to be followed to run this workflow.

About

This project is set up as part of the Master's course Data Preparation & Workflow Management at the Department of Marketing, Tilburg University, the Netherlands.

The project is implemented by team < x > members: < insert member details>

About

course-dprep-classroom-spring-2025-team-project-reproducible-workflow-template created by GitHub Classroom

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 99.7%
  • Makefile 0.3%