Skip to content

pm0kjp/IS607_Project3

Repository files navigation

IS607_Project3

##Scope of the Project This project was undertaken as partial fulfillment of the requirements for IS 607, Data Acquisition and Management, at the City University of New York. The [assignment] (Assignment_Description.pdf) was to use data to determine whether Best Film Editing is the best predictor of Best Picture in the Academy Awards (Oscars).

##Participants This implementation was carried out by members of section 2 of the fall 2015 course, namely (and in alphabetical order by last name):

  • Daina Bouquin
  • Matthew Farris
  • Robert Godbey
  • Andrew Goldberg
  • Nabila Hossain
  • Srinivasa Illapani
  • Sanjive Kumar
  • Joy Payton
  • Veneranda Skrelja
  • Maxwell Wagner
  • Karen Weigandt
  • Youqing Xiang

##Methods Research into the history and methods of the Oscars allowed us to create an Academy Award Metadata folder that includes

Data Acquisition was largely from pre-existing tabular databases:

This resulted in untidy .csv files, found in the Untidy Data Folder:

The above data was then tidied ith the following code:

  • [Cleaning and Combining](IS607_Project3/Munging Scripts/Extraction data.R)
  • [Gathering and Spreading](IS607_Project3/Munging Scripts/IS607P3 CombOscar Winner Munge AG.R)

Which resulted in the combined datasets:

R libraries including dplyr and tidyr were used to create tidying scripts and tidy data:

Data design gave us the following tools:

A visualization repository was created

Some screenshots for our presentation can be found in the Images for Presentation folder.

##Findings

Our team discovered that Best Editing is in fact not the best predictor of Best Picture. Putting our efforts solely into the analysis of the Academy Award data and not (at this point) including other film awards, we discerned that Best Director is in fact the best predictor of Best Picture.

##Future Directions

Given that we've pulled data from other awards sources, a logical next step would be to include this data in our analysis and machine learning in order to determine if some combination of awards (either a single factor or a combination of several) could be a still more accurate predictor than Best Director. Additionally, a future direction could be a more robust project management model using tools like Slack.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 7