MichaelScottvisualization

This is visualisation of the character from the tv show "The Office" - "Michael Scott"

I've decided to analyze main character of the great show - ‘Michael Scott”
For the next visualizations I have used few datasets, that I get from kaggle.
First dataset providing spoken text lines during the show for all characters. There are 5 attributes:

index
character
line
season
episode_number

While working with this dataset I have found that some eposides have been missing, but amount of the lost episodes is small enough for continue working with this dataset.
Second dataset gibe us more general information, such as title of the episode, director, writers, original air date.The attributes for this dataset are:

season
episode number of season
episode number in series
title of the episode
director
writers
original air date
production coded
US viewers on orignal air date

As for the third dataset it's pretty much the same as the second one this dataset providing general information. The only attributes, that we are looking for is Average IMDd rating. For connection of all of those dataset I joined using two attributes -'season' and 'episode number'
Because data, that have been provided can’t be directly used for the questions,that I want to answer,I had to preprocessed it. Because I wanted to see the appearance of the famous Michael Scott line ‘That’s what she said’ during the show, I had to store in which episodes are this line appeared and who said that with use of first dataset. As a result we have dataset that provide to us every appearance of this line during the show with next attributes:

season
episode
character

. Also using first dataset, I have counted amount of lines from Michael Scott during the show. As a result I had dataset with next attributes:

season
episode
Michael count (count of Michaels line in this episode)

For the second visualisation - “Michael appearance during the show” I used ‘Season’ as a column, and sum of Michael lines for the seasons with sum of US viewers as a rows. Also I’m marking median IMBd rating for the each season. The plot type is dual lines.

For the last visualisation I’ve made text visualisations using “character” attribute from the ‘the-office-lines’ dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data vis the office		data vis the office
A0228582X_B.pdf		A0228582X_B.pdf
Michael Scott .twb		Michael Scott .twb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MichaelScottvisualization

About

Uh oh!

Releases

Packages

Uh oh!

Sveta151/MichaelScottvisualization

Folders and files

Latest commit

History

Repository files navigation

MichaelScottvisualization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages