This is visualisation of the character from the tv show "The Office" - "Michael Scott"
I've decided to analyze main character of the great show - ‘Michael Scott”
For the next visualizations I have used few datasets, that I get from kaggle.
First dataset providing spoken text lines during the show for all characters. There are 5 attributes:
- index
- character
- line
- season
- episode_number
Second dataset gibe us more general information, such as title of the episode, director, writers, original air date.The attributes for this dataset are:
- season
- episode number of season
- episode number in series
- title of the episode
- director
- writers
- original air date
- production coded
- US viewers on orignal air date
Because data, that have been provided can’t be directly used for the questions,that I want to answer,I had to preprocessed it. Because I wanted to see the appearance of the famous Michael Scott line ‘That’s what she said’ during the show, I had to store in which episodes are this line appeared and who said that with use of first dataset. As a result we have dataset that provide to us every appearance of this line during the show with next attributes:
- season
- episode
- character
- season
- episode
- Michael count (count of Michaels line in this episode) As for my first visualisation “‘that’s what she said’ line” I used ‘season’ as column and ‘episode’ and ‘character’ as rows. To separate Michael from other characters, I’ve made a set with this character and used it as a marks. The visualisation is a Gantt one.

For the second visualisation - “Michael appearance during the show” I used ‘Season’ as a column, and sum of Michael lines for the seasons with sum of US viewers as a rows. Also I’m marking median IMBd rating for the each season. The plot type is dual lines.

For the last visualisation I’ve made text visualisations using “character” attribute from the ‘the-office-lines’ dataset.