Introduction
Adventure Time is an American fantasy animated television series following the adventures of a boy named Finn the Human and his best friend and adoptive brother Jake the Dog. The show ran from April of 2010 until September of 2018, airing 283 episodes across 10 seasons where each episode is about 10 minutes long. The show started out as being mainly a children’s program but after it became a viral hit on the internet, many teens and grown-ups also started watching. As older fans of the show ourselves, we wanted to use this project to examine whether it is possible to determine that Adventure Time matures together with its audience.
Data
The sheer amount of episodes and characters makes this an incredibly interesting subject for network science and text analysis. We got all of our data from The Adventure Time Wiki which has pages for each episode containing lists of major and minor characters together with episode transcripts. There is a severe lack of transcripts for episodes in Season 8-10 so for the text analysis part of this project we only analysed episodes from Season 1-7. If we summarise every character and relation (a relation between two characters meaning that they appeared in the same episode), we now have a dataset consisting of
- 953 characters
- 12,984 relations
- 225 transcripts (Season 1-7)
Project questions
In order to determine whether Adventure Time becomes more and more mature as the show progresses, we wanted to examine the development in the emotional value (sentiment) of the words used for each episode. Our hypothesis was that the show would get more serious with time and that the sentiment would lower. We also wanted to look at the diversity of words used; an increasing lexical diversity across seasons could very well indicate that the show is becoming more mature with time.
We also wanted to look at the sentiment for the individual characters but for this we first needed to find out which out of the 953 characters to focus on. Our hypothesis was that Finn and Jake were by far the most important characters to the show, seeing as these are usually labeled as the main characters. Other important characters could include Princess Bubblegum, Ice King, BMO, and more. We also wanted to see if there were any interesting communities between the characters – perhaps a community of princesses?
With this we were now able to calculate the sentiment on an individual level. We were interested to see which characters contributed to the series getting more positive/negative and why. Our hypothesis was that Finn and Jake would change the least over the course of the series seeing as these are the main characters and have to stay more or less consistent. The farther away from being a main character, the more the character would change.
Seasonal overview
Adventure Time is known to most people as a generally fun and joyful show but we wanted to dig deeper into this belief and examine what exactly made this show so happy (apart from the funny voices and vibrant colour scheme). We were also interested to see how the sentiment of the show progressed over time with the initial hypothesis that as the show becomes more complex and the stakes grow higher, the positivity of the episodes will decrease. We were also interested in analysing the sentiment on a character-by-character base to see which characters have a positive impact on the overall sentiment of the show and which have a negative impact. Lastly, we looked at lexical diversity across seasons.
Episode sentiment
First, we calculated the sentiment for each episode in each season using the LabMT wordlist. Here we see the result for Season 1. There are some big spikes in sentiments but most episodes lie within a range of 5.45 to 5.65. The episode with the lowest score in this season is The Duke with a sentiment of 5.37 and the episode with the highest score is Wizard with a sentiment of 5.75.
We wanted to see more specifically why these two episodes were respectively the most negative and positive episodes of the season. To do this, we created word clouds for each of them. For The Duke we do see that generally negative words such as ‘hate’, ‘crime’, ‘assassin’ and ‘villainous’ are very defining words of the episode, leading us to believe that this is in fact one of the more negative episodes. Looking at the word cloud for Wizard we see some much more positive words such as ‘star’, ‘power’ and ‘magic’, so this is indeed clearly a positive episode.
We now wanted to get a broader perspective of the series as a whole so we calculated the sentiment for each transcribed episode from Season 1 to 7 and added a trendline. First of all, we see that the overall sentiment of the show actually increases with time, which contradicts our initial hypothesis. The most positive season is Season 6 with a mean sentiment of 5.63 and the most negative is Season 3 with a mean sentiment of 5.55. However the sentiments of the episodes in the later seasons vary much more than those of the earlier seasons. The standard deviations of the mean sentiment scores for Season 6 and 7 are 0.13 and 0.14, respectively, while those for Season 2 and Season 3 are both only 0.09. This means that the episodes in the earlier seasons were generally equally negative while in the later seasons they are generally more positive but only because there are both very negative and very positive episodes, the latter slightly outweighing the negative.
This theory is backed by the fact that both the overall most positive and most negative episodes of the show happen in the later seasons. The episode with the lowest sentiment score is On the Lam in Season 6 with a score of 5.28 and the episode with the highest sentiment is Football in Season 7 with a score of 5.98.
Lexical diversity
Another indication of maturity could be the lexical diversity, meaning the amount of unique words in an episode or a season divided by the total amount of words. As we see, the lexical diversity does indeed increase with time meaning that as the viewers of the show grew older, the amount of unique words (and thereby the complexity of the language used) increased.
Network exploration
We want to verify that the characters we believe are the most important actually are from an analytical point of view. From the table below, we see to no surprise that Finn and Jake are the main players given their large out-degree and relatively low in-degree. This is a metric of how many episode in which they’ve had a prominent role.
Out-degree | In-degree | Total degree | |
---|---|---|---|
1 | Finn: 832 | BMO: 186 | Jake: 978 |
2 | Jake: 820 | Candy People: 173 | Finn: 965 |
3 | BMO: 363 | Jake: 158 | BMO: 549 |
4 | Princess Bubblegum: 339 | Finn: 133 | Princess Bubblegum: 461 |
5 | Ice King: 320 | Lady Rainicorn: 132 | Ice King: 448 |
6 | Marceline: 237 | Ice King: 128 | Marceline: 336 |
7 | Lumpy Space Princess: 216 | Princess Bubblegum: 122 | Candy People: 303 |
8 | Lady Rainicorn: 158 | Gunter: 106 | Lumpy Space Princess: 302 |
9 | Gunter: 150 | Tree Trunks: 105 | Lady Rainicorn: 290 |
10 | Peppermint Butler: 142 | Peppermint Butler: 101 | Gunter: 256 |
With our hypothesis in mind we can also make use of the network to say something about whether or not the show becomes more mature over time. With the introduction of many plot heavy elements and moving away from fairly shallow characters which are more often seen in a children’s show we can look at the development of which characters are important throughout the show’s airtime.
Getting a general look at the network there are clusters floating on their own away from the central part of the network. These represent episodes in which there are a lot of characters that are never seen again or only appear in episodes with each other. An example of this is the episode City of Thieves in which Finn and Jake travel to a city where everyone is a thief. These thief characters are never used again and are therefore only interconnected forming a cluster.
The following plot tells us the most about our hypothesis since we see a significant decline in importance for the main characters of the show which later is recovered. This could imply that the they wanted to invest time in other characters and thus build up the universe more thoroughly.
The notisable spike in season 9 is likely caused by the show focusing on serialised episode with a heavy reliance on “mini-series” where the same characters are sure to be present in a few episodes while interacting with a bunch of new characters.
Another intesting takeaway from the network is the measurement “Eigenvector Centrality”. This tells us if important characters are linked to other important characters. With our modelling of the network this leads to some interesting results. Namely that it seems as if ‘secondary’ characters are given a eigenvector centrality. We use the term secondary in the sense that they most definitely are part of the story but not a primary part (with some exceptions). Characters such as ‘The Lich’ which is the main antagonist for a large portion of the show, Cosmic Owl and Prismo which are also characters that drives the plot of the story onwards.
Thus we can see a change from characters such as Tree Trunks, BMO and a few fairly random characters to a select few that has to do with the story that is told in the last few seasons.
season 1 | season 2 | season 3 | season 4 | season 5 | |
---|---|---|---|---|---|
1 | Tree Trunks: 0.302 | Mr. Goose: 0.412 | Two-Headed Duck: 0.388 | Scientific Parasite: 0.328 | Candy People: 0.217 |
2 | King Worm: 0.291 | Mrs. Yoder: 0.39 | Mr. Pig: 0.347 | Gumball Guardian: 0.304 | Gunter: 0.212 |
3 | Lady Rainicorn: 0.271 | Cat: 0.302 | Tree Trunks: 0.31 | Jellyfish: 0.292 | BMO: 0.201 |
4 | Centipede: 0.229 | Chipmunk: 0.261 | Phil: 0.278 | Cow: 0.226 | Gumball Guardian: 0.176 |
5 | Wildberry Princess: 0.207 | Owls: 0.225 | Punch Bowl: 0.249 | Banana Guards: 0.207 | Finn: 0.16 |
season 6 | season 7 | season 8 | season 9 | season 10 | |
---|---|---|---|---|---|
1 | Candy People: 0.272 | Worm: 0.246 | BMO: 0.263 | Lumpy Space People: 0.355 | The Morrow: 0.358 |
2 | King of Ooo: 0.247 | Marceline\'s mom: 0.238 | Candy People: 0.24 | Wyatt: 0.33 | Squirrel: 0.322 |
3 | Penguins: 0.214 | Spider Ghost: 0.221 | The Lich: 0.229 | Me-Mow: 0.307 | Scorcher: 0.29 |
4 | Laser Wizard: 0.21 | Toronto: 0.212 | Slime Princess: 0.206 | Cosmic Owl: 0.286 | Sir Slicer: 0.262 |
5 | Bufo: 0.192 | Coal Man: 0.198 | Gunter: 0.189 | Prismo: 0.266 | Ash: 0.236 |
Community detection
Since there aren’t any predefined communities in Adventure Time we have to be creative. This caused us to think about a group of characters with something very particular in common. The abundance of princesses in the show who each governs their respective kingdom. Do they form a community even though they don’t necessarily interact all that much?
We can plot the communities found by Louvain community detection algorithm to find and color those to get a visual representation of what we’re working with. The labels are for the top ten character degrees.
From a subgraph of the princesses it seems as though they could potentially form a community. We plot the distribution of princesses with respect to their assigned community and find that there most certainly is a community of princesses present.
Character development
The change in sentiment across any tv-show can usually be ascribed to the individual changes in the most important characters, and Adventure Time should be no different. Based on our network analysis, the top 10 most important characters in Adventure Time are Jake, Finn, BMO, Princess Bubblegum, Ice King, Marceline, Candy People, Lumpy Space Princess, Lady Rainicorn and Gunter, so these are the characters we will be examining on an individual basis. However, Candy People, Lady Rainicorn and Gunter barely ever speak so these have been left out of this part of the analysis.
Again we used sentiment analysis on an episode-by-episode basis, this time filtered by the character speaking which specific lines. We see that the average sentiment for the most important characters is 5.57 across the entire series. The main characters Finn and Jake both have pretty average sentiments with Finn’s being a bit below the average (5.55) and Jake’s being a bit above (5.58). Princess Bubblegum is the character with the highest average sentiment and Marceline is the character with the lowest, something that fits very clearly with each character’s aesthetic.
No character is static, though, and from series beginning to series end we do see that the sentiment of each character changes. Both of the main characters Finn and Jake have a positive development across the entire series. However, while Finn starts the show 0.08 lower than Jake, his sentiment score actually ends up surpassing that of Jake. This tells us that out of the two main characters, Jake is much more solid and does not change much while Finn spends the show going through a lot of positive character development.
BMO has the most impressive development, going from the most negative to the most positive character across the series. The only characters that develop in a negative direction are Lumpy Space Princess and Marceline, both ending up as the most negative characters by the end of the show. Although some characters do have negative evolutions they get outweighed by the positive evolutions of the other main characters, leading to the general positive development across the entire show that we saw before.
Discussion
Our main hypothesis for this project was that there would be evidence to support that Adventure Time becomes more mature with age. We see that the sentiment for each episode actually increases over time which goes against our hypothesis that the show would become more negative over time. We do not see this as a direct contradiction to our main hypothesis, however, since the emotional value of words in a show only tells half the truth of the complexity of the subjects explored in each episode. Hence, it could be very possible that even though the episodes generally become happier they could also at the same time be handling more mature themes. This could very well be backed by the fact that as the positivity of the show increases, so does the lexical diversity. This means that the language used in the episodes becomes more diverse and complex which definitely supports our main hypothesis that the show becomes more mature with time.
As we began to explore the network in our data we found that Finn and Jake should definitely be regarded as the main characters. Other important characters include BMO, Princess Bubblegum, Ice King, Marceline, Candy People, Lumpy Space Princess, Lady Rainicorn and Gunter which was no surprise to us. We also did indeed find evidence that there was a community of princesses with our data.
Using this list of most important characters, we could calculate the individual sentiments for each character. Most characters’ sentiments were all around the average with Princess Bubblegum’s being well above average and Marceline’s being well below. We expected for Finn and Jake to be the characters with the least development across the series but actually found that Finn develops a lot throughout the show. This makes sense since part of Finn’s journey is growing up and finding himself.
In conclusion, we did not find evidence to disprove our theory that Adventure Time goes from being a children’s show to being a more mature show for all ages. The episodes increase in sentiment score over time and so does the lexical diversity. The main characters are Finn and Jake and though Jake does not change much in sentiment throughout the show, Finn undergoes a lot of character development as he himself becomes more mature. In the future it would be interesting to analyse the three lasts seasons of the show since episodes for these have not been transcribed yet.