Detective Data/Data Detective

Feb 5, 2023 · 11 min read · R Nancy Drew ·

Let’s do some Nancy Drew analysis!

Back story: I love Nancy Drew - the character, the books, and especially the point-and-click adventure games by HerInteractive. Even though these games are by no means the world’s most well-known computer games, there is a vibrant online presence of ND fans, including, for the past couple years, Vote4Holt. Comprised of brothers Julian and Jameson, Vote4Holt is a YouTube channel dedicated to all sorts of Nancy Drew content, from no-commentary playthroughs, to let’s plays, and, most importantly, the most comprehensive ND game ranking system I’ve seen.

Julian and Jameson started out by created separate rankings of the Nancy Drew games for each of six categories: story, suspects, puzzles, music, atmosphere, endings. While no ranking can be truly objective, I like this system because it forces you to consider the strengths and weaknesses of each game. Let’s say you have a game that’s a personal favorite because you, personally, really like hard puzzles; in the Vote4Holt ranking system, that only counts for 1/6 of the game’s actual score. If the other pieces of the game don’t hold up, that game really shouldn’t rank high overall, even if the puzzles are really good.

So, by virtue of breaking the game rankings into separate categories, we have the making of an awesome data set! Let’s take a look at our initial data:

You’ll notice that I’ve added a column beyond just the rankings called ui_era. The Nancy Drew games have used several different user interfaces over the 20+ years that the games have been made, and I was curious whether that had any impact on their rankings. For more information about the different UIs, and explanations of their names, check out this post on the Nancy Drew Wiki.

Ranking #1: Simple Averages

Alright, now that we have our basic data, we can start doing some basic analysis. The first, most straightforward way to turn these scores into an overall ranking is to average all of the scores, and rank from lowest to highest. In other words, the closer the average is to 1, the better it ranks.

Now we can see how the games rank if we value all categories equally. We see that Danger on Deception Island narrowly beats out a tie for second place between Last Train to Blue Moon Canyon and Legend of the Crystal Skull. Shockingly, Midnight in Salem managed to make it all the way to 27th place instead of being dead last!

Which UI ranks best in each category?

Let’s take a look at what happens if we group the games by their UI and average their category scores:

Here, we see that the Exploration era ranks best overall, but there are some other winners in individual categories.

What if you don’t value all categories the same?

People have very different ideas of what makes a good Nancy Drew game. For me, puzzles are a very important part of the experience, along with suspects. For other people, the overall story might be the most important part. So, with this in mind, let’s think about some alternative rankings.

The Vibes Ranking

Some people love playing Nancy Drew games for the vibes. Nancy travels all over the world, and these games have some of the best video game music I’ve ever heard.

In our data set, the relevant categories for a vibes-based ranking are music and atmosphere. To get a vibes-based ranking, we should calculate a weighted average, in which these two categories are given a higher weight than the others. For the purpose of being able to see a difference in the data, I’ve decided to give these two categories a weight of 3, and the others a weight of 1.

Danger on Deception Island remains the winner, but notice that Crystal Skull has moved from second place all the way down to 6th place - this is because it ranked 20th for music. At the very end of the list, Shattered Medallion has surpassed Secrets Can Kill, as it should for a more vibes-based ranking; if I’m playing a game for the vibes, I’d rather be in New Zealand than a generic high school in Florida!

Does UI factor into this ranking?

Of all of the alternative rankings, this is the one that I hypothesize has most to do with user interface. Let’s see how each UI scores, on average:

So, we see that while games from the Exploration era (games 16-25) rank the best in overall scores, the Short era (games 10-15) rank considerably better based on vibes.

For another visualization of which era’s games rank better on vibes, we can look at a scatterplot comparing each game’s overall score and vibes score.

Games that fall above the line ranked higher on their vibe scores than overall scores, while points below the line scored lower on their vibes scores relative to their overall scores. Looking at this plot, we see that there are more red and peach points above the line, and more light blue points below the line. What does this mean? Games from the Exploration and Short eras tended to score higher on vibes than they do for overall scores, whereas games from the Updated UI era (games 26-32) scored worse on vibes than they did overall. Interestingly, these later games coincide with a change in the games’ music composer: Kevin Manthei composed the music for games 1-25, while Thomas Regin composed for games 26-32. While I don’t necessarily remember the earlier games as having distinctly better music, perhaps this change contributed to these games’ relatively lower scores in the vibe categories.

The Writing Ranking

Another perspective might argue that the quality of writing is what separates the quality of Nancy Drew games. Among the six ranked categories, there are 3 that, together, can indicate the overall rating of writing quality: story, suspects, and endings.

Once again, we will compute weighted averages, giving these three categories a weight of 3 while keeping the other categories at a weight of 1:

Hello, Final Scene!

Does writing quality correspond to UI?

Let’s take another look at UI, this time with relation to writing.

This time, the newer games from the Updated UI era ranked better on average based on their writing in comparison to their overall scores. Let’s look at a scatterplot of this data:

The points are a little more all-over-the-place this time, but we can see that the light blue points, corresponding to the Updated era, are consistently above the line, again suggesting that these games are stronger in the writing categories relative to their other scores.

The Interaction Ranking

The last ranking subset I wanted to look at was a combination of puzzles and suspects, because those are two key ingredients to making a game hold my attention. I’m calling this ranking the Interaction Ranking, because puzzles and character dialogue are two major ways that the play interacts with the game.

Once again, we’ll compute weighted averages, giving these two categories a weight of 3, while keeping the other categories at a 1:

We have yet another new winner! For interaction, Sea of Darkness takes first place.

Are interaction scores related to UI?

Let’s look at UI group averages again, this time for the new interaction scores.

The Updated UI once again scores better in this category compared to its overall scoring average. Notably, the Starter, or original, UI (games 1-9) seems to score worse in this category. Let’s look at another scatterplot to get a better sense of this comparison:

Here, we see three patterns of note:

The Updated era games (light blue) are consistently above the line, meaning they generally score higher in suspects and puzzles than the other categories.
The Starter era games (gold) are mostly just below the line, indicating that they score slightly worse in these categories than in other categories. Having played all 33 ND games, I’d generally say that the puzzles increase in quality and quantity as the series progresses (excluding Midnight in Salem, of course), so it makes sense that this first set of games is ranked lower in the interaction categories.
The Exploration era games (red) are split evenly between above the line, below the line, and exactly on the line. In other words, these games are very consistent in their scores: they don’t, overall, score any higher or lower on interaction categories than they do in the other categories.

Final Comments

Having looked at this data extensively, I still can’t come to any definitive conclusion on what the best Nancy Drew is, or what the best era of ND games is. I think, generally, the Exploration era games are the most consistent in quality. But, as Julian and Jameson certainly found while working on their rankings, so much about ranking these games is subjective, and depends on what you personally value in your entertainment. I’d be really curious to see how the games would rank if we had a larger sample of fans use the same ranking system, and were able to average the data to get rid of some of the bias.

If you’re interested in using this data, it will be up on my GitHub, so you can spare yourself the transcription time! Let me know if you come to any more conclusions, or just other ways of looking at this data.