DS4UX (Spring 2016)/Day 3 coding challenge

From CommunityData
Jump to: navigation, search

Each of the challenges this week will ask you to modify and work with code in the Baby Names dataset which you should have installed and begun working with in class.

As always, it's not essential that you solve or get through all of these — I'm not grading your answers on these. That said, being able to work through at least many of them is a good sign that you have mastered the concepts for the week. It is always fine to collaborate or work together on these problem sets.


  1. Are there more boy names or girl names? What about for particular first letters? What about for every first letter?
  2. How many babies are in the dataset (assuming nobody is counted more than once)?
  3. What is the longest name in the dataset?
  4. How many boy names are also girl names? How many girls' names are also boys' names?
  5. How many names are subsets of other names?
  6. What is the most popular girl name that is also a boy name?
  7. Write a program that will take a name as input and return the number of babies with that name in the girl and boy datasets.
  8. Take a prefix as input and print the number of babies with that prefix in each dataset (i.e., "m" would list babies whose names start with "m" and "ma" would list babies whose names start with "ma", etc).
  9. Which boy and girl names are the most popular across all four years in our dataset? (hint: to solve this challenge, you will need to edit ssadata.py)
  10. Which boy and girl names have increased most in popularity between 2010 and 2013? Which ones have declined most in popularity?


Click here to download the solutions to this week's coding challenges