home / fivethirtyeight

Menu
  • GraphQL API

Most Common Name: aging-curve.csv

Table actions
  • GraphQL API for most-common-name/aging-curve

This directory contains the code and data behind the story Dear Mona, What’s The Most Common Name In America?

The main script file is most-common-name.R

There are four input files:

  • state-pop.csv - Total population and Hispanic population by state.
  • surnames.csv - Data on surnames from the U.S. Census Bureau, including a breakdown by race/ethnicity.
  • aging-curve.csv - Data from the Social Security Administration on the chances that someone born in the decade shown was still alive in 2013: http://www.ssa.gov/oact/NOTES/as120/LifeTables_Tbl_7.html
  • adjustments.csv - Taken directly from Lee Hartman's article: http://mypage.siu.edu/lhartman/johnsmith.html.

And five output files:

  • adjusted-name-combinations-list.csv - Adjusted estimates for the most common full names.
  • adjusted-name-combinations-matrix.csv - The same data from the file adjusted-name-combinations-list.csv but in matrix form. These are the estimates presented in the second (and final) table of the article.
  • independent-name-combinations-by-pop.csv - Matrix of estimates for the top 100 most common first names by top 100 most common surnames. These were calculated using independent odds, and displayed in the first table presented in the article.
  • new-top-firstNames.csv - Final estimated ranking of top first names.
  • new-top-surnames.csv - Final estimated ranking of top surnames.

Data license: CC Attribution 4.0 License · Data source: fivethirtyeight/data on GitHub · About: simonw/fivethirtyeight-datasette

12 rows

✎ View and edit SQL

This data as json, copyable, CSV (advanced)

Link rowid ▼ Decade Age Male Female Male.1 Female.1
1 1 1900 113 0 0 0.0 0.0
2 2 1910 103 45 318 0.00045 0.00318
3 3 1920 93 4154 11403 0.04154 0.11403
4 4 1930 83 28836 44336 0.28836 0.44336
5 5 1940 73 58728 70955 0.58728 0.70955
6 6 1950 63 78170 86389 0.7817 0.86389
7 7 1960 53 87064 92468 0.87064 0.92468
8 8 1970 43 92710 95619 0.9271 0.95619
9 9 1980 33 96010 97619 0.9601 0.97619
10 10 1990 23 97802 98570 0.97802 0.9857
11 11 2000 13 99003 99178 0.99003 0.99178
12 12 2010 3 99348 99449 0.99348 0.99449

Advanced export

JSON shape: default, array, newline-delimited

CSV options:

CREATE TABLE "most-common-name/aging-curve" (
"Decade" INTEGER,
  "Age" INTEGER,
  "Male" INTEGER,
  "Female" INTEGER,
  "Male.1" REAL,
  "Female.1" REAL
);
Powered by Datasette · Queries took 780.077ms · Data license: CC Attribution 4.0 License · Data source: fivethirtyeight/data on GitHub · About: simonw/fivethirtyeight-datasette