home / fivethirtyeight

Menu
  • GraphQL API

State of the State: words.csv

Table actions
  • GraphQL API for state-of-the-state/words

This table contains the data behind the story What America’s Governors Are Talking About.

index.csv contains a listing of each of the 50 speeches, one for each state as well as the name and party of the state's governor and a link to an official source for the speech. If an official government source could not be found, we have linked to a news media source that had a transcript of the speech.

The speeches/ folder contains 50 .txt files containing the text of each of the speeches.

words.csv contains every one-word phrase that was mentioned in at least 10 speeches and every two- or three-word phrase that was mentioned in at least five speeches after a list of stop-words was removed and the word "healthcare" was replaced with "health care" so that they were not counted as distinct phrases. It also contains the results of a chi^2 test that shows the statistical significance of and associated p-value of phrases.

Column Definition
n-gram one-, two- or three-word phrase
category thematic categories for n-grams hand-coded by FiveThirtyEight staff: economy/fiscal issues, education, health care, energy/environment, crime/justice, mental health/substance abuse
d_speeches number of Democratic speeches containing the n-gram
r_speeches number of Republican speeches containing the n-gram
total total number of speeches containing the n-gram
percent_of_d_speeches percent of the 23 Democratic speeches containing the phrase
percent_of_r_speeches percent of the 27 Republican speeches containing the phrase
chi2 chi^2 statistic
pval p-value for chi^2 test

Data license: CC Attribution 4.0 License · Data source: fivethirtyeight/data on GitHub · About: simonw/fivethirtyeight-datasette

2,223 rows

✎ View and edit SQL

This data as json, copyable, CSV (advanced)

Suggested facets: category, d_speeches, r_speeches, percent_of_d_speeches, percent_of_r_speeches

Link rowid ▼ phrase category d_speeches r_speeches total percent_of_d_speeches percent_of_r_speeches chi2 pval
1 1 minimum wage economy/fiscal issues 9 0 9 39.13 0.0 10.56521739 0.001152355
2 2 clean energy energy/environment 11 1 12 47.83 3.7 10.07461084 0.001503264
3 3 climate change energy/environment 13 2 15 56.52 7.41 9.986580784 0.001576851
4 4 gun violence crime/justice 8 0 8 34.78 0.0 9.391304348 0.00218017
5 5 affordable care 10 1 11 43.48 3.7 8.931196018 0.002803407
6 6 international 0 10 10 0.0 37.04 8.518518519 0.003515506
7 7 education need education 7 0 7 30.43 0.0 8.217391304 0.00414908
8 8 universal 11 2 13 47.83 7.41 7.803914282 0.005213318
9 9 affordable care act health care 9 1 10 39.13 3.7 7.793880837 0.005242347
10 10 middle class economy/fiscal issues 9 1 10 39.13 3.7 7.793880837 0.005242347
11 11 sick health care 9 1 10 39.13 3.7 7.793880837 0.005242347
12 12 care act 9 1 10 39.13 3.7 7.793880837 0.005242347
13 13 implemented 1 12 13 4.35 44.44 7.680044593 0.005583479
14 14 students state education 6 0 6 26.09 0.0 7.043478261 0.007955439
15 15 just weeks ago 6 0 6 26.09 0.0 7.043478261 0.007955439
16 16 gun safety crime/justice 6 0 6 26.09 0.0 7.043478261 0.007955439
17 17 protect health 6 0 6 26.09 0.0 7.043478261 0.007955439
18 18 pre existing 6 0 6 26.09 0.0 7.043478261 0.007955439
19 19 set aside 0 8 8 0.0 29.63 6.814814815 0.009040468
20 20 prison crime/justice 5 20 25 21.74 74.07 6.803542673 0.009097718
21 21 business community economy/fiscal issues 8 1 9 34.78 3.7 6.664698515 0.009834129
22 22 economic success economy/fiscal issues 8 1 9 34.78 3.7 6.664698515 0.009834129
23 23 freedom 1 10 11 4.35 37.04 6.032645294 0.014043672
24 24 wind 11 3 14 47.83 11.11 5.979296066 0.014474778
25 25 choices 11 3 14 47.83 11.11 5.979296066 0.014474778
26 26 11 million 0 7 7 0.0 25.93 5.962962963 0.014609469
27 27 doing business economy/fiscal issues 0 7 7 0.0 25.93 5.962962963 0.014609469
28 28 state income economy/fiscal issues 0 7 7 0.0 25.93 5.962962963 0.014609469
29 29 pre existing conditions health care 5 0 5 21.74 0.0 5.869565217 0.015404856
30 30 need make sure 5 0 5 21.74 0.0 5.869565217 0.015404856
31 31 reproductive health health care 5 0 5 21.74 0.0 5.869565217 0.015404856
32 32 educators deserve education 5 0 5 21.74 0.0 5.869565217 0.015404856
33 33 expanding access 5 0 5 21.74 0.0 5.869565217 0.015404856
34 34 energy future energy/environment 5 0 5 21.74 0.0 5.869565217 0.015404856
35 35 economy works economy/fiscal issues 5 0 5 21.74 0.0 5.869565217 0.015404856
36 36 existing conditions health care 5 0 5 21.74 0.0 5.869565217 0.015404856
37 37 sure children 5 0 5 21.74 0.0 5.869565217 0.015404856
38 38 cost health health care 5 0 5 21.74 0.0 5.869565217 0.015404856
39 39 good faith 5 0 5 21.74 0.0 5.869565217 0.015404856
40 40 values 14 5 19 60.87 18.52 5.862276464 0.015468775
41 41 child care education 9 2 11 39.13 7.41 5.681305812 0.017146602
42 42 core 2 12 14 8.7 44.44 5.66873706 0.017269885
43 43 officer 2 12 14 8.7 44.44 5.66873706 0.017269885
44 44 state let 7 1 8 30.43 3.7 5.546698873 0.018515575
45 45 expand medicaid health care 7 1 8 30.43 3.7 5.546698873 0.018515575
46 46 past years ve 7 1 8 30.43 3.7 5.546698873 0.018515575
47 47 list 3 14 17 13.04 51.85 5.501657668 0.018998456
48 48 climate energy/environment 15 6 21 65.22 22.22 5.46652864 0.019384091
49 49 childhood 12 4 16 52.17 14.81 5.417069243 0.019940805
50 50 minimum 12 4 16 52.17 14.81 5.417069243 0.019940805
51 51 ongoing 1 9 10 4.35 33.33 5.217391304 0.022362073
52 52 2014 1 9 10 4.35 33.33 5.217391304 0.022362073
53 53 affordable 20 10 30 86.96 37.04 5.158346753 0.023134909
54 54 thanking 2 11 13 8.7 40.74 4.905363558 0.026773415
55 55 enjoy 2 11 13 8.7 40.74 4.905363558 0.026773415
56 56 law enforcement crime/justice 5 17 22 21.74 62.96 4.796955058 0.028510082
57 57 enforcement crime/justice 5 17 22 21.74 62.96 4.796955058 0.028510082
58 58 necessary 5 17 22 21.74 62.96 4.796955058 0.028510082
59 59 bless great state 3 13 16 13.04 48.15 4.783011272 0.028741818
60 60 16 3 13 16 13.04 48.15 4.783011272 0.028741818
61 61 man 4 15 19 17.39 55.56 4.760488177 0.029120288
62 62 concerns 8 2 10 34.78 7.41 4.653784219 0.030985168
63 63 carbon energy/environment 8 2 10 34.78 7.41 4.653784219 0.030985168
64 64 sustain 8 2 10 34.78 7.41 4.653784219 0.030985168
65 65 pre 14 6 20 60.87 22.22 4.637681159 0.031277237
66 66 starts 11 4 15 47.83 14.81 4.511540526 0.033666905
67 67 afford 19 10 29 82.61 37.04 4.447165306 0.034959231
68 68 looks like 6 1 7 26.09 3.7 4.444674488 0.035010264
69 69 friends neighbors 6 1 7 26.09 3.7 4.444674488 0.035010264
70 70 just weeks 6 1 7 26.09 3.7 4.444674488 0.035010264
71 71 paid family 6 1 7 26.09 3.7 4.444674488 0.035010264
72 72 house senate 1 8 9 4.35 29.63 4.410270174 0.035723183
73 73 savings account economy/fiscal issues 0 5 5 0.0 18.52 4.259259259 0.03903694
74 74 schools safer crime/justice 0 5 5 0.0 18.52 4.259259259 0.03903694
75 75 local law enforcement crime/justice 0 5 5 0.0 18.52 4.259259259 0.03903694
76 76 state law 0 5 5 0.0 18.52 4.259259259 0.03903694
77 77 department commerce 0 5 5 0.0 18.52 4.259259259 0.03903694
78 78 prison population crime/justice 0 5 5 0.0 18.52 4.259259259 0.03903694
79 79 local law crime/justice 0 5 5 0.0 18.52 4.259259259 0.03903694
80 80 state income tax economy/fiscal issues 0 5 5 0.0 18.52 4.259259259 0.03903694
81 81 skills necessary 0 5 5 0.0 18.52 4.259259259 0.03903694
82 82 education workforce education 0 5 5 0.0 18.52 4.259259259 0.03903694
83 83 cost doing 0 5 5 0.0 18.52 4.259259259 0.03903694
84 84 continues grow 0 5 5 0.0 18.52 4.259259259 0.03903694
85 85 line duty 0 5 5 0.0 18.52 4.259259259 0.03903694
86 86 tax rates economy/fiscal issues 0 5 5 0.0 18.52 4.259259259 0.03903694
87 87 long ago 0 5 5 0.0 18.52 4.259259259 0.03903694
88 88 fully funding economy/fiscal issues 0 5 5 0.0 18.52 4.259259259 0.03903694
89 89 thanks leadership 0 5 5 0.0 18.52 4.259259259 0.03903694
90 90 mr chief justice 0 5 5 0.0 18.52 4.259259259 0.03903694
91 91 mr chief 0 5 5 0.0 18.52 4.259259259 0.03903694
92 92 conservative 2 10 12 8.7 37.04 4.156736447 0.041469233
93 93 corrections crime/justice 2 10 12 8.7 37.04 4.156736447 0.041469233
94 94 involved 2 10 12 8.7 37.04 4.156736447 0.041469233
95 95 adults 12 5 17 52.17 18.52 4.137633797 0.041939755
96 96 reach 16 8 24 69.57 29.63 4.126677402 0.042212153
97 97 environment energy/environment 4 14 18 17.39 51.85 4.096976203 0.042959981
98 98 continued 4 14 18 17.39 51.85 4.096976203 0.042959981
99 99 positive 3 12 15 13.04 44.44 4.082125604 0.043339106
100 100 looks 9 3 12 39.13 11.11 4.062801932 0.043837703

Next page

Advanced export

JSON shape: default, array, newline-delimited

CSV options:

CREATE TABLE "state-of-the-state/words" (
"phrase" TEXT,
  "category" TEXT,
  "d_speeches" INTEGER,
  "r_speeches" INTEGER,
  "total" INTEGER,
  "percent_of_d_speeches" REAL,
  "percent_of_r_speeches" REAL,
  "chi2" REAL,
  "pval" REAL
);
Powered by Datasette · Queries took 16.692ms · Data license: CC Attribution 4.0 License · Data source: fivethirtyeight/data on GitHub · About: simonw/fivethirtyeight-datasette