COVID-19 Case Surveillance Public Use Data with 
Geography Profile (2024-05-02 version) 


e Basic Statistics 
e Missing Data Profile 
e Univariate Distribution 


o Bar Chart (with frequency) 


Basic Statistics 


Row Counts 
Item Counts 
String Long 
1 Rows 105869141 
2 Columns 19 
3 Rows With Null or Missing Values 105735504 
4 Rows With No Null or Missing Values 133637 


Missing Data Profile 


Missing/null Data Profile 


underlying_conditions_yn 

death_yn 53.29% 

icu_yn 

hosp yn 48.71% 
symptom_status 48.1% 

current_status 

exposure_yn 

process 

case_onset_interval 

case_positive_specimen_interval 

ethnicity 

race 

Sex 

age_group 

county_fips_code 

res_county 

state _fips code 

res_state 

case_month 


Features 


0% 20% 40% 60% 80% 100% 
% of Total Rows 


Univariate Distributions 


Row count 


Row count 


16,000,000 


8,000,000 


| 
| 
| 
12,000,000 4 
1 
] 


4,000,000 : 


—_ 


55,000,000 4 
50,000,000 4 
45,000,000 4 
40,000,000 4 
35,000,000 4 
30,000,000 4 
25,000,000 1 
20,000,000 4 
15,000,000 4 
10,000,000 4 


5,000,000 | 


2020-01 


~ wo ast fon) a a mo wn Les a 1 a (se) ire) = Dn a onl oO wo 
oO oO [=] oO a oO oO oO [=] oO ta | oO oO oO oO oO a oO oO oO 
SOC OO Oo dA ee ee te AN NNN NAN Om mH Om 
N N N N QN N N N N N N N N N N N N N N N 
oO oO oO fo} oO oO oO oO oO oO oO oO i=) oO oO oO oO oO oO oO 
N N N N N N N N N N N N N N N N N N N N 
case_month 
52,039,021 
19,251,784 
17,515,450 
14,840,430 
1,085,405 

0-17 years 18 to 49 years 50 to 64 years 65+ years Missing 


age_group 


2023-07 


2023-09 


2023-11 
2024-01 
2024-03 


1,137,051 


NA 


NA 


res_state 


oO 
y 
oO 
77 
wn 
x 


2,000,000 4,000,000 6,000,000 8,000,000 10,000,000 12,000,000 14,000,001 


Row count 


Row count 


Row Count 


60,000,000 


50,000,000 


40,000,000 


30,000,000 


20,000,000 


10,000,000 


50,000,000 


40,000,000 


30,000,000 


20,000,000 


10,000,000 


sex 


race 


Row count 


ethnicity 


50,000,000 


40,000,000 


30,000,000 


20,000,000 


10,000,000 


ethnicity 


process 


100,000,000 


80,000,000 


60,000,000 


Row count 


40,000,000 


20,000,000 


process 


exposure_yn 


100,000,000 


98,123,842 
80,000,000 


60,000,000 


Row count 


40,000,000 


20,000,000 


5,554,766 
2,190,533 


Unknown Missing 


exposure_yn 


current_status symptom_status 


55,000,000 
90,000,000 
50,000,000 
80,000,000 sania 
,000, 45,000,000 
70,000,000 40,000,000 
35,000,000 
60,000,000 
€ 
e 3 30,000,000 
© 50,000,000 = 
o 6 25,000,000 
3 ao 
& 40,000,000 20,000,000 
30,000,000 15,000,000 
10,000,000 
20,000,000 
19,395,210 5,000,000 
10,000,000 
0 
Se 
~ 
Laboratory-confirmed Probable Case & 
case cS) 
current_status symptom_status 
hosp_yn Icu_yn 


100,000,000 


50,000,000 


80,000,000 
40,000,000 


60,000,000 


30,000,000 


Row count 
Row count 


40,000,000 
20,000,000 


10,000,000 20,000,000 


Row count 


death_yn 


60,000,000 4 


50,000,000 4 


40,000,000 4 


30,000,000 4 


20,000,000 4 


10,000,000 4 


underlying_conditions_yn 


4 


100,000,000 4 


80,000,000 4 


60,000,000 4 


Row count 


40,000,000 4 


20,000,000 4 


4 3,835,073 


69,507 


io) ° NS 
Ww ~ © 
_yn 


underlying_conditions