The association between smoking or passive smoking and cardiovascular diseases using a Bayesian hierarchical model: based on the 2008-2013 Korea Community Health Survey
Article information
Abstract
OBJECTIVES
Smoking and passive smoking have been extensively reported as risk factors of cardiovascular morbidity and mortality. Despite the biological mechanisms underlying the impact of hazardous chemical substances contained in tobacco in cardiovascular diseases (CVD), studies investigating the association between smoking and passive smoking with morbidity are at an inchoate stage in Korea. Therefore, this study aimed to estimate the risks of smoking and passive smoking on cardiovascular morbidity at the national and regional levels.
METHODS
This study calculated sex-standardized and age-standardized prevalence of CVD and smoking indices in 253 community health centers (si/gun/gu) in Korea using the 2008-2013 Korea Community Health Survey data. Furthermore, a Bayesian hierarchical model was used to estimate the association of smoking and passive smoking with the prevalence of CVD from the national and regional community health centers.
RESULTS
At the national level, smoking was significantly associated with stroke (relative risk [RR], 1.060) and hypertension (RR, 1.016) prevalence, whilst passive smoking at home and work were also significantly associated with prevalence of stroke (RR, 1.037/1.013), angina (RR, 1.016/1.006), and hypertension (RR, 1.010/1.004). Furthermore, the effects of smoking and passive smoking were greater in urban-industrial areas than in rural areas.
CONCLUSIONS
The findings of this study would provide grounds for national policies that limit smoking and passive smoking, as well as regionally serve as the basis for region-specific healthcare policies in populations with high CVD vulnerability.
INTRODUCTION
Smoking and passive smoking have extensively been documented as risk factors of morbidity and mortality by a number of diseases [1-4]. Among more than 4,000 types of chemical substances contained in a cigarette, polycyclic aromatic hydrocarbons and oxidizing gases cause cardiac toxicity [1], having adverse effects on various cardiovascular diseases (CVD), including coronary artery disease, ischemic stroke, non-traumatic subarachnoid hemorrhage [5,6].
The World Health Organization reported that CVD-related death (1.69 million) is the leading cause of smoking-related death. In a study on 360 thousand US men, smoking has been identified as a key risk factor for lung cancer, chronic heart disease, and stroke [3]. In addition, a Norwegian study revealed that smokers are 2.74 times more likely to have a stroke and 6.74 times more likely to die from it than are non-smokers [4]. Similar to the effects of smoking, passive smoking has also been reported to be the cause of respiratory and CVD, various cancers, and premature deaths in non-smoking adults [7]. Other studies have suggested that secondhand smoking in non-smokers increases their risk of developing a heart disease by 25 to 30% [8-10].
In South Korea (hereafter Korea), several studies have reported that smoking increases the risk of CVD-related death in adults [11-13]. As of 2012, the number of smoking-related deaths was 4,148 for stroke and 3,858 for ischemic heart disease, which are ranked the second and third highest causes of death by an illness respectively [14]. Furthermore, it also has been reported that 26.7% of all CVD-related deaths in men and women adults in Korea are caused by smoking [13]. Although adult smoking rate in Korea has consistently been on a decline since 1998 [15], past smoking history may have an impact on death [11]. Moreover, medical expenses and patient population related to hypertension, heart disease, and cerebrovascular disease are continuously on the rise [16], suggesting that smoking is a health hazard in terms of CVD and mortality requiring much attention.
As shown here, though there have been much research investigating the association between smoking and death by CVD in Korea, studies on smoking and morbidity is relatively lacking. Hence, this study aimed at quantitatively analyzing the association between smoking and the prevalence of various CVD (i.e., hypertension, stroke, myocardial infarction, angina), and to evaluate the relative risk (RR) of smoking on the prevalence of CVD using data from a large-scale cross-sectional study with more than 1.5 million participants. We also examined the effects of passive smoking on the prevalence of CVD.
MATERIALS AND METHODS
Data
Data from the 2008-2013 Korea Community Health Survey (KCHS) involving 253 community health centers were used for this study [17]. The KCHS was conducted by the Korea Centers for Disease Control and Prevention, and data collected from adults aged 19 years or older via interviews. The sample was extracted from an average of 900 adults per community (si/gun/gu) based on the type of housing within each dong/eup/myeon. The primary sample region was obtained using a probability proportional to size systematic sampling, after which the secondary sample families were selected [18]. A total of 1,567,930 people were included in the KCHS data used in this study (2008: 200,800; 2009: 220,258; 2010: 230,712; 2011: 229,229; 2012: 229,226; 2013: 228,781), and the participants were randomly extracted every year. Using this data, we calculated the prevalence of 4 CVD (hypertension in adults ≥ 30 years, stroke in adults ≥ 50 years, myocardial infarction in adults ≥ 40 years, angina in adults ≥ 40 years), smoking-related regional parameters (current smoking rate, passive smoking rate at home, and passive smoking rate at work), body mass index (BMI), and monthly drinking rate by community (si/gun/gu), after reflecting gender/age distributions based on the year of study. The prevalence of CVD was measured, based on a self-reported response regarding a physician’s diagnosis, whilst BMI and monthly drinking rates were also measured with self-reported data.
Statistical analysis
A generalized linear mixed model (GLMM) was used to analyze the association among regional and national smoking indices and prevalence of CVD that were repeatedly measured from 2008-2013. The prevalence of 4 CVD (hypertension in adults ≥ 30 years, stroke in adults ≥ 50 years, myocardial infarction in adults ≥ 40 years, and angina in adults ≥ 40 years) was set as the response variables. A Shapiro-Wilk test revealed that the prevalence of all 4 diseases did not follow a normal distribution, but instead showed a gamma distribution. Hence, a GLMM that assumes gamma distribution for the response variables was used. A log function was used as the link function. Mean BMI and monthly drinking rate per community (si/gun/gu) for 2008-2013, were set as the respective confounding variables and covariate, and a linear adjustment was made for annual effects to consider the effects of time. Smoking indices (current smoking rate, passive smoking rate at home, passive smoking rate at work) were used as the response variables, and we predicted associations with response variables with equal confounding variables and covariates.
Furthermore, a Bayesian hierarchical model was used with a GLMM to measure the effects of smoking indices of each community (si/gun/gu) and adjust for the correlations according to distance. The associations between community (si/gun/gu)-specific smoking indices and CVD were examined with random intercept and random slope, and the prior distributions of random intercept and random slope were assumed to follow a normal distribution with mean zero and an assumption that the inverse of variance follows a gamma distribution (parameter= 1, 5*10-5). Spatial correlation was adjusted for, with random intercept using the Besag-York-Mollie method [19], and all random intercepts and random slopes were assumed to be independent. Based on existing studies, the prior distribution of fixed effects was assumed a multivariate normal, which assumes variance to be a Gaussian Markov random field [20], and the posterior credibility interval (CI) was estimated based on the prior distribution. All analyses were performed using the R and R version 3.2.2 (https://cran.r-project.org/bin/windows/base/old/3.2.2/) for integrated nested Laplace approximation (INLA) [21]. More detailed formula and prior conditions are reported in Appendix 1.
RESULTS
Table 1 shows the descriptive statistics for the variables used in this study. Hypertension over 30 years was the most prevalent (18.2%) throughout the 6 years of study (2008-2013), while angina over 40 years was the least prevalent (1.9%). The distributions of prevalence varied widely across the country. The average smoking rate as an interest variable during 6 years was 25.2% (14.9 to 33.4%), with an average passive smoking rate at home of 11.8% (2.3 to 37.3%) and an average passive smoking rate at work of 29.2% (6.0 to 76.7%).
Table 2 shows the top and bottom 5 community health centers in terms of the prevalence of CVD and smoking indices (6-year average). Hypertension over 30 years was generally prevalent in the Gangwon province (21.72 to 22.35%) and less prevalent in community health centers in the Gyeongsang province and Jeolla province (14.03 to 14.72%). Stroke over 50 years was generally prevalent in community health centers in the North Gyeongsang province (4.68 to 5.33%) and less prevalent in the South Jeolla province (1.93 to 2.12%). Myocardial infarction over 40 years tended to be prevalent in community health centers in Seoul and the Gyeonggi province (2.12 to 2.25%) and less prevalent in community health centers in North Jeolla province (0.78 to 0.83%). Angina over 40 years tended to be prevalent in community health centers in Busan (2.88 to 3.25%) and less prevalent in the North Jeolla province (1.03 to 1.08%). In terms of smoking parameters, smoking rate did not vary by region. Community health centers in Jeju Island showed higher passive smoking rates at home (19.88 to 21.80%) while community health centers in Ulsan showed higher passive smoking rates at work (40.93 to 41.70%).
The national average RR of smoking indices on the prevalence of CVD is shown in Table 3, whilst the regional average (RR) for top and bottom 5 regions is shown in Table 4. All RR shown in Tables 3 and 4 represent RR when smoking parameters increase by 5%. As shown in Table 3, smoking rate was most highly associated with stroke (RR, 1.060; 95% CI, 1.022 to 1.100), and it was also significantly associated with hypertension (RR, 1.016; 95% CI, 1.004 to 1.029). Passive smoking at home was most strongly associated with a rise in the prevalence of stroke (RR, 1.037; 95% CI, 1.023 to 1.051), and its RR for hypertension, stroke, and angina (but not myocardial infarction) was statistically significant (RR, 1.010 to 1.037). Passive smoking at work had the greatest effect on stroke (RR, 1.013; 95% CI, 1.007 to 1.019) and also had a significant effect on hypertension (RR, 1.004; 95% CI, 1.002 to 1.006).

Relative risk of smoking/passive smoking (at home)/passive smoking (at work) by cardiovascular diseases (per 5% smoking or passive smoking rate)

Higher and lower 5 community health centers (si/gun/gu) of relative risk (RR) to smoking/passive smoking (at home)/passive smoking (at work)
Table 4 shows the top and bottom three community health centers in terms of the RRs of smoking indices, whilst Figures 1-3 are national maps showing the effects of community-specific smoking rate and passive smoking rate at home and work on the prevalence of CVD. RR of smoking rate for hypertension was high in community health centers in Gyeonggi and Gangwon provinces (RR, 1.042 to 1.046) and low in community health centers in North Jeolla and Gyeonsang provinces (RR, 0.985 to 0.996). Furthermore, smoking was strongly associated with stroke in community health centers in North Gyeongsang province (RR, 1.095 to 1.099) but weakly associated with stroke in community health centers in Busan and Ulsan (RR, 1.033 to 1.034). The RR of smoking for myocardial infarction was high in community health centers in Daejeon (RR, 1.029 to 1.029), but there were no regional differences in terms of low RR (RR, 0.973 to 0.983). The effects of smoking on angina was high in the Gyeonggi region (RR, 1.040 to 1.035) and low in the southern region of the Korean peninsula (North Gyeonsang, North Jeolla, and Jeju Island) (RR, 0.975 to 0.989). The distributions of RR of passive smoking at home and work were similar to those of smoking, and the community rankings were similar as well. Exposure to passive smoking at home (RR, 1.003 to 1.037) had greater impact on all 4 CVD than did exposure to passive smoking at work (RR, 0.998 to 1.013). In addition, the RR of all smoking indices tended to be higher in the Seoul metropolitan area (Figures 1-3).

Geographical distribution of relative risks (RR). The associations between 4 cardiovascular diseases (A) hypertension over 30 years, (B) stoke over 50 years, (C) myocardial infarction over 40 years, and (D) angina over 40 years and smoking.

Geographical distribution of relative risks (RR). The associations between 4 cardiovascular diseases (A) hypertension over 30 years, (B) stoke over 50 years, (C) myocardial infarction over 40 years, and (D) angina over 40 years and passive smoking at home.
DISCUSSION
This study analyzed the associations between regional smoking indices (smoking rate and passive smoking rate at home and work) and 4 CVD (hypertension in adults ≥ 30 years, stroke in adults ≥ 50 years, myocardial infarction in adults ≥ 40 years, angina in adults ≥ 40 years) with the KCHS data using a Bayesian hierarchical model. The results showed that smoking rate was significantly associated with stroke (RR, 1.060) and hypertension (RR, 1.016), and that passive smoking was significantly associated with stroke (RR, 1.013 to 1.037), angina (RR, 1.006 to 1.016), and hypertension (RR, 1.004 to 1.010) at the national level. Furthermore, exposure to passive smoking at home (RR, 1.003 to 1.037) had a greater impact on CVD than did exposure to passive smoking at work (RR, 0.998 to 1.013).
In addition to the national-level analysis, this study also analyzed the effects of smoking regionally in 253 regions, based on the assumption that specific regional features (geographical location, environment, income, population matrix) may have varying effects on the prevalence of CVD and may alter the effects of smoking on CVD. Our findings showed that RR of smoking varied across regions, suggesting that the effects of smoking on CVD varied across regions. The effects of smoking and passive smoking on all CVD were high in urban-industrial regions.
Nicotine, the primary ingredient of tobacco, is a sympathetic stimulant that acts on the central and peripheral nervous systems to stimulate the release of catecholamine and other neurotransmitters. It leads to cardiovascular consequences, such as elevating the heart rate, blood pressure, and cardiac output. Furthermore, it leads to a rise in low-density lipoprotein and decrease in high-density lipoprotein by mobilizing free fatty acids, thereby intensifying vasoconstriction and accelerating the progression of blood epithelial cell injury and atherosclerosis [1]. Carbon monoxide that is inhaled through smoking and passive smoking binds with hemoglobin to induce hypoxia, which increases the number of red blood cells and subsequently, blood viscosity thereby, having a direct impact on thrombosis and atherosclerosis [22]. Through these actions, smoking not only causes coronary artery diseases but also induces structural damage to arterial walls, and has also been associated with ischemic stroke caused by atherosclerosis and nontraumatic subarachnoid hemorrhage caused by formation and rupture of aneurysm [23].
This study presents data that support these biological causes. Although smoking indices were not statistically significantly associated with myocardial infarction and angina, the RR of smoking was above 1 for all CVDs at the national level (RR, 1.004 to 1.010). These results are in line with those reported by US and Norwegian studies on the relationship between smoking and heart diseases [3,4]. According to a foreign 20-year retrospective study, the greater the degree of exposure to passive smoking (cotinine concentration) was, the greater the risk for coronary artery disease and stroke [24]. Another study involving experimental and control groups revealed that the risk for myocardial infarction increases by 68% among passive smokers [25]. Furthermore, a meta-analysis of studies in 6 countries found that the RR for CVDs was between 1.2 and 1.3 [26]. In one study, passive smoking was found to be more of a health-hazard than direct smoking because smoke from the tip of a cigarette is more harmful than filtered smoke [27]. Our study also found similar results, and our results showed that the RR of passive smoking at home on angina (RR, 1.016) was higher than that of direct smoking (RR, 1.007).
Nevertheless, this study has a few limitations. First, this study merely analyzed associations and not the causal relationships between smoking indices and CVD. It is difficult to consider the temporal relations of smoking indices and CVD with KCHS data because this was a cross-sectional study [18]. In addition, this study did not consider risk factors at the individual level, so the results should be interpreted as a regional association and not as causal relations. Second, we were able to predict regions that were vulnerable to smoking and passive smoking but could not identify the biological and epidemiological causes of their vulnerability. More specifically, this study was able to verify that people with CVD in urban regions (including industrial regions) are relatively more vulnerable to smoking and passive smoking, but we could not shed light on the fundamental hypothesis and cause after adjusting for gender and age. We expect follow-up studies to evaluate the cause and to explain these results via diverse investigations and advanced analytical methods. Finally, this study did not consider social indicators. CVD may be influenced by social factors, such as income level and environment, but this study could not take these into considerations due to a lack of appropriate data. However, we partially compensated for this by examining potential regional features through performing a longitudinal analysis using a random effect model [28,29].
Despite a few limitations, this study offers several benefits. First, the large sample size of more than 1.5 million subjects increased the accuracy of our analysis in identifying the associations between CVD and smoking indices. Particularly, most of the 4 CVD studied were strongly associated with the smoking indices, with stroke having the greatest association with smoking and passive smoking (RR, 1.013 to 1.060). Second, by using an advanced statistical method, we were able to predict RRs not only at the national but also regional levels. The INLA method [20] predicts post-distributions as does the Markov Chain Monte Carlo method that is widely used in Bayesian statistical analysis, but the INLA is quicker and is easier for expanding distributions [30]. This study is valuable in that it simultaneously analyzed national and regional level risks by performing a Bayesian hierarchical analysis, considering regional correlations with the INLA.
Furthermore, the findings of this study can be used as the grounds for public healthcare policies that limit smoking and passive smoking nationwide to promote people’s health. Particularly, they will contribute to devising region-specific healthcare policies for regions populated by groups of people who may be vulnerable to CVD. In other words, there is a need to implement more active measures to limit smoking and to respond to the problems of indoor passive smoking in regions with high populations of people vulnerable to CVD, such as the elderly and financially unstable groups.
Acknowledgements
This work was supported by the Korea Ministry of Environment as Climate Change Correspondence Program (grant no. 20140013 10007).
Notes
The authors have no conflicts of interest to declare for this study.
SUPPLEMENTARY MATERIAL
Supplementary material (Korean version) is available at http://www.e-epih.org/.
References
Appendices
Appendix 1. Statistical analysis formulation
Gamma distribution
where Yi is a observed values with i= 1, 2, … N. We assumed Gamma distribution as a distribution of Yi with parameter μ and ν. And we set the prior distribution of log(ν) as logGamma distribution with parameter 1, and 0.01 each.
Integrated nested Laplace approximation (INLA)
The INLA algorithm, proposed by Rue al. (2009) [1] and Martino & Rue [2] is a deterministic algorithm for Bayesian inference rather than simulation based, MCMC (Markov Chain Monte Carlo). β is a set of regression 16 coefficient parameter and Q(ψ) is a sparse precision matrix known as Gaussian Markov random field (GMRF).
Regression model
β0 is an intercept of regression model and X is an interesting variable (smoking, indirect smoking indices, each). Bayesian hierarchical model is used with hyper parameters;ψ=(
Besag-York-Mollie (BYM) model
Considering N areas, each characterized by a set of neighbors Nj. And ajk is 1 if areas j and k are neighbors and 0 otherwise.