Skip to Main Content
Purpose

This paper aims to design a vulnerability assessment model considering the multidimensional and systematic approach to disaster risk and vulnerability. This model serves to both risk mitigation and disaster preparedness phases of humanitarian logistics.

Design/methodology/approach

A survey of 27,218 households in Pueblo Rico and Dosquebradas was conducted to obtain information about disaster risk for landslides, floods and collapses. We adopted a cross entropy-based approach for the measure of disaster vulnerability (Kullback–Leibler divergence), and a maximum-entropy estimation for the reconstruction of risk a priori categorization (logistic regression). The capabilities approach of Sen supported theoretically our multidimensional assessment of disaster vulnerability.

Findings

Disaster vulnerability is shaped by economic, such as physical attributes of households, and health indicators, which are in specific morbidity indicators that seem to affect vulnerability outputs. Vulnerability is heterogeneous between communities/districts according to formal comparisons of Kullback–Leibler divergence. Nor social dimension, neither chronic illness indicators seem to shape vulnerability, at least for Pueblo Rico and Dosquebradas.

Research limitations/implications

The results need a qualitative or case study validation at the community/district level.

Practical implications

We discuss how risk mitigation policies and disaster preparedness strategies can be driven by empirical results. For example, the type of stock to preposition can vary according to the disaster or the kind of alternative policies that can be formulated on the basis of the strong relationship between morbidity and disaster risk.

Originality/value

Entropy-based metrics are not widely used in humanitarian logistics literature, as well as empirical data-driven techniques.

Global warming can increase the higher frequency of natural disasters such as floods and landslides. These, in turn, can cause substantial damage to local livelihoods and civilian infrastructure (Lim et al., 2017; Simpson, 2017; Torres and Pereira, 2017). In this paper, we show that some human actions can magnify the damage of such natural disasters. For example, settling in high-risk areas is a decision made under scarcity conditions. Public policy oriented to mitigate risks must consider this scarcity for the planning of disaster risk mitigation as this shapes the vulnerability of households to disasters.

Risk can be defined as the combination of vulnerability, hazard and exposure (Wisner et al., 2004; Twigg, 2004; United Nations Office for Disaster Risk Reduction, 2015). Vulnerability to disasters conforms the human part in the concept of risk: is defined as the foreseeable consequences of a damaging event on entities such as human lives, health, wealth or environment. The level of vulnerability to natural disasters is a concept that requires operationalization to be measured. A good indicator of vulnerability must consider features that can serve to differentiate the vulnerable unit from those nonvulnerable. In this regard, we argue that vulnerability to disasters has four dimensions: economic (Cutter, 2012; Folke, 2006; Adger, 2006; Eakin and Luers, 2006), health (Djalante et al., 2020) social (Schoon and Bartley, 2008; Cutter and Finch, 2008; Rapeli, 2017), and geographical (Mattea, 2019).

Economic and educational dimensions are important to recognize the vulnerability level and may shape the ability to reactivate the livelihoods after the disaster (Cutter, 2012; Folke, 2006; Adger, 2006; Eakin and Luers, 2006). During a disaster, health and morbidity can magnify the deprivation costs and affect the response and recovery (Djalante et al., 2020). Social capabilities are important for generating resilience by creating cooperation means within a community regarding the disaster response (Schoon and Bartley, 2008; Cutter and Finch, 2008; Rapeli, 2017). Geographical vulnerability depends on household location; those located in high-exposure districts are more likely to suffer from catastrophes Mattea (2019). Vulnerability is affected by all the possible interactions between those dimensions. We will show that all these features configure disaster vulnerability, defining it as multidimensional. We argue that households with an unstable economy, high morbidity regarding both chronic and acute illness, low social capabilities and geographical exposure to disasters are more prone to suffer from disasters, whether they happen or not. This selection of vulnerability dimensions is based on a welfare perspective, specifically in Sen's (1999) capabilities approach. In consequence, a limited set of capabilities in a household results in lower welfare and higher vulnerability. This is because fewer capabilities are traduced in fewer opportunities and this makes the disaster response and recovery tougher. This approach has the advantage that defines welfare in function of not only on economic status but also on educational, health, geographical and social status.

This paper reconstructs the classification of risky households made by the surveyors, in function of all the dimensions listed above that shape multidimensional vulnerability to disasters. Using logistic regression, we will give evidence that shows that vulnerability is a multidimensional problem. Risk is mitigated through the construction of resilience, which is equivalent to reducing vulnerability (UNDRR, 2017), and through preparedness policymaking. We will additionally make an experimental benchmarking to rank vulnerability at the province/district level. For this purpose, we will use entropy-based metrics to make systemic comparisons between communities/districts (Smyth and Hai, 2012; Yoon et al., 2017; Fatemi et al., 2020). Previous works have applied neither multidimensional nor entropy-based metrics for vulnerability assessment (Dong et al., 2016).

We implement the methodology in two Colombian communities: Pueblo Rico and Dosquebradas. We observe high heterogeneity between the cases: Pueblo Rico has a relatively small urban settlement, and the rest is composed of indigenous communities like Embera Chami or Embera Katio; in contrast, Dosquebradas has urban districts that are densely populated and a small rural zone. Both communities face risks of landslides, floods and housing collapse. The research questions are as follows: Which factors are key for mitigating disaster vulnerability for Pueblo Rico and Dosquebradas? With this information, what can humanitarian logisticians do to mitigate risks and improve disaster preparedness policies? The main contribution of this study is the use of informational entropy to support decision-makers in prioritizing public intervention in the risk mitigation and preparedness phases of humanitarian logistics, measuring the vulnerability of communities/districts. Following Alexander (2002), humanitarian logistics analyses four phases for the management of a disaster: the preparedness, the response, the recovery and the risk mitigation phases. This paper only covers risk mitigation and preparedness phases as there are few approaches in these (Overstreet et al., 2011; Leiras et al., 2014). For the former, the results can be useful for assessing vulnerability determinants. In this regard, the results contribute to the long-run planning of resilience in communities/districts. For the latter, a vulnerability assessment can lead to optimal locations for prepositioning stocks for humanitarian aid and determining the actions to be pursued according to the type of disaster; this is especially helpful in the short run. Furthermore, the demand for humanitarian aid is a direct function of vulnerability: the most vulnerable communities/districts will have higher demand and thus higher deprivation costs (Olorumtoba et al., 2018).

The remainder of this study is structured as follows. Section 2 presents the relevant literature and the theoretical framework. Section 3 describes the data collection procedures and the entropy-based methods to estimate disaster vulnerability. Section 4 outlines the main results and discusses them. Section 5 addresses the implications of our results for policymakers and stakeholders of the humanitarian supply chain. Finally, Section 6 presents the conclusions and recommendations for future research.

Here, we focus on the theoretical basis behind the estimation of the entropy-based vulnerability of households to disasters. The study of vulnerability is complex because it is a multidimensional and multi-causal phenomenon. Its evaluation requires processing a large number of variables that adequately characterize it. In this section, we present different concepts regarding entropy and entropy-based methods applied to urban, productive and humanitarian logistics that build an entropy-based vulnerability metric.

First, we briefly review the literature related to entropy, information theory and urban, production and humanitarian logistics. Second, we construct the theoretical framework based on previous works on multidimensional disaster vulnerability assessment. Third, we describe a community/district perspective for disaster vulnerability reduction.

In essence, entropy is said to be an expression of the disorder, the randomness of a system or the lack of information about it (Gray, 2011; Dong et al., 2016). Its measurement is useful for making decisions based on information about the entire system (captured in informational terms by the probability distribution). Considering households' disaster vulnerability, this variable can be defined as a system. This system is composed of interactions between different dimensions of disaster vulnerability. In this study, the entropy for households' disaster vulnerability is estimated from an entropy maximization problem which is restricted to information about the different dimensions affecting the system (Jaynes, 1957). When we want to compare two systems, we use cross-entropy methods (Krauz and Tabrikian, 2019; Ho and Wookey, 2020). Cross-entropy is a measure based on entropy and calculates the differences between two probability distributions, thereby estimating the distance between them. Specifically, we use the divergence of Kullback–Leibler (DKL) to analyze and compare probability distributions (Bouhlel and Dziri, 2019).

In the case of urban systems, Purvis et al. (2019) carried out a systematic literature review and observed that entropy is useful for measuring spatial concentration, dispersion of population growth and diversity of urban land use. In urban logistics, which represents an important part of urban systems, entropy is useful for multi-criteria decision analysis. Guoyi and Xiaohua (2011) evaluated two logistics providers based on multiple criteria, establishing entropy-based weights for each. Greater weight was assigned to those criteria where experts had a more uniform opinion, thereby valuing the consensus. In the field of production logistics, entropy has been used as an indicator of the complexity of the production process (Guoliang et al., 2017; Deng et al., 2017; Zhang and David, 2018). More complex processes imply higher entropy and lower efficiency for general operation.

Following Wisner et al. (2004) and Twigg (2004), we consider vulnerability as the human dimension of disasters and the result of the interaction of physical, economic, social and environmental factors. Disaster risk can be defined as the combination of vulnerability, danger and exposure and expressed as a probability of loss of life, injuries or destroyed or damaged assets (UNDRR, 2015). In the classic “Pressure and Release” (PAR) model (Twigg, 2004), vulnerability is declared as a progression: from root causes to dynamic pressures and ultimately to unsafe conditions. Then, disasters happen when unsafe conditions meet dangers. Table 1 shows the basic definitions that will be considered in this study:

Table 1

Basic definitions regarding disaster risk

ConceptDefinition
VulnerabilityIs the foreseeable consequences of a damaging event on entities such as human lives, health, wealth or environment
HazardThe loss of life or injury, property damage, social and economic disruption or environmental degradation caused by a potentially damaging physical event, phenomenon or human activity
ExposureThe property, people, systems or other elements present in hazard zones
RiskThe combination of vulnerability, hazard and exposure. We consider risk as an outcome of the vulnerability system. High risk is a result of high vulnerability, hazard and exposure

Source(s): UNDRR (2015) and Twigg (2004) 

Although the PAR model succeeds in explaining vulnerability progression, vulnerability metrics fail to capture information systemically. For example, Salazar-Briones et al. (2020) proposed an integrated urban flood vulnerability index composed of social, economic and physical factors. Furthermore, the vulnerability can also be affected by health or geographical factors (Eakin and Luers, 2006; Djalante et al., 2020). Even though a vulnerability index is comprehensive or multidimensional, it is calculated by normalizing each component and then averaging them up into one indicator. For example, Salazar–Briones et al.'s approach (2020) outputs a summary index of vulnerability. However, this index cannot provide information about the distribution of risk among the cities because it considers only the average between observations. The main disadvantage of this approach is that decision-making will not be based on the entire distribution but only on the mean values, thereby ignoring the information of extreme values. To formulate a systemic indicator that shows a measure of the system, we propose an entropy method approach. For this reason, the cross-entropy or the relative entropy (as measured by DKL) is the selected variable for entropy calculation. In this way, it is possible to assess vulnerability for each individual based on the characteristics of the group and the interrelationships of the different dimensions of vulnerability. Figure 1 presents the theoretical model that we use for estimating the vulnerability of households.

Figure 1

A theoretical framework for household vulnerability assessment

Figure 1

A theoretical framework for household vulnerability assessment

Close modal

The selection of vulnerability dimensions is based on a welfare perspective: Sen's (1999) capabilities approach. From this perspective, vulnerability is associated with the set of capabilities of the population. The main idea behind this approach is that a limited set of capabilities in a household results in lower welfare and higher vulnerability. This is because fewer opportunities or capabilities can make disaster response and recovery tougher. This approach is multidimensional because it allows welfare to depend not only on economic status but also on educational, health, geographical and social status. Economic and educational dimensions are important to recognize the vulnerability level and may shape the ability to reactivate the livelihoods after the disaster (Cutter, 2012; Folke, 2006; Adger, 2006; Eakin and Luers, 2006). During a disaster, health and morbidity can magnify the deprivation costs and affect the response and recovery (Djalante et al., 2020). Thus, they are sources of vulnerability. Social capabilities are important for generating resilience by creating cooperation means within a community regarding the disaster response (Schoon and Bartley, 2008; Cutter and Finch, 2008; Rapeli, 2017). Geographical vulnerability depends on household location; those located in high-exposure districts are more likely to suffer from catastrophes. Thus, disasters can have significant spatial patterns (Mattea, 2019). All four dimensions mentioned above are very likely to be related to each other. The vulnerability may also be the result of all those possible interactions.

With this theoretical framework at the household level, we analyze the distribution of vulnerability metrics from a community/district perspective. Decision-making is better informed when we have tools for systemic diagnosis. Hence, we focus on the distribution of vulnerability within districts. Estimates for distribution are made with a statistical “rule of thumb” method: the kernel density estimation at the community/district level. This estimation is summarized via several graphs for community/districts to generate a systemic perspective of the entire distribution of vulnerability among households. Finally, we conduct student t-tests for significant differences between the vulnerability of districts (measured in terms of DKL).

The theoretical framework for household disaster vulnerability is applied for floods, landslides and collapses. One key assumption of this framework is that the vulnerability to those natural disasters has the same dynamics between units of analysis (Cutter, 2012; Folke, 2006; Adger, 2006; Eakin and Luers, 2006). Then, we can state the hypothesis formally:

H1.

There are similar vulnerability dynamics between disasters.

H2.

There are different vulnerability dynamics between disasters.

This hypothesis is theoretically supported by the relationship between welfare, capabilities and vulnerability to disasters. Our regression model tests the “similar vulnerability dynamics” hypothesis by determining the overall correlation of risk for different kinds of disasters for the entire sample.

The central aim of the humanitarian logistics discipline is to minimize the deprivation costs or the costs of suffering the lack of basic goods for disaster aftermath (Holguín-Veras et al., 2013). This is done by planning in four phases: risk mitigation, disaster preparedness, response and recovery (Overstreet et al., 2011; Leiras et al., 2014). To achieve this goal, the entropy-related literature has addressed the supply chain's diagnosis, the interaction between agents that compose a humanitarian response network and the performance of the institutions that provide support and supplies. Natural and anthropogenic disasters' damages are shaped by the level of vulnerability of the affected population. However, there is scarce literature that applies entropy as a framework to evaluate vulnerability regarding those disasters (Yoon et al., 2017). Humanitarian aid networks are special cases in which graph theory is used to study the flows between nodes and emulate a configuration based on the interaction of multiple actors involved in the humanitarian response (Comfort et al., 2009; Liyuan and Qingmei, 2014; Wu et al., 2013; Gomez, 2017). In network theory, entropy can be estimated from nodes, edges or topological measurements. At maximum entropy, the humanitarian response is disordered, every agent or institution is individualist, and there is a low density of cooperation (Wu et al., 2013). Comfort et al. (2009) determined that a network's entropy reaches its lowest value during a disaster and then increases over time. Therefore, it is necessary to focus on the preparedness phase to maintain a low entropy cooperation framework. Other metrics for measuring the performance of aid institutions are built based on the entropy theory, such as Theil's Index. It measures equity of the distribution of help and is especially useful when the minimization of deprivation costs alone is not sufficient to guarantee a low-inequity solution (Gutjahr and Fischer, 2018). The system is informing that maximum equity is reached at the maximum-entropy distribution of aid. Levner and Ptuskin (2015) also used the entropy method to identify vulnerability components within the supply chain related to disruptions, breaks, defects and errors. As is the case for humanitarian aid networks, risks are positively associated with entropy because it implies disorder in the humanitarian response. Entropy metrics depend on system attributes and complexity.

For the risk mitigation phase, the maximum entropy regression model allows the identification of those factors that are most related to disaster risk (Ben-Tal et al., 2011; Mateusz, 2017). Figure 2 shows how our procedure can contribute to risk mitigation.

Figure 2

A framework to estimate vulnerability from an a priori disaster risk classification

Figure 2

A framework to estimate vulnerability from an a priori disaster risk classification

Close modal

When we run a regression, we can extract the vulnerability component from what disaster risk is by definition. Hazard and exposure analysis imply other types of predictive work, which involve disaster forecasts other than population and housing predictors (e.g. see Intrieri et al., 2019; Tshimanga et al., 2016). There is compelling evidence that development efforts address the root causes of vulnerability and can reinforce national, community and individual resistance and resilience to disasters (Goldschmidt and Kumar, 2016). For example, the building materials for households are classic vulnerability drivers. Their improvement is economically and temporally expensive. Furthermore, the multidimensionality of the study allows us to draw alternative policies from the results and propose more cost-effective (in terms of time and economic cost) interventions that contribute to risk mitigation. Based on our results, for the community/districts analyzed, a territorial framework for disaster vulnerability mitigation can be established, allowing for targeted risk mitigation (López-Vargas and Cárdenas-Aguirre, 2017).

Regarding disaster preparedness and humanitarian, Ukkusuri and Yushimito (2008) point out: “humanitarian logistics cannot be improvised at the time of the emergency since very little can be done after a disruption occurs”. A preferred disaster preparedness strategy is the prepositioning of inventory and supplies (Tofigui et al., 2016). Prepositioning enhances the postdisaster response because it has an important impact on the distribution cost (Ukkusuri and Yushimito, 2008). This prior positioning must be based on community/district vulnerability assessment, such as those resulting from the proposed model. Furthermore, the type of goods must match the type of disaster (Olorumtoba et al., 2018). Although the population density in rural areas is low and the location of aid points can be far and dispersed, vulnerability tends to be higher because of rural poverty. Therefore, these areas can be expensive to supply to as more vulnerability leads to more demand. The type of materials that are prepositioned depends on the type of disaster; this discussion is further extended in the “Managerial and research implications” section. Nevertheless, “far from cities” rural areas should not be an exception or receive less aid (Gutjhar and Fischer, 2018). In summary, the results of this model can be used to match the identified vulnerability patterns with the configuration of the humanitarian supply chain and the inventory prepositioning strategies.

This section presents the methodology used for data processing and disaster vulnerability estimation at the household level. First, we explain the data gathering and processing methods. Second, we define the parameters and subscripts (see Table 2). Third, we explain the disaster vulnerability estimation methods in detail.

Table 2

Categories retrieved by the dataset

Dimension of vulnerabilityNumber of binary variables
Economic vulnerability41
Social vulnerability17
Health vulnerability23
Geographical vulnerability24
Total variables for MCA (all except geographical)81

Note(s): Without considering Geographical vulnerability binary variables, we have 81 variables

The data were collected by the Risaralda Health Department considering the framework of the Collective Action Plan, called “Plan de Intervenciones Colectivas” - PIC (for its Spanish acronym), that is used to prioritize the neediest communities in the region. The sample was obtained in 2017 using a simple random sampling method. The sample size was determined as a function of the population size estimated using the census data of the Solidarity and Guarantee Fund (FOSYGA), a health insurance system aimed at the general population. The advantage of this database is its representativeness rate for nonurban inhabitants. This representation is higher than the XVIII Official Population Census and the VII Housing Census (2018).

The data collected contains economic, educational, health and social variables, following the framework presented in Figure 1. Along with the household head's education achievement, the household's physical environment (material and status, wealth, income and assets) was considered a part of its economic vulnerability. A social vulnerability was composed of the set of social abilities of the household members: the use of neighborhood structure, attendance in local associations and group activities. For health vulnerability, the following variables were considered: chronic diseases, morbidity, mortality, disability, nutrition and medical attention. Finally, for geographical vulnerability, we retrieved data for the district to which a household belongs. However, more sophisticated spatial patterns were not analyzed.

The impact of natural disasters in both municipalities is different. While there are more floods in Dosquebradas, there are at least three times more landslides and almost double the collapse of homes in Pueblo Rico. In terms of urban development, Dosquebradas presents an increase in the extent of the urbanized territory (density of 2532.5 “hab./km2”). In Pueblo Rico, there is only a small urbanized city next to other districts or unified shelters where the rural population is spatially dispersed to different degrees (density of 25.5 “hab./km”). Figure 3 shows the territory that Pueblo Rico and Dosquebradas occupy in Risaralda.

Figure 3

Geographical areas of study

Figure 3

Geographical areas of study

Close modal

The Institute of Hydrology, Meteorology and Ambiental Studies (2019) reports rainfall with an annual variation of 2,500–9,000 mm and an average within 200–250 days of rainfall for Pueblo Rico and Dosquebradas. Although there is greater precipitation in Pueblo Rico (7,000–9,000 mm), there is less flooding risk reported in the survey. The more frequent landslides in Pueblo Rico can be explained mainly by the proximity between homes and mountains (an observation made in Google Earth Pro, 2019). Therefore, collapses are a more frequent phenomenon commonly caused by natural disasters because of the precarious construction of households.

The vulnerability dimension contains a high number of categorical and binary variables. Categorical variables were transformed into a set of additional binary variables to apply the multi correspondence analysis (MCA). Thus, Table 2 shows only binary variables. This approach was applied for reducing the dimensions of the original data set to a new set of linear indicators that can represent the maximum amount of data variances, such as in Filmer and Pitchet (2001), Ezzari and Verme (2012), Costa et al. (2013), Haregu et al. (2018), and Parchomenko et al. (2019).

3.2.1 Data processing methods

This section reviews the data processing methods. Table 2 shows a summary of the categories collected for each dimension of disaster vulnerability.

Geographical vulnerability variables are not considered in the MCA model because they cannot have multiple correspondences by definition: a household cannot be located in more than one place at a time. These variables are considered directly in the regression model as binary variables.

The MCA graph of the first two dimensions provides a graphic representation of the multiple correspondences between all the categories of the 81 binary variables. The MCA method can be summarized in three steps as follows.

First, the indicator matrix is recoded into a Burt matrix, where the indicator matrix is the original data set with the 81 binary variables defined as  Z. The Burt matrix is defined as  B=ZZ.

Second, we apply correspondence analysis in the Burt matrix (Rencher and Christensen, 2012). Then, we obtain the unadjusted principal inertias and the total inertia from a spectral or eigen decomposition of the Burt matrix. φl is the obtained eigenvalue. Then, λl=φl is defined as the principal inertia for dimension l, and total inertia is defined as  λ=lλl.

Third, we predict the numeric output for dimensions as a linear function of the standard coordinates. Equation (1) shows this estimation for household  i. Then, Ahl is the matrix of standard coordinates, Zih and is an element of the indicator matrix for the row i and column  h. A similar formula was reported by Ezzari and Verme (2012) for their MCA application to multidimensional poverty.

(1)

where Dil is the dimension l score for household i, and Ahl is the h rows column vector of standard coordinates for dimension l. Dimensions are linear functions of the standard coordinates, and they can be obtained by summing up every score and its respective category, divided by qφl.

Finally, information about disaster risk is contained in three variables that take binary values: the value of these indicators is equal to one if the household has been positively classified as a risk of flooding, landslide or collapse; otherwise, they are equal to zero. These variables reflect both the incidence of natural disasters and the level of exposure to them. Most households located in areas with recurring disasters have a positive score on the risk indicator. We expect households located in such areas to have a positive risk indicator, even if they have a larger set of capabilities. These binary variables are considered as dependent variables in a regression model. This regression model is estimated following the maximum entropy theorem, instead of making assumptions about the distribution of the stochastic error terms or the behavior of the risk classification.

3.2.2 Vulnerability estimation methods

We propose three regression models for each of the three types of disasters in the sample: floods, landslides and collapses. For multinomial discrete choice models, we obtain the probability distribution that maximizes the entropy for the dependent variable. Soofi (1992) established a correspondence between maximum entropy logit and maximum likelihood logit, where the first-order conditions of the former are equivalent to those for the latter. Nevertheless, in small samples, the resulting coefficients will not be the same. Instead, the maximum entropy logit gives a family of logit distributions that are part of the generalized maximum entropy logit distribution (Golan et al., 1996). In the experiments carried out by Golan et al. (1996) for the theorems of Soofi (1992) using large samples (for example, N > 1,000), the generic logistic distribution is always the one that maximizes the entropy. Furthermore, this is also true for the maximum likelihood logit that outputs the same parametric results as the maximum entropy logit. In our case, the binary choice model is a special case in which we observe a binomial distribution for the dependent variable. Based on Soofi (1992) and Golan et al. (1996), we selected the logistic distribution for the estimation of the regression models to obtain a maximum entropy estimation of the distribution of probabilities among the sample.

Instead of estimating a model with a large number of binary variables, we reduced the dimension of the covariables by applying the MCA method. In this regard, the regression models account for a lower number of independent variables. The following regression equation (2) presents the model after the application of the MCA method application. We kept the first two dimensions after the MCA algorithm because they explain 72.6% of the total variance in the dataset.

(2)

The dependent variable of equation (2), Yij, is the risk classification for the disaster j (j=1,2,3) for every household  i in the sample. The first variables, Di1 and Di2, account for the first two dimensions from applying MCA. The geographic district location variables are then considered as independent regressors (housing location can be endogenous, but this issue is beyond the scope of this study) as well as the risk classification for the other k (kj=1,2,3) types of disasters. A discussion of endogeneity regarding the risk classification for the other types of disaster is in  Appendix 2. We proceed to an estimation considering Di1 and Di2 as endogenous regressors. While this is important for the robustness of inference, we show that this change does not have a significant impact on prediction.

The structural model considering endogenous regressors is as follows:

(2.1)
(2.2)
(2.3)

The selection of instruments was based on the assumptions of the two-step estimator for the logit model with endogenous regressors (Avery, 2005). First, corr(Xi,ϵi)=0, and then corr(Di1,Xi)0 and corr(Di2,Xi)0. It is always better if there is a high correlation between instruments Xi, and instrumented variables Di1 and Di2. We test the assumptions and the exogeneity of the regressors.

The next step is to recover the vector of probabilities for risk classification. This is the distribution of the probability for a positive risk classification, which is estimated at the household level. It is important to have this information to generate tools that can provide insights based on the entire distribution, instead of a single metric summarizing the information for a territory. This distribution is later extracted for each territory in the sample. The following equation (3) presents the method used to estimate these probabilities:

(3)

These probabilities are estimated using the parameters obtained from the logistic regression (with endogenous regressors) for all households in the sample. The distribution of probabilities for the district (Equation 4) and the households (Equation 5) are as follows for every type of disaster j:

(4)
(5)

At this point, we must note two distributions: the maximum-logistic probability distribution at the community/district level and the binomial probability distribution at the household level. With both distributions (4) and  (5), we can define the risk metric as a divergence from a low-vulnerability controlled scenario. This scenario is constructed with a simulated distribution for both (4) and  (5) for each community/district. This algorithm is known as PageRank in several branches of data science literature (Sardhara and Lakhataria, 2019; Mo and Lee, 2019). Equation (6) shows the experimental setting:

where low_riskcj is normally distributed with a mean of μ=0.10 and a standard deviation of σ=0.01, with the sub-sample sizes of Nc, and N1+N2,,Nc=N. N is the total sample size.

The next step is the cross-entropy calculus. We can alternatively estimate entropy for a community/district, but this estimation alone is difficult to compare between the districts without having a base scenario, which in this case is simulated as a low-vulnerability scenario as depicted above (Krauz and Tabrikian, 2019; Ho and Wookey, 2020). The following measure is known as the Kullback–Leibler divergence (DKL), which is used to capture systemic divergences between two different distributions of probabilities. This measure can be paired or unpaired. Here, we use the simplest version. Since we have a binary choice classification, it is measured in bits.

(6)

The two inputs for this measure are the two probability distributions, P^cj and  Low_Vulnerabilitycj. Divergence is obtained by summing across the binomial probability distribution (5), and thus, is estimated at the household level. The divergence is greater if P^cj and Low_Riskcj show very different patterns and information. For this case, low_riskij is assigned to a low-vulnerability simulated distribution, which is used as a base scenario. Thus, a greater value of  DKLij will show high vulnerability patterns (or high divergence from low-vulnerability scenario). This divergence metric captures systemic information and is constructed based on the multidimensional perspective of disaster vulnerability.

To build a community/district perspective for vulnerability, we output a graph of the kernel density for DKL metrics. We use the divergence from households within a community/district to generate a density of divergence from low-vulnerability scenarios. The following formula is computed for kernel density:

(7)

In equation (7), wi are weights for the sample. As the sample is random, those weights are all equal to 1. Then,  r=iwi and  d drive how many values are included in estimating the density at each point. We use a classical rule of thumb to estimate  d, which implies minimizing the mean integrated squared error (Silverman, 1986).

With the DKLcj estimation at the household level, we conduct a set of student t-tests of means between the combinatory of 24 communities/districts that support the conclusions drawn from the graphics. The methodology for these tests is described in  Appendix 4.

Figure 4 shows the step-by-step methodology and the value added of each step and estimation:

Figure 4

Summary of the empirical methodology to address disaster vulnerability

Figure 4

Summary of the empirical methodology to address disaster vulnerability

Close modal

From step 1, MCA analysis, we can also condense all the information in the dataset into two variables, thereby simplifying the regression model and obtaining parsimonious estimates of vulnerability (defined by the probability of facing risks for landslides, floods or collapses). The experimental setting supports the postregression estimation of DKL. The more the divergence from a low-vulnerability scenario, the greater the household's vulnerability. With kernel densities of DKL, we make both a graphic analysis and a nonparametric analysis with a student's t-test of means (results are shown in  Appendix 4).

In this section, we present the results of the MCA. We simulated through MATLAB integrated software a basic scenario with a low probability of risk, a normal distribution with a mean value of 0.10 and a standard deviation of 0.01. This simulation was our comparison of the vulnerability distribution. We repeated the procedure for each community/district. Using the simulated and empirically estimated vectors of probabilities, we calculated DKL. Finally, we plot the density of the DKL for the three cases: collapses, floods and landslides, to obtain a district perspective of vulnerability (non-parametric t-student tests of means are available in  Appendix 4).

MCA provides continuous linear indicators and identifies associations between the original variables of our data set. Initially, the predictors contained many categorical variables. Therefore, the application of a regression model with a large number of categorical variables provides results that can be difficult to interpret, and there may also be multicollinearity between the variables. Thus, statistical inferences made about the covariables may not be reliable. Table 3 shows the MCA results.

Table 3

Multiple correspondence analysis

Dimension (Di)Principal inertiaProportion of varianceCumulate percent
D1 (Wealth: assets, household status, access to services)0.006866.0466.04
D2 (Chronic diseases and related social support groups)0.00066.4272.46

In Figure 5, the horizontal axis represents the number of dimensions retained, while the vertical axis shows the principal inertias. We observe that the first two dimensions make the form of an elbow. This method, called “the elbow method” is used for MCA and clustering models to determine the optimal number of dimensions (or clusters) to be retained. In the second dimension, the principal inertias show a significant decrease. We can argue that every additional dimension that we consider will have a negligible impact on the total variance explained, and instead, will lead us to a more complex interpretation of the data.

Figure 5

Scree plot of eigenvalues after MCA

Figure 5

Scree plot of eigenvalues after MCA

Close modal

The size of our sample of households was 27,218, and the total inertia was 0.01. As noted before, we decided to keep the first two dimensions because both dimensions of the MCA analysis have a cumulative percentage of explained variance (which is equal to principal inertia divided by total inertia) that exceeds 72% of the original dataset. D1 had the highest proportion of variance value concerning the 81 binary variables, and D2 added information that captures the information for the remaining variables. The first dimension, D1, represents 66.04% of the variance ratio (according to the resulting principal inertias). This method allows maintaining the multidimensionality of the original data set while reducing its dimensionality. The dimensions were named following the obtained row scores for the standard coordinates (Ahl in Equation 1). Figure 6 plots such standard coordinates in a (D1,D2) graphic space. This MCA plot is a graphic representation of the multiple correspondences between 162 categories of the 81 analyzed binary variables.

Figure 6

MCA for dimensionality reduction (standard coordinates)

Figure 6

MCA for dimensionality reduction (standard coordinates)

Close modal

Figure 6 graphically shows the results from the MCA method. Each point represents a category (out of 162 categories) and is plotted as a function of its standard normalization scores. First, we found a cluster near the (0,0) point where several categories have multiple correspondences. The blue circle represents all the categories associated with D1: negative values of D1 are related to high morbidity outcomes for households, while positive values are related to good household status and good practices of food cooking and waste management. The horizontal axis of Figure 7 presents categories ranked from highest to lowest by their scores. D1 takes values according to these scores. Also, fewer morbidity indicators in D1 mean better household status. These results suggest that this component is a representation of the economic and health dimensions of well-being. The small scores for social dimension variables show that there is a small association between D1 and social dimension. D1 is numerically constructed with economic, health and social variables. Thus, we decided to interpret D1 as “Multidimensional Wealth” (following Ezzari and Verme, 2012), which is also related to well-being in the sense of Sen (1999).

Figure 7

Most relevant categories in D1 (horizontal) and D2 (vertical)

Figure 7

Most relevant categories in D1 (horizontal) and D2 (vertical)

Close modal

On the contrary, negative values of D2 are strongly related to chronic diseases that are less sensitive to changes in economic wealth or other variables affecting D1 (as shown in Figure 7). Some chronically ill people are more likely to belong to many kinds of social groups (the vertical axis of Figure 4 shows these relationships), while this characteristic also has a small incidence in D1 or multidimensional welfare. This aspect suggests that D1 alone fails to represent the multidimensionality proposed in the theoretical framework (economic, health and social vulnerability dimensions). We compensate for this issue by taking D2 as a complementary variable that not only captures chronicity aspects for health dimension but also captures social one which is associated with chronic illness. Thus, D1 and D2 together allow us to reduce the dimensionality of the original data set without losing multidimensional information for disaster vulnerability assessment.

In Table 4, we present the results of the margins of logistic regression evaluated at means for landslides, floods and collapses.

Table 4

Maximum-entropy logistic results for risk classification

Equation(1)(2)(3)
Independent variablesLandslidesFloodsCollapses
D1 (endogenous)−0.013***−0.004*−0.014***
(0.002)(0.002)(0.002)
D2 (endogenous)−0.0000.0000.001
(0.001)(0.001)(0.001)
Floods(a)0.096*** 0.115***
(0.010) (0.034)
Collapses(a)0.212***0.097** 
(0.019)(0.022) 
Landslides(a) 0.068***0.186***
 (0.013)(0.017)
ROC0.860.880.85
Accuracy (%)96.51%96.94%96.90%
Observations27,21827,21827,218

Note(s): Robust standard errors at the district level are in parentheses. We estimated them to evaluate whether there were heteroscedastic patterns between the districts. The geographic district variable's output was suppressed, but it is available in  Appendix 3. P-values associated with t-tests for individual significance of coefficients: ***p < 0.01, **p < 0.05, *p < 0.1

The results of the model for each case – landslides, floods and collapses – had accuracies of 96.51%, 96.94% and 96.90%, respectively. The area below the ROC curve also suggested that a reliable level of fit was achieved, thereby allowing reliable predictions from those estimators. In terms of inference, the estimation of the model with endogenous regressors leads us to a robust significance of the effects. The model for landslides predicts that an increase in 1 unit of D1 (wealth, assets, access to education, housing material and status, access to services and morbidity) would cause a statistically significant decrease of 1.3% in the probability of facing landslides, maintaining all the other factors unchanged. The model for floods predicts a decrease of 0.4% in the probability of facing floods, given the same change in D1. This is not statistically significant, so it may be the opposite for some cases (confidence interval at 95% [−0.0088681, −0.0005697]). Finally, the model for collapses predicts a statistically significant decrease of 1.4% in the probability of facing collapses. The highest effect was for the collapses, which could be a completely foreseeable disaster.

For D2 (chronic diseases and related social support groups), the effects are not significant for all the sample sizes (it is possible to have significant effects on subsamples). A detailed endogeneity discussion for disaster risk classification is provided in  Appendix 2. Our results are robust to the endogeneity of D1 and D2. The correlation matrix of the regressors in Table 4 is reported in  Appendix 3.

Among the dimensions of the set of capabilities, the risk classification is largely explained by wealth: assets, household construction material, state of the home, access to services and morbidity contained in D1. Poor households are built with unstable material and are located in at-risk areas. On the contrary, neither the presence of chronic and infectious diseases nor does belonging to social groups of D2 has strong implications in the risk classification. This rejects the hypothesis constructed by Schoon and Bartley (2008), Cutter and Finch (2008), and Rapeli (2017). In relative terms, collapses are more sensitive to changes in D1 than other types of disasters. This may be explained by the fact that this type of disaster is strongly related to vulnerability. Consequently, to avoid housing collapses, it is necessary to improve the housing infrastructure, access to services and reduce morbidity. This is the same as in the case of landslides. Nevertheless, it is important to give differential treatment to floods. For Pueblo Rico and Dosquebradas, vulnerability to floods is not subject to economic, social, educational or health vulnerability.

Regarding D1 morbidity categories associated with lower wealth, education and poor practices of food cooking and waste disposal (see Figure 7), higher morbidity is associated with a high vulnerability for landslides and collapses. In this sense, the optimal policy should not focus only on the infrastructure but also consider morbidity to improve households' resilience (Djalante et al., 2020). An important result is that if a household is vulnerable to one type of disaster, it is very likely that it will be vulnerable to the other types. Thus, disasters are statistically related to each other. This result favors the “similar vulnerability dynamics” hypothesis for the three types of disasters, which means that landslides, floods and collapses may have similar vulnerability dynamics. Thus, we cannot reject H1 only based on our theoretical framework. Floods seem to be exogenous regarding the dimensions of disaster vulnerability. Therefore, variations in D1 and D2 may not affect the probability of facing floods.

Finally, the results for community/district control binary variables show that there are systematic differences in risk patterns between districts. Therefore, in the next section, we will discuss the results on divergence or level of vulnerability from a district perspective.

The strongest implication of these results is that disaster vulnerability is anthropogenic (at least for landslides and collapses). According to the theory, vulnerability is not only a result of interaction between humans and their physical context but also their economic and health aspects. However, disaster vulnerability does not seem to be strongly affected by differences in social capabilities or chronic illness, as seen in the results for D2. Consequently, a population with low human capital, low income, high morbidity and poor housing status tends to have greater disaster vulnerability regarding landslides and collapses. On average, people with these characteristics are those excluded from capitalist development, like indigenous communities. Morbidity is another vulnerability driver that comes with poverty, as suggested by the MCA results. Policies may have a quicker impact on morbidity than on poverty, considering that poverty reduction may be slower. Table 5 shows a legend for districts that is useful to understand the next subsection on vulnerability patterns from a district perspective.

Table 5

Community/districts subsamples and nomenclature

NomenclatureNameDefinition
Pueblo Rico
KL1Urban Pueblo RicoDistrict
KL14Rural Pueblo RicoDistrict
KL15Embera Chami Zone 1Community
KL16Embera Chami Zone 2Community
KL17Embera Chami Zone 3Community
KL18Embera Chami Zone 4Community
KL19Embera Katio Zone 1Community
KL20Embera Katio Zone 1Community
KL21Santa Cecilia 2District
KL22Santa Cecilia 3District
KL24Villa Clareth 2District
Dosquebradas
KL2Commune 1District
KL3Commune 10District
KL4Commune 11District
KL5Commune 12District
KL6Commune 2District
KL7Commune 3District
KL8Commune 4District
KL9Commune 5District
KL10Commune 6District
KL11Commune 7District
KL12Commune 8District
KL13Commune 9District
KL23Rural DosquebradasDistrict

4.3.1 Pueblo Rico households' distribution of divergence

DKL is always positive by definition. Our results for the three types of disasters and for all households in the sample are also positive. According to Table 6, the maximum divergence is 3.334. We observe smaller mean, median and variance indicators. Thus, we can state that disaster vulnerability is distributed around small values, with few atypical high-vulnerability cases. These divergence estimations also allow us to explore the vulnerability patterns between districts for each type of disaster.

Table 6

Descriptive statistics from Kullback-Leibler divergence

Disaster KLDMeanMedianVarianceMinMax
Landslides0.1170.0910 .0580.0003.334
Floods0.1140.1050 .0330.0003.163
Collapses0.1170.0920 .0520.0003.291

According to Figures 8–10, we observe an overall small divergence (0–0.3 bits) from the low vulnerability–simulated probability distributions for the three types of disasters. This evidence suggests that in Pueblo Rico province, households are more vulnerable to floods and collapses. The vulnerability to landslides is relatively small. Table 7 provides insights into the proportion of households with more than 0.3 bits of divergence in the sample.

Figure 8

Density of Kullback–Leibler divergence for KL1, KL14, KL15, and KL16

Figure 8

Density of Kullback–Leibler divergence for KL1, KL14, KL15, and KL16

Close modal
Figure 9

Density of Kullback–Leibler divergence for KL17, KL18, KL19 and KL20

Figure 9

Density of Kullback–Leibler divergence for KL17, KL18, KL19 and KL20

Close modal
Figure 10

Density of Kullback–Leibler divergence for KL21, KL22, and KL24

Figure 10

Density of Kullback–Leibler divergence for KL21, KL22, and KL24

Close modal
Table 7

The proportion of households with more than 0.3 bits of divergence in Pueblo Rico

Community/DistrictLandslidesFloodsCollapses
Pueblo Rico
KL1Urban Pueblo Rico0.670.690.66
KL14Rural Pueblo Rico0.200.220.22
KL15Embera Chami Zone 10.180.180.18
KL16Embera Chami Zone 20.170.180.18
KL17Embera Chami Zone 30.120.120.11
KL18Embera Chami Zone 40.100.110.11
KL19Embera Katio Zone 10.210.230.20
KL20Embera Katio Zone 20.110.120.11
KL21Santa Cecilia 20.260.280.26
KL22Santa Cecilia 30.170.190.17
KL24Villa Clareth 20.050.050.04

Note(s): Values are in percentage

Representing indigenous communities, districts KL15-KL20 are less vulnerable to landslides but highly vulnerable to floods and collapses. Our evidence suggests that in Pueblo Rico, general disaster vulnerability is higher for floods and collapses. This may be explained by the geography of the territories. Along similar lines, there are relatively few “high vulnerability” cases within Pueblo Rico communities. However, almost every community has such cases, so it is reasonable to think that high vulnerability cases are spatially dispersed in Pueblo Rico.

Based on Figures 8–10, for the case of the districts KL1, KL14, KL22, and KL24, the pattern is the same as that for communities KL15-KL20: high vulnerability to floods and collapses. KL24 is an exception with lower vulnerability to collapses but high vulnerability to floods and landslides. It is important to conduct a deeper analysis of this district to explain these results. The “similar vulnerability dynamics” hypothesis does not seem to be true when we disaggregate the results to the communities/districts level. Thus, we can reject H1 in favor of H2 at this level.

It is important to note that we have higher divergence from low-vulnerability scenarios in Embera Chami Zone 2 (KL22) and Santa Cecilia 3 (KL16) for landslides (see  Appendix 4: mean tests for evidence from formal statistical tests show the t-statistics to be positive and significant for the columns KL22 and KL16). These are cases with lower population densities, which face higher vulnerability than highly populated areas. The obtained results favor the hypothesis that low populated areas can face high risk (with higher vulnerabilities) as densely populated areas. However, rural risk mitigation can be challenging because of the cost of aid supply in terms of transport and the fact that higher vulnerability leads to greater demand requirements.

Stock prepositioning, as a strategy for disaster preparedness, in rural areas can be a way to cover the vulnerability gap in the short term. More stocks and materials in highly vulnerable districts can be located at strategic points where logisticians can start a delivery without losing materials. In the long run, humanitarian logisticians should also consider mitigating risks by reducing vulnerability via improving infrastructure, access to services, reducing morbidity and getting information about what risks can and cannot be mitigated, for example, floods that are exogenous to dimensions of vulnerability.

4.3.2 Dosquebradas households' distribution of divergence

For Dosquebradas, Table 8 gives us insights about the proportion of households with more than 0.3 bits of divergence in the sample:

Table 8

Proportion of households with more than 0.3 bits of divergence in Dosquebradas

Community/DistrictLandslidesFloodsCollapses
Pueblo Rico
KL2Commune 13.693.523.70
KL3Commune 104.274.284.27
KL4Commune 110.490.510.49
KL5Commune 120.540.540.54
KL6Commune 25.205.215.21
KL7Commune 31.521.521.52
KL8Commune 40.510.510.51
KL9Commune 51.081.101.08
KL10Commune 60.360.360.36
KL11Commune 70.530.530.53
KL12Commune 84.974.974.95
KL13Commune 95.505.555.48
KL23Rural Dosquebradas1.601.681.63

Note(s): Values are in percentage

Based on Figures 11–13, we observed a lower vulnerability to landslides for KL3, KL6, KL10, KL11, KL12, and KL13. Relatively to Pueblo Rico's communities/districts, evidence suggests that vulnerability, regarding landslides, floods and collapses, is higher (see  Appendix 4 for the mean tests). Landslides are more frequent in Dosquebradas. Some districts face minimum vulnerability to floods (KL2, KL7, and KL8), but a higher vulnerability to landslides and collapses.

Figure 11

Density of Kullback–Leibler divergence for districts KL2, KL3, KL4 and KL5

Figure 11

Density of Kullback–Leibler divergence for districts KL2, KL3, KL4 and KL5

Close modal
Figure 12

Density of Kullback-Leibler divergence for districts KL6, KL7, KL8, and KL9

Figure 12

Density of Kullback-Leibler divergence for districts KL6, KL7, KL8, and KL9

Close modal
Figure 13

Density of Kullback–Leibler divergence for districts KL10, KL11, KL12, KL13 and KL23

Figure 13

Density of Kullback–Leibler divergence for districts KL10, KL11, KL12, KL13 and KL23

Close modal

The low vulnerability districts (with an average of 0.1 bits of divergence from low vulnerability simulated scenarios) are KL10 and KL3. This can also be seen in the mean test in  Appendix 4. The distribution of divergences suggests that Pueblo Rico had a slightly lower and more scattered (between households) level of vulnerability than Dosquebradas. While Pueblo Rico's vulnerability is higher for floods and collapses, Dosquebradas districts have similar vulnerability patterns for all three disasters (except for KL2, KL4, KL7, KL8, KL9, and KL23). Tables 7 and 8 show that there are more highly vulnerable atypical cases in Dosquebradas than in Pueblo Rico.

For both Pueblo Rico and Dosquebradas, although the risk mitigation recipes may be similar, the results (Figures 8–13) suggest that there is a need for specific materials according to the aftermath of disasters (Shao et al., 2020). Regarding floods, households may suffer from physical damage, but people's lives may be hardly exposed. This kind of disaster may not cause death, but it may cause significant losses in housing. For dealing with this aftermath, we recommend the prepositioning of food and beverage supplies and materials for shelters. We consider that epidemic-prevention drugs are important because of acute diarrheal disease and foodborne illness prevalence. In  Appendix 1, we observe a greater prevalence in Pueblo Rico. For landslides, we have different aftermath. We expect a higher number of deaths. Therefore, we recommend prepositioning rescue materials and assigning rescue teams within the communities/districts. This is important if we want to reduce the risk of those disasters. Collapses could be a consequence of landslides or floods. However, they could also happen by other causes, like the poor housing condition. Following the proposed framework, material prepositioning may also help in dealing with collapses, even if they do not happen after another type of disaster. This is how we can approach resilience for Pueblo Rico and Dosquebradas.

On the one hand, households' vulnerability estimation is important because decision-making can be made more informed on this basis. Furthermore, following this new approach, further academic research can propose alternative data-driven techniques to improve logistics operations. On the other hand, households' disaster risk can be defined as an open system in practice, that is, this risk's configuration may be affected by external forces. Thus, policies focused on reducing disaster risk vulnerability may indeed positively affect this system. We show that entropy is a useful tool to work with and summarize trends for multidimensional data. This work also generates evidence that shows that disaster vulnerability is multidimensional. Humanitarian logistics should consider this multidimensionality when planning for risk mitigation and preparedness.

Based on the graphic results and the student t-tests available in  Appendix 4, we can establish a vulnerability ranking between the communities/districts in Pueblo Rico and Dosquebradas. The more vulnerable communities/districts are those that have the highest divergence. Table 9 presents this ranking:

Table 9

Risk ranking of communities/districts in Pueblo Rico and Dosquebradas

OrderLandslidesFloodsCollapses
1KL23KL1KL23
2KL2KL22KL2
3KL4KL2KL8
4KL5KL23KL22
5KL7KL4KL7
6KL8KL9KL10
7KL22KL14KL4
8KL16KL15KL6
9KL13KL17KL11
10KL19KL21KL14
11KL3KL3KL19
12KL6KL13KL3
13KL11KL5KL13
14KL12KL12KL1
15KL21KL19KL5
16KL1KL11KL9
17KL9KL16KL12
18KL10KL6KL15
19KL14KL10KL16
20KL24KL8KL17
21KL15KL18KL18
22KL17KL20KL20
23KL18KL24KL21
24KL20KL7KL24

Note(s): Yellow highlighted cells belong to Dosquebradas and green highlighted cells, to Pueblo Rico

After disasters pass, risk mitigation policies aim to reduce the vulnerability to future disasters with a primary focus on restoration, returning life to normal and building back better (Goldschmidt and Kumar, 2016; UNDRR, 2017). We found two channels through which policymakers can reduce vulnerability to disasters. First, risk can be mitigated by reducing the vulnerability of households by improving infrastructure that can withstand landslides and collapses and investing in better household materials to strengthen resistance to collapses and landslides. For floods, we can only propose preparedness strategies based on our results and the context of our study (because the model predicted that is very unlikely that risk of floods can be mitigated by reductions in vulnerability). Along similar lines, we propose a constant refinement and effective inspection of building codes. If these do not exist, we recommend building it following UNDRR (2017) to create resilient buildings. Second, morbidity aspects of health have a strong effect on vulnerability to such disasters. Specifically, a higher incidence of acute respiratory infections (ARTIs), acute diarrheal disease (ADT) or malaria is strongly related to greater disaster vulnerability. Our recommendation is to consider patterns of morbidity as an alternative source of policymaking for disaster risk mitigation. Economic wealth and morbidity vary in the same direction for Pueblo Rico and Dosquebradas. Therefore, further research should focus on this relationship to mitigate risks through reductions in vulnerability associated with morbidity. Social capabilities and chronic illness indicators do not seem to be related to disaster risk reduction as their impact on vulnerability is not significant. Except for the two last indicators, the evidence here partially supports our main hypothesis: vulnerability is determined by a set of capabilities composed of economic, educational and health outcomes (Sen, 1999). Thus, a better or greater set of capabilities is important for disaster vulnerability and risk mitigation.

The number of households in a vulnerable situation is typically used as a criterion for location of aid. Thus, vulnerability can shape the decisions about where to preposition supplies. Supplying aid can be expensive if demand points are spatially dispersed (Gutjhar and Fischer, 2018). Location decisions about inventory prepositioning are important for reducing the cost of supplying to disperse rural areas with high vulnerability to disasters (Ukkusuri and Yushimito, 2008; Tofigui et al., 2016). In this regard, our results can serve disaster preparedness planning as demand can be estimated between and within communities/districts. Humanitarian logisticians can locate better, distribute inventory and stock before the disaster. Consequently, the cost of supplies is reduced if preparedness policies adopt inventory prepositioning based on estimated disaster vulnerability.

Prepositioning can also cover the high costs of supply to rural areas and indigenous communities (Ukkusuri and Yushimito, 2008). Thus, decision-makers should consider prepositioning materials to supply dispersed vulnerable areas far from the central warehouses. Furthermore, they should not preposition aid in vulnerable locations at risk of being impacted by the disaster (Goldschmidt and Kumar, 2016). The type of prepositioned supplies should consider the characteristics and consequences of the disaster. This has been a gap in humanitarian logistics research (Sabbaghtorkan et al., 2020). For example, in the case of landslides and collapses, the aftermath response must be fast using search, rescue and excavation equipment. In the case of floods, there are greater needs for shelters, food and health services to the population (de Brito Jr et al., 2020). More detailed evidence and routing problems are left for further investigation because there are few examples of routing in low densely populated rural areas (Huang et al., 2018).

The ranking in Table 9 serves as a policy tool. Humanitarian logisticians can pay more attention to highly vulnerable communities/districts. In both risk mitigation and disaster preparedness phases, the information about which households and which districts are more vulnerable can be used to improve the impact of humanitarian operations. By knowing what type of disaster will affect a community/district, the type of disaster vulnerability can be used for thorough public preparedness drills to increase emergency and management capacities. This strategy involves strengthening disaster preparedness for response, taking action in the anticipation of events and ensuring capacities are in place for effective response and recovery (UNDRR, 2017).

The methodology used in this study was extensive. To replicate the methodology, humanitarian logisticians should work with population and housing data available around the world in the format of household surveys. Disaster risk classification in household surveys is essential. Even if one has disaster risk classification alone for a set of households, one can use this information to replicate DKL and obtain a community/district ranking with a probabilistic approach. While we do work with multidimensional data, the vulnerability can be also estimated using a regression with socio-economic data or wealth and assets data. We argue that this can work because of the robust regression results for wealth predictors. This branch of literature can be further developed.

In this work, we show that entropy theory and cross-entropy metrics can provide insights into the vulnerability dynamics of a closed system, like the households in Pueblo Rico and Dosquebradas. We evaluated the cross-entropy divergence using Kullback–Leibler divergence (DKL) metrics between our measured distribution and the one resulting from a simulated low-vulnerability scenario. These metrics show which households are more vulnerable to disasters using a systemic and multidimensional perspective. This aspect makes the metrics valuable for decision-making. However, the information considered in the disaster vulnerability model was the maximum possible, given that the database could retrieve only categorical variables. Nevertheless, the MCA method showed great capacity to process a large number of categorical variables and summarize them in two dimensions. These dimensions together explained 72.46% of the variance of the original database without losing the multidimensional perspective. Thus, the maximum-entropy logistic regression model can estimate the distribution of risk probabilities as a function of these dimensions at the household level.

We analyzed households from three types of communities/districts: urban, rural and rural indigenous communities. We found overall “similar vulnerability dynamics” for all the samples. However, when we tested this same hypothesis at the community/district level, different patterns for vulnerability were found. Thus, we can reject our hypothesis on similar vulnerability dynamics at more spatially disaggregated levels. This issue is due to the limitations of quantitative research methods, that is, the results obtained by these methods can be invalidated for specific subpopulations. Further qualitative or mixed research methods are needed to gain a deeper understanding of the disaster vulnerability dynamics within communities/districts. Similarly, the identified vulnerability patterns for each type of disaster can be used for communities'/districts' development and enhancing resilience against disasters. Furthermore, the multidimensional vulnerability perspective can provide insights on how to improve well-being and capabilities (Sen, 1999). Multidimensional vulnerability is transversal to adverse situations, like poverty and democratic representation, among others, which are not reduced due to disasters. Future research can investigate this aspect.

Following Wisner et al. (2004) and Twigg (2004), we consider vulnerability as the human part (or anthropogenic) of disaster risk. The dependent variables analyzed were risk classifications for landslides, floods and collapses. Therefore, we first tried to explain the three components of this risk classification: vulnerability, hazard and exposure. We observed that our data covered only the vulnerability part of the risk configuration. Nevertheless, vulnerability alone was a good predictor as we had good measures of model accuracy. However, the model is far from perfect. The obtained results from community/district analysis show that we need more sophisticated spatial data, like the georeferentiation of households, which can account for interactions with hazard and exposure. These aspects can be included in future variants of this model to improve its performance by fitting the data and generating entropy-based insights.

Finally, as proposed in the “Managerial and research implications” section, the results of these models can improve humanitarian operations, allowing data to drive the planning of risk mitigation and preparedness policies in humanitarian operations. The insights allow us to generalize the utility of population and housing surveys to measure the level of vulnerability to natural disasters. With these results, we can both mitigate risks in the long run and trace a preparedness strategy for the short run disaster risk mitigation. This methodology can be applied to any case of study, if the data on population, housing, health, social and geographical dimensions are available. We also argue that vulnerability metrics are a good proxy for the demand for aid, even if a disaster is not happening. We can use this information for risk mitigation and inventory prepositioning.

This research was supported by the Centro de Investigación de la Universidad del Pacifico (CIUP - Research Center of the Universidad del Pacifico) and the Coordination for the Improvement of Higher Education Personnel - Brazil (CAPES), Procad Defesa 88887.387760/2019-00.

Adger
,
W.N.
(
2006
), “
Vulnerability
”,
Global Environmental Change
, Vol.
16
No.
3
, pp.
268
-
281
, doi: .
Alexander
,
D.E.
(
2002
),
Principles of Emergency Planning and Management
,
Oxford University Press
,
Oxford
.
Avery
,
G.
(
2005
),
Endogeneity in Logistic Regression Models
,
Centers for Disease Control and Prevention
.
Ben-Tal
,
A.
,
Chung
,
B.D.
,
Mandala
,
S.R.
and
Yao
,
T.
(
2011
), “
Robust optimization for emergency logistics planning: risk mitigation in humanitarian relief supply chains
”,
Transportation Research Part B: Methodological
, Vol.
45
No.
8
, pp.
1177
-
1189
, doi: .
Bouhlel
,
N.
and
Dziri
,
A.
(
2019
), “
Kullback-Leibler divergence between multivariate generalized Gaussian distributions
”,
IEEE Signal Processing Letters
, Vol.
26
No.
7
, pp.
1021
-
1025
, doi: .
Comfort
,
L.
,
Oh
,
N.
and
Ertan
,
G.
(
2009
), “
The dynamics of disaster recovery: resilience and entropy in hurricane response systems 2005-2008
”,
Public Organization Review
, Vol.
9
, pp.
309
-
323
.
Costa
,
P.S.
,
Santos
,
N.C.
,
Cunha
,
P.
,
Cotter
,
J.
and
Sousa
,
N.
(
2013
), “
The use of multiple correspondence analysis to explore associations between categories of qualitative variables in healthy ageing
”,
Journal of Aging Research
, Vol.
2013
, pp.
1
-
12
, doi: .
Cutter
,
S.L.
(
2012
),
Hazards Vulnerability and Environmental Justice
,
Routledge
,
London
.
Cutter
,
S.L.
and
Finch
,
C.
(
2008
), “
Temporal and spatial changes in social vulnerability to natural hazards
”,
Proceedings of the National Academy of Sciences
, Vol.
105
No.
7
, pp.
2301
-
2306
.
de Brito
,
I.
 Jr
,
Leiras
,
A.
and
Yoshizaki
,
H.T.Y.
(
2020
), “
A multi-criteria stochastic programming approach for pre-positioning disaster relief supplies in Brazil
”,
Production
, Vol.
30
, doi: .
Deng
,
F.
,
Liu
,
C.
,
Liang
,
X.
and
Xu
,
J.
(
2017
), “
Performance evaluation of logistics listed companies based on grey ideal correlation entropy
”, in
2017 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM)
, pp.
1646
-
1650
.
Djalante
,
R.
,
Shaw
,
R.
and
DeWit
,
A.
(
2020
), “
Building resilience against biological hazards and pandemics: COVID-19 and its implications for the Sendai Framework
”,
Progress in Disaster Science
, Vol.
6
, p.
100080
, doi: .
Dong
,
X.
,
Lu
,
H.
,
Xia
,
Y.
and
Xiong
,
Z.
(
2016
), “
Decision-making model under risk assessment based on entropy
”,
Entropy
, Vol.
18
No.
11
, p.
404
.
Eakin
,
H.
and
Luers
,
A.L.
(
2006
), “
Assessing the vulnerability of social-environmental systems
”,
Annual Review of Environment and Resources
, Vol.
31
No.
1
, pp.
365
-
394
, doi: .
Ezzari
,
A.
and
Verme
,
P.
(
2012
), “
A multiple correspondence analysis approach to the measurement of multidimensional poverty in Morocco 2001-2007
”,
The World Bank: Policy Research Working Paper
,
World Development Bank
,
Washington
.
Fatemi
,
M.N.
,
Okyere
,
S.A.
,
Diko
,
S.K.
,
Kita
,
M.
,
Shimoda
,
M.
and
Matsubara
,
S.
(
2020
), “
Physical vulnerability and local responses to flood damage in peri-urban areas of Dhaka, Bangladesh
”,
Sustainability
, Vol.
12
No.
10
, p.
3957
, doi: .
Filmer
,
D.
and
Pritchett
,
L.
(
2001
), “
Estimating wealth effects without expenditure data-or tears: an application to educational enrollments in states of India
”,
Demography
, Vol.
38
No.
1
, pp.
115
-
132
,
available at:
www.jstor.org/stable/3088292.
Folke
,
C.
(
2006
), “
Resilience: the emergence of a perspective for social-ecological systems analyses
”,
Global Environmental Change
, Vol.
16
, pp.
253
-
267
, doi: .
Gómez
,
D.M.
(
2017
),
Evaluación de las capacidades de logística humanitaria para la atención de desastres naturales en la red de ayuda humanitaria, el caso de Manizales
,
Facultad de Ingeniería y Arquitectura, Departamento de Ingeniería Industrial, Universidad Nacional de Colombia
,
Bogotá
.
Golan
,
A.
,
Judge
,
G.
and
Perloff
,
J.M.
(
1996
), “
A maximum entropy approach to recovering information from multinomial response data
”,
Journal of the American Statistical Association
, Vol.
91
, pp.
841
-
853
, doi: .
Goldschmidt
,
K.H.
and
Kumar
,
S.
(
2016
), “
Humanitarian operations and crisis/disaster management: a retrospective review of the literature and framework for development
”,
International Journal of Disaster Risk Reduction
, Vol.
20
, pp.
1
-
13
, doi: .
Google Earth Pro
(
2019
), “
Mapa de Risaralda, Colombia en Google maps
”,
(s.f.), Recuperado el 3 de Octubre, 2019, de
,
available at:
https://www.google.com.co/maps.
Gray
,
R.M.
(
2011
),
Entropy and Information Theory
,
Springer US
,
Washington
, doi: .
Guoliang
,
F.
,
Aiping
,
L.
,
Moroni
,
G.
,
Liyun
,
X.
,
Xuemei
,
L.
and
Moroni
,
G.
(
2017
), “
Operation-based configuration complexity measurement for manufacturing system
”,
Procedia CIRP
, Vol.
63
, pp.
645
-
650
, doi: .
Guoyi
,
X.
and
Xiaohua
,
C.
(
2011
), “
Research on the third-party logistics supplier selection evaluation based on AHP and entropy
”, in
2011 International Conference on Mechatronic Science, Electric Engineering and Computer (MEC)
, pp.
788
-
792
.
Gutjahr
,
W.J.
and
Fischer
,
S.
(
2018
), “
Equity and deprivation costs in humanitarian logistics
”,
European Journal of Operational Research
, Vol.
270
No.
1
, pp.
185
-
197
, doi: .
Haregu
,
T.N.
,
Mohamed
,
S.F.
,
Muthuri
,
S.
,
Khayeka-Wandabwa
,
C.
and
Kyobutungi
,
C.
(
2018
), “
Body mass index and wealth index: positively correlated indicators of health and wealth inequalities in Nairobi slums
”,
Global Health, Epidemiology and Genomics
, Vol.
3
No.
1
, pp.
1
-
12
.
Ho
,
Y.
and
Wookey
,
S.
(
2020
), “
The real-world-weight cross-entropy loss function: modeling the costs of mislabeling
”,
IEEE Access
, Vol.
8
, pp.
4806
-
4813
, doi: .
Holguín-Veras
,
J.
,
Pérez
,
N.
,
Jaller
,
M.
,
Van Wassenhove
,
L.N.
and
Aros-Vera
,
F.
(
2013
), “
On the appropriate objective function for post-disaster humanitarian logistics models
”,
Journal of Operations Management
, Vol.
31
No.
5
, pp.
262
-
280
, doi: .
Huang
,
Y.
,
Savelsbergh
,
M.
and
Zhao
,
L.
(
2018
), “
Designing logistics systems for home delivery in densely populated urban areas
”,
Transportation Research Part B: Methodological
, Vol.
115
, pp.
95
-
125
, doi: .
Institute of Hydrology, Meteorology and Ambiental Studies
(
2019
),
Atlas Climatológico de Colombia 1981-2010
,
Bogotá
,
Colombia
.
Intrieri
,
E.
,
Carlà
,
T.
and
Gigli
,
G.
(
2019
), “
Forecasting the time of failure of landslides at slope-scale: a literature review
”,
Earth-Science Reviews
, Vol.
193
, pp.
333
-
349
, doi: .
Jaynes
,
E.T.
(
1957
), “
Information theory and statistical mechanics
”,
Physical Review
, Vol.
106
No.
4
, pp.
620
-
630
, doi: .
Krauz
,
O.
and
Tabrikian
,
J.
(
2019
), “
Detection of modeling misspecification using cross-entropy test
”, in
2019 IEEE 8th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP)
, doi: .
Leiras
,
A.
,
de Brito
,
I.
 Jr
,
Queiroz Peres
,
E.
,
Rejane Bertazzo
,
T.
and
Yoshizaki
,
H.
(
2014
), “
Literature review of humanitarian logistics research: trends and challenges
”,
Journal of Humanitarian Logistics and Supply Chain Management
, Vol.
4
No.
1
, pp.
95
-
130
, doi: .
Levner
,
E.
and
Ptuskin
,
A.
(
2015
), “
An entropy-based approach to identifying vulnerable components in a supply chain
”,
International Journal of Production Research
, Vol.
52
No.
22
, pp.
1
-
15
.
Lim
,
J.
,
Loveridge
,
S.
,
Shupp
,
R.
and
Skidmore
,
M.
(
2017
), “
Double danger in the double wide: dimensions of poverty, housing quality and tornado impacts
”,
Regional Science and Urban Economics
, Vol.
65
, pp.
1
-
15
.
Liyuan
,
S.
and
Qingmei
,
T.
(
2014
), “
Supernetwork and entropy theory in quantitative decision-making of emergency logistics
”, in
5th IEEE International Conference on Software Engineering and Service Science (ICSESS)
, pp.
897
-
903
.
López-Vargas
,
J.C.
and
Cárdenas-Aguirre
,
D.M.
(
2017
), “
Gestión de la logística humanitaria en las etapas previas al desastre: revisión sistemática de la literatura
”,
Revista de Investigación, Desarrollo e Innovación
, Vol.
7
No.
2
, pp.
1
-
10
, doi: .
Mateusz
,
J.
(
2017
), “
Japanese and American approach to humanitarian logistics in natural disasters prevention
”,
Scientific Journal of Logistics
, Vol.
13
No.
2
, pp.
171
-
182
, doi: .
Mattea
,
S.
(
2019
), “
Exploring spatial sources of preference heterogeneity for landslide protection
”,
Land Economics
, Vol.
95
No.
3
, pp.
333
-
352
.
Ministry of Information Technology and Communications
(
2019
),
Colombian Shapefiles and Georeferenced Databases
,
Bogotá
.
Mo
,
H.
and
Lee
,
J.
(
2019
), “
Spatial community search using PageRank vector
”, in
2019 International Conference on Computing, Networking and Communications (ICNC)
, doi: .
Oloruntoba
,
R.
,
Sridharan
,
R.
and
Davison
,
G.
(
2018
), “
A proposed framework of key activities and processes in the preparedness and recovery phases of disaster management
”,
Disasters
, Vol.
42
No.
3
, pp.
541
-
570
, doi: .
Overstreet
,
R.E.
,
Hall
,
D.
,
Hanna
,
J.B.
and
Kelly Rainer
,
R.
 Jr
(
2011
), “
Research in humanitarian logistics
”,
Journal of Humanitarian Logistics and Supply Chain Management
, Vol.
1
No.
2
, pp.
114
-
131
, doi: .
Parchomenko
,
A.
,
Nelen
,
D.
,
Gillabel
,
J.
and
Rechberger
,
H.
(
2019
), “
Measuring the circular economy - a multiple correspondence analysis of 63 metrics
”,
Journal of Cleaner Production
, Vol.
210
, pp.
200
-
216
, doi: .
Purvis
,
B.
,
Mao
,
Y.
and
Robinson
,
D.
(
2019
), “
Entropy and its application to urban systems
”,
Entropy
, Vol.
21
No.
1
, pp.
11
-
28
.
Rapeli
,
M.
(
2017
), “
Social capital in social work disaster preparedness plans: the case of Finland
”,
International Social Work
, Vol.
61
No.
6
, pp.
1054
-
1066
, doi: .
Rencher
,
A.C.
and
Christensen
,
W.F.
(
2012
),
Methods of Multivariate Analysis
, (3rd ed.) ,
Wiley
,
Hoboken, NJ
.
Sabbaghtorkan
,
M.
,
Batta
,
R.
and
He
,
Q.
(
2020
), “
Prepositioning of assets and supplies in disaster operations management: review and research gap identification
”,
European Journal of Operational Research
, Vol.
284
No.
1
, pp.
1
-
19
, doi: .
Salazar-Briones
,
C.
,
Ruiz-Gibert
,
J.M.
,
Lomelí-Banda
,
M.A.
and
Mungaray-Moctezuma
,
A.
(
2020
), “
An integrated urban flood vulnerability index for sustainable planning in arid zones of developing countries
”,
Water
, Vol.
12
No.
2
, p.
608
, doi: .
Sardhara
,
R.
and
Lakhataria
,
K.I.
(
2019
), “
An improved PageRank algorithm based on reachability to reduce mutual reinforcement effect
”, in
2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT)
, doi: .
Schoon
,
I.
and
Bartley
,
M.
(
2008
), “
The role of human capability and resilience
”,
The Psychologist
, Vol.
21
No.
1
, pp.
24
-
27
.
Sen
,
A.
(
1999
),
Development as Freedom
,
Oxford University Press
,
Oxford
.
Shao
,
J.
,
Liang
,
C.
,
Wang
,
X.
,
Wang
,
X.
and
Liang
,
L.
(
2020
), “
Relief demand calculation in humanitarian logistics using material classification
”,
International Journal of Environmental Research and Public Health
, Vol.
17
No.
2
, p.
582
, doi: .
Silverman
,
B.W.
(
1986
),
Density Estimation for Statistics and Data Analysis
, Vol.
26
,
CRC Press
,
Washington
.
Simpson
,
A.G.
(
2017
), “
Study: expected increase in tornadoes raises risk to mobile homes
”,
Insurance Journal
,
available at:
https://search.proquest.com/docview/2023356682.
Smyth
,
I.A.
and
Hai
,
V.M.
(
2012
),
The Disaster Crunch Model: Guidelines for a Gendered Approach
,
Oxfam
.
Soofi
,
E.
(
1992
), “
A generalizable formulation of conditional logit with Diagnostics
”,
Journal of American Statistical Association
, Vol.
87
No.
419
, pp.
812
-
816
, doi: .
Tofighi
,
S.
,
Torabi
,
S.A.
and
Mansouri
,
S.A.
(
2016
), “
Humanitarian logistics network design under mixed uncertainty
”,
European Journal of Operational Research
, Vol.
250
No.
1
, pp.
239
-
250
, doi: .
Torres
,
L.
and
Pereira
,
C.A.
(
2017
), “
Post-disaster housing recovery guidelines for development countries based on experiences in the American continent
”,
International Journal of Disaster Risk Reduction
, Vol.
24
, pp.
340
-
347
.
Tshimanga
,
R.M.
,
Tshitenge
,
J.M.
,
Kabuya
,
P.
,
Alsdorf
,
D.
,
Mahe
,
G.
,
Kibukusa
,
G.
and
Lukanda
,
V.
(
2016
), “
A regional perceptive of flood forecasting and disaster management systems for the Congo river basin
”,
Academic Press
, pp.
87
-
124
, doi: .
Twigg
,
J.
(
2004
),
Disaster Risk Reduction: Mitigation and Preparedness in Development and Emergency Programming, Humanitarian Practice Network
,
Overseas Development Institute
,
London
.
Ukkusuri
,
S.V.
and
Yushimito
,
W.F.
(
2008
), “
Location routing approach for the humanitarian prepositioning problem
”,
Transportation Research Record: Journal of the Transportation Research Board
, Vol.
2089
No.
1
, pp.
18
-
25
, doi: .
United Nations Office for Disaster Risk Reduction
(
2015
),
The Sendai Framework for Disaster Risk Reduction
,
United Nation
,
New York, NY
.
United Nations Office for Disaster Risk Reduction
(
2017
),
How to Make Cities More Resilient - A Handbook for Local Government Leaders
,
United Nations
,
Geneva
.
Wisner
,
B.
,
Blaikie
,
P.
,
Cannon
,
T.
and
Davis
,
I.
(
2004
),
At Risk: Natural Hazards, People's Vulnerability and Disasters
,
Routledge
,
London
.
Wu
,
L.
,
Tan
,
Q.
and
Zhang
,
Y.
(
2013
), “
Network connectivity entropy and its application on network connectivity reliability
”,
Physica A: Statistical Mechanics and its Application
, Vol.
392
No.
21
, pp.
5536
-
5541
.
Yoon
,
D.K.
,
Eun
,
J.
and
Jun
,
Y.
(
2017
), “
Measuring community vulnerability to natural disasters in Korea using the entropy method
”,
International Journal of Mass Emergencies and Disasters
, Vol.
35
No.
2
, pp.
8
-
29
.
Zhang
,
Z.
and
David
,
J.
(
2018
), “
An entropy-based approach for assessing the operation of production logistics
”,
Expert Systems with Applications
, Vol.
119
, pp.
118
-
127
, doi: .

The Appendix files are available online for this article.

Licensed re-use rights only

Supplementary data

or Create an Account

Close Modal
Close Modal