This study examines US consumer purchasing behavior for elderberry products, focusing on both the decision to purchase and the frequency of consumption. It investigates how consumer attitudes toward a range of product qualities—such as price, taste, nutritional benefits and environmental impact—as well as socioeconomic characteristics influence purchasing and consumption patterns.
The study uses a sample of 597 respondents from a 2022 national survey conducted in the US. Factor analysis was implemented to condense attitudinal variables, which then informed consumer segmentation via cluster analysis. Five consumer segments were identified: health-indifferent consumers, socially conscious consumers, health-motivated consumers, conventional consumers, and taste- and value-seekers. A double-hurdle model was estimated to separately examine the decision to purchase and the frequency of consumption.
Specific consumer segments—particularly socially conscious and health-motivated consumers—significantly influence the likelihood of purchasing elderberry products. In contrast, a broader set of segments contributes to consumption frequency, indicating that different motivations drive trial versus habitual use. Socioeconomic factors—including age, income, educational attainment, race and family composition—also influence purchasing and consumption decisions.
This study advances the literature by moving beyond stated preferences for specific elderberry products to examine actual purchasing behavior across the product category. By integrating detailed attitudinal consumer segmentation with socioeconomic analysis and applying a double-hurdle model, it links consumer attitudes directly to both market participation and consumption intensity. This approach addresses the common attitude-behavior gap and provides useful insights for targeted marketing and product development in the growing elderberry sector.
1. Introduction
Over the past 2 decades, elderberry (Sambucus spp.) has gradually moved from being a niche crop to gaining broader attention in North American agriculture, attracting interest from consumers and farmers alike. Known for its antioxidant-rich properties and wide range of uses—from immune-boosting remedies to alcoholic beverages—elderberry’s potential continues to grow (Chuang et al., 2014; Curtis et al., 2024; Osman et al., 2023). To build on this momentum, it is important to understand consumer behavior and the factors influencing demand for elderberry products.
In particular, significant questions remain regarding consumer attitudes, market segmentation and the drivers of consumption decisions. To address these issues, this paper asks how consumer segments—defined based on consumer attitudes—along with other socioeconomic characteristics, influence the likelihood of purchasing elderberry products and the frequency of consumption. Specifically, this study uses a 2022 US consumer survey conducted by Cai et al. (2024), which reports on a series of consumer attitudes toward elderberry products, to identify consumer segments through factor and cluster analysis. Then, it investigates how segment membership and other socioeconomic factors influence the demand for elderberry products using a double-hurdle model, which treats the purchasing decision and the frequency of consumption as independent processes (Cragg, 1971; Skevas et al., 2014).
Forecasts predict a 5.8% annual growth rate of the elderberry market between 2023 and 2028 (Technavio, 2024). Yet the literature on this emerging market remains sparse and fragmented. Early work focused predominantly on supply-side challenges, from horticultural practices (Charlebois et al., 2010) to market entry barriers like scale economies and credit constraints identified through stakeholder interviews (Cernusca et al., 2011, 2012; Cernusca and Gold, 2013). On the demand side, foundational studies by Mohebalian et al. (2012, 2013) provided initial segmentation into health-conscious consumers and analyzed preferences for specific products like juice and jelly, finding value in health claims and local production. A recent replication and extension by Cai et al. (2024) confirms a significant rise in consumption and identifies consumer willingness-to-pay for attributes like organic and American-grown labels. However, this existing demand-side research has two key limitations: first, it often focuses on specific product types; and second, it primarily analyzes stated preferences (e.g., via conjoint or discrete-choice experiments), falling short of explaining how these preferences translate into actual purchasing decisions and consumption frequency.
A robust, parallel stream of literature in food economics demonstrates the power of attitudinal segmentation to decode complex consumer markets, moving beyond simple demographics and looking into the underlying psychological drivers. For instance, Sgroi et al. (2024) employ K-means clustering on socioeconomic characteristics of Sicilian consumers (e.g., age, gender and city size) and identify three actionable typologies: small-town young consumers, middle-aged males from medium-sized cities and older females from large cities. In a sustainability framework, Schäufele-Elbers and Janssen (2023) utilize K-means to segment consumers based on their expenditures for organic food (environmental proxy), meat (climate proxy) and sweet snacks (health proxy), revealing that pro-environmental behavior in one area does not guarantee it in another. Similarly, segmentation of food waste behaviors identifies groups like “conscientious conservers” and “guilty carb wasters” (Li and Roe, 2024). Other research used principal component analysis to reduce numerous behavioral traits into three factors—“healthy and sustainable”, “hedonic” and “price-conscious” food choices—and then conducted a hierarchical cluster analysis on these factors to identify consumer segments (Hempel, 2024). These studies underscore that global attitudes such as “health consciousness” are insufficient and that nuanced segmentation is crucial. However, while this literature excels at profiling who consumers are and what they value, it stops short of linking these detailed attitudinal segments directly to observed purchasing and consumption behavior for emerging food products.
To analyze how attitudinally-defined segments might influence purchasing decisions and consumption patterns—whether in terms of frequency or quantity—the double-hurdle model (Cragg, 1971) can be used, as it explicitly separates the initial market participation decision from subsequent consumption intensity. In food demand analysis, this model has been applied to show that different factors can govern each stage. For instance, a product’s visual appearance may drive the decision to purchase, while its taste determines consumption quantity (Ray et al., 2024). Similarly, socioeconomic factors can have divergent effects: while parents’ educational level may increase their willingness to participate in a cost-shared school food program, having more children may decrease the amount they are willing to pay (Gupta et al., 2023). The double-hurdle model’s flexibility to accommodate such complex, two-tiered decision-making processes has been validated across diverse contexts, from tea consumption (Chen et al., 2020) to willingness-to-pay for credence attributes like winemaker gender (Gallais and Livat, 2024). Notably, focusing solely on the purchase decision—or combining purchasers and non-purchasers—may fail to capture the full complexity of consumption behavior.
This paper contributes to the literature in several important ways. First, it advances the elderberry literature by moving beyond stated preferences for specific products (e.g., elderberry juice and jelly) to analyze actual purchasing behavior across the entire product category and by developing a more nuanced, attitudinally-based consumer segmentation that captures dimensions beyond basic health consciousness. Second, it contributes to the consumer segmentation literature by directly linking attitudinally-based segments to observed market behavior, thereby addressing the common attitude–behavior gap—i.e., the disconnect between an individual’s attitudes or intentions and their actual behaviors (Vermeir and Verbeke, 2006; Szeląg-Sikora et al., 2025)—and providing a framework that moves from profiling who consumers are to predicting how they behave. By integrating detailed consumer segments into the analysis of both market participation (purchase decision) and consumption intensity (frequency of use), this study offers a multidimensional perspective on market heterogeneity that can inform targeted marketing and agricultural development strategies for the growing elderberry sector.
The remainder of this paper is structured as follows. Section 2 describes the data from the 2022 consumer survey. Section 3 presents the methodology. Section 4 reports the results, including purchasing and consumption frequency, consumer attitudes and perceptions, segmentation analysis and the determinants of both purchasing decisions and consumption frequency. Section 5 discusses the key findings, linking them to the literature and highlighting their implications for stakeholders. Finally, Section 6 concludes with a summary of contributions, limitations and avenues for future research.
2. Data
This study uses data from a non-probability, online survey of US consumers conducted by Cai et al. (2024) in 2022. The survey was pretested with a small group of 17 consumers in October before full deployment in November using the Qualtrics online survey platform. Participants were recruited through Qualtrics Research Services, which maintains a diverse panel of over 4 million respondents and provides compensation such as e-gift cards, retail shopping points and air miles. Invitations and instructions were sent via email with daily reminders until the predetermined quota of 1,000 responses was met. In total, 1,036 surveys were collected, of which 597 provided complete data on all the variables of interest and were included in our analysis.
The survey was divided into four sections: the first section inquired about consumers’ familiarity with elderberry products, their past purchasing behaviors and their consumption frequency. Specifically, the survey collected consumption patterns for eleven elderberry products, including juice, fresh berries, wine, concentrate, tea, syrup, jelly, pie, vitamins, drinks and others. Respondents were asked whether they had purchased any of these products—a binary response variable—and their overall frequency of consumption, captured through a six-point Likert scale with the following levels: “less than once a year”, “1–2 times a year”, “3–10 times a year”, “once a month”, “multiple times per month” and “once a week or more”. Importantly, the frequency question was asked in relation to elderberry products in general rather than for specific product types.
The second section gauged consumers’ attitudes and perceptions toward elderberry products. Specifically, the consumers were asked about their degree of health consciousness, knowledge of food-related health benefits, awareness of elderberry’s nutritional and antioxidant value, and elderberry’s richness in vitamins and minerals. Responses were recorded using a five-point Likert scale ranging from “strongly disagree” to “strongly agree”. Moreover, respondents were asked about the relative importance of several product attributes, including nutrition, taste, price, ingredients, packaging, brand, environmental benefits and origin (US produced vs imported), all measured on a five-point Likert scale with the options “not at all important”, “slightly important”, “moderately important”, “very important” and “extremely important”.
The third section conducted a discrete-choice experiment to elicit consumer preferences for selected attributes of a 355 mL bottle of elderberry juice. However, this survey component was not used in the present study, since it had already been exploited by Cai et al. (2024). Finally, the fourth section collected socioeconomic characteristics of the respondents, including gender, education, household income, age, number of children, marital status, race and distance to the nearest urban center. Gender was reported in four categories (male, female, non-binary and other). For parsimony, we consolidated them into two groups: male and female/other gender [1]. Educational attainment was collected across eight levels and grouped into four categories based on the typical years of education required to complete each degree: high school, associate’s degree, bachelor’s degree and postgraduate. Household income was originally reported in seven levels and was consolidated into three categories: under $35,000; $35,000 to $74,999; and $75,000 or more. Age was originally reported in seven groups and consolidated into three categories: under 35, 35 to 54 and over 54. The number of children was treated as a discrete numeric variable. Marital status was recorded as a binary variable: married or single. Race included multiple categories but was condensed into three groups—White, Black and other race—due to minimal responses in the remaining categories. Finally, distance to the nearest urban center was categorized into three groups based on proximity to the respondents’ residence: less than 5 miles, 5 to 29 miles and more than 29 miles. Summary statistics of the socioeconomic variables are presented in Table 1.
Descriptive statistics of respondents’ socioeconomic characteristics (n = 597)
| Variables | Units | Mean | S.D. | Min. | Max. |
|---|---|---|---|---|---|
| Gender | |||||
| Male | (1/0) | 0.471 | 0.500 | 0 | 1 |
| Female/other gender | (1/0) | 0.529 | 0.500 | 0 | 1 |
| Education | |||||
| High school | (1/0) | 0.236 | 0.425 | 0 | 1 |
| Associate’s degree | (1/0) | 0.372 | 0.484 | 0 | 1 |
| Bachelor’s degree | (1/0) | 0.240 | 0.427 | 0 | 1 |
| Postgraduate | (1/0) | 0.152 | 0.360 | 0 | 1 |
| Income ($ US) | |||||
| Income under 35 k | (1/0) | 0.317 | 0.466 | 0 | 1 |
| Income 35 k to 75 k | (1/0) | 0.337 | 0.473 | 0 | 1 |
| Income over 75 k | (1/0) | 0.347 | 0.476 | 0 | 1 |
| Age | |||||
| Age under 35 | (1/0) | 0.231 | 0.422 | 0 | 1 |
| Age 35 to 54 | (1/0) | 0.407 | 0.492 | 0 | 1 |
| Age over 54 | (1/0) | 0.362 | 0.481 | 0 | 1 |
| Race | |||||
| White | (1/0) | 0.764 | 0.425 | 0 | 1 |
| Black | (1/0) | 0.114 | 0.318 | 0 | 1 |
| Other race | (1/0) | 0.122 | 0.328 | 0 | 1 |
| Distance (miles) | |||||
| Distance under 5 mi | (1/0) | 0.508 | 0.500 | 0 | 1 |
| Distance 5 mi to 29 mi | (1/0) | 0.325 | 0.469 | 0 | 1 |
| Distance over 29 mi | (1/0) | 0.168 | 0.374 | 0 | 1 |
| Other variables | |||||
| Married | (1/0) | 0.469 | 0.499 | 0 | 1 |
| Children | count | 0.759 | 1.057 | 0 | 5 |
| Variables | Units | Mean | S.D. | Min. | Max. |
|---|---|---|---|---|---|
| Gender | |||||
| Male | (1/0) | 0.471 | 0.500 | 0 | 1 |
| Female/other gender | (1/0) | 0.529 | 0.500 | 0 | 1 |
| Education | |||||
| High school | (1/0) | 0.236 | 0.425 | 0 | 1 |
| Associate’s degree | (1/0) | 0.372 | 0.484 | 0 | 1 |
| Bachelor’s degree | (1/0) | 0.240 | 0.427 | 0 | 1 |
| Postgraduate | (1/0) | 0.152 | 0.360 | 0 | 1 |
| Income ($ US) | |||||
| Income under 35 k | (1/0) | 0.317 | 0.466 | 0 | 1 |
| Income 35 k to 75 k | (1/0) | 0.337 | 0.473 | 0 | 1 |
| Income over 75 k | (1/0) | 0.347 | 0.476 | 0 | 1 |
| Age | |||||
| Age under 35 | (1/0) | 0.231 | 0.422 | 0 | 1 |
| Age 35 to 54 | (1/0) | 0.407 | 0.492 | 0 | 1 |
| Age over 54 | (1/0) | 0.362 | 0.481 | 0 | 1 |
| Race | |||||
| White | (1/0) | 0.764 | 0.425 | 0 | 1 |
| Black | (1/0) | 0.114 | 0.318 | 0 | 1 |
| Other race | (1/0) | 0.122 | 0.328 | 0 | 1 |
| Distance (miles) | |||||
| Distance under 5 mi | (1/0) | 0.508 | 0.500 | 0 | 1 |
| Distance 5 mi to 29 mi | (1/0) | 0.325 | 0.469 | 0 | 1 |
| Distance over 29 mi | (1/0) | 0.168 | 0.374 | 0 | 1 |
| Other variables | |||||
| Married | (1/0) | 0.469 | 0.499 | 0 | 1 |
| Children | count | 0.759 | 1.057 | 0 | 5 |
As shown in Table 1, the sample was relatively balanced by gender (47.1% male, 52.9% female/other) and age, with the largest cohort being middle-aged (35–54 years, 40.7%). Educational attainment was diverse: 37.2% had an associate’s degree, 24% a bachelor’s and 23.6% a high school diploma. Income was fairly evenly distributed across the three categories. The sample was predominantly White (76.4%), with Black (11.4%) and other racial groups (12.2%) also represented. Most respondents lived in urban or suburban areas, with over half (50.8%) within 5 miles of an urban center. Approximately half of the respondents (46.9%) were married, and the average household had fewer than one child (0.76).
3. Methodology
This paper proceeded in four methodological steps. First, the twelve attitudinal variables captured by the survey were consolidated into fewer categories using factor analysis. This technique shrinks the variable space by combining two or more correlated variables into single factors, which are linear combinations of the original variables, allowing the detection of structure in the relationships among them (Skevas et al., 2014). Prior to extraction, the suitability of the data for factor analysis was assessed using the Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy and Bartlett’s test of sphericity (Watkins, 2018). Principal component factoring (PCF) was used as the extraction method, retaining factors with eigenvalues greater than 1. The retained factors were then subjected to an orthogonal varimax (Crawford–Ferguson) rotation to enhance interpretability. Finally, the internal consistency of items loading highly (≥0.6) on each factor was assessed using Cronbach’s alpha (Cronbach, 1951).
Second, after consolidating the attitudinal variables, cluster analysis was conducted (Kaufman and Rousseeuw, 1990) to divide heterogeneous markets into clearly defined groups or segments of consumers based on the factors identified in the previous step (Welk et al., 2025). The optimal number of clusters was determined using the Calinski–Harabasz pseudo-F index, with the solution that maximized this index selected as optimal.
Third, to examine how socioeconomic characteristics varied across clusters, differences in cluster means were assessed using analysis of variance (ANOVA) for the continuous variables (i.e., number of children) and chi-squared tests for all binary variables.
Fourth, a double-hurdle model (Cragg, 1971) was implemented to explore the determinants of both purchasing and consumption frequency. The double-hurdle framework was preferred over alternative approaches such as Tobit or Heckman-type models because it explicitly separates the decision to participate in the market from the decision regarding intensity of consumption. This distinction was crucial in our context, as some consumers did not purchase elderberry products at all, while others varied in how frequently they consumed them. Specifically, the model assumes a two-stage data-generating process: in the first stage or “hurdle”, consumers decide whether to purchase any elderberry product. The first hurdle is considered “crossed” if the respondent reported purchasing any positive quantity. Conditional on a reported purchase, the second stage models overall consumption frequency at the aggregate product level, as the survey did not collect frequency data for individual product types. Thus, the double-hurdle model can be represented by two sets of equations and corresponding censoring rules, as follows:
where is a latent variable defining the purchasing decision and is the observed binary outcome; is a latent variable defining the frequency of consumption and is the observed ordinal frequency; is a vector of explanatory variables for both purchasing and frequency; and are the vectors of parameters to be estimated; and and are independently and identically normally distributed random errors with zero means and constant variances (Engel and Moffatt, 2014). Because the estimated parameters and do not have a direct economic interpretation, it is common to compute and report average marginal effects.
For the first hurdle, in particular, a binomial probability model—specifically, a logit model—was employed, which is appropriate for a binary dependent variable ( in Eq. (1)) equal to 1 if a respondent purchased any elderberry product and 0 otherwise. For the second hurdle, an ordered logistic regression model was used, suitable for a dependent variable with discrete, ordered categories ( in Eq. (2)) representing the frequency of consumption. In this case, the ordered categories were “less than once a year,” “1–2 times a year,” “3–10 times a year,” “once a month,” “multiple times per month,” and “once a week or more”.
The proportional odds assumption of the ordered logistic model was assessed using the Brant test (Brant, 1990) to confirm the appropriateness of this specification. The results derived from these methodological procedures are presented in the subsequent section.
4. Results
4.1 Elderberry purchasing decision and consumption frequency
Figure 1 shows that 57% of respondents in our sample purchased elderberry products, while 43% did not. As noted earlier, a total of eleven product types were considered, including juice, fresh berries, wine, concentrate, tea, syrup, jelly, pie, vitamins, drinks and others. Among the 341 purchasers, the most commonly bought products were elderberry-based vitamin supplements (49%), juice (36%) and wine (26%), as shown in Figure 2—totals exceed 100% because respondents often purchased more than one product type. This distribution reveals a strong preference for vitamin supplements, suggesting a significant appeal of elderberries’ wellness and immune-boosting attributes in medicine-style formats. At the same time, substantial purchases of traditional food and beverage products such as juice, wine and tea indicate that consumers also value incorporating functional benefits into daily consumption.
The pie chart has two colored sectors: “purchased” to the right and “not purchased” to the left. A legend below the chart identifies the sectors and indicates the colors used for “purchased” and “not purchased”. The data from the pie chart is as follows: Purchased: 57 percent. Not purchased: 43 percent.Percentage of respondents who purchased elderberry products (n = 597). Source(s): Authors’ own work
The pie chart has two colored sectors: “purchased” to the right and “not purchased” to the left. A legend below the chart identifies the sectors and indicates the colors used for “purchased” and “not purchased”. The data from the pie chart is as follows: Purchased: 57 percent. Not purchased: 43 percent.Percentage of respondents who purchased elderberry products (n = 597). Source(s): Authors’ own work
The horizontal axis of the horizontal bar graph ranges from 0 percent to 50 percent in increments of 10 percent. The vertical axis lists the product categories from top to bottom: “vitamin supplements”, “juice”, “wine”, “jelly”, “syrup”, “tea”, “fresh berries”, “pie”, “concentrate”, “drinks”, and “others”. There are eleven horizontal bars. The data from the bar graph is as follows: For vitamin supplements: 48.89 percent. For juice: 35.60 percent. For wine: 25.47 percent. For jelly: 21.99 percent. For syrup: 21.99 percent. For tea: 18.04 percent. For fresh berries: 17.88 percent. For pie: 10.60 percent. For concentrate: 8.54 percent. For drinks: 7.27 percent. For others: 3.17 percent. Note: All the numerical data values are approximated.Percentage of consumers who purchased each product type (n = 341). Note: Totals exceed 100%, since respondents often purchased more than one product type. Source(s): Authors’ own work
The horizontal axis of the horizontal bar graph ranges from 0 percent to 50 percent in increments of 10 percent. The vertical axis lists the product categories from top to bottom: “vitamin supplements”, “juice”, “wine”, “jelly”, “syrup”, “tea”, “fresh berries”, “pie”, “concentrate”, “drinks”, and “others”. There are eleven horizontal bars. The data from the bar graph is as follows: For vitamin supplements: 48.89 percent. For juice: 35.60 percent. For wine: 25.47 percent. For jelly: 21.99 percent. For syrup: 21.99 percent. For tea: 18.04 percent. For fresh berries: 17.88 percent. For pie: 10.60 percent. For concentrate: 8.54 percent. For drinks: 7.27 percent. For others: 3.17 percent. Note: All the numerical data values are approximated.Percentage of consumers who purchased each product type (n = 341). Note: Totals exceed 100%, since respondents often purchased more than one product type. Source(s): Authors’ own work
With regard to consumption frequency, six levels were captured by a Likert scale (see Section 2). Figure 3 shows the distribution of consumption frequency among the 341 purchasers. Nearly 40% of respondents consumed elderberry products once or more than once a month, while over 60% consumed them three to ten times per year or less. This suggests that while many consumers engage with elderberry products on a regular basis, a substantial share does so only occasionally.
The horizontal axis of the horizontal bar graph ranges from 0 percent to 25 percent in increments of 5 percent. The vertical axis lists the frequency categories from top to bottom: “once a week or more”, “multiple times per month”, “once a month”, “3 to 10 times a year”, “1 to 2 times a year”, and “less than once a year”. There are six horizontal bars. The data from the bar graph is as follows: For once a week or more: 14.78 percent. For multiple times per month: 15 percent. For once a month: 10 percent. For 3 to 10 times a year: 21.16 percent. For 1 to 2 times a year: 22.05 percent. For less than once a year: 17.40 percent. Note: All the numerical data values are approximated.Frequency of consumption on a six-point Likert scale (n = 341). Source(s): Authors’ own work
The horizontal axis of the horizontal bar graph ranges from 0 percent to 25 percent in increments of 5 percent. The vertical axis lists the frequency categories from top to bottom: “once a week or more”, “multiple times per month”, “once a month”, “3 to 10 times a year”, “1 to 2 times a year”, and “less than once a year”. There are six horizontal bars. The data from the bar graph is as follows: For once a week or more: 14.78 percent. For multiple times per month: 15 percent. For once a month: 10 percent. For 3 to 10 times a year: 21.16 percent. For 1 to 2 times a year: 22.05 percent. For less than once a year: 17.40 percent. Note: All the numerical data values are approximated.Frequency of consumption on a six-point Likert scale (n = 341). Source(s): Authors’ own work
4.2 Respondents’ attitudes and perceptions toward elderberry products
As described in the data section, the survey includes twelve variables capturing respondents’ attitudes toward elderberry products and perceptions of specific product attributes. Figure 4 shows the distribution of responses for each variable on a 5-point Likert scale. Nutrition and antioxidants were considered very or extremely important, with 87% of respondents rating them at levels 4 or 5. Taste was the next most valued attribute, with 82% of respondents rating it very or extremely important. More generally, consumers tended to agree or strongly agree (or to rate as very or extremely important) with statements related to knowledge of food-related health benefits, awareness of elderberry’s vitamin and mineral content, overall health consciousness and nutrition, highlighting a strongly health-motivated consumer base. By contrast, extrinsic attributes such as environmental impact, origin, brand and packaging were rated relatively less important, with fewer respondents assigning them a 4 or 5 level on the Likert scale.
The vertical axis of the stacked horizontal bar chart displays twelve categories, each representing an importance factor: “elderberry nutrition and antioxidants”, “taste importance”, “knowledge of food health benefits”, “elderberry vitamins and minerals”, “health consciousness”, “nutrition importance”, “ingredient importance”, “price importance”, “environmental importance”, “origin importance”, “brand importance”, “packaging importance”. The horizontal axis ranges from 0 percent to 100 percent in increments of 10 percent. Each category contains five stacked bars. A legend at the bottom indicates that the color segments represent: (1) strongly disagree or not at all important, (2) disagree or slightly important, (3) neither agree nor disagree or moderately important, (4) agree or very important, and (5) strongly agree or extremely important. The data from the chart is as follows: Elderberry nutrition and antioxidants: (1) 1 percent, (2) 1 percent, (3) 11 percent, (4) 47 percent, (5) 40 percent. Taste importance: (1) 1 percent, (2) 3 percent, (3) 14 percent, (4) 49 percent, (5) 33 percent. Knowledge of food health benefits: (1) 1 percent, (2) 4 percent, (3) 13 percent, (4) 56 percent, (5) 26 percent. Elderberry vitamins and minerals: (1) 1 percent, (2) 1 percent, (3) 18 percent, (4) 48 percent, (5) 32 percent. Health consciousness: (1) 2 percent, (2) 5 percent, (3) 16 percent, (4) 49 percent, (5) 29 percent. Nutrition importance: (1) 1 percent, (2) 7 percent, (3) 24 percent, (4) 44 percent, (5) 24 percent. Ingredient importance: (1) 2 percent, (2) 5 percent, (3) 27 percent, (4) 44 percent, (5) 23 percent. Price importance: (1) 1 percent, (2) 5 percent, (3) 28 percent, (4) 43 percent, (5) 24 percent. Environmental importance: (1) 7 percent, (2) 20 percent, (3) 34 percent, (4) 27 percent, (5) 12 percent. Origin importance: (1) 12 percent, (2) 16 percent, (3) 35 percent, (4) 26 percent, (5) 11 percent. Brand importance: (1) 15 percent, (2) 29 percent, (3) 34 percent, (4) 16 percent, (5) 6 percent. Packaging importance: (1) 18 percent, (2) 31 percent, (3) 33 percent, (4) 13 percent, (5) 6 percent.Distribution of responses on attitudes toward elderberry products and key product attributes (n = 597). Source(s): Authors’ own work
The vertical axis of the stacked horizontal bar chart displays twelve categories, each representing an importance factor: “elderberry nutrition and antioxidants”, “taste importance”, “knowledge of food health benefits”, “elderberry vitamins and minerals”, “health consciousness”, “nutrition importance”, “ingredient importance”, “price importance”, “environmental importance”, “origin importance”, “brand importance”, “packaging importance”. The horizontal axis ranges from 0 percent to 100 percent in increments of 10 percent. Each category contains five stacked bars. A legend at the bottom indicates that the color segments represent: (1) strongly disagree or not at all important, (2) disagree or slightly important, (3) neither agree nor disagree or moderately important, (4) agree or very important, and (5) strongly agree or extremely important. The data from the chart is as follows: Elderberry nutrition and antioxidants: (1) 1 percent, (2) 1 percent, (3) 11 percent, (4) 47 percent, (5) 40 percent. Taste importance: (1) 1 percent, (2) 3 percent, (3) 14 percent, (4) 49 percent, (5) 33 percent. Knowledge of food health benefits: (1) 1 percent, (2) 4 percent, (3) 13 percent, (4) 56 percent, (5) 26 percent. Elderberry vitamins and minerals: (1) 1 percent, (2) 1 percent, (3) 18 percent, (4) 48 percent, (5) 32 percent. Health consciousness: (1) 2 percent, (2) 5 percent, (3) 16 percent, (4) 49 percent, (5) 29 percent. Nutrition importance: (1) 1 percent, (2) 7 percent, (3) 24 percent, (4) 44 percent, (5) 24 percent. Ingredient importance: (1) 2 percent, (2) 5 percent, (3) 27 percent, (4) 44 percent, (5) 23 percent. Price importance: (1) 1 percent, (2) 5 percent, (3) 28 percent, (4) 43 percent, (5) 24 percent. Environmental importance: (1) 7 percent, (2) 20 percent, (3) 34 percent, (4) 27 percent, (5) 12 percent. Origin importance: (1) 12 percent, (2) 16 percent, (3) 35 percent, (4) 26 percent, (5) 11 percent. Brand importance: (1) 15 percent, (2) 29 percent, (3) 34 percent, (4) 16 percent, (5) 6 percent. Packaging importance: (1) 18 percent, (2) 31 percent, (3) 33 percent, (4) 13 percent, (5) 6 percent.Distribution of responses on attitudes toward elderberry products and key product attributes (n = 597). Source(s): Authors’ own work
Given the multiplicity of attitudes and perceptions, the first step was to consolidate them using factor analysis. As established in the Methodology, the suitability of the data for factor analysis was confirmed: the KMO measure of sampling adequacy was 0.80, classifying the data as “meritorious” according to conventional standards (Watkins, 2018). Bartlett’s test of sphericity was significant (χ2[66] = 2212.36, p = 0.000), indicating that the variables are sufficiently intercorrelated and suitable for factoring. The PCF extraction method and eigenvalue >1 retention rule identified four factors, which were then subjected to varimax rotation to enhance interpretability. Together, these four factors accounted for 66.95% of the common variance. Factor loading scores are presented in Table 2.
Rotated factor loadings (n = 597)
| Attitudinal variable | Factor 1 | Factor 2 | Factor 3 | Factor 4 |
|---|---|---|---|---|
| Health consciousness | 0.068 | 0.739 | 0.311 | −0.100 |
| Knowledge of food health benefits | 0.077 | 0.753 | 0.285 | −0.077 |
| Elderberry nutrition and antioxidants | −0.006 | 0.226 | 0.858 | 0.087 |
| Elderberry vitamins and minerals | 0.129 | 0.149 | 0.879 | 0.039 |
| Nutrition importance | 0.189 | 0.772 | 0.127 | 0.229 |
| Taste importance | 0.034 | 0.058 | 0.176 | 0.784 |
| Price importance | 0.086 | 0.061 | −0.025 | 0.793 |
| Ingredient importance | 0.333 | 0.673 | 0.051 | 0.221 |
| Packaging importance | 0.829 | 0.103 | 0.050 | 0.086 |
| Brand importance | 0.798 | 0.040 | 0.158 | 0.083 |
| Environmental importance | 0.703 | 0.341 | 0.018 | −0.002 |
| Origin importance | 0.693 | 0.181 | −0.019 | −0.013 |
| Attitudinal variable | Factor 1 | Factor 2 | Factor 3 | Factor 4 |
|---|---|---|---|---|
| Health consciousness | 0.068 | 0.739 | 0.311 | −0.100 |
| Knowledge of food health benefits | 0.077 | 0.753 | 0.285 | −0.077 |
| Elderberry nutrition and antioxidants | −0.006 | 0.226 | 0.858 | 0.087 |
| Elderberry vitamins and minerals | 0.129 | 0.149 | 0.879 | 0.039 |
| Nutrition importance | 0.189 | 0.772 | 0.127 | 0.229 |
| Taste importance | 0.034 | 0.058 | 0.176 | 0.784 |
| Price importance | 0.086 | 0.061 | −0.025 | 0.793 |
| Ingredient importance | 0.333 | 0.673 | 0.051 | 0.221 |
| Packaging importance | 0.829 | 0.103 | 0.050 | 0.086 |
| Brand importance | 0.798 | 0.040 | 0.158 | 0.083 |
| Environmental importance | 0.703 | 0.341 | 0.018 | −0.002 |
| Origin importance | 0.693 | 0.181 | −0.019 | −0.013 |
Note(s): Loadings in italic are values greater than or equal to 0.6
Considering loadings above 0.6, four factors were identified: (1) “sustainability and identity orientation”, with high loadings on statements related to branding, packaging, origin and environment; (2) “health awareness”, with high loadings on statements related to overall health; (3) “nutritional awareness”, with high loadings on statements related to nutrition and vitamins; and (4) “taste-price sensitivity”, with high loadings on statements related to taste and price.
The internal consistency of items loading highly (≥0.6) on each factor was assessed using Cronbach’s alpha (Cronbach, 1951). The first three factors demonstrated good internal consistency with alpha values ranging from 0.78 to 0.79, all above the conventional threshold of 0.6. The fourth factor (“taste-price sensitivity”) had a lower but still acceptable alpha of 0.48, common for factors comprised of only two items (Taber, 2018; Ekolu and Quainoo, 2019). The overall reliability for the entire 12-item scale was 0.80.
4.3 Segmentation of respondents by attitudes and perceptions toward elderberry products
After consolidating the attitudinal variables into four factors, a non-hierarchical K-means cluster analysis was conducted to identify consumer groups. The Calinski-Harabasz pseudo-F index was used to identify the optimal number of clusters (Table 3).
Cluster solution fit indices (n = 597)
| Number of clusters | Calinski–Harabasz pseudo-F |
|---|---|
| 2 | 115.48 |
| 3 | 119.95 |
| 4 | 122.26 |
| 5 | 124.03 |
| 6 | 122.19 |
| 7 | 110.64 |
| Number of clusters | Calinski–Harabasz pseudo-F |
|---|---|
| 2 | 115.48 |
| 3 | 119.95 |
| 4 | 122.26 |
| 5 | 124.03 |
| 6 | 122.19 |
| 7 | 110.64 |
The pseudo-F index increased with the addition of more clusters but peaked at five before declining. Hence, five clusters or consumer segments were selected (see Table 4), comprising 16.8%, 21.8%, 19.1%, 25.6% and 16.8% of respondents, respectively.
Respondents’ cluster membership and mean factor deviations (n = 597)
| Factor | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | Cluster 5 | F-value* | Prob > F |
|---|---|---|---|---|---|---|---|
| Sustainability and identity orientation | 0.001 | 1.182 | −0.116 | −0.956 | 0.057 | 175.45 | 0.000 |
| Health awareness | −1.540 | 0.335 | 0.416 | 0.160 | 0.386 | 140.54 | 0.000 |
| Nutritional awareness | −0.137 | 0.521 | 0.078 | 0.384 | −1.216 | 79.53 | 0.000 |
| Taste-price sensitivity | −0.279 | 0.387 | −1.257 | 0.462 | 0.501 | 53.68 | 0.000 |
| Factor | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | Cluster 5 | F-value* | Prob > F |
|---|---|---|---|---|---|---|---|
| Sustainability and identity orientation | 0.001 | 1.182 | −0.116 | −0.956 | 0.057 | 175.45 | 0.000 |
| Health awareness | −1.540 | 0.335 | 0.416 | 0.160 | 0.386 | 140.54 | 0.000 |
| Nutritional awareness | −0.137 | 0.521 | 0.078 | 0.384 | −1.216 | 79.53 | 0.000 |
| Taste-price sensitivity | −0.279 | 0.387 | −1.257 | 0.462 | 0.501 | 53.68 | 0.000 |
Note(s): *Between-groups, one-way ANOVA F-value
To characterize each segment, we compared the factor mean scores across the five clusters. These scores reflect the relative standing of each cluster on the identified factors, as shown in Table 4.
From the information in Table 4, we classify consumers as follows:
Cluster 1: Respondents show the lowest concern for health awareness among all clusters, with slightly below-average or near-average scores on other factors. We label this cluster “health-indifferent consumers.”
Cluster 2: Respondents show the highest concern for sustainability and identity orientation, along with above-average interest in health, nutrition and taste-price factors. We name this cluster “socially conscious consumers.”
Cluster 3: Respondents are primarily motivated by health awareness. They score the highest on health awareness while showing relatively low concern for taste and price sensitivity, as well as low interest in sustainability and nutritional content. We label this group “health-motivated consumers.”
Cluster 4: Respondents show the lowest concern for sustainability and identity while displaying moderate concern for health, nutrition and taste-price factors. They prioritize personal benefits (health, taste and price) over social or environmental issues. We name this cluster “conventional consumers.”
Cluster 5: Respondents show the lowest concern for nutritional awareness, low concern for sustainability and identity, moderate concern for health benefits, and the highest concern for taste and price among all segments. We name this cluster “taste- and value-seekers.”
To understand how respondents’ socioeconomic characteristics vary by segment, statistical tests were conducted (ANOVA for the continuous variable and chi-squared tests for categorical variables) to compare cluster means and assess significant differences across clusters. The results show that 9 of the 17 socioeconomic traits differ significantly across the five clusters (at varying levels of significance), as shown in Table 5, implying that the clusters exhibit distinct profiles.
Characteristics of elderberry consumers by cluster (mean values, n = 597)
| Variables | Health-indifferent | Socially conscious | Health-motivated | Conventional | Taste- and value-seekers |
|---|---|---|---|---|---|
| Purchased | 0.460*** | 0.677*** | 0.632*** | 0.601*** | 0.430*** |
| Frequency | 2.500*** | 3.693*** | 3.556*** | 3.120*** | 3.093*** |
| Male | 0.580** | 0.500** | 0.500** | 0.438** | 0.340** |
| Associate’s degree | 0.380 | 0.385 | 0.316 | 0.392 | 0.380 |
| Bachelor’s degree | 0.190 | 0.185 | 0.281 | 0.288 | 0.240 |
| Postgraduate | 0.080** | 0.215** | 0.184** | 0.137** | 0.130** |
| Income 35 k to 75 k | 0.420 | 0.308 | 0.298 | 0.379 | 0.270 |
| Income over 75 k | 0.210** | 0.362** | 0.404** | 0.366** | 0.370** |
| Age 35 to 54 | 0.440 | 0.446 | 0.439 | 0.366 | 0.350 |
| Age over 54 | 0.290* | 0.323* | 0.333* | 0.399* | 0.460* |
| Married | 0.430 | 0.485 | 0.412 | 0.549 | 0.430 |
| Children | 0.780 | 0.823 | 0.842 | 0.680 | 0.680 |
| White | 0.780*** | 0.662*** | 0.737*** | 0.837*** | 0.800*** |
| Black | 0.090** | 0.177** | 0.132** | 0.059** | 0.120** |
| Distance 5 mi to 29 mi | 0.360 | 0.300 | 0.333 | 0.314 | 0.330 |
| Distance over 29 mi | 0.230 | 0.169 | 0.167 | 0.170 | 0.100 |
| Variables | Health-indifferent | Socially conscious | Health-motivated | Conventional | Taste- and value-seekers |
|---|---|---|---|---|---|
| Purchased | 0.460*** | 0.677*** | 0.632*** | 0.601*** | 0.430*** |
| Frequency | 2.500*** | 3.693*** | 3.556*** | 3.120*** | 3.093*** |
| Male | 0.580** | 0.500** | 0.500** | 0.438** | 0.340** |
| Associate’s degree | 0.380 | 0.385 | 0.316 | 0.392 | 0.380 |
| Bachelor’s degree | 0.190 | 0.185 | 0.281 | 0.288 | 0.240 |
| Postgraduate | 0.080** | 0.215** | 0.184** | 0.137** | 0.130** |
| Income 35 k to 75 k | 0.420 | 0.308 | 0.298 | 0.379 | 0.270 |
| Income over 75 k | 0.210** | 0.362** | 0.404** | 0.366** | 0.370** |
| Age 35 to 54 | 0.440 | 0.446 | 0.439 | 0.366 | 0.350 |
| Age over 54 | 0.290* | 0.323* | 0.333* | 0.399* | 0.460* |
| Married | 0.430 | 0.485 | 0.412 | 0.549 | 0.430 |
| Children | 0.780 | 0.823 | 0.842 | 0.680 | 0.680 |
| White | 0.780*** | 0.662*** | 0.737*** | 0.837*** | 0.800*** |
| Black | 0.090** | 0.177** | 0.132** | 0.059** | 0.120** |
| Distance 5 mi to 29 mi | 0.360 | 0.300 | 0.333 | 0.314 | 0.330 |
| Distance over 29 mi | 0.230 | 0.169 | 0.167 | 0.170 | 0.100 |
Note(s): ***p < 0.01, **p < 0.05, *p < 0.1. Note: For binary variables (e.g., gender), only one category is reported; the complementary category is implied. For multi-category variables (e.g., income, education), the omitted group serves as the reference
These differences are reflected in socioeconomic patterns across the segments. Socially conscious and health-motivated consumers are the most engaged with the elderberry market, exhibiting the highest purchase rates and consumption frequency, as well as the largest shares of high income and postgraduate respondents. In contrast, health-indifferent consumers are the least likely to purchase elderberry products and present the lowest consumption frequency, as well as the lowest shares of postgraduates, high income-earners and older adults (over 54). Taste- and value-seekers stand out for having the highest share of older consumers (over 54) and the lowest share of males. Demographic differences also appear in racial composition: conventional consumers are predominantly White, while the socially conscious cluster has the largest share of Black respondents.
4.4 Determinants of the purchasing decision
We used a double-hurdle model to analyze the purchasing decision and the frequency of consumption of elderberry products. As explained in the methodology section, the first hurdle involves the estimation of a logistic regression, where the dependent variable is a binary indicator of whether the respondent purchased any elderberry product or not. Independent variables include the socioeconomic variables from Table 1, plus dummy variables for four of the five clusters, leaving cluster 1—health-indifferent consumers—as the reference category.
The results of the logistic regression are presented in Table 6. The model shows good fit (p < 0.001; Pseudo R2 = 0.098). In interpreting the results of the logistic regression, we focus on the column reporting average marginal effects, as these provide a more intuitive measure of the change in probability of purchasing elderberry products for a typical respondent.
Logistic regression results (1st hurdle): Probability of purchasing elderberry products (n = 597)
| Variable | Coefficient | Average marginal effect |
|---|---|---|
| Male | 0.0704 | 0.015 |
| (0.190) | (0.041) | |
| Associate’s degree | 0.781*** | 0.167*** |
| (0.236) | (0.049) | |
| Bachelor’s degree | 1.140*** | 0.244*** |
| (0.275) | (0.056) | |
| Postgraduate | 1.088*** | 0.233*** |
| (0.321) | (0.066) | |
| Income 35 k to 75 k | 0.233 | 0.050 |
| (0.223) | (0.048) | |
| Income over 75 k | 0.417* | 0.089* |
| (0.252) | (0.054) | |
| Age 35 to 54 | −0.841*** | −0.180*** |
| (0.251) | (0.052) | |
| Age over 54 | −0.740*** | −0.158*** |
| (0.261) | (0.055) | |
| Children | 0.239** | 0.051** |
| (0.102) | (0.022) | |
| Married | 0.279 | 0.060 |
| (0.206) | (0.044) | |
| White | −0.0531 | −0.011 |
| (0.280) | (0.060) | |
| Black | −0.184 | −0.039 |
| (0.374) | (0.080) | |
| Distance 5 mi to 29 mi | 0.0945 | 0.020 |
| (0.203) | (0.043) | |
| Distance over 29 mi | 0.221 | 0.047 |
| (0.256) | (0.055) | |
| Socially conscious | 0.872*** | 0.186*** |
| (0.287) | (0.060) | |
| Health-motivated | 0.589** | 0.126** |
| (0.299) | (0.063) | |
| Conventional | 0.425 | 0.091 |
| (0.275) | (0.058) | |
| Taste- and value-seekers | −0.160 | −0.034 |
| (0.310) | (0.066) | |
| Log pseudolikelihood | −367.597 | |
| Prob > | 0.000 | |
| Pseudo R2 | 0.098 | |
| Observations | 597 | |
| Variable | Coefficient | Average marginal effect |
|---|---|---|
| Male | 0.0704 | 0.015 |
| (0.190) | (0.041) | |
| Associate’s degree | 0.781*** | 0.167*** |
| (0.236) | (0.049) | |
| Bachelor’s degree | 1.140*** | 0.244*** |
| (0.275) | (0.056) | |
| Postgraduate | 1.088*** | 0.233*** |
| (0.321) | (0.066) | |
| Income 35 k to 75 k | 0.233 | 0.050 |
| (0.223) | (0.048) | |
| Income over 75 k | 0.417* | 0.089* |
| (0.252) | (0.054) | |
| Age 35 to 54 | −0.841*** | −0.180*** |
| (0.251) | (0.052) | |
| Age over 54 | −0.740*** | −0.158*** |
| (0.261) | (0.055) | |
| Children | 0.239** | 0.051** |
| (0.102) | (0.022) | |
| Married | 0.279 | 0.060 |
| (0.206) | (0.044) | |
| White | −0.0531 | −0.011 |
| (0.280) | (0.060) | |
| Black | −0.184 | −0.039 |
| (0.374) | (0.080) | |
| Distance 5 mi to 29 mi | 0.0945 | 0.020 |
| (0.203) | (0.043) | |
| Distance over 29 mi | 0.221 | 0.047 |
| (0.256) | (0.055) | |
| Socially conscious | 0.872*** | 0.186*** |
| (0.287) | (0.060) | |
| Health-motivated | 0.589** | 0.126** |
| (0.299) | (0.063) | |
| Conventional | 0.425 | 0.091 |
| (0.275) | (0.058) | |
| Taste- and value-seekers | −0.160 | −0.034 |
| (0.310) | (0.066) | |
| Log pseudolikelihood | −367.597 | |
| Prob > | 0.000 | |
| Pseudo R2 | 0.098 | |
| Observations | 597 | |
Note(s): Robust standard errors in parenthesis. ***p < 0.01, **p < 0.05, *p < 0.1. Constants omitted
Education emerges as a key factor: respondents with an associate’s degree are 16.7 percentage points more likely to purchase elderberry products than those with only a high school degree. Those with a bachelor’s degree are 24.4 percentage points more likely, and respondents with a postgraduate degree are 23.3 percentage points more likely. This suggests that more educated consumers may have greater health awareness or familiarity with functional foods. Age also plays a significant role. Compared to respondents under 35, those aged 35–54 are 18.0 percentage points less likely to purchase elderberry products and those over 54 are 15.8 percentage points less likely. This result is in line with Mohebalian et al. (2013), who found a similar pattern, and may reflect established consumption habits among older consumers that reduce their willingness to try new products.
Household composition matters as well: having children increases the probability of purchase by 5.1 percentage points, suggesting that families might be more inclined to seek out products perceived as healthy or immune-boosting. Finally, consumer attitudes captured by the segments are strongly associated with the purchase likelihood. Membership in the socially conscious cluster increases purchase probability by 18.6 percentage points, and being in the health-motivated cluster increases it by 12.6 percentage points, compared to the health-indifferent segment. Other variables, including gender, income, marital status, race and distance, were not statistically significant predictors. Overall, these results highlight that education, age, household composition and consumer attitudes are the primary drivers of the likelihood of purchasing elderberry products.
4.5 Determinants of consumption frequency
The second hurdle examines the frequency of elderberry consumption. The Brant test indicates that the proportional odds assumption is not violated ( = 85.55, p = 0.131), confirming that the ordered logistic regression is appropriate. The model shows good overall fit (p < 0.001; Pseudo R2 = 0.051), and the results are presented in Table 7.
Ordered logistic regression results (2nd hurdle): Frequency of consumption (n = 341)
| Variables | Coefficient | Odds ratio |
|---|---|---|
| Male | 0.162 | 1.176 |
| (0.210) | (0.247) | |
| Associate’s degree | −0.153 | 0.858 |
| (0.310) | (0.266) | |
| Bachelor’s degree | −0.400 | 0.670 |
| (0.341) | (0.229) | |
| Postgraduate | −1.003*** | 0.367*** |
| (0.388) | (0.142) | |
| Income 35 k to 75 k | 0.946*** | 2.577*** |
| (0.258) | (0.665) | |
| Income over 75 k | 0.883*** | 2.418*** |
| (0.303) | (0.732) | |
| Age 35 to 54 | 0.324 | 1.383 |
| (0.224) | (0.310) | |
| Age over 54 | −0.295 | 0.744 |
| (0.278) | (0.207) | |
| Children | 0.0287 | 1.029 |
| (0.0995) | (0.102) | |
| Married | 0.0223 | 1.023 |
| (0.244) | (0.250) | |
| White | 0.0808 | 1.084 |
| (0.302) | (0.328) | |
| Black | 1.158*** | 3.184*** |
| (0.368) | (1.172) | |
| Distance 5 mi to 29 mi | −0.174 | 0.841 |
| (0.243) | (0.205) | |
| Distance over 29 mi | −0.373 | 0.689 |
| (0.293) | (0.202) | |
| Socially conscious | 1.404*** | 4.071*** |
| (0.352) | (1.435) | |
| Health-motivated | 1.342*** | 3.827*** |
| (0.366) | (1.403) | |
| Conventional | 0.909** | 2.481** |
| (0.357) | (0.886) | |
| Taste- and value-seekers | 0.872** | 2.392** |
| (0.442) | (1.057) | |
| Log pseudolikelihood | −569.901 | |
| Prob > | 0.000 | |
| Pseudo R2 | 0.051 | |
| Observations | 341 | |
| Variables | Coefficient | Odds ratio |
|---|---|---|
| Male | 0.162 | 1.176 |
| (0.210) | (0.247) | |
| Associate’s degree | −0.153 | 0.858 |
| (0.310) | (0.266) | |
| Bachelor’s degree | −0.400 | 0.670 |
| (0.341) | (0.229) | |
| Postgraduate | −1.003*** | 0.367*** |
| (0.388) | (0.142) | |
| Income 35 k to 75 k | 0.946*** | 2.577*** |
| (0.258) | (0.665) | |
| Income over 75 k | 0.883*** | 2.418*** |
| (0.303) | (0.732) | |
| Age 35 to 54 | 0.324 | 1.383 |
| (0.224) | (0.310) | |
| Age over 54 | −0.295 | 0.744 |
| (0.278) | (0.207) | |
| Children | 0.0287 | 1.029 |
| (0.0995) | (0.102) | |
| Married | 0.0223 | 1.023 |
| (0.244) | (0.250) | |
| White | 0.0808 | 1.084 |
| (0.302) | (0.328) | |
| Black | 1.158*** | 3.184*** |
| (0.368) | (1.172) | |
| Distance 5 mi to 29 mi | −0.174 | 0.841 |
| (0.243) | (0.205) | |
| Distance over 29 mi | −0.373 | 0.689 |
| (0.293) | (0.202) | |
| Socially conscious | 1.404*** | 4.071*** |
| (0.352) | (1.435) | |
| Health-motivated | 1.342*** | 3.827*** |
| (0.366) | (1.403) | |
| Conventional | 0.909** | 2.481** |
| (0.357) | (0.886) | |
| Taste- and value-seekers | 0.872** | 2.392** |
| (0.442) | (1.057) | |
| Log pseudolikelihood | −569.901 | |
| Prob > | 0.000 | |
| Pseudo R2 | 0.051 | |
| Observations | 341 | |
Note(s): Robust standard errors in parenthesis. ***p < 0.01, **p < 0.05, *p < 0.1. Constants and cut-off points omitted
The results reveal clear drivers of elderberry consumption frequency. All consumer segments show significantly higher odds of frequent consumption compared to the health-indifferent reference group. The socially conscious and health-motivated segments are particularly strong predictors, with odds ratios of 4.07 and 3.83, respectively, indicating that these consumers are about four times more likely to report higher consumption frequency. The conventional and taste- and value-seeker segments also show significantly higher odds, with odds ratios of 2.48 and 2.39, respectively.
Among socioeconomic characteristics, Black consumers are about three times more likely than the “other race” baseline to be frequent consumers. Higher income consistently predicts more frequent consumption: consumers earning $35,000–$75,000 and those earning over $75,000 are about 2.5 times more likely to be high-frequency consumers compared to low-income respondents. In contrast, postgraduate education is associated with significantly lower odds of frequent consumption (odds ratio = 0.37), indicating that these individuals are roughly 63% less likely than those with a high school education to report higher consumption frequency. Other variables, including gender, age, number of children, marital status and distance to the nearest urban center, were not significantly associated with consumption frequency.
A more intuitive interpretation is obtained from the average marginal effects in Table 8, which illustrate how each factor shifts probability mass across the consumption spectrum. For instance, socially conscious consumers are 15.6 percentage points less likely to fall in the lowest frequency category and 19.5 percentage points more likely to be in the highest—representing the largest shift among all segments and underscoring their role as the core consumer base. Similarly, Black consumers exhibit a strong tendency toward regular use, being 11.7 percentage points less likely to be low-frequency consumers and 17.4 percentage points more likely to be high-frequency users. This pattern may reflect enduring cultural traditions of using natural or plant-based remedies within African diasporic communities (Gallego, 2019).
Second hurdle average marginal effects
| Frequency | ||||||
|---|---|---|---|---|---|---|
| Variable | 1 | 2 | 3 | 4 | 5 | 6 |
| Postgraduate | 0.155 | 0.061 | −0.020 | −0.031 | −0.070 | −0.095 |
| Income 35 k to 75 k | −0.115 | −0.074 | −0.011 | 0.018 | 0.061 | 0.121 |
| Income over 75 k | −0.114 | −0.067 | −0.002 | 0.020 | 0.058 | 0.106 |
| Black | −0.117 | −0.099 | −0.037 | 0.010 | 0.069 | 0.174 |
| Socially conscious | −0.156 | −0.113 | −0.028 | 0.019 | 0.082 | 0.195 |
| Health-motivated | −0.145 | −0.106 | −0.031 | 0.015 | 0.075 | 0.192 |
| Conventional | −0.110 | −0.066 | −0.010 | 0.014 | 0.053 | 0.119 |
| Taste- and value-seekers | −0.097 | −0.070 | −0.018 | 0.011 | 0.052 | 0.121 |
| Frequency | ||||||
|---|---|---|---|---|---|---|
| Variable | 1 | 2 | 3 | 4 | 5 | 6 |
| Postgraduate | 0.155 | 0.061 | −0.020 | −0.031 | −0.070 | −0.095 |
| Income 35 k to 75 k | −0.115 | −0.074 | −0.011 | 0.018 | 0.061 | 0.121 |
| Income over 75 k | −0.114 | −0.067 | −0.002 | 0.020 | 0.058 | 0.106 |
| Black | −0.117 | −0.099 | −0.037 | 0.010 | 0.069 | 0.174 |
| Socially conscious | −0.156 | −0.113 | −0.028 | 0.019 | 0.082 | 0.195 |
| Health-motivated | −0.145 | −0.106 | −0.031 | 0.015 | 0.075 | 0.192 |
| Conventional | −0.110 | −0.066 | −0.010 | 0.014 | 0.053 | 0.119 |
| Taste- and value-seekers | −0.097 | −0.070 | −0.018 | 0.011 | 0.052 | 0.121 |
The relationship with education is more complex. Postgraduate consumers are 15.5 percentage points more likely to belong to the lowest frequency group and 9.5 percentage points less likely to be in the highest, suggesting a gap between purchase intention and sustained use. This contrast with the first hurdle results—where higher education predicted purchase—may indicate that, while highly educated consumers are open to trying elderberry products, their greater skepticism or critical evaluation may hinder regular use, particularly if their consumption experience does not align with expectations about taste or efficacy (Grunert, 2010).
Finally, income effects show a roughly symmetric shift across the consumption spectrum: middle- and high-income consumers are about 11–12 percentage points less likely to be in the lowest frequency category and approximately the same amount more likely to be in the highest frequency level. Meanwhile, the conventional and taste- and value-seeking clusters show positive but more moderate shifts toward regular consumption, while health-inclined, taste-price indifferent consumers remain among the more engaged segments overall.
5. Discussion
This study set out to answer how consumer segments, defined by their attitudes, along with socioeconomic characteristics, influence the likelihood of purchasing elderberry products and the frequency of their consumption. By integrating attitudinal segmentation with a double-hurdle econometric framework, the analysis provides a multi-layered understanding of consumer behavior, confirming and extending the existing literature on elderberry and functional food consumption.
Regarding consumer segments, the findings advance the elderberry-specific literature. While Mohebalian et al. (2012) established a basic dichotomy of health-conscious consumers, factor and cluster analysis reveals a more complex, five-segment typology. This nuanced segmentation addresses limitations of prior research that relied on broad categorizations and product-specific analyses. The emergence of distinct segments such as socially conscious consumers (valuing sustainability and identity) and taste- and value-seekers demonstrates that motivations in the modern elderberry market extend beyond health considerations alone. This aligns with the broader consumer segmentation literature (e.g., Schäufele-Elbers and Janssen, 2023; Hempel, 2024), which highlights the multi-faceted nature of consumer attitudes.
The double-hurdle model not only links attitudinal profiles to actual behavior but also highlights differences between market entry and consumption intensity. For the initial purchase decision, only the socially conscious and health-motivated segments had a significant effect. This is broadly consistent with Cai et al. (2024), who found that premiums for attributes such as “organic” and “American-grown” influence purchasing behavior. The results here suggest that the socially conscious segment—identified as valuing sustainability and identity—may be particularly responsive to such attributes, indicating convergence between their stated preferences and the motivations captured by attitudinal segmentation. For consumption frequency, all non-baseline segments contributed significantly, indicating that while specific values may be necessary to induce trial, a broader range of motivations supports habitual use. These results help bridge the “attitude-behavior gap” noted in segmentation literature (e.g., Li and Roe, 2024), showing that the influence of attitudes varies across different stages of consumption.
Socioeconomic characteristics reveal additional heterogeneity. Younger consumers are more likely to make initial purchases, consistent with early adoption patterns for emerging functional foods (Mohebalian et al., 2012; Baker et al., 2022). The strong, consistent effect of higher income on both purchase and consumption frequency underscores the role of economic access, highlighting that financial resources facilitate participation in this emerging market (Verbeke, 2005). The most striking demographic result is the consistently high consumption frequency among Black consumers, which may be contextualized by cultural traditions of using natural remedies (Gallego, 2019). This suggests that elderberry producers may find a particularly loyal consumer base among this demographic.
Education presents a dual character. Specifically, postgraduate consumers are more likely to purchase, yet more likely to remain low-frequency consumers, suggesting a gap between trial and sustained consumption. This aligns with Baker et al. (2022), whose literature review on determinants of consumer acceptance toward functional foods notes that education can have mixed effects on consumption, with some studies finding higher education positively associated with use and others negatively. The two-stage model provides insight into this pattern: while higher education may increase health awareness and drive initial purchases, the critical evaluation skills associated with postgraduate education may lead to more discerning and less frequent consumption, consistent with theories that perceived quality evolves with experience (Grunert, 2010).
These findings offer a strategic roadmap for stakeholders in the elderberry industry, highlighting differences between factors influencing customer acquisition and those affecting retention. For acquisition, younger consumers (under 35) and families with children appear particularly responsive, with ethical production resonating with socially conscious consumers and immune-boosting benefits appealing to health-motivated consumers. Patterns of regular use suggest that addressing a range of consumer preferences may be beneficial—for example, premium brand-aligned products for socially conscious consumers, vitamin supplements for health-inclined consumers, and affordable, flavorful options for taste- and value-seekers. Communication that provides clear, science-backed information may help mitigate skepticism among highly educated individuals.
Although the study is based on US survey data, the methodological approach of integrating segmentation with a double-hurdle model and the observed distinction between drivers of purchase and consumption offer a useful framework for research on emerging functional foods in other markets. Context-specific studies would be needed to determine whether these patterns hold in other regions such as the UK or Europe.
6. Conclusions
This study examined how consumer segments defined by attitudes, along with socioeconomic characteristics, influence both the likelihood of purchasing elderberry products and the frequency of their consumption. By integrating attitudinal segmentation with a double-hurdle econometric framework, the findings provide a multi-layered understanding of these behaviors, offering both confirmation and meaningful extensions to the literature on elderberry and functional food consumer behavior.
Five consumer segments emerged: health-indifferent consumers, socially conscious consumers, health-motivated consumers, conventional consumers, and taste- and value-seekers. Socially conscious and health-motivated consumers exhibited the highest likelihood of purchase, while all non-baseline segments showed higher consumption frequency compared to health-indifferent consumers. Across segments, age, income, education, race and family composition also influenced purchasing behavior, suggesting demographic factors that may shape market outreach for elderberry products. These results highlight the importance of distinguishing between drivers of initial purchase and those that sustain regular consumption, providing a framework for dual marketing strategies—targeting socially and health-inclined consumers for market entry, and addressing a broader set of motivations and product formats for market retention.
Several limitations should be noted. First, the analysis relied on a non-probability sample without demographic or geographic quotas or post-stratification weights. Therefore, while the sample is national in scope, it is not statistically representative of the entire US population. Moreover, participants in online surveys may systematically differ from the general public in terms of online literacy or interest in functional foods. Second, all data—including purchasing behavior and consumption frequency—were self-reported, which may introduce recall bias or social desirability bias (e.g., by over-reporting consumption of products perceived as healthy or environmentally responsible). Third, the purchasing decision and frequency of consumption were captured at an aggregate level (elderberry products overall) rather than for specific product types. Although this approach allows for assessment of overall demand, it limits the ability to detect nuanced patterns for individual products such as wines, juices, or vitamin supplements.
Future research could explore the underlying motivations behind consumer preferences, particularly regarding emerging trends such as sustainability and environmental values. Longitudinal studies would be valuable for tracking how consumer attitudes and consumption patterns evolve over time, especially as the elderberry market grows. Additionally, research could assess whether similar consumer segment patterns emerge in other markets or cultural contexts, such as Europe or the UK. Such insights may support more effective marketing strategies, product development and policy decisions, helping businesses and policymakers understand shifting consumer dynamics and plan for future market opportunities.
Note
Because the “non-binary” and “other” categories accounted for only four responses, these were merged with the “female” category to form the reference group. Results are robust to alternative coding, such as merging them with “male”.

