0% found this document useful (0 votes)
80 views

CATA CATA - Intensity Napping

This study compared three sensory profiling methods - CATA, CATA with intensity ratings, and Napping - based on how 135 consumers perceived eight different beers. The three methods were compared based on their ability to discriminate between the beer samples, the degree to which they agreed on descriptive attributes, and the configurational similarity of the sample spaces obtained. ANOVA and PLSR analyses showed that all three methods successfully discriminated between the samples and identified similar numbers of significant descriptive attributes. Multiple factor analysis of the combined data revealed very high agreement between the methods, with RV coefficient values between 0.90 and 0.97, indicating similarity between the sample configurations obtained by each method. The results suggest that CATA provides comparable precision and reproducibility to
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
80 views

CATA CATA - Intensity Napping

This study compared three sensory profiling methods - CATA, CATA with intensity ratings, and Napping - based on how 135 consumers perceived eight different beers. The three methods were compared based on their ability to discriminate between the beer samples, the degree to which they agreed on descriptive attributes, and the configurational similarity of the sample spaces obtained. ANOVA and PLSR analyses showed that all three methods successfully discriminated between the samples and identified similar numbers of significant descriptive attributes. Multiple factor analysis of the combined data revealed very high agreement between the methods, with RV coefficient values between 0.90 and 0.97, indicating similarity between the sample configurations obtained by each method. The results suggest that CATA provides comparable precision and reproducibility to
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Food Quality and Preference 32 (2014) 160166

Contents lists available at SciVerse ScienceDirect

Food Quality and Preference


journal homepage: www.elsevier.com/locate/foodqual

Comparison of three sensory proling methods based on consumer perception: CATA, CATA with intensity and Napping
Helene C. Reinbach 1, Davide Giacalone 1, Leticia Machado Ribeiro, Wender L.P. Bredie, Michael Bom Frst
Department of Food Science, Faculty of Science, University of Copenhagen, Rolighedsvej 30, 1958 Frederiksberg C, Denmark

a r t i c l e

i n f o

a b s t r a c t
The present study compares three proling methods based on consumer perceptions in their ability to discriminate and describe eight beers. Consumers (n = 135) evaluated eight different beers using Check-All-That-Apply (CATA) methodology in two variations, with (n = 63) and without (n = 73) rating the intensity of the checked descriptors. With CATA, consumers rated 38 descriptors grouped in seven overall categories (berries, oral, hoppy, nutty, roasted, spicy/herbal and woody). Additionally 40 of the consumers evaluated the same samples by partial Napping followed by Ultra Flash Proling (UFP). ANOVA- and Discriminant Partial Least Square Regression (A-PLSR, D-PLSR) were used to evaluate the discriminative ability of the methods and descriptors. A-PLSR results showed that all samples were perceived as different in all three methods, whereas D-PLSR showed that all three methods had similar numbers of discriminating descriptors. For the two CATA variants, 29 and 24 descriptors for without and with rating intensity were signicant, for Napping/UFP the number was 26. Multiple Factor Analysis was used to derive an overall product map and to compare it to product congurations from individual methods. Both qualitative and quantitative analysis (comparison of RV coefcients of the MFA congurations) revealed a very high agreement of the three methods in terms of perceived product differences. RV coefcients were used to compare sample congurations obtained in the three descriptive methods. For all comparisons the RV coefcients varied between 0.90 and 0.97, indicating a very high similarity between all three methods. These results show that the precision and reproducibility of sensory information obtained by consumers by CATA is comparable to that of Napping. The choice of methodology for consumer descriptive methods should then be based on whether it is desired to have consumers articulate their own perception of descriptors, or if it sufcient to present them to an existing vocabulary. Napping is slower and more laborious, and better for explorative studies with smaller number of consumers whereas, CATA is faster, less labor-intensive and thus more suitable for larger groups of consumers. 2013 Elsevier Ltd. All rights reserved.

Article history: Received 26 October 2012 Received in revised form 22 February 2013 Accepted 25 February 2013 Available online 7 March 2013 Keywords: Fast sensory methods Napping Check-all-that-apply Descriptive proling Multivariate statistics Consumer perception

1. Introduction Descriptive sensory proling is important for the food industry as it can guide product development and reformulation of products as well as identify key sensory drivers essential for consumer acceptance and marketing of products. Conventional descriptive proling is performed with a trained panel to obtain an objective description of the food products investigated (Lawless & Heymann, 2010). The need for less time-consuming and economical descriptive methods in the food industry has supported the development and use of more dynamic and fast descriptive sensory proling methods assessed by panelists, food experts and consumers (Ares, Deliza, Barreiro, Gimenez, & Gambaro, 2010; Dehlholm, Brockhoff, Meinert, Aaslyng, & Bredie, 2012; Giacalone, Machado Ribeiro, &
Corresponding author. Tel.: +45 35 33 32 07; fax: +45 35 33 3509.
1

E-mail address: [email protected] (M.B. Frst). These authors contributed equally to this work.

Frst, forthcoming; Nestrud & Lawless, 2010). The fast methods include projective mapping (Risvik, McEwan, & Rdbotten, 1997) and Napping (Pages, 2003, 2005), Flash Proling (Dairou & Sieffermann, 2002) based on Free-Choice Proling (Williams & Langron, 1984) and different sorting techniques such as free (Lawless, Sheng, & Knoops, 1995) single (Rosenberg & Kim, 1975) and multiple sorting (Dehlholm et al., 2012). Napping is a method in which food samples are projected on a two-dimensional space based on similarities, and is often combined with Ultra Flash Proling (Perrin & Pages, 2009) to add a semantic description to the product differences. Napping can performed as a global Napping, including all sensory aspects, or as partial Napping focusing on specic sensory modalities (e.g. appearance, taste or mouthfeel) (Dehlholm et al., 2012; Pags, 2005). Other consumer-friendly methods, such as just-about right scales (JAR), attribute liking, emotional questionnaires and checkall-that-apply (CATA) are increasingly used to capture consumer perception of food products. In particular the CATA method, in

0950-3293/$ - see front matter 2013 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.foodqual.2013.02.004

H.C. Reinbach et al. / Food Quality and Preference 32 (2014) 160166

161

which a product is described by selecting appropriate words from a given list, is a simple and valid approach to gather information about sensory and non-sensory perception, and is believed to have smaller effect on liking and consumer perception of the product than similar methods (e.g. JAR) (Adams, Williams, Lancaster, & Foley, 2007; Ares et al., 2010; Giacalone, Bredie, & Frst, 2013; Lado, Vicente, Manzoni, & Ares, 2010). Consumer-elicited CATA proles have shown good agreement with traditional panel-developed sensory proles (Dooley, Lee, & Meullenet, 2010; Ares et al., 2010), suggesting that CATA could be a valuable alternative to understand perception of product sensory attributes. The various methodologies to capture consumer perceptions are generally easier to perform and less time-consuming than traditional descriptive analysis with a trained sensory panel. Some methods are reductionist and based on a predened list of descriptor (e.g. CATA), while other methods are more holistic and explorative (e.g. Napping). One of the suggested drawbacks of CATA is that this method produces relatively impoverished dichotomized data (1/0), which allegedly would mask relative differences between specic attributes. Including intensity scaling of attributes in the CATA method may therefore improve the accuracy of descriptive proling and lead to a better product differentiation. This hypothesis could be tested by comparing CATA with CATA combined with intensity scaling. Data on consumer ratings of intensity generally show large variability and thus it is not clear if the scaling element would actually improve the CATA descriptions made by the consumer. Additionally, it would be of interest to compare how reductionist methods, with and without scale elements (CATA and CATA with intensity ratings), would fare compared to a more holistic and explorative one, such as Napping. The aim of the present study was to compare the effectiveness of three proling methods, CATA, Napping and a novel method combining CATA with intensity scaling in studying consumer perception of a sample of eight beers. Three comparative criteria were considered in this study: (1) Discriminative ability: i.e. the methods ability to successfully discriminate between the samples; (2) Descriptive ability: the degree to which the three proling methods would agree on the sensory characterization; (3) Congurational congruence: the degree to which the sample spaces obtained by the different methods would be closely related to one another. 2. Materials and methods 2.1. Consumers One hundred and thirty-ve consumers between 18 and 65 years were recruited in and around of University of Copenhagen (UCPH), through advertisement on websites, social networks, beer magazines and yers. Approximately half of the consumers (n = 73, 46 males and 27 females) described the avor of the beers using a CATA questionnaire. The other half (n = 62, 46 males, 16 females) completed a modied version of the CATA questionnaire where we introduced the possibility of scaling the intensity of checked attributes. Additionally, some of the consumers (n = 40, 23 males, 17 females) returned after approximately 10 days for a second session to perform a partial Napping focusing on the smell and taste attributes of the eight beers. After the testing, consumers received a token incentive for their participation (a bottle of craft beer, value  6 ). 2.2. Samples Eight beers were chosen for the study (Table 1), ve that represented the avor diversity of the Danish beer marked (e.g. fruity,

oral, woody, nutty or spicy), two beers were developed for the study to represent novel ingredients (sea buckthorn and pine) and nally a standard pilsner was included to represent the most consumed beer type in Denmark. 40 ml of beer was served at approximately 10 C in 24 cl beer glass covered with watch glasses and coded with three-digit random numbers. Serving orders were randomized to balance rst order and carry-over effects (MacFie, Bratchell, Greenhoff, & Vallis, 1989). 2.3. CATA variants Sensory perception of the eight beers was evaluated by respectively CATA and CATA combined with a 15-point intensity scale. On the CATA ballot seven overall avor categories were presented (Table 2). For each avor category consumers were asked to check yes, if the avor was present, and no if the avor was not present. This formulation2 differs from the classical check-all-that-apply, and was adopted in order to enhance the likelihood that consumers actually read through the whole list, reducing the behavior known as satiscing (Krosnick, 1991; Rasinski, Mingay, & Bradburn, 1994). Briey, satiscing is a theory in behavioral decision making maintaining that when most people examine alternative sequentially, they tend to choose the rst alternative that seem reasonable, as opposed to the optimal situation in which they would evaluate all alternatives comprehensively before taking a decision (Simon, 1955). Further, some overall avor descriptors were supplemented with sub-descriptors to enable consumers to specify the exact avor they perceived (Table 2). The list of avor attributes was developed with inspiration from the Danish beer language (Det Danske lakademi [Eng. The Danish Beer Academy], 2006), and the ballot was pre-tested informally to assess that the appropriateness of the attribute list. On the CATA ballot with intensity scaling, the seven overall avor attributes were presented with the yes/no checkboxes, the avor subdescriptors and one horizontally oriented 15-point intensity scale per avor category anchored with very weak and very strong in the ends to enable consumers to rate the intensity of the appropriate beer avors. The choice of including only avor terms, which differs from earlier CATA applications where often more holistic terms (e.g. emotions, usage attributes, conceptual attributes, etc.) are included, was motivated by our aim to restrict the focus on the descriptive proling applicability of CATA. 2.4. Partial Napping Napping was performed as a partial Napping focusing on the smell and taste of the eight beers. Each consumer was provided with a 60 40 cm blank paper (the Napping sheet), a pen, postits, a tray with eight beer samples and a spittoon. The sample order on the individual trays was randomized to counter-act rst order carry-over effect, even though the Napping methodology allows and requires subjects to go back and forth between samples. Consumers were instructed to evaluate the beer samples according to similarities or dissimilarities in smell and taste attributes by placing similar samples close to each other and more dissimilar samples further apart on the Napping sheet. After they had reached a nal conguration, consumers noted down appropriate descriptors for the smells and tastes of the beers on the post-its, which were moved around the Napping sheet, when needed. This procedure is known as Ultra-Flash proling and is commonly used to add a descriptive dimension to a Napping task (Perrin et al., 2008). When
2 A very similar formulation has been recently tested by Ennis and Ennis (2011), who coined their approach applicability scores. Although unaware of this contribution at the time of designing this experiment, it is interesting to notice that we came to very similar conclusions regarding the need to account for unchecked items in CATA questionnaires.

162 Table 1 Description of the eight beers. Beer name Sea buckthorn beer Fynsk forr Bgebryg Pine beer Valnd Hertug Stjernebryg Enebr Stout Thy Pilsner

H.C. Reinbach et al. / Food Quality and Preference 32 (2014) 160166

Main sensory characteristics Berries Floral Woody Woody Nutty Spicy Spicy Neutral

Main avor ingredient Sea buckthorn Elderower Beech tree extract Pine (Pin-Thyrol) Walnut Anise Juniper berries Hops

Beer style Flavored pilsner Pale Ale Amber Ale Flavored pilsner Dark Ale Strong Ale Stout Pilsner

Brewery Indslev/Univ. of Copenhagen (UCPH) rbk Skovlyst Indslev/UCPH Rise Herslev Grauballe Thy

Table 2 Overview of descriptors for the three methods and estimated jack-knife signicance from D-PLSR. Common descriptors BERRIES Blueberry Cranberry Sea buckthorn Rose hip Other berries Floral Elderower Chamomile Lavender Rose Other oral HOPPY NUTTY Hazelnut Almond Walnut Other nutty ROASTED Roasted bread Caramel Coffee Chocolate Other roasted SPICY/HERBAL Juniper berries Bog myrtle Anise Rosemary Cloves Laurel Other spicy/herbal WOODY Piney Birch Beech Maple Other woody Signicant descriptors Optimal PC Validated Y variance CATA (n = 73) 166** 16* n.s. n.s. 58** n.s. 242*** 93*** 50** 61** 36** 27** 345*** 200*** 72*** 47*** 71*** n.s. 271*** 60* 133*** 64*** 36*** n.s. 295*** 52* n.s. 56** 47*** 44* n.s. n.s. 232** 70** 60* 33* 36* n.s. 29 4 16% CATA Int. (n = 62) 140** 9* 31* n.s. n.s. 21** 215*** 77*** 38** n.s. n.s. n.s. 389*** 149*** 47* 33* 48*** 243*** n.s. 111*** 64*** 47*** 17* 274*** 50* 70* 64*** 35*** n.s. n.s. n.s. 205*** 71*** n.s. n.s. n.s. n.s. 24 4 11% Napping (n = 40) 7n.s. Other Napping descriptors (by # of occurrences) Sweet*** (96) Bitter** (74) Sour*** (48) Citrus*** (24) Fresh** (22) Fruityn.s. (20) Strong*** (20) Light* (19) Liquorice** (19) Yeasty** (20) Full-bodied* (18) Alcoholic** (15) Pilsner** (15) Summer* (14) Thin* (13) Burnt* (12) Grainyn.s. (12) Wateryn.s. (11) Heavyn.s. (9) Soapy* (8) Neutraln.s. (8) Springn.s. (7) Regular* (6) Autumnn.s. (6) Applen.s. (5) Low bitternessn.s. (5)

22***

45*** 18*

9n.s. 10* 33*** 8* 8* 19*

12n.s. 19**

26 5 22%

p Values for b coeff. Signicance levels, n.s. non-signicant. p < 0.05. ** p < 0.01. *** p < 0.001.
*

all the samples had been placed on the paper, the assessors substituted the post-its with an X and noted the sample codes and the beer characteristics next to the X. Each beer sample was tasted in the given order and swallowed at least once to get the full perception of the beers. Re-tastings and spitting out the beer were allowed. Water and crackers were used as palate cleansers. 2.5. Data analysis The analysis conducted was divided into two parts: an analysis of the descriptive outputs of the three methods (using unfolded data matrices and Partial Least Squares Regression), and an analy-

sis of the sample spaces (using crosstab matrices and Multiple Factor Analysis). The descriptive proles of the beers obtained from CATA, CATA with intensity and Napping were analyzed for product differences by ANOVA-Partial least square regression (A-PLSR; Martens & Martens, 2001) using the Unscrambler software (version 10.1, CAMO, Oslo, Norway). The matrices thus consisted of the number of beers (8) times the number of consumers in each group. For the A-PLSR analysis, the X-matrix consisted of the eight experimental beers (X = 1/0 design variables) while the Y-matrix consisted of the beer avor descriptors for CATA (Y = 1/0) and CATA with intensity (Y = intensity/0). The Y-matrix for the Napping

H.C. Reinbach et al. / Food Quality and Preference 32 (2014) 160166

163

included the taste and avor descriptors of the beers (Y = 1/0) elicited during the Ultra-Flash proling task. Relations between the avor/taste descriptors (X = 1/0 design variables for CATA and Napping, X = intensity/0 for CATA with intensity) and the experimental beers (Y = 1/0) were studied by Discriminant-PLSR (D-PLSR; Martens & Martens, 2001) with all variables standardized. Cross validation was performed by splitting the datasets into consecutive segments with eight samples (corresponding to one consumer), thus leaving out one consumer at a time. Martens uncertainty test, a jack-knife based extension of cross validation, was performed to estimate the signicance of the model parameters at the optimal number of components, taken at the minimum predicted root mean square error (Martens & Martens, 2000). At last, data were organized into three separate matrices to allow for comparison of the individual methods sample congurations obtained. First, the frequencies of the CATA descriptors were calculated for each of the eight beers and the counts organized in a table crossing beers and descriptors. For CATA with intensity ratings, a similar table was constructed using averaged attributes ratings. For each Napping sheet, the X and Y coordinates of each sample were determined, using the left bottom corner of the sheet as the origin of the coordinate system. Multiple Factor Analysis (MFA) was used to analyze the data, using the three initial data matrices as individual MFA groups. Multiple Factor Analysis is a multivariate statistical technique which aims at integrating different groups of variables describing the same observations. MFA can be regarded as an enriched PCA and is performed in three steps: rst, an initial PCA is computed on each separate group (in our case, CATA, CATA with intensity, and Napping). Secondly, the square root of the rst eigenvalue of each individual PCA is extracted and used as scaling factor for each respective data matrix. Finally, the data are re-merged into a global matrix, and a new PCA is performed on this new and scaled data. The scores plot represents a consensus map of how the samples were perceived on a global level. Additionally, MFA has the important characteristic that it allows a rapid comparison of the overall conguration with the individual group congurations (i.e. the initial PCAs), both qualitatively, i.e. by visual inspections of the partial points via projection matrix and quantitatively, i.e. by computing regressor vector (RV) coefcients (Robert & Escoer, 1976) to measure congurational congruence. In this study, RV coefcients were calculated for all possible combination of methodologies for two dimensions and were used to compare sample congurations for each of the three descriptive methods. Mathematically, it can be shown that the RV coefcient corresponds to the Pearsons correlation coefcient after rearranging the original matrices into vectors (Meyners, 2001). Thus, in the context of the present research high RV coefcients values would indicate that the methods would yield similar information. The MFA was carried out using the FactoMineR package (L, Josse, & Husson, 2008) in the statistical software R 2.14.1.

predictor variables, i.e. the samples since they correspond to columns of a design matrix) and visual inspection of the perturbed sub-models revealed that in all three methods all beers were perceived and described differently (p < 0.05). In agreement all three methods tended to group beers in two clusters (Fig. 1ac) containing beers with oral notes (Fynsk Forr, Sea-buckthorn, Thy Pilsner, Pine beer) versus beers with roasted and caramel notes (Stjernebryg, Enebr Stout, Bgebryg, Valnd Hertug). The congurations differed somewhat between the three methods particularly with regards to the sample correlated with the second PLSR component. 3.2. Sensory characterization (D-PLSR) Data from CATA (n = 73), CATA with intensity (n = 62) and Napping (n = 40) were analyzed by D-PLSR to compare the descriptive ability of the three models in providing a sensory characterization of the samples. The D-PLSR conducted with the two CATA dataset included 38 avor descriptors of which respectively 29 and 24 descriptors for CATA and CATA with intensity were found to have a signicant effect on the descriptive prole of the eight beers (Table 2). The most frequently used avor descriptors for CATA and CATA with intensity were hoppy, spicy/herbal, roasted, oral, woody, nutty, berries, caramel and elderower, words which covered all seven avor categories. For Napping basic taste descriptors were the most frequently used (e.g. sweet, bitter, sour) followed by the avor descriptors hoppy, caramel, fruity, citrus and spicy. The optimal number of component was four for both CATA and CATA with intensity, accounting for 11% and 16% of the validated Y variance respectively. The rst PLSR component described the difference in roasted and nutty avors versus the oral and berry notes whereas the second component explained differences in hoppy versus spicy/herbal and woody avors as well as oral/berries (only for CATA). The third component explained variation due to woody and spicy versus other characteristics (e.g. hoppy, berries, oral, roasted) for CATA and hoppy versus oral for CATA with intensity (not shown in gure). For the Napping/UFP method, consumers used a total of approximately 250 words to describe the avors and tastes of the eight beers. Words mentioned ve times or less were kept out of the analysis and synonyms were semantically grouped, following the guidelines given by Perrin et al. (2009). A total of 37 words were retained, of which 26 were found to have signicant effect on the description of the eight beers (Table 2). Thirteen Napping descriptors were in common with the CATA/CATA with intensity. Ten of these descriptors were signicantly for describing the beers and covered all seven avor categories whereas the remaining 24 descriptors were unique to the Napping method. The Napping data were optimally described by ve PLSR components accounting for 22% of the validated variance (Table 2). The rst PLSR component accounted for variation in sweet, liquorice, alcoholic, caramel, full-bodied and strong avor/tastes versus more sour, fresh, oral, fruity and light notes. In accordance with CATA and CATA with intensity roasted and nutty notes tended to be opposed to oral along the rst component of the D-PLSR for the Napping data. The second component described variation due to bitterness versus low bitterness, fruity, oral and sweet characteristics. The third component described variation due to hoppy and malty versus piney avor notes (gure not shown). Clear common trends in consumer perception of the beers were observed across all three methods (Fig. 2). For all three descriptive methods the major variance in the beers were caused by differences in roasted, nutty and caramel notes as opposed to oral avors along the rst component. The second component described variation due to hoppy versus spicy/herbal (CATA and CATA with

3. Results 3.1. Discriminative ability (A-PLSR) Data from CATA (n = 73), CATA with intensity (n = 62) and Napping (n = 40) were analyzed by A-PLSR to compare the discriminative ability of the three models. The optimal component number derived from the cross-validation was two for the CATA, three for the CATA with intensity method, and two for the partial Napping. These components accounted respectively for 8%, 11%, and 4% of the total variance in avors/tastes of the eight beers (Fig. 1). Jack-knife based estimation of the regression coefcients (of the

164

H.C. Reinbach et al. / Food Quality and Preference 32 (2014) 160166

Fig. 1. Scores plots from A-PLSR models for CATA (a), CATA with intensity (b), and Napping (c).

Fig. 2. Correlation loadings plot of the D-PLSR models including descriptors for CATA (A), CATA with intensity (B), and Napping (C). Circled descriptors have a signicant effect on the sample variation (p < 0.05). The inner and outer ellipses represent R2 = 50% and R2 = 100% respectively.

intensity) or oral and fruity avors for Napping. Interestingly, the D-PLSR analysis on the Napping dataset showed that the basic tastes, sweet and sour accounted for much of the variation in the rst component whereas bitter versus fruity avors explained variation in the second component. These ndings could indicate the additional information could have been gained with CATA/CATA with intensity if basic tastes and the fruity descriptor had been included in the listed descriptors. This also highlights the specic advantage of free descriptors elicitation (such as in Napping/UFP) for uncovering main dimensions relevant for consumers which may be overlooked in methods where subjects are presented with a predened list of descriptors. The ability to identify the main sensory characteristics for each of the eight beers (given in Table 1) varied between the three sensory proling methods. The sea-buckthorn avored pilsner was described as having sea buckthorn, oral (especially chamomile) and berry avors using CATA and CATA with intensity whereas Napping revealed that sour was the most descriptive word for the beer (Fig. 2). The pale ale with elderower (Fynsk Forr) was correctly identied as having a distinct oral and elderower avors by all descriptive methods. In contrast, the prominent avor from the ale with beech twigs (Bgebryg) and dark ale with walnut (Valnd Hertug) were less clearly identied in their descriptive prole. Bgebryg and Valnd Hertug tended to be described as having caramel, chocolate, walnut and roasted notes by all methods as well as having nutty and woody notes (Napping, CATA) as well as almond (CATA, CATA intensity) hoppy (CATA intensity), piney (CATA) and to have juniper berry (CATA) and Other berry avors (CATA intensity). The pine avored pilsner were perceived to have piney or piney-like avors (e.g. woody, birch, rosemary) with all three descriptive methods. However, this beer was best differentiated from the other beers with the CATA with intensity method. The anise and liquorice avors were correctly identied in the strong ale with anise (Stjernebryg) by all descriptive methods and was further characterized as having nutty avors (all methods) as well as having notes of caramel, cloves, walnut and hazelnut

(CATA and CATA with intensity), spicy/herbal avors (CATA) and as having a sweet taste (Napping). None of the descriptive methods identied the Juniper berries in the Stout with Juniper berries (Enebr Stout); however, this beer was perceived as having distinct coffee and chocolate avors by all methods. Finally the Thy pilsner was perceived to be hoppy with a hint of oral when using the CATA and CATA intensity method whereas the it was described as a pilsner type beer with bitter and sour tastes when assessed by Napping. 3.3. Congurational congruence (MFA) The last step of our analysis aimed at evaluating the congurational similarity of the sample spaces obtained by the three methods. This was assessed by MFA performed on three cross tabulation matrices containing frequency of concurrencies for each descriptor (CATA), average ratings over samples (CATA with intensity), and the assessors coordinates (Napping). Fig. 3 below shows the rst two dimensions of the consensus MFA sample map (70.5% of the explained variance). The partial congurations obtained by the individual methods are superimposed to the consensus points. Visual inspection of Fig. 3 shows that all methods provided very similar sample maps. The only partial exception was the Pine beer where the CATA and the CATA with intensity seem to disagree, but only with regards to the variation described by the second MFA component. Accordingly, the three methods correlated similarly with the rst MFA component (CATA = 33.7%, CATA with intensity = 33.4%, Napping = 32.9%. Percentages refer to the contribution of individual groups of variables to the MFA component), and differed slightly with regards to the second component (17.5%, 48.9%, and 33.6% respectively). RV coefcients were used to compare sample congurations obtained in the three descriptive methods. For all method comparisons the RV coefcients varied between 0.90 and 0.97, indicating a very high similarity between all three methods. The highest similarity was found between the two CATA methods (RV = 0.97)

H.C. Reinbach et al. / Food Quality and Preference 32 (2014) 160166

165

CATA CATA_INT Napping

PineBeer

Stjernebryg

ValndHertug SeaBuckthorn FynskForr Bgebryg

EnebrStout

ThyPilsner

-3

-2

-1

0 Dim 1 (54.79 %)

Fig. 3. Consensus MFA sample space (rst two components) with superimposed partial points from individual methods.

followed by Napping and CATA with intensity (RV = 0.93) whereas the similarity between Napping and the CATA method was slightly lower (RV = 0.90).

4. Discussion In the present study, three consumer-based descriptive methods were compared. Two of the methods were verbal-based CATA, and a CATA variant with possibility to rate intensity of checked attributes while one was based on direct sample comparisons (Napping). Despite the different nature, comparative MFA revealed that the overall product map based on data from the three descriptive methods was very similar to the individual beer sample congurations for each method. The high similarity between methods was conrmed by the RV coefcients ranging from 0.90 to 0.97. These results suggest that the precision and reproducibility of sensory information obtained by consumers with CATA is comparable to that of Napping. This is in concordance with ndings by Ares et al. (2010) that projective mapping and CATA questions provide very similar sensory proles of eight milk desserts, supporting high validity of both sensory methods. The high similarity found between the two CATA methods in the present study indicates that quantitative scaling might not improve the accuracy of the CATA prole, but indicates that a quantitative dimension can be added to CATA without complicating the task for the consumer. The latter point is suggested by the equal time to completion (average of 45 min for both CATA versions), the equal number of checked terms (5.5 on average for the straight CATA and 5.7 for the scaled version), and, indirectly, by the fact that on a separate analysis no signicant difference were found on liking ratings between the two CATA groups. A previous comparison between CATA and intensity scaling conrmed a high agreement between the two methods (Parente, Ares, & Manzoni, 2010). Recently Ares, Varela, Rado, and Gimenez (2011a) and Ares, Varela, and Rado (2011b) compared intensity scaling with CATA and projective mapping of orange avored powdered drinks in one study, and added sorting as a fourth method in a second study. Comparing the data by MFA, the authors also found the methods to provide similar information regarding the sensory proles, suggesting that stable product congurations can be obtained across different methodologies and different groups of consumers.

The presented work expanded this approach by including APLSR and D-PLSR analyses to ascertain additional information. This approach turned out to be very useful in comparing the three descriptive methods. First the discriminative ability of the three methods was analyzed by A-PLSR and comparable overall trends were obtained for the three methods revealing that the eight beers tended to group into clusters of oral beers versus beers with roasted and caramel avors. Importantly, the three methods were able to discriminate between the beers. CATA with its clearly dened word descriptors might be faster and make it easier for the consumer to discriminate between and describe the beers and therefore improve discriminative ability when compared to Napping. Conversely CATA have previous been criticized for using relatively impoverished dichotomous data, which would supposedly yield a smaller discrimination power. These claims were however not conrmed in the present study. Next the descriptive ability of the three methods was compared by using D-PLSR, which revealed that the methods tended to vary in both number of words and type of words used. Comparable overall trends for the three methods was conrmed with D-PLSR, revealing that the eight beers spanned the product space that ranged from beers with roasted, nutty and caramel notes (Stjernebryg, Enebr Stout, Valnd Hertug, Bgebryg) to beers with oral, berry and piney avors (Sea buckthorn beer, Fynsk Forr, Thy Pilsner, Pine beer) in one dimension and hoppy (Thy Pilsner) versus spicy/herbal, woody (Pine beer), fruity or oral in the second dimension. The fact that the sweet and sour as well as bitter taste explained much of the variation in the Napping data highlights the importance of the basic tastes in consumer perception of the eight beers and gives additional information that was not captured by the predened CATA words. The ability to identify the main sensory characteristics for each of the eight beers was also similar for the three sensory proling methods. Three ingredients, elderower in the pale ale (Fynsk Forr), pine in one of the avoured pilsners, and anise/liquorice in the strong ale (Stjernebryg) were correctly identied by all descriptive methods, as was the Thy pilsner, which was hoppy and described as a pilsner type beer with bitter taste. The dark ale with walnut (Valnd Hertug) tended to be described as having walnut avors by all methods and was perceived similar to the amber ale with beech twigs (Bgebryg). The beech avor in the amber ale was identied with the CATA method whereas the ale only tended to be woody with the Napping method. The sea-buckthorn in the avored sea buckthorn pilsner was only identied with CATA and CATA with intensity suggesting that novel ingredients might be easier to capture if consumers are inuenced by predened CATA words raising awareness of the novel ingredients. The juniper berries in the Stout (Enebr Stout) were however not identied by any of the descriptive methods despite the help from the CATA words suggesting that some avors (e.g. novel, unfamiliar, unexpected) might be difcult to capture, and might not be identied unless proper training is conducted. Even though the ability to identify and describe key sensory attributes in the beers did not vary much between the three methods, small differences were observed. Napping/UFP naturally tended to facilitate the development of a larger and more diverse vocabulary comprised of both sensory attributes and holistic terms whereas CATA is limited to the listed sensory attributes. Conversely, the predened CATA words aided the description and identication of certain attributes (e.g. woody and sea buckthorn). These ndings suggest that the choice of method should be based on the type of descriptive prole required and the resources available. Napping might be benecial to use when unique, intuitive or creative descriptors are needed (e.g. explorative studies) with a smaller number of consumers, as Napping is slower and more laborious than CATA. CATA might useful in raising awareness of certain

Dim 2 (15.69 %)

-1

166

H.C. Reinbach et al. / Food Quality and Preference 32 (2014) 160166 Ares, G., Varela, P., Rado, G., & Gimenez, A. (2011a). Are consumer proling techniques equivalent for some product categories? The case of orange-avored powdered drinks. International Journal of Food Science and Technology, 46, 16001608. Bruzzone, F., Ares, G., & Gimenez, A. (2012). Consumers texture perception of milk desserts. II Comparison with trained assessors data. Journal of Texture Studies, 43, 214226. Dairou, V., & Sieffermann, J. M. (2002). A comparison of 14 jams characterized by conventional prole and a quick original method, the ash prole. Journal of Food Science, 67, 826834. Dehlholm, C., Brockhoff, P. B., Meinert, L., Aaslyng, M. D., & Bredie, W. L. P. (2012). Rapid descriptive sensory methods Comparison of Free Multiple Sorting, Partial Napping, Napping, Flash Proling and conventional proling. Food Quality and Preference, 26, 267277. Dooley, L., Lee, Y., & Meullenet, J. (2010). The application of check-all-that-apply (CATA) consumer proling to preference mapping of vanilla ice cream and its comparison to classical external preference mapping. Food Quality and Preference, 21, 394401. Ennis, D. M., & Ennis, J. M. (2011). Interpreting applicability scores. IFP Press, 14(4), 34. Giacalone, D., Bredie, W. L. P., & Frst, M. B. (2013). All-in-one test (AI1): A rapid and easily applicable approach to consumer product testing. Food Quality and Preference, 27, 108111. Giacalone, D., Machado Ribeiro, L., & Frst, M. B. (forthcoming). Consumer-based product proling: Application of partial Napping for sensory characterization of specialty beers by novices and experts. Journal of International Food and Agribusiness Marketing (accepted manuscript). Krosnick, J. A. (1991). Response strategies for coping with the cognitive demands of attitude measures in surveys. Applied Cognitive Psychology, 5, 213236. Lado, J., Vicente, E., Manzoni, A., & Ares, G. (2010). Application of a check-all-thatapply question for the evaluation of strawberry cultivars from a breeding program. Journal of the Science of Food and Agriculture, 90, 22682275. Lawless, H. T., & Heymann, H. (2010). Sensory evaluation of food: Principles and practices (2nd ed.). New York, NY: Springer. Lawless, H. T., Sheng, N., & Knoops, S. S. C. P. (1995). Multidimensional-scaling of sorting data applied to cheese perception. Food Quality and Preference, 6, 9198. L, S., Josse, J., & Husson, F. (2008). FactoMineR: an R package for multivariate analysis. Journal of Statistical Software, 25, 118. MacFie, H. J., Bratchell, N., Greenhoff, N. K., & Vallis, I. V. (1989). Designs to balance the effect of order of presentation and rst-order carry-over effects in hall tests. Journal of Sensory Studies, 4, 129148. Martens, H., & Martens, M. (2000). Modied jack-knife estimation of parameter uncertainty in bilinear modelling by partial least squares regression (PLSR). Food Quality and Preference, 11, 516. Martens, H., & Martens, M. (2001). Multivariate analysis of quality: An introduction (1st ed.). Chichester, West Sussex, England: John Wiley & Sons Ltd. Meyners, M. (2001). Permutation tests: Are there differences in product liking? Food Quality and Preference, 12, 345351. Nestrud, M. A., & Lawless, H. T. (2010). Perceptual mapping of apples and cheeses using projective mapping and sorting. Journal of Sensory Studies, 25, 390405. Pags, J. (2003). Direct collection of sensory distances: Application to the evaluation of ten white wines of the Loire Valley. Sciences des Aliments, 23, 679688. Pags, J. (2005). Collection and analysis of perceived product inter-distances using multiple factor analysis: Application to the study of 10 white wines from the Loire Valley. Food Quality and Preference, 16, 642649. Parente, M. E., Ares, G., & Manzoni, A. V. (2010). Application of two consumer proling techniques to cosmetic emulsions. Journal of Sensory Studies, 25, 685705. Perrin, L., & Pages, J. (2009). Construction of a product space from the Ultra-Flash proling method: Application to 10 red wines from the Loire Valley. Journal of Sensory Studies, 24, 372395. Rasinski, K. A., Mingay, D., & Bradburn, N. M. (1994). Do respondents really mark all that apply on self-administered questions? Public Opinion Quarterly, 58, 400408. Risvik, E., McEwan, J. A., & Rdbotten, M. (1997). Evaluation of sensory proling and projective mapping data. Food Quality and Preference, 8, 6371. Robert, P., & Escoer, Y. (1976). Unifying tool for linear multivariate statisticalmethods- RV-coefcient. Journal of the Royal Statistical Society Series C Applied Statistics, 25, 257265. Rosenberg, S., & Kim, M. P. (1975). Method of sorting as a data-gathering procedure in multivariate research. Multivariate Behavioral Research, 10, 489502. Simon, H. A. (1955). A behavioral model of rational choice. Quarterly Journal of Economics, 69, 99118. Williams, A. A., & Langron, S. P. (1984). The use of free-choice proling for the evaluation of commercial ports. Journal of the Science of Food and Agriculture, 35, 558568.

attributes and thereby aid the consumer in performing the descriptive prole. CATA is fast and easy and thus suitable for larger groups of consumers. The CATA with intensity variation investigated here did not show signicant advantages compared to the unscaled version, providing very similar information. However, it should be acknowledged that the samples used in this work were characterized by rather large sensory differences. Scope exists for future research to assess the issue of discriminative power of CATA with less heterogeneous sample sets. Thus we conclude, in agreement with other authors (Bruzzone, Ares, & Gimenez, 2012), that the number of times consumers checked the presence of an attribute provides already a good estimate of its intensity. Importantly, the CATA with intensity method yielded fewer signicant descriptors than the CATA version, possibly due to consumers inconsistency in the use of scales. This latter conclusion should be considered tentative due to the fact that the sample size in the CATA with intensity condition was slightly lower than in the CATA one. However, a separate analysis treating the ratings in the CATA w/intensity data as normal 1/0 CATA revealed extremely similar results to the scaled version, and returned a signicant effect of two variables (lavender and rose) which were found to be non-signicant when intensity ratings were used in the analysis (cf. Table 2). This corroborates the idea that potential advantages of scaling might be outweighed by the higher probability of obtaining a noisier dataset. 5. Conclusion In summary, the combination of MFA and PLSR for data analysis of CATA, CATA with intensity and Napping data revealed very good agreement in terms of inter-perceived product differences. The high inter-method reliability shows that consumer data are highly repeatable, and supports the validity of the three methods employed in the study. Additionally, these ndings conrm that rapid descriptive methods are suitable to capture differences among products and that they can be a useful tool for capturing and understanding consumer perceptions. The choice of methodology should be based on practical considerations, such as ease of use or whether it is desired to have consumers articulate their own perception of descriptors, or if it sufcient to present them to an existing vocabulary. Acknowledgments Support for this work was provided by the Danish Agency for Science, Technology and Innovation and by the Faculty of Science, University of Copenhagen. The organizing committee of the 5th Eurosense Conference is thanked for the bursary award granted to Davide Giacalone. References
Adams, J., Williams, S., Lancaster, B., & Foley, M. (2007). Advantages and uses of check-all-that-apply response compared to traditional scaling of attributes for salty snacks. In 7th Pangborn sensory science symposium. Minneapolis, MN, USA: Hyatt Regency. Ares, G., Deliza, R., Barreiro, C., Gimenez, A., & Gambaro, A. (2010). Comparison of two sensory proling techniques based on consumer perception. Food Quality and Preference, 21, 417426. Ares, G., Varela, P., & Rado, G. (2011b). Identifying ideal products using three different consumer proling methodologies. Comparison with external preference mapping. Food Quality and Preference, 22, 581591.

You might also like