Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The development and psychometric evaluation of the Chinese Big Five Personality Inventory-15

  • Xintong Zhang,

    Roles Methodology, Writing – original draft

    Affiliations Department of Psychology, Guangzhou University, Guangzhou, China, The Center for Psychometrics and Latent Variable Modeling, Guangzhou University, Guangzhou, China

  • Meng-Cheng Wang ,

    Roles Conceptualization, Funding acquisition, Writing – review & editing

    [email protected]

    Affiliations Department of Psychology, Guangzhou University, Guangzhou, China, The Center for Psychometrics and Latent Variable Modeling, Guangzhou University, Guangzhou, China, The Key Laboratory for Juveniles Mental Health and Educational Neuroscience in Guangdong Province, Guangzhou University, Guangzhou, China

  • Lingnan He,

    Roles Data curation, Investigation, Methodology, Project administration, Supervision

    Affiliation School of Communication and Design, Sun Yat-Sen University, Guangzhou, China

  • Luo Jie,

    Roles Methodology, Writing – review & editing

    Affiliation School of Education, Guizhou Normal University, Guiyang, China

  • Jiaxin Deng

    Roles Data curation, Investigation, Project administration

    Affiliations Department of Psychology, Guangzhou University, Guangzhou, China, The Center for Psychometrics and Latent Variable Modeling, Guangzhou University, Guangzhou, China

Abstract

The Chinese Big Five Personality Inventory (CBF-PI), a 134-item self-report scale, and its 40-item brief version (CBF-PI-B) are sound psychometric instruments used to measure the Big Five personality domains in the Chinese population. However, their applicability is limited by their length, as well as restricted by assessment conditions. In this study, we developed and validated a new shortened version with 15 items (CBF-PI-15) through exploratory factor analysis and confirmatory factor analysis in a large sample (Sample 1) of 10,738 Chinese adults (mean = 33.90 years, SD = 9.39 years, range 17–57 years). Measurement invariance results suggested the CBF-PI-15 were invariant across gender and age groups. Convergent, discriminant and criterion validities were tested in Sample 2 (N = 256, mean = 21.62 years, SD = 3.06 years, range 18–35 years) and findings showed an expected correlational pattern with external variables. Results revealed positive correlations of Neuroticism with the Barratt Impulsiveness Scale Brief Version (BIS-Brief), the Patient Health Questionnaire, and the Generalized Anxiety Disorder Screener, as well as a strongly negative correlation between Conscientiousness and BIS-Brief. Additionally, Conscientiousness positively correlated with academic performance as expected. In conclusion, the CBF-PI-15 holds promise as an informative alternative for the original CBF-PI-B when administration time or conditions are limited, and our findings provide preliminary support for the utility of the CBF-PI-15.

Introduction

The Big Five model of personality traits is one of the well-established paradigms for the conceptualization of human personality [1, 2]. However, it is not a comprehensive taxonomy of natural language personality descriptors, and excludes assessable terms and state descriptors [3]. Admittedly, the Big Five has been advocated for as a basic framework for personality description and assessment due to its replicability and ubiquity [4] across numerous cultures over the years. Generally, Extraversion (E), Agreeableness (A), Conscientiousness (C), Neuroticism (N; emotional stability), and Openness to experience (O), each having several facets in a hierarchical structure, are widely accepted as five relatively independent factors which account for phenotypic personality variations between people. The Big Five factor framework has profoundly affected studies about individual personality differences [1], and to date, an extensive body of evidence has corroborated close associations of personality traits with extant variables such as academic performance, impulsiveness, and depression (e.g., [57]).

The wide acceptance of the Big Five framework by personality researchers highlights a need for an efficient, reliable, and valid instrument to measure these traits. The 240-item Revised NEO Personality Inventory (NEO-PI-R; [8]) with six facets per dimension, and the NEO Five-Factor Inventory (NEO-FFI; [8]), a brief version of the NEO-PI-R with 60 items, are two of the most commonly utilized questionnaires to assess personality using the Big Five model. However, these two measures are proprietary instruments, which inhibits their availability for research use [9, 10]. Alternatively, the 44-item Big Five Inventory (BFI-44; [11]) was constructed in the late 1980s as a creatively sufficient personality inventory for non-commercial use. A total of 44 short-phrase items were developed, to be responded to in five minutes or less [12], greatly shortening the assessment time.

Although the NEO-PI-R, NEO-FFI, and BFI-44 continue to be widely used, there has been a recent tendency towards developing shorter measures [13]. Shortened personality questionnaire assessments often have psychometric limitations in certain aspects (e.g., reliability or content validity) as expounded upon by Credé and co-investigators (2012) [14], but show attractive benefits when considering the balance between practice and psychometric properties. Specifically, the shortened forms minimize assessment time, avoiding problems of respondents feeling fatigued and bored, meaning that respondents rate each item with more intention and focus which will then lead to more accurate responses [15]. As such, negative participant reactions (i.e., refusal to respond, careless or random responses due to fatigue) can be avoided to some extent by using the shortened forms [14]. More importantly, the shortened forms retain the original measures’ conceptual focus, reliability, and validity, and have a good convergence with the corresponding facets of the original form (e.g., [1618]).

Examples of this trend of minimal measurements included the 10-item Big Five Inventory (BFI-10; [12]) and the 20-item Mini-IPIP [19]. For instance, the BFI-10 demonstrates good convergent validity with the NEO-PI-R, good external validity with peer ratings, test-retest reliability, and comparative or even improved predictive validity against the BFI-44 [10, 12]. Additionally, Donnellan and colleagues (2006) [19] designed the Mini-IPIP on the basis of two considerations (i.e., the aim to maintain content coverage and acceptable psychometric properties, and the desire to evaluate each personality domain with fewer items), and demonstrated favorable results (e.g., excellent content coverage, high test–retest correlations) using a large Western undergraduate sample. Likewise, Baldasaro and colleagues (2013) [20] replicated the previous findings and suggested that the Mini-IPIP had acceptable reliability, partial or full metric invariance, and exhibited some degree of criterion validity when drawing on a representative sample of American young adults. More recently, two short German forms of the Next Big Five Inventory (BFI-2; [21]), namely the 30-item BFI-2-S and the 15-item BFI-2-XS, have been developed by Soto and John (2017b) [22] to address the common limitations that the original long version might possess in some circumstances, such as being a large-scale survey or for when participants need to complete the same personality measure multiple times. As expected, both the BFI-2-S and BFI-2-XS maintained much of the full measure’s reliability and validity at the level of the Big Five domains, and the BFI-2-S has been proven to be useful when assessing facet personality traits among reasonably large populations [22]. Overall, the quality of psychometric properties of the relevant long or shortened forms measuring the Big Five personality structure have been robustly validated for Western populations [10].

From a cross-cultural perspective, it is important to replicate the Big Five model in a non-Western sample (e.g., Chinese). At present, questionnaire assessments for the Big Five model have been translated and revised, and are being used widely in China. McCrae and colleagues (1996) [23] translated the NEO-PI-R into Chinese, and yielded strong evidence of the cross-cultural replicability of the NEO-PI-R factor structure and the Big Five model in a sample of 352 Hong Kong college students. Subsequently, Dai and colleagues (2004) [24] further confirmed the factor structure and psychometric properties of the revised Chinese version of the NEO-PI-R in Mainland China community adults. Similarly, those findings were also replicated using Chinese college students using the Chinese version of the NEO-FFI [25], and Zheng and colleagues (2008) [26] confirmed the five-factor model when using the Chinese-language IPIP Big Five factor markers. Despite these promising findings, however, there are still a number of limitations in existing research. Specifically, Zheng et al. (2008) [26] evaluated the BFI-44 using Chinese participants, but did not report the details about the resource translation or psychometric properties. Leung and colleagues (2013) [27] translated the BFI-44 into traditional Chinese characters to conduct research with Hong Kong-based participants, but it was difficult to use the same questionnaire widely in Mainland China where people generally use simplified characters. Notably, a previous meta-analysis study [28] has demonstrated that the Cronbach’s αs in Chinese samples were lower than those in the foreign samples when using the same Western Big Five personality inventories. Also, the reliability coefficients of the revised Western Big Five factor personality measures were lower than the self-developed Chinese Big Five factor personality inventory that was applied in China [28]. Moreover, the limitation on the length of the aforementioned measures should be taken into consideration.

To overcome these existing limitations of the Big Five personality measures, Wang and colleagues (2010a; 2010b) [29, 30] developed the Chinese Big Five Personality Inventory (CBF-PI). The CBF-PI consists of 134 items with 22 facets, and participants respond to each item based on a six-point Likert scale that ranges from 1 (“disagree strongly”) to 6 (“agree strongly”). Results indicated the CBF-PI had favorable internal consistency (Cronbach’s α), ranging from .83 (A) to .91 (C), and the test-retest reliability coefficients were above .80 [29]. Additionally, the CBF-PI correlated strongly with relevant dimensions of the NEO-PI-R and BFI, ranging from .45 (A) to .62 (E), and .69 (A) to .82 (O), respectively [30]. An abbreviated measure, the CBF-PI-B, was developed from the CBF-PI. Taking the total score of each dimension as a marker variable, the criteria of selecting adequate items to report on were as follows: (a) the correlation of the item with the marker variable was > .40; (b) the factor loading of the item on the non-target dimension was < .40; (c) the selected items must have come from different facets (i.e., the resulting items should cover multiple content domains). If the selected item did not meet the above criteria, it was replaced until every item satisfied the criteria. Through this process, a total of 40 items were chosen to form the shortened version [31]. Similar to the original version, the CBF-PI-B demonstrated excellent Cronbach’s α coefficients (ranging from .76 [A] to .81 [N]), good test-retest reliability (ranging from .67 [A] to .81 [N]), and its five dimensions correlated robustly with the corresponding dimensions of both the NEO-PI-R (r = .36 [A] to .85 [C]) and BFI-44 (r = .58 [A] to .83 [N]). Equivalence coefficients between the CBF-PI-B and the original being above .85, the shortened version was regarded as a promising self-report tool that would be a good substitute for the original form to assess the Big Five personality traits in Chinese samples. Overall, both the CBF-PI and the CBF-PI-B dovetail with Chinese idiomatic expressions, and have since become widely applied to various Chinese groups (e.g., [32, 33]). Still, concerns regarding the length of the CBF-PI and the CBF-PI-B should still be taken into consideration.

Current study

The primary purpose of the present study was to develop and evaluate the psychometric properties of a more time-effective form of the CBF-PI-B. First, we validated the original CBF-PI-B on a sample of participants, and then used these results to create a more simplified 15-item version of the CBF-PI-B (CBF-PI-15). The factor structure was then tested through confirmatory factor analysis (CFA). Fifteen items were expected to provide adequate coverage of the five factors, as well as excellent psychometric properties, while showing a meaningful reduction in response time, particularly in an epidemiological survey. Second, measurement invariance analyses were then performed to examine whether the factor structure of the CBF-PI-15 was equivalent across gender and age groups by testing configural, metric, scalar, and error invariances. Third, the convergent and discriminant validity of the CBF-PI-15 was examined by correlating the CBF-PI-15 subscale scores to the subscale scores of alternative self-report measures that identify personality traits and other relevant assessments that measure impulsiveness, depression, and anxiety in a new sample. The correlations with the original CBF-PI-B were also calculated. Finally, the criterion validity of the CBF-PI-15 was scrutinized by correlating the CBF-PI-B five factor scores to academic performance, the relevant external variable. We also reported the internal consistency, including the Cronbach’s α and mean inter-item correlations, of the CBF-PI-15. We expected the abbreviated inventory to hold promise as an informative alternative for the original CBF-PI-B, showing favorable psychometric properties.

Materials and methods

Participants

Two independent samples were used. The first sample included 11,218 adult participants (62.2% male, 37.8% female) who had completed the CBF-PI-B online in September 2017. Most participants (93.9%) were residents of Mainland China, with a smaller percentage of them (6.1%) living abroad. Considering most current studies using the CBF-PI-B intend to measure adult personality traits, we excluded data from those under 17 years of age. Finally, a total of 10,738 participants (62.4% males; mean = 33.90 years; SD = 9.39 years; range 17–57 years) were included in the first sample. The second sample of 256 college students (mean = 21.62 years; SD = 3.06 years; range 18–35 years) was recruited anonymously from Guangzhou University in China for partial fulfillment of a course requirement. The gender breakdown was 32.0% male and 67.2% female, with two participants not reporting their gender. Regarding ethnic origin, 96.5% of participants were Han, the majority ethnic group in China, and only four students were from other ethnic groups.

This study was approved by the Human Subjects Review Committee at Guangzhou University. All questionnaires were administered to those who had given informed consent.

Measures

The Chinese Big Five Personality Inventory Brief Version (CBF-PI-B).

The CBF-PI-B [31], an abbreviated form of the CBF-PI [29, 30], was developed to evaluate the five dimensions of the Big Five personality model. The CBF-PI-B is administered as a self-report measure consisting of 40 items. Items 5, 8, 13, 15, 18, 32, and 36 are scored reversely. Each item is answered on a six-point Likert-type scale, ranging from 1 (“disagree strongly”) to 6 (“agree strongly”). The Cronbach’s α coefficients in Sample 1 were .812, .771, .736, .765, and .762, for N, C, A, O, and E, respectively. In Sample 2, the Cronbach’s α coefficients were .859, .779, .745, .781, and .780, respectively.

The Mini-International Personality Item Pool Five-Factor Model (Mini-IPIP).

The Mini-IPIP [19] is based on the 50-item International Personality Item Pool five-factor model (IPIP-BF; [34]). This instrument contains 20 items that are rated on a five-point Likert-type scale, ranging from 1 (“disagree strongly”) to 5 (“agree strongly”). The Cronbach’s α coefficients for Sample 2 were .712, .598, .609, .707, and .734 for N, C, A, O, and E, respectively.

The 10-Item Big Five Inventory (BFI-10).

The BFI-10 [12] is an abbreviated version of the BFI-44 [11]. Its 10 items are short, descriptive phrases that participants rate on a five-point Likert-type scale, ranging from 1 (“disagree strongly”) to 5 (“agree strongly”). The BFI-10 assesses the Big Five model using two items per dimension, one coded to be in a positive direction, and the other in a negative direction on the scale. Previous research has confirmed that the construct validity of the BFI-10 is dependable because it retains an extensive portion of the reliability and validity of the initial BFI-44 [12]. In the current study, Cronbach’s α coefficients in Sample 2 for N, C, A, O, and E were .553, .359, .251, .289, and .549, respectively.

The Barratt Impulsiveness Scale Brief Version (BIS-Brief).

The BIS-Brief [35] is a shortened form of the BIS-11 [36]. A total of eight items are rated to assess general impulsivity, with each item scored on a four-point scale, ranging from 1 (“rarely/never”) to 4 (“almost always/always”). The Chinese version of the BIS-Brief was validated in a sample of Chinese male prisoners and showed good reliability and construct validity [37]. The Cronbach’s α of the BIS-Brief in Sample 2 of the present study was .753.

The Patient Health Questionnaire-9 (PHQ-9).

The PHQ-9 [38] is a nine-item instrument designed to assess depression, and is based on the DSM–IV diagnostic criteria for major depressive disorder. Each item is rated on a four-point scale, ranging from 1 (“not at all”) to 4 (“nearly every day”). In the current study, the Cronbach’s α coefficient of Sample 2 was .823.

The Generalized Anxiety Disorder Screener-7 (GAD-7).

The GAD-7 [39] is a brief self-report scale that uses seven items to identify generalized anxiety in primary care. The items are rated on a four-point scale, ranging from 1 (“not at all”) to 4 (“nearly every day”). The Cronbach’s α coefficient of Sample 2 in this study was .895.

Academic performance.

Participants reported their academic performance as a comparison to their classmates. Responses were rated from 1 (“at the lower percentile”) to 5 (“at the upper percentile”). The results showed that 6.3% (N = 16) were in the lower percentile, 16.8% (N = 43) were in the middle-to-lower percentile, 39.5% (N = 101) were in the middle percentile, 24.6% (N = 63) were in the middle-to-upper percentile, and 12.5% (N = 32) were in the upper percentile, with one participant refusing to respond.

Statistical analyses

First, we examined the factor structure of the CBF-PI-B for Sample 1 with Mplus 7.4 [40]. The data was divided randomly into two parts: Sample A and Sample B. Each sample included 5,369 participants. Sample A was chosen at random for exploratory factor analysis to develop an abbreviated form. Robust Maximum Likelihood (MLR) with oblique rotation was adopted. Factor loadings less than .30 or more than .30 on more than one factor were dropped [41]. Based on the Big Five model, we selected three items from each dimension whose factor loadings were relatively high but whose cross-loadings were distinctly low to be included in the CBF-PI-15. Following the EFA, Sample B was then used for confirmatory factor analysis. Fit indices, including chi-square (χ2), the standardized root mean square residual (SRMR), the root mean square error of approximation (RMSEA), the Tucker-Lewis index (TLI), and the comparative fit index (CFI), were computed to assess the goodness of fit of the model. Conventional guidelines suggest a cutoff value close to .08 for SRMR, and a cutoff value close to .06 for RMSEA [42]. Moreover, CFI and TLI values ≥ .90 indicate an adequate model fit [43].

Second, measurement invariance (MI) tests were conducted across gender and age groups of Sample 1 using a series of multi-group CFAs. MI was examined at four levels (configural invariance, metric invariance, scalar invariance, and error variance invariance), and the differences in CFI (ΔCFI) and TLI (ΔTLI) were regarded as suitable indicators of measurement invariance [44]. Additionally, ΔCFIs ≤ .01 and ΔTLIs ≤ .010 indicated that the invariance hypothesis should be accepted, as mean differences exist when ΔCFIs and ΔTLIs are between or equal to .01 and .02, and definite differences exist when ΔCFIs and ΔTLIs are > .02 [44]. Based on the taxonomy of the invariance tests, latent mean invariance was then performed across gender and age groups in Sample 1 to detect latent mean differences.

Third, to assess internal consistency of the CBF-PI-15 subscales scores, Cronbach’s α [45] was calculated using SPSS (IBM, SPSS version 19, 2010). Ranges of measures were as follows: < .60 = insufficient; .60 to .69 = marginal; .70 to .79 = acceptable; .80 to .89 = good; and .90 or higher = excellent [46]. However, Cronbach’s α depends on inter-item correlations and number of items. Therefore, mean inter-item correlations (MIC) were also computed to be used as straightforward indicators of the scale’s internal consistency (i.e., not simply an effect of a few particular items), and was considered to be adequate if between .15 and .50 [47].

Finally, to assess the convergent, discriminant and criterion validity of the 15-item version, Pearson correlations of the CBF-PI-15 were performed with SPSS 19.0 using Mini-IPIP, BFI-10, BIS-Brief, PHQ-9, GAD-7, and academic performance. Meanwhile, associations between the CBF-PI-15 subscales and the original CBF-PI-B subscales were computed and given further examination. According to Cohen’s guidelines [48], a correlation (r) of ≤ .29 is interpreted as being weak, an r from .30 to .49 is interpreted as being moderate, and an r ≥ .50 is interpreted as being robust. The hypothesis was that the CBF-PI-15 five subscales would correlate strongly with the corresponding subscales of the original and alternative personality measures, and that the CBF-PI-15 would also exhibit significant associations with relevant external variables (i.e., impulsiveness, depression, anxiety, and academic performance), in line with previous findings (e.g., [57]). Finally, to determine whether the strength of the correlation of the abbreviated CBF-PI-15 with the criterion tools differed from that with the original CBF-PI-B or the other personality assessments, we employed the method proposed by Dunn and Clark (1969) [49] (c.f., [50]) using a spreadsheet that was developed by DeCoster and Lselin (2005) [51] and that can be retrieved at: http://stat-help.com/spreadsheets.html.

Results

Preliminary analyses

Descriptive statistics, including Cronbach’s αs, means, standard deviations, and number of items about all relative measures in the current study are presented in Table 1.

thumbnail
Table 1. Descriptive statistics for all scales in the current study.

https://doi.org/10.1371/journal.pone.0221621.t001

Development of the CBF-PI-15

Developing a brief, psychometrically strong personality inventory based on the five-factor model for wide use within the Chinese population was the central purpose of abbreviating the CBF-PI-B. Item reduction was achieved using Sample A through a stepwise selection process with EFA. Given the aforementioned conventional guidelines and the particular five factors, those items with loadings below .30 or higher than .30 on more than one factor were eliminated. A total of 15 items were selected to construct the shortened CBF-PI-15 by identifying the three highest-loading items from each supported factor. We then experimented with different combinations of these 15 items for the CBF-PI-15, taking all indicators (e.g., Cronbach’s α coefficients) into consideration. As a result, the CBF-PI-15 was formed using items 21, 26, and 31 to measure N, items 17, 22, and 37 to measure C, items 3, 23, and 33 to measure A, items 9, 14, and 24 to measure O, and items 5, 15, and 35 to measure E. Items 5 and 15 were scored reversely. Of note, as Table 2 demonstrates, all factor loadings of the selected items were higher than .50.

Sequentially, the factor structure of the CBF-PI-15 was re-examined using Sample B. Fit indices were good for the CBF-PI-15 in the current study (MLRχ2 = 918.882, df = 80, CFI = .946, TLI = .929, RMSEA = .044, SRMR = .040), supporting the factorial validity of the abbreviated scale.

Invariance across gender and age groups

Tests of measurement invariance were conducted in Sample 1 to systematically investigate the extent to which the measurement model was replicated across gender and age groups, and to investigate possible latent differences across these subgroups of participants. The results are shown in Table 3. Gender is divided into two groups: male (N = 6,698) and female (N = 4,040). Age is divided into four groups, 17–20 (N = 611; Group 1), 21–29 (N = 3,391; Group 2), 30–39 (N = 3,772; Group 3), and 40 and older (N = 2,964; Group 4). CFIs and TLIs of these four types of measurement invariance varied in gender groups from .923 to .944, and in age groups from .904 to .918. All ΔCFI and ΔTLI were below or equal to .01 among the four levels of measurement invariance, indicating that there was no significant difference in gender groups or age groups [44], and that the CBF-PI-15 model with cross-group equality constraints was the best fit to the data.

thumbnail
Table 3. Goodness-of-fit indices and model comparisons for CBF-PI-15 measurement invariance models.

https://doi.org/10.1371/journal.pone.0221621.t003

Based on the taxonomy of invariance tests, latent mean differences in gender and age groups were pursued in the current study. Above all, when the latent means of males were fixed to zero for identification purposes, the latent means of females were significantly higher (p ≤ .001) for N (.199) and A (.153), but lower for C (-.267), O (-.461), and E (-.103). Additionally, Fig 1 shows the latent means of the five dimensions across age groups and Group 1, the designated reference group. Specifically, the latent means of the older groups, excluding Group 2 (-.016), were significantly lower for N (Group 3 = -.278, Group 4 = -.602), whereas they were significantly higher for C (Group 2 = .185, Group 3 = .454, Group 4 = .721). The older-aged groups had higher latent means for A (Group 2 = .103, Group 3 = .231, Group 4 = .460), and lower latent means for O (Group 2 = -.206, Group 3 = -.282, Group 4 = -.344) when compared to Group 1. With regards to E, Group 1 showed significant but relatively weak differences in latent means in comparison to the other four groups (Group 2 = -.051, Group 3 = -.171, Group 4 = -.258). Meanwhile, Figs 2 and 3 show the results of invariance across all 10 groups, representing all combinations of the two genders and five age categories (age 17–20 males, age 17–20 females, age 21–29 males, age 21–29 females, etc.).

thumbnail
Fig 1. Latent means of Big Five dimensions results across different age groups.

https://doi.org/10.1371/journal.pone.0221621.g001

thumbnail
Fig 2. Latent means of males’ Big Five dimensions results across different age groups.

https://doi.org/10.1371/journal.pone.0221621.g002

thumbnail
Fig 3. Latent means of females’ Big Five dimensions results across different age groups.

https://doi.org/10.1371/journal.pone.0221621.g003

Internal consistency

The internal consistency of the two samples of the CBF-PI-15 is shown in Table 1. The internal consistency of the CBF-PI-15 factor scores ranged from moderate to good. Namely, Cronbach’s αs ranged from .611 (C) to .803 (O), and all MICs except for the O factor were in the range of .15 to .50 in Sample 1. In Sample 2, the internal consistency of the CBF-PI-15 was acceptable in both Cronbach’s α and MIC, although the internal consistency of the A, N, and O factor scores was only favorable in Cronbach’s α (A = .769, N = .809, O = .811), but not in MIC (MICs > .50).

Convergent and discriminant validity

Table 4 shows the correlations between the CBF-PI-15 and the alternative personality measures. Results demonstrate favorable associations of the CBF-PI-15 with the original CBF-PI-B. The corresponding five dimensions all showed correlations of > .75 (mean = .832), with all discriminant correlations being < .40 (mean = .194). The CBF-PI-15 also showed acceptable convergent correlations with the Mini-IPIP domains (mean = .414, range = .219 to .546), and with the BFI-10 (mean = .439, range = .238 to .590). In contrast, the correlations of the corresponding dimensions between the Mini-IPIP and BFI-10 ranged from .197 to .676 (mean = .516), and the mean discriminant correlation was .208.

thumbnail
Table 4. Correlations between the CBF-PI-15, Mini IPIP, BFI-10, and CBF-PI-B.

https://doi.org/10.1371/journal.pone.0221621.t004

Additionally, Table 5 shows the correlations between the CBF-PI-15 and the BIS-Brief, PHQ-9, and GAD-7. As expected, the BIS-Brief and C demonstrate a significantly negative correlation (r = -.646, p < .001), and there is a modestly but significantly positive correlation between the BIS-Brief and N (r = .303, p < .001). The negative correlation coefficients between the BIS-Brief and A, O, and E ranged from -.035 to -.195. The PHQ-9 exhibited a robustly positive relationship with N (r = .500, p < .001), and negative correlations with C, E, A, and O (r = -.242, p < .001; r = -.252, p < .001; r = -.181, p < .01; and r = -.078, p > .05, respectively). The GAD-7 also showed a strongly positive relationship with N (r = .566, p < .001) and linked negatively to other the four dimensions (r = -.106 to -.257).

Criterion validity

From the results presented in Table 5, academic performance appears to have a relationship with the Big Five personality traits of the CBF-PI-15 (r = .068 to .332), and the correlations between academic performance and C as well as N were significant (r = .332, p < .001; r = -.129, p < .05, respectively). There were also non-significant or trivial relationships of academic performance with A (r = .068), O (r = .106), and E (r = .109). Table 5 also demonstrates correlations between the alternative personality measures (i.e., original CBF-PI-B, Mini-IPIP, and BFI-10) and external criterion variables. Additional analyses revealed that about three quarters of Pearson correlation coefficients were non-significant (see z-tests in Table 5) between the CBF-PI-15 and the CBF-PI-B, as well as between the CBF-PI-15 and the other alternative personality measures, suggesting that the CBF-PI-15 has comparable criterion-related validity with the corresponding measures.

thumbnail
Table 5. Correlations between personality measures and criterion-related variables.

https://doi.org/10.1371/journal.pone.0221621.t005

Discussion

The primary purpose of the present study was to develop and validate an abbreviated form of the CBF-PI-B. Items for a newly developed CBF-PI-15 were selected by balancing concerns for factor structure, internal consistency, and content representativeness. MI tests indicated that the CBF-PI-15 was invariant across gender- and age-differentiated subgroups. Reliability analysis indicated adequate internal consistency with the full CBF-PI-B, and moreover, expected correlation patterns with the shortened form were found with external variables. In general, the CBF-PI-15 can act as an efficient, reliable, and valid assessment of personality traits that can be administered quickly with minimal administration burden while retaining the reliability and validity of the full CBF-PI-B.

Short-form measures of the Big Five Personality traits (e.g., the BFI-10, BFI-2-XS, Mini-IPIP) have been developed and become widely used by researchers over the years, which could derive from their economic value and sound practical results. First, these shorter personality inventories ensure the accuracy of the external validity of research findings to a reliable extent, in that they take less time to complete, thus effectively lessening the likelihood of negative or erroneous participant responses caused by feelings of boredom or fatigue (e.g., refusal to participate in study, careless responses) [14, 15]. Second, the brevity of the shorter measures also plays an important role in the research setting where there is a need to assess a vast amount of related constructs in addition to personality traits within a limited period time. Third, shorter personality inventories improve the accuracy of face validity due to the ease of item distribution (e.g., lack redundant repetitive items [52]) allowing respondents to better understand the content of each item. Finally, the shorter inventories appear to not correspond with substantial psychometric sacrifices, as Credé and colleagues (2012) [14] have pointed out, and have been validated with comparable psychometric properties such as criterion validity and test-retest reliability when compared with longer scales in previous research [1618].

However, there are also several notable psychometric concerns about the reliability and validity of dramatically shortened scales. Credé et al. (2012) [14] noted that the shortened tools appear to be limited by measurement error, which relates to reliability and could therefore increase the risk of criterion invalidity. While adding more items could cancel out the risk of measurement error from a psychometric perspective, the quality of information could decrease if informants are asked to respond to numerous seemingly-repetitive questions, which could also have an inverse impact on the reliability, from cognitive perspective [53]. Another concern about shorter personality measures is the risk that it might prioritize one personality facet over the other facets, rather than covering all facets equally. In this case, the shorter measures could lack substantial content as a consequence of including fewer items, suggesting that construct underrepresentation could result in poor predictive validity [54]. Credé and colleagues (2012) [14] also criticized the use of briefer scales when measuring the Big Five as weak associations of Big Five measures with either the criteria or new predictors will substantially increase both Type 1 and Type 2 error rates. Ultimately, shorter measures are recommended when assessing personality traits only if time and space are limited. Otherwise, it is recommended to use the full-length, well-established measurement instruments [12, 19].

Admittedly, the development of short measures in existing research has typically emerged out of practical necessity. In the current study, the search for a brief self-report based on the five-factor model arose from investigating personality traits of Chinese people. The CBF-PI-B has been difficult to use as often as researchers would like because it is too long to include in already-crowded assessment batteries during short time intervals, or it is unsuitable for use due to its intensive and repetitive nature. Therefore, the factor structure of the CBF-PI-B was reinvestigated using EFA in half of Sample 1, and a total of 15 items with high factor loadings and low cross-loadings were used to construct a new abbreviated scale. In assessments of the CBF-PI-15, results were encouraging. First, the CFA was sequentially conducted with the other half of Sample 1. All fit indices were fully favorable (CFI and TLI > .90, RMSEA < .06, and SRMR < .08), supporting the adequacy of the CBF-PI-15 structure. The 15-item abbreviated scale suitably represented the content and structure of the original CBF-PI-B in a condensed format.

Second, measurement invariance and latent means comparisons were tested across gender and age groups. Prior studies (e.g., [13, 55]) have focused on comparing the scores of the five personality dimensions among different gender or age groups without meeting the requirement of measurement equivalence. In the present study, measurement invariance analysis provided some informative perspectives on the measurement properties of the CBF-PI-15 scores. Results revealed that the CBF-PI-15 had strict measurement invariance (i.e., configural invariance, metric invariance, scalar invariance, and error invariance) across both gender and age groups, suggesting that the CBF-PI-15 scores can be interpreted in the same way for various groups of people.

With a satisfying level of measurement invariance, the latent mean differences across gender and age were examined. In line with clear findings from a previous study [56], women showed higher latent means of N and A than those of men on responses to the Big Five Inventory. Men tended to score higher in the extraversion-related trait as Feingold (1994) [57] found, and differences in O between Chinese males and females were apparent. It is possible that social and economic conditions lead to the differences in measured personality traits between males and females [58]. For age differences, our results are consistent with previous related studies [55, 59], and reveal that A and C show a positive age trend, while E and O show relatively smaller differences when compared to age. N is negatively correlated with age, and there is a curvilinear trend peaking in the 21–29 years of age group, which to some extent mirrors the higher incidence of N-related psychopathology as noted by McCrae, Martin, and Costa (2005) [60]. The maturity principle suggests that as an individual grows older, they tend to mature and become more productive contributors to society rather than focusing only on satisfying their needs for self-actualization, from a humanistic perspective. Accordingly, E and O both appear to decrease after young adulthood [61]. Recent research [56] has also shown clear evidence that the latent means differ systematically across age-gender groups. In the current study, older males demonstrate higher scores for A and C, lower scores for N and O, and showed only the smallest negative difference in E. Females show similar patterns to males in latent mean scores in the five dimensions, except in the case of N and O. Women show lower but more stable levels of O when over the age of 20, whereas the latent means of O shows an obvious decrease from the age of 20. Additionally, a large fluctuation in N appears throughout a woman’s life span, indicating that women in early adulthood are more emotionally unstable, but as a woman’s age increases, the latent means trends lower. Considering the abundant variation in observed gender and age differences across previous research [56], there is a need to examine these differences deeper within the Mainland China context.

Third, the CBF-PI-15 maintains adequate internal consistency, though only in approximately one third of the CBF-PI-B’s total items. Cronbach’s α coefficients of the five subscale scores in both samples were higher than those for the alternative short personality measures (i.e., BFI-10 and Mini-IPIP). It was noteworthy that C ranks lowest and O ranks highest in the current study, which is inconsistent with a general finding in personality research overall. For example, Benet-Martinez and John (1998) [62] suggested that A showed the lowest reliability in both English and Spanish versions of the BFI-44. Likewise, this finding seems to be comparable in the Chinese version of BFI-10 [13] and BFI (29-item; [27]). Gnambs (2014) [63] pointed out that situational differences would be more likely to affect A ratings. In the present study, an adverse finding–namely the reliability of C being lower than A–was obtained, which warrants further investigation. Nunnally and Bernstein (1994) [64] asserted that the purpose of a study determines the choice of the required level of reliability. As such, a cutoff of Cronbach’s α value at .70 or even lower seems appropriate when considering the breadth of the construct and time availability into account [65]. In doing this, all personality domains of the CBF-PI-15 show reliability coefficients at an adequate level.

The MIC is considered to be a more straightforward indicator of a scale’s internal consistency than Cronbach’s α as it minimizes the effects of the number of total items. The MICs of the CBF-PI-15 are higher in contrast to those of the CBF-PI-B, suggesting stronger inter-item correlations in the CBF-PI-15. John and Soto (2007) [66] raised concern that a greater overlap between items is more likely to emerge in a narrow assessment of the Big Five personality types due to items tending to evaluate closely-related patterns of behavior. In order to meet the requirement of acceptable reliability of the CBF-PI-15, the 15 selected items would be highly correlated with each other, meaning that there is a possibility of content overlap. To address this issue, we reviewed the content of every item of the CBF-PI-15, and found that while there was minor overlap of content, it was to a minimal degree. and concluded that the results were within the acceptable range.

The convergent validity of the CBF-PI-15 was supported by significant associations between the scores on the CBF-PI-15 and the scores on the other Big Five personality measures (e.g., CBF-PI-B, BFI-10, and Mini-IPIP). Comparing the personality measures on the basis of the Big Five framework, convergent validity with the domain scales was adequate for N, C, and E, but somewhat lower for O and A, which corresponds with previous research [12] (See Table 4). As each of these scales contain a small number of items, there is inevitably a limit in content coverage, and conceptual differences in definitions of personality between the various measures, rather than the particular item selection, resulted in a lower level of convergence in the O and A dimensions [12].The correlations between the CBF-PI-15 subscale scores and the BFI-10/Mini-IPIP subscale scores were approximately equal to those between the Mini-IPIP and BFI-10 subscale scores, however the correlation between BFI-10-O and Mini-IPIP-O was higher. Overall, the abbreviated version of the CBF-PI-B has kept most of the information of the original long instrument, and showed itself to be superior when compared to the two other short personality measures.

The present study also explored the associations between the CBF-PI-15 scores and impulsion, anxiety, and depression. The BIS-Brief measures impulse levels, and the CBF-PI-15 showed negative correlations with C, A, O, and E, and a positive correlation with N. Mao and colleagues (2018) [7] indicated that greater self-control seemed evident in individuals with higher levels of N and C, or low levels of O of the CBF-PI-B, possibly contributing to the individual exhibiting less impulsivity. Moreover, in line with previous findings that N is linked with anxiety and depression [6, 67], both the PHQ-9 and GAD-7 exhibited negative correlations with C, E, A, and O, with a positive relationship with N. As Watson and colleagues (1988) [68] suggested, neuroticism represents the tendency to experience heightened levels of negative emotionality (e.g., anger, anxiety, depression).

The CBF-PI-15’s criterion validity was also tested using academic performance. Poropat (2009) [69] once employed a meta-analysis to conduct a thorough review of the associations of Big Five personality traits with academic performance, which added credibility to the assertion that academic performance is correlated with A, C, and O. However, the current study resulted in different findings that academic performance showed moderate to robust correlations with the five personality dimensions, but the most strongly significant correlation to academic performance was with C, followed by N, with A showing only a slight correlation with academic performance (See Table 5). In contrast to those with higher scores in C, individuals with high scores in A were more willing to participate in social activities to improve their social desirability, therefore possibly ignoring the importance of knowledge and skills [70]. Alternatively, results of the current study suggest that the Chinese educational style particularly emphasizes independence over co-operative learning, which may also lead to differences in our findings, meaning that students with a higher level of C will show stronger academic performances. As for the negative correlations with N, Uppal and Mishra (2014) [70] also found that individuals with a professional education appeared to have mastered strategies to control emotional reactions, which could be related to their high scores in N or E.

Finally, although the CBF-PI-B seems perform better than the CBF-PI-15 in terms of convergent and criterion validity, additional analysis in the current study revealed that most correlational coefficients were non-significant (see z-tests in Table 5). Consequently, the scores for the CBF-PI-15 and the alternative personality assessments (CBF-PI-B, BFI-10, Mini-IPIP), especially the lengthier version of the CBF-PI-B, present the same substantial pattern of correlations with external variables, including impulsivity, anxiety, depression, and academic performance, supporting the validity of the CBF-PI-15.

In general, our results support the idea that the performance of the CBF-PI-15 is comparable to that of the original CBF-PI-B scores when assessing the five personality dimensions in a Chinese cultural context. The CBF-PI-15 also has additional credence for its utility as a shorter personality assessment. However, the findings of this study should be considered in light of its limitations. First, all data in this study was collected through self-reporting instruments, which can result in shared method variance. Thus, future studies should utilize multiple methods of data collection, as well as make use of multiple informants. Second, this was in essence cross-sectional research, and lacked other more psychometric assessments such as test-retest coefficients. Longitudinal studies evaluating correlations over time, as well as test-retest analysis, should be conducted in future research. Finally, more external criterion-related variables (e.g., self-esteem, stress resistance, concentration; [13, 21]) and other variants in the Chinese sample should be used in future research to provide more robust evidence for the broader validation of the abbreviated CBF-PI-15.

Conclusions

The current study has contributed to the development of the CBF-PI-15, an abbreviated form of the 40-item Chinese Big Five Personality Inventory Brief Version (CBF-PI-B). Results revealed that the CBF-PI-15 has a good factor structure, acceptable internal consistency reliability, and, as expected, convergent, discriminant, and criterion-related validity. Overall, the shortened version of the inventory holds promise as an informative alternative to the original CBF-PI-B, intended for use when there is a limit of time or space available for the research, or where personality is not a main point of study focus. Although researchers (e.g., [12]) will insist on using the full assessment measure to obtain more stable personality results, the CBF-PI-15 enriches the number of brief Chinese personality measurements and offers more variety in available methods to assess the Big Five personality dimensions.

Supporting information

S1 File. The Chinese Big Five Personality Inventory-15.

https://doi.org/10.1371/journal.pone.0221621.s003

(DOCX)

References

  1. 1. Goldberg LR. The structure of phenotypic personality traits. Am Psychol. 1993; 48(1):26–34. pmid:8427480
  2. 2. John OP, Naumann LP, Soto CJ. Paradigm shift to the integrative big five trait taxonomy: History, measurement and conceptual issues. In: John OP, Robbins RW, Pervin LA, editors. Handbook of Personality: Theory and Research (3rd edition). New York, NY: Guilford Press; 2008. pp. 114–158.
  3. 3. Norman WT. 2800 personality trait descriptors: normative operating characteristics for a university population. Ann Arbor: University of Michigan, Department of Psychology; 1967.
  4. 4. McCrae RR, Costa PT Jr. Clinical assessment can benefit from recent advances in personality psychology. Am Psychol. 1986; 41(9):1001–1003.
  5. 5. Caprara GV, Barbaranelli C, Pastorelli C, Cervone D. The contribution of self-efficacy beliefs to psychosocial outcomes in adolescence: predicting beyond global dispositional tendencies. Pers Indiv Differ. 2004; 37(4):751–763.
  6. 6. Kadimpati S, Zale EL, Hooten MW, Ditre JW, Warner DO. Associations between neuroticism and depression in relation to catastrophizing and pain-related anxiety in chronic pain patients. PLoS ONE. 2015; 10(4). pmid:25902153
  7. 7. Mao T, Pan W, Zhu Y, Yang J, Dong Q, Zhou G. Self-control mediates the relationship between personality trait and impulsivity. Pers Indiv Differ. 2018; 129:70–75.
  8. 8. Costa PT Jr, McCrae RR. Revised NEO personality inventory and five factor inventory professional manual. Odessa, FL: Psychological Assessment Resources; 1992.
  9. 9. Goldberg LR, Johnson JA, Eber HW, Hogan R, Ashton MC, Cloninger CR, Gough HG. The international personality item pool and the future of public-domain personality measures. J Res Pers. 2006; 40(1):84–96.
  10. 10. Thalmayer AG, Saucier G, Eigenhuis A. Comparative validity of brief to medium-length big five and big six personality questionnaires. Psychol Assess. 2011; 23(4):995–1009. pmid:21859221
  11. 11. John OP, Srivastava S. The Big Five trait taxonomy: history, measurement, and theoretical perspectives. In: Pervin LA, John OP, editors. Handbook of Personality: Theory and Research (2nd edition). New York: Guilford Press; 1999. pp. 102–138.
  12. 12. Rammstedt B, John OP. Measuring personality in one minute or less: a 10-item short version of the big five inventory in English and German. J Res Pers. 2007; 41(1):203–212.
  13. 13. Carciofo R, Yang J, Song N, Du F, Zhang K. Psychometric evaluation of Chinese-language 44-item and 10-item Big Five Personality Inventories, including correlations with chronotype, mindfulness and mind wandering. PLoS ONE. 2016; 11(2):e0149963. pmid:26918618
  14. 14. Credé M, Harms P, Niehorster S, Gaye-Valentine A. An evaluation of the consequences of using short measures of the Big Five personality traits. J Pers Soc Psychol. 2012; 102(4):874–888. pmid:22352328
  15. 15. Burisch M. Approaches to personality inventory construction: a comparison of merits. Am Psychol. 1984; 39(3):214–227.
  16. 16. Gosling SD, Rentfrow PJ, Swann WB. (2003). A very brief measure of the Big Five personality domains. J Res Pers. 2003; 37(6):504–528.
  17. 17. Robins RW, Hendin HM, Trzesniewski KH. Measuring global self-esteem: construct validation of a single-item measure and the Rosenberg Self-Esteem Scale. Pers Soc Psychol Bull. 2001; 27(2):151–161.
  18. 18. Thalmeyer AG, Saucier G, Eigenhuis A. Comparative validity of brief and medium-length Big Five and Big Six personality questionnaires. Psychol Assess. 2011; 23(4):995–1009. pmid:21859221
  19. 19. Donnellan MB, Oswald FL, Baird BM, Lucas RE. The Mini-IPIP scales: tiny-yet-effective measures of the Big Five factors of personality. Psychol Assess. 2006; 8(2):192–203.
  20. 20. Baldasaro RE, Shanahan MJ, Bauer DJ. Psychometric properties of the Mini-IPIP in a large, nationally representative sample of young adults. J Pers Assess. 2013; 95(1):74–84. pmid:22808913
  21. 21. Soto CJ, John OP. The Next Big Five Inventory (BFI-2): developing and assessing a hierarchical model with 15 facets to enhance bandwidth, fidelity, and predictive power. J Pers Soc Psychol. 2017a; 113(1):117–143. pmid:27055049
  22. 22. Soto CJ, John OP. Short and extra-short forms of the Big Five Inventory–2: the BFI-2-S and BFI-2-XS. J Res Pers. 2017b; 68:69–81.
  23. 23. McCrae RR, Costa PT Jr, Yik MSM. Universal aspects of Chinese personality structure. In: Bond MH, editor. The handbook of Chinese psychology. Oxford, England: Oxford University Press; 1996. pp. 189–207.
  24. 24. Dai X, Yao S, Cai T. Reliability and validity of the NEO- PI- R in Mainland China. Chinese Mental Health Journal. 2004; 18(3):171–174.
  25. 25. Yao R, Liang L. Analysis of the application of simplified NEO-FFI to undergraduates. Chinese Journal of Clinical Psychology. 2010; 18(4):457–459.
  26. 26. Zheng L, Goldberg LR, Zheng Y, Zhao Y, Tang Y, Liu L. Reliability and concurrent validation of the IPIP big-five factor markers in China: consistencies in factor structure between internet-obtained heterosexual and homosexual samples. Pers Indiv Differ. 2008; 45(7):649–654. pmid:20383283
  27. 27. Leung DYP, Wong EML, Chan SSC, Lam TH. Psychometric properties of the Big Five Inventory in a Chinese sample of smokers receiving cessation treatment: a validation study.Journal of Nursing Education and Practice. 2012; 3(6):1–10.
  28. 28. Luo J, Dai X. Meta-analysis of Big-five factor personality tests in China. Chinese Journal of Clinical Psychology. 2011; 19(6):740–752.
  29. 29. Wang M-C, Dai X, Yao S. Development of Chinese Big Five Personality Inventory (CBF-PI): theoretical framework and reliability analysis. Chinese Journal of Clinical Psychology. 2010a; 18(5):545–548.
  30. 30. Wang M-C, Dai X, Yao S. Development of the Chinese Big Five Personality Inventory (CBF-PI): validity analysis. Chinese Journal of Clinical Psychology. 2010b; 18(6):687–690.
  31. 31. Wang M-C, Dai X. Development of the Chinese Big Five Personality Inventory (CBF-PI): psychometric properties of CBF-PI brief version. Chinese Journal of Clinical Psychology. 2011; 19(4):454–457.
  32. 32. Fan J, Zhu X, Tang L, Wang Y, Li L, Yang Y, Lei H. Reliability and validity of Chinese Big Five Personality Inventory brief version in breast cancer women. Chinese Journal of Clinical Psychology. 2013; 21(5):783–785.
  33. 33. Wang M-C, Shou Y, Deng Q, Sellbom M, Salekin RT, Gao Y. Factor structure and construct validity of the Levenson Self-Report Psychopathy scale (LSRP) in a sample of Chinese male inmates. Psychol Assess. 2018; 30(7):882–892. pmid:29565613
  34. 34. Goldberg LR. A broad-bandwidth, public domain, personality inventory measuring the lower-level facets of several five-factor models. In: Mervielde I, Deary I, De Fruyt F, Ostendorf F, editors. Personality psychology in Europe. Tilburg, The Netherlands: Tilburg University Press; 1999. pp. 7–28.
  35. 35. Steinberg L, Sharp C, Stanford MS, Tharp AT. New tricks for an old measure: the development of the Barratt impulsiveness scale–brief (BIS-brief). Psychol Assess. 2013; 25(1):216–226. pmid:23148649
  36. 36. Patton JH, Stanford MS, Barratt ES. Factor structure of the Barratt impulsiveness scale. J Clin Psychol. 1995; 51(6):768–774. pmid:8778124
  37. 37. Wang M-C, Deng Q, Shou Y, Deng J, Gao Y, Li Z. Assessing impulsivity in Chinese elaborating validity of BIS among male prisoners. Crim Justice Behav. 2018; Online.
  38. 38. Kroenke K, Spitzer RL, Williams JBW. The PHQ-9: Validity of a brief depression severity measure. J Gen Intern Med. 2001; 16(9):606–613. pmid:11556941
  39. 39. Spitzer RL, Kroenke K, Williams JBW, Löwe B. A brief measure for assessing generalized anxiety disorder: the GAD-7. Arch Intern Med. 2006; 166(10):1092–1097. pmid:16717171
  40. 40. Muthe´n LK, Muthe´n BO. Mplus user’s guide. 7th ed. Los Angeles: Muthen & Muthen; 1998–2015.
  41. 41. Stevens JP. Applied multivariate statistics for the social sciences. 2nd ed. Hillsdale, NJ: Erlbaum; 1992.
  42. 42. Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Modeling. 1999; 6(1):1–55.
  43. 43. Kline RB. Principles and practice of structural equation modeling. 3rd ed. New York: Guilford Press; 2010.
  44. 44. Cheung GW, Rensvold RB. Evaluating goodness-of-fit indexes for testing measurement invariance. Struct Equ Modeling. 2002; 9(2):233–255. _5
  45. 45. Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika. 1951; 16(3):297–334.
  46. 46. Barker C, Pistran N, Elliot R. Research methods in clinical and counselling psychology. Chichester: Wiley; 1994.
  47. 47. Clark LA, Watson D. Constructing validity: basic issues in objective scale development. Psychol Assess. 1995; (3):309–319.
  48. 48. Cohen J. Statistical power analysis for the behavioral sciences. Hillsdale: Lawrence Earlbaum Associates; 1988.
  49. 49. Dunn OJ, Clark V. Correlation coefficients measured on the same individuals. J Am Stat Assoc. 1969; 64(325):366–377.
  50. 50. Steiger JH. Tests for comparing elements of a correlation matrix. Psychol Bull. 1980; 87(2):245–251.
  51. 51. DeCoster J, Iselin A-M. Comparing correlation coefficients [spreadsheet]; 2005. Available from: http://stat-help.com/spreadsheets.html
  52. 52. Wanous JP, Reichers AE, Hudy MJ. Overall job satisfaction: how good are single item measures? J Appl Psychol. 1997; 82(2):247–252. pmid:9109282
  53. 53. Konstabel K, Lönnqvist J, Walkowitz G, Konstabel K, Verkasalo M. The ‘Short Five’ (S5): measuring personality traits using comprehensive single items. Eur J Personal. 2012; 26(1):13–29.
  54. 54. Messick S. Validity of psychological assessment: validation of inferences from persons’ responses and performances as scientific inquiry into score meaning. Am Psychol. 1995; 50(9):741–749.
  55. 55. Soto CJ, John OP, Gosling SD, Potter J. Age differences in personality traits from 10 to 65: big five domains and facets in a large cross-sectional sample. J Pers Soc Psychol. 2011; 100(2):330–348. pmid:21171787
  56. 56. Marsh HW, Nagengast B, Morin AJS. Measurement invariance of Big-Five factors over the life span: ESEM tests of gender, age, plasticity, maturity, and la dolce vita effects. Dev Psychol. 2013; 49(6):1194–1218. pmid:22250996
  57. 57. Feingold A. Gender differences in personality: a meta-analysis. Psychol Bull. 1994; 116:429–456. pmid:7809307
  58. 58. Schmitt DP, Realo A, Voracek M, Allik J. Why can't a man be more like a woman? Sex differences in Big Five personality traits across 55 cultures. J Pers Soc Psychol. 2008; 94(1):168–182. pmid:18179326
  59. 59. Terracciano A, Mccrae RR, Brant LJ, Costa PT Jr. Hierarchical linear modeling analyses of the NEO-PI-R scales in the Baltimore longitudinal study of aging. Psychol Aging. 2005; 20(3):493–506. pmid:16248708
  60. 60. McCrae RR, Martin TA, Costa PT Jr. Age trends and age norms for the NEO personality inventory-3 in adolescents and adults. Assessment. 2005; 12(4):363–373. pmid:16244117
  61. 61. Caspi A, Roberts BW, Shiner RL. Personality development: stability and change. Annu Rev Psyhchol. 2005; 56(1):453–484. pmid:15709943
  62. 62. Benet-Martinez V, John OP. Los Cinco Grandes across cultures and ethnic groups: multitrait-multimethod analyses of the Big Five in Spanish and English. J Pers Soc Psychol. 1998; 75(3):729–750. pmid:9781409
  63. 63. Gnambs T. A meta-analysis of dependability coefficients (test–retest reliabilities) for measures of the Big Five. J Res Pers. 2014; 52:20–28.
  64. 64. Nunnally JC, Bernstein IH. Psychometric theory. 3rd ed. New York: McGraw Hill; 1994.
  65. 65. Konstabel K, Lönnqvist JE, Leikas S, García VR, Qin H, Verkasalo M, et al. Measuring single constructs by single items: constructing an even shorter version of the "Short Five" personality inventory. PLoS ONE. 2017; 12(8):e0182714. pmid:28800630
  66. 66. John OP, Soto CJ. The importance of being valid: reliability and the process of construct validation. In: Robins RW, Fraley RC, Krueger RF, editors. Handbook of research methods in personality psychology. New York, NY: Guilford Press; 2007. pp. 461–494
  67. 67. Kashdan TB. Social anxiety spectrum and diminished positive experiences: teoretical synthesis and meta-analysis. Clin Psychol Rev. 2007; 27(3):348–365. pmid:17222490
  68. 68. Watson D, Clark LA, Tellegen A. Development and validation of brief measures of positive and negative affect: the PANAS scales. J Pers Soc Psychol. 1988; 54(6):1063–1070. pmid:3397865
  69. 69. Poropat AE. A meta-analysis of the five-factor model of personality and academic performance. Psychol Bull. 2009; 135(2):322–338. pmid:19254083
  70. 70. Uppal N, Mishra SK. Moderation effects of personality and organizational support on the relationship between prior job experience and academic performance of management students. Stud High Educ. 2014; 39(6):1022–1038.