Skip to main content Accessibility help
×
Hostname: page-component-f554764f5-fr72s Total loading time: 0 Render date: 2025-04-16T04:04:16.174Z Has data issue: false hasContentIssue false

The κ-Statistics Approach to Income Distribution Analysis

Published online by Cambridge University Press:  11 April 2025

Fabio Clementi
Affiliation:
University of Macerata, Italy
Mauro Gallegati
Affiliation:
Marche Polytechnic University, Italy
Lisa Gianmoena
Affiliation:
University of Pisa, Italy
Giorgio Kaniadakis
Affiliation:
Politecnico di Torino, Italy
Simone Landini
Affiliation:
IRES Piemonte – Socioeconomic Research Institute of Piedmont

Summary

This Element presents the κ-generalized distribution, a statistical model tailored for the analysis of income distribution. Developed over years of collaborative, multidisciplinary research, it clarifies the statistical properties of the model, assesses its empirical validity and compares its effectiveness with other parametric models. It also presents formulas for calculating inequality indices within the κ-generalized framework, including the widely used Gini coefficient and the relatively lesser-known Zanardi index of Lorenz curve asymmetry. Through empirical illustrations, the Element criticizes the conventional application of the Gini index, pointing out its inadequacy in capturing the full spectrum of inequality characteristics. Instead, it advocates the adoption of the Zanardi index, accentuating its ability to capture the inherent heterogeneity and asymmetry in income distributions.
Type
Element
Information
Online ISBN: 9781009446341
Publisher: Cambridge University Press
Print publication: 30 April 2025

1 Introduction

The issue of income and wealth inequality has long been a focal point in economic literature, drawing contributions from numerous esteemed scholars. In the early stages of economic inquiry, when the discipline focused mainly on resource allocation and societal welfare, theoretical and methodological tools were comparatively rudimentary, and data availability was limited. As a result, classical economists concentrated on the “functional distribution” of income, analyzing its allocation among wages, profits, and rents. In contrast, contemporary economists place greater emphasis on the “personal distribution” of income, which looks at how income is distributed among individuals, irrespective of their role in production.

David Ricardo’s “Principles” extensively explored the underlying laws governing income distribution. With the emergence of the marginalist and neoclassical revolutions, economic focus shifted towards the general equilibrium theory (GET), a framework still relevant today under the guise of DSGE (dynamic stochastic general equilibrium) methodology. However, this approach, rooted in the “representative agent” paradigm, tends to overlook income distribution dynamics. While income distribution and inequality have never been entirely sidelined, they have often taken a backseat to the pursuit of proving market equilibrium’s existence, uniqueness, and stability under idealized assumptions.

Notably, applied mathematicians and statisticians, many hailing from Italy, have shown significant interest in the realm of income distribution and inequality. Figures such as Max Otto Lorenz, Vilfredo Pareto, Gaetano Pietra, Umberto Ricci, Corrado Gini, and Giampaolo Zanardi have made substantial contributions to this field, highlighting the interdisciplinary nature of the study of economic disparities. These scholars, among others, will be acknowledged in this Element.

Various definitions exist for income or wealth inequality, yet they converge on the idea that unequal distribution signifies uneven access to opportunities among individuals. The root causes of this disparity can vary widely, spanning global, local, and individual factors.

In many countries, there is a growing acknowledgment of widening gaps in income and wealth distribution, indicating a concerning trend towards heightened inequality. This consensus is underscored by various studies and reports, highlighting the significance of this issue within both academic circles and among policymakers.

The issue of inequality in the distribution of income and wealth has recently taken center stage in political debates, sparking renewed interest among scholars. Key contributions to this discussion include works by Atkinson (Reference Atkinson2015), Atkinson & Piketty (Reference Atkinson and Piketty2007), Piketty (Reference Piketty2014), and Stiglitz (Reference Stiglitz2012, Reference Stiglitz2015).

For policymakers in particular, it is imperative to develop theories and models rooted in robust empirical evidence. Thus, there is a need to utilize precise methodologies to determine the most accurate parametric distributions of income and wealth, as well as to identify the most reliable estimators for measuring inequality.

Inequality is a phenomenon that fluctuates across time and space, with a persistent presence throughout history and likely into the future. It stems from various factors, including the behaviors of individuals holding a transferable quantity like income or wealth, as well as broader environmental conditions such as political, cultural, and economic contexts. The uneven distribution of these resources underscores the disparity in opportunities among individuals.

Some contend that inequality is an inevitable outcome, while others see it as fulfilling a social function by incentivizing advancement and progress. However, the crux of the matter lies in the detrimental effects of inequality, as it perpetuates cycles of poverty and deprivation. While complete eradication may prove elusive, effective control through redistributive or protective policies is essential. These policies should prioritize crucial aspects such as ensuring access to quality healthcare and education, particularly for disadvantaged groups, as these are vital for unlocking better job prospects and opportunities.

The management of inequality is critical for its mitigation. Achieving this goal necessitates the development and application of robust theoretical and methodological frameworks to comprehend and gauge its extent accurately.

For example, while economic growth is typically seen as positive news, its benefits are not always evenly distributed among individuals. In some rapidly growing economies, we have observed an exacerbation of inequality, as only a select few reap substantial rewards, particularly when considering both the financial and real sectors of the economy. Therefore, true economic growth should extend its benefits to all members of society, ideally favoring those who are most disadvantaged. Sustainable growth is characterized by a balanced distribution of wealth, steering clear of extreme social stratification and ensuring a high quality of life for the majority, if not all, citizens. However, achieving this requires a thorough understanding of inequality, underpinned by robust theoretical frameworks and analytical tools that can accurately measure its extent within the mechanisms of economies.

Over the years, numerous theoretical models and inequality estimators have been proposed and explored, making it impractical to list or review them all here. Each proposal has its strengths and weaknesses, contributing to the ongoing discourse without yielding a definitive solution. In this context, this Element presents the most effective and up-to-date solutions available for inferring both the parametric distribution of income and wealth, as well as the appropriate measures of inequality. Specifically, it introduces the κ-generalized distribution and the Zanardi index of asymmetry as prominent tools in this endeavor. These approaches offer valuable insights into understanding and addressing the complexities of economic inequality in contemporary society.

A quantity is deemed “positive” if all its values are nonnegative. Furthermore, a positive quantity is considered “transferable” if a portion held by one observation unit can be transferred to another without altering the total available quantity through interaction. This transferable quantity is termed “diffused” when the ensemble of N units possesses nonuniform shares of the total amount. Conversely, if each unit holds an identical share of 1/N, the quantity is deemed “equi-distributed” among individuals.

When the distribution disproportionately favors certain individuals or groups, it is categorized as “concentrated.” Thus, while all transferable quantities are inherently diffused, the degree and shape of the distribution can vary widely, ranging from equi-distribution with minimal concentration to maximum concentration, where one unit possesses nearly the entire quantity while the remaining N1 units possess negligible amounts.

We can unambiguously refer to equi-distribution when each individual holds an equal fraction, representing the scenario with minimal concentration. However, the opposite extreme, characterized by maximum concentration, lacks a specific term. In essence, between these theoretical extremes, lies a spectrum of situations with varying degrees of concentration. While the term equi-distribution implies equality, other distribution scenarios inherently involve some degree of inequality. Although this terminology is intuitive, it may not capture the nuanced variations effectively.

Equality can be conceptualized as a state of distribution devoid of any disparities. Conversely, when a transfer mechanism operates within a heterogeneous population, interacting through a complex network of relationships, inequality emerges.Footnote 1 This inequality can either intensify over time as distributive imbalances exacerbate or diminish as such imbalances ease. Unlike the idealized state of equality, which represents homogeneity, inequality is a persistent condition characterized by varying degrees of heterogeneity.

The κ-generalized distribution, initially proposed by Kaniadakis (Reference Kaniadakis2001) in the field of nonlinear particle physics kinetics and further refined over the following two decades, has recently found application in economics, thanks to studies by, among others, Clementi, Di Matteo, Gallegati, and Kaniadakis (Reference Clementi, Di Matteo, Gallegati and Kaniadakis2008), Clementi and Gallegati (Reference Clementi and Gallegati2016), and Clementi, Gallegati, and Kaniadakis (Reference Clementi, Gallegati and Kaniadakis2007, Reference Clementi, Gallegati and Kaniadakis2009, Reference Clementi, Gallegati and Kaniadakis2010, Reference Clementi, Gallegati and Kaniadakis2012a, Reference Clementi, Gallegati and Kaniadakis2012b). These works demonstrate that this distribution offers a superior fit compared to well-known functional forms such as the Singh–Maddala, Dagum type I, and GB2 models.

The asymmetry index of the Lorenz curve, introduced by Zanardi (Reference Zanardi1964, Reference Zanardi1965), employs a geometric decomposition method popularized by Tarsitano (Reference Tarsitano1987, Reference Tarsitano1988). Recent research by Clementi, Gallegati, Gianmoena, Landini, and Stiglitz (Reference Clementi, Gallegati, Gianmoena, Landini and Stiglitz2019) and Gallegati, Landini, and Stiglitz (Reference Gallegati, Landini and Stiglitz2016) has shown that this index outperforms the traditional Gini concentration index in measuring inequality.

Both the κ-generalized distribution and the Zanardi index serve as essential analytical and theoretical tools for understanding and quantifying income and wealth distribution inequality. They are invaluable for academic research as well as applied studies aimed at informing policy-making efforts.

This Element is structured as follows.

In Section 2, we provide a brief overview of existing methods for analyzing income distribution, focusing primarily on the Lorenz curve and traditional inequality measures. We then introduce novel insights into inequality measurement by addressing the asymmetry inherent in this curve. Specifically, we introduce the Zanardi index of asymmetry as a superior measure of inequality. Unlike other measures, this index considers both the intensity and direction of inequality within the distribution, facilitating comparisons even when Lorenz curves intersect. Empirical space-time estimates and comparisons with other indices, notably the Gini index, which only addresses concentration, support the superiority of the Zanardi index.

Section 3 presents the κ-generalized distribution, offering comprehensive mathematical details regarding its origin, limit cases, definitions, and fundamental properties. It explores the parametric specification of the Lorenz curve and various indices associated with this distribution. Additionally, the methodology for parameter estimation is elucidated, along with insights gleaned from applying this model to real income distribution data.

Section 4 provides many up-to-date applications of the κ-generalized distribution to real-world income data, showing fitting results and comparing them with those obtained from other parametric models.

Section 5 concludes this Element with essential reflections. While acknowledging that no result can be deemed definitively absolute, akin to physics, some findings can be deemed sufficiently robust until proven otherwise, especially if they offer superior explanations for known phenomena. This Element aspires to contribute to such advancements, and we invite readers to engage in further improving the results presented herein.

2 New Insights on the Measurement of Inequality

2.1 The Inequality Measures

The growing attention to economic inequality has revived debates on the most suitable metric or index for measuring income inequality. Since the introduction of the Lorenz curve in Reference Lorenz1905, which shows the share of income or wealth accruing to the bottom x percent of the population, numerous indices have been proposed to assess economic inequality.

The Gini (Reference Gini1914) coefficient stands as the predominant and extensively employed measure of income inequality. This metric measures inequality by calculating the area between the Lorenz curve, representing income distribution, and the line of perfect equality.

Based on the Lorenz curve, the literature on inequality has introduced alternative metrics, including the Pietra–Ricci index (Pietra, Reference Pietra1915; Ricci, Reference Ricci1916) and the Zanardi index (Zanardi, Reference Zanardi1964, Reference Zanardi1965), which provide new insights into income inequality. The Pietra–Ricci focuses on the redistribution needed to achieve equality, while the Zanardi index explores the asymmetry on the income distribution.Footnote 2

Other indices, like the Theil index, are grounded in information theory (Shorrocks, Reference Shorrocks1980; Theil, Reference Theil1967), while some, like the Atkinson (Reference Atkinson1970) index, are welfare-based measures of inequality and are more responsive to value judgments regarding inequality aversion. Though these indices offer valuable insights into the complex nature of inequality, each comes with its strengths and limitations. The choice of an index depends on the specific aspect of inequality under consideration and the societal values deemed most important.

In particular, indices that satisfy the three axiomatic conditions of (i) symmetry, (ii) scale invariance, and the (iii) transfer principle produce identical rankings for distinct income distributions only if the Lorenz curves do not intersect.Footnote 3 Likewise, rankings based on the Atkinson index remain consistent regardless of the level of risk aversion, only if the Lorenz curves do not intersect. If the Lorenz curves intersect, then different indices can yield different results depending on the type of inequality to which each index is most sensitive.

With the aim of discerning the nature of inequality, the Zanardi index distinguishes itself from the aforementioned indices by its capability to gauge the asymmetry of Lorenz curves. Unlike the previous measures, the Zanardi index delves into the shape of the distribution, aiding in the identification of whether inequality predominantly stems from the conditions of the poor or from the concentration of wealth among the richest.

2.2 Exploring the Lorenz Curve and the Gini Index Logic

It is well known that the most popular tool used to represent income distribution is the Lorenz curve Lp, whose upward and convex trend is strongly affected by how income is concentrated/distributed among individuals. The Lp, whose formalization is discussed in Section 3.2.2, tells us which proportion of the total income is in the hands of a given percentage of population. If income is distributed homogeneously among individuals, the Lorenz curve coincides with the main diagonal of a unit square, showing an absence of income concentration.

However, real-world income distribution is characterized by disparities between poor and rich people. This implies that less affluent individuals hold a share of total income below an equi-distributed allocation, while wealthier individuals hold more. Consequently, for typical income distributions, the Lorenz curve deviates from the diagonal, assuming the shape of a convex curve.

Closely tied to the representation of the Lorenz curve Lp, the Italian statistician Corrado Gini, in 1914, introduced a synthetic measure known as the Gini coefficient. This coefficient quantifies the ratio of the area between the Lorenz Curve and the equi-distribution line (hereinafter referred to as the concentration area) to the area of maximum concentration. It is crucial to emphasize that the Gini index specifically conveys information about the concentration of transferable quantities. However, it falls short in capturing other critical dimensions of inequality, such as the extent of heterogeneity, concentration, and asymmetry inherent in income distribution (Clementi et al., Reference Clementi, Gallegati, Gianmoena, Landini and Stiglitz2019; Gallegati et al., Reference Gallegati, Landini and Stiglitz2016).

The concept of heterogeneity underscores the presence of a nonuniform distribution of economic resources among different socioeconomic groups (poor and rich), with implications for understanding the dynamics of income inequality within a society; the concentration means considering different degrees of disparity between the classes, while the concept of asymmetry refers to the directional disparity in the distribution of endowments.

In particular, asymmetry implies that the distribution reveals a certain direction of the income imbalance which could be “right-wards” if the rich class of the distribution is more heterogeneous than the poor one, or “left-wards” in the opposite scenario.Footnote 4 These three concepts – concentration, heterogeneity, and asymmetry – are crucial for a thorough understanding of inequality. Consequently, it is necessary to consider alternative measures that can examine for the shape of Lorenz curves.

2.3 The Zanardi Asymmetry Index of the Lorenz Curve

In what follows, we provide a description and an interpretation of the Zanardi (Reference Zanardi1964, Reference Zanardi1965) index, which can be considered as a superior measure for examining income inequality, particularly in the presence of asymmetric income distributions. As highlighted by Clementi et al. (Reference Clementi, Gallegati, Gianmoena, Landini and Stiglitz2019), Gallegati et al. (Reference Gallegati, Landini and Stiglitz2016) and Park, Kim, and Ju (Reference Park, Kim and Ju2021) inequality is characterized not only by its “intensity” but also by its “direction.” In this framework, the “direction” of the inequality is discussed in terms of a positive and transferable quantity, such as income, which is unevenly distributed among recipients. This uneven distribution implies a form of “right-wing” or “left-wing” inequality concentration, highlighting the asymmetry of the Lorenz curve Lp.

Graphically, this situation is illustrated in Figure 2.1(a). Consider two intersecting Lorenz curves: one with a “bulge” in the lower-income segment (solid line) and the other with a “bulge” in the wealthier segment (dashed line); in this case, even though the areas enclosed by the two Lorenz curves and the diagonal of the unit square are the same, the income share (qth) of the poorest fraction (pth) of the population is lower in the first distribution than in the second, although the income concentration is exactly equal.

(a) Two Lorenz curves with the same Gini index and opposite asymmetry

(b) the characteristic point, discriminant point, and critical point of the Lorenz curve

Figure 2.1 Note: See also Clementi et al. (Reference Clementi, Gallegati, Gianmoena, Landini and Stiglitz2019).

Figure 2.1(a) shows two contrasting scenarios: one where the rich are very rich and get a high share of the total income, represented by the dashed curve, and another where the poor are very poor and get a very small share of the income, depicted by the solid curve.

In this framework, the discussion of the asymmetry of the Lp will be in terms of the discriminant point Dpd,qd, which separates the poor class from the rich one, given by the intersection of the Lorenz curve with the negative bisector q=1p, that is, the “axis of symmetry.” Thus, the share of poor income earners within the population is given by pd and accumulates a share qd of total income, which is larger than the income share 1qd accumulated by the 1pd rich. The discriminant point Dpd,qd plays a relevant role, since it separates the total area under the Lorenz curve in two sub-areas from which the Gini indexes for the poor and the rich (Gp and Gr, respectively) can be estimated (see Figure 2.1(b)).

Based on the discriminant point Dpd,qd, the Zanardi (Reference Zanardi1964, Reference Zanardi1965) index can be defined as

Z=2KδG,(2.1)

where K=pdqd2, δ=GrGp denotes the disparity of concentration between the rich and poor, and G is the overall Gini ratio. The index varies between −1 and 1, where Z<0 means that the Lorenz curve is negatively asymmetric (or asymmetric to the left), while for Z>0 the curve is positively asymmetric (or asymmetric to the right).

Therefore, if Gr>Gp then Z>0: the poor side is less concentrated than the rich one, or similarly the rich are more heterogeneous and concentrated than the poor, hence the distribution disadvantages the poor as they are more within-homogeneously poor than the rich side. In this case we face distributional imbalance toward the top (top-inequality), as described by the dashed curve in Figure 2.1(a). If Gr<Gp then Z<0 and the opposite interpretation holds: the poor exhibit greater heterogeneity and concentration compared to the rich, suggesting an imbalance distribution toward the bottom (bottom-inequality) – in other words, there are more ways of being poor than rich. This corresponds to the solid curve in Figure 2.1(a). Obviously, Z=0 if the Lorenz curve is symmetric and no distributional imbalances are found, therefore inequality can be analyzed looking at the overall Gini index.

In this scenario, the Zanardi index may provide more insightful information than the Gini index, especially for distributions that exhibit the same G (i.e. the same level of concentration of Lp) but differ in the sign of Z (i.e. Z>0, indicating “right-wing” skewness of Lp, or Z<0, indicating “left-wing” skewness of Lp).

2.4 Empirical Insights on Inequality

This section briefly presents some empirical evidence on inequality measures estimated using the Luxembourg Income Study (LIS) Database. The LIS Database provides public access to granular household-level income data for 52 countries, including both developed and developing nations over a period spanning 1963 to 2022.Footnote 5 Using a harmonized and equivalized dataset,Footnote 6 we explore the dynamics of disposable household income across 822 distributions.Footnote 7

Figure 2.2 shows the relationship between the concentration index G and the entropy inequality indexes: mean logarithmic deviation (MLD), Theil, and Atkinson (with inequality aversion parameter equal to 1).Footnote 8 Although each index provides a distinct viewpoint on inequality, it is possible to observe that higher levels of concentration are associated with higher levels of distributive imbalances. For lower levels of concentration, all the indices are almost equivalent in classifying distributions, but for higher levels of concentration, the indices diverge while maintaining the same order. This pattern confirms a strong relationship between concentration and inequality, however, it does not offer insights into the specific nature or direction of the inequality.

Figure 2.2 Relationship between concentration and entropy indexes of inequality (×1,000)

Figure 2.3 illustrates the relationship between the general concentration index G and the concentration levels on the rich (Gr) and poor (Gp) side. The graph unveils a distributional imbalance stemming from varying concentration levels between the two segments of the distribution – that is, the rich and poor segments – indicating a directional aspect of inequality not fully captured by the Gini index. Furthermore, it appears that the Gini index makes little distinction between sampling from the rich or the poor sides of the distribution. The estimates provided in Figure 2.3 show that a 1% increase in concentration on the rich side increases the overall concentration by approximately 0.64%, while a similar increase on the poor side results in an approximately 0.74% boost in overall concentration. This preliminary analysis underscores that inequity is slightly driven by a higher concentration on the poor side rather than the rich side. The presence of such asymmetry enables a deeper understanding of the inequality directionality, particularly in this scenario where it is driven by the disadvantaged group. Consequently, exploring the concentration gap between the rich and the poor through the Zanardi index could offer valuable insights.

Figure 2.3 Overall Gini index, G, together with the concentration on the rich, Gr, and the poor, Gp, sides (×100)

For this purpose, Figure 2.4 shows the Gini concentration index (horizontal axis) and the Zanardi asymmetry index (vertical axis), both estimated on the Lorenz curves for all distributions. Gray-triangular values denote cases of equivalent Gini paired with opposite Zanardi, indicating distributions with similar concentration but opposite asymmetry. For such observations, an analysis based solely on the Gini index would be misleading because, behind the same concentration value, different forms of inequality may be hidden, that is, top-inequality cases (above the horizontal axis) and bottom-inequality cases (below the horizontal axis).

Figure 2.4 Overall Gini index, G, together with the Zanardi index, Z

At this point in the analysis, we can draw some preliminary conclusions:

  1. 1. The income distributions for the entire sample turn out to be very heterogeneous in terms of income concentration (Gini values ranging from a maximum of 0.6626 to a minimum of 0.1887).

  2. 2. Such distributions exhibit significant inequality or asymmetry in both the rich and poor segments, as described by the Zanardi index (with values ranging from a maximum of 0.1253 to a minimum of −0.1932).

In order to eliminate these distributional gaps, wealth transfers should be implemented to nullify both concentration (G=0) and inequality (Z=0). The Pietra–Ricci index (H) takes this aspect into account.

With reference to Figure 2.1(b), the Pietra–Ricci index measures the maximum distance of the Lorenz curve q=Lp from the equi-concentration segment, that is, the vertical distance of the critical point from the line of perfect equality. Starting with an income distribution characterized by G>0 and Z0, it is possible to calculate the amount of income to transfer from a group (whether rich or poor) to achieve a distribution in which G=0 and Z=0, indicating the absence of inequality. As shown in Figure 2.5(a), as concentration levels increase, it will be necessary to transfer larger amounts of income to eliminate inequality. Thus, with reference to Figure 2.5(b), for values of Z<0, it will be necessary to transfer a portion H of the total income from the poor class to the rich class to completely eliminate concentration and inequality – akin to a “Matthew” effect. In the case of Z>0, on the other hand, the opposite will be necessary: transferring a portion of the total income from the rich class to the poor class – a kind of “Robin Hood” effect.

Figure 2.5

(a) Overall Gini index against the Pietra–Ricci index

(b) the Zanardi index against the Pietra–Ricci

2.4.1 Examining the Inequality over Time

The analysis of inequality presented so far, while informative, is quite general and does not fully allow for an understanding of the time evolution of inequality. Furthermore, a misleading understanding of the concept of inequality becomes evident when solely focusing on the overall Gini index. A country-specific analysis provides a deeper understanding of the various forms of inequality within an economy, highlighting trends and potential divergences between the Gini and Zanardi indices.

With this purpose, Figures 2.62.9 show the time series of the Gini and Zanardi indices for a subset of countries with the highest number of consecutive years of data within the time period from 1990 to 2021. For the sake of exposition, we classify these economies into four groups: Anglo-Saxon countries (Canada, the United Kingdom, and the United States); Western European countries (France and Germany); Southern European countries (Italy and Spain); and Northern European countries (Denmark, Norway, and Sweden).

Figure 2.6 Temporal evolution of the Gini (bullets, left scale) and Zanardi (squares, right scale), Anglo-Saxon countries

Note: The horizontal line represents the zero reference line for the Zanardi index.

The United States (Figure 2.6(a)) shows an almost constant increase in the G concentration index throughout the period considered, with the exception of the last few years, where there is a marked decrease in the G index from values close to 0.39 to values close to 0.37. However, concerning the type of the inequality, there is a cyclical fluctuation between negative values of the Z index and values close to zero for almost the entire period. This increased inequality on the poor side, as indicated by negative Z values, is attributed to greater heterogeneity within the lower-income group, where individuals experience higher levels of poverty. These disparities on the side of the poor are mitigated in three time periods: 2000–2001, 2005–2006, and 2014–2015, during which the Zanardi index approaches zero, indicating a reduction in income distribution asymmetry, albeit with persistently high-income concentration values as indicated by the Gini index. In 2020–2021, however, the distribution undergoes a shift, as indicated by the positive Z values associated with an increase of inequality, this time in favor of the rich-side. The income distribution shows an increase in heterogeneity within the richer class, with the emergence of a segment of wealthy individuals holding significant shares of income within the richer class (i.e. the super-rich).

A similar trend, albeit with lower values, is observed for Canada (Figure 2.6(b)). The concentration levels of G increase throughout the first decade, stabilizing at values around 0.32 in the following ten years, and then fluctuating with a tendency to decrease in the last period. Although the Gini index shows an almost constant trend, the Z index values move further away from the symmetry line, especially in the last ten years, with a significant increase in concentration (hence heterogeneity) among the poor (bottom-inequality).

The case of the United Kingdom in Figure 2.6(c) differs and is, in some respects, the opposite of the United States and Canada, showing a downward trend in both the G and Z indices. The Zanardi index shows how, in the United Kingdom, the distributive imbalance has always favored the richer, with a tendency towards values close to zero only in recent years. We are therefore seeing a general reduction in concentration and inequality, particularly within the wealthy class, indicating a shift from a top-inequality profile to a bottom-inequality one.

Countries in Western Europe, like France and Germany, generally have lower levels of income concentration (Figure 2.7). France’s Gini index shows a relatively stable trend over time, while Germany’s Gini index depicts a rising trend. Both countries exhibit positive imbalances in income distribution, as reflected by consistently positive values of Z. Consequently, a significant portion of total income in these economies is unequally distributed, with a larger proportion going to the rich while a smaller proportion is shared among the poor.

Figure 2.7 Temporal evolution of the Gini (bullets, left scale) and Zanardi (squares, right scale), Western European countries

Note: The horizontal line represents the zero reference line for the Zanardi index.

The situation in Southern European countries, specifically Italy and Spain, presents another interesting pattern (Figure 2.8). The Gini index shows relatively stable values over the years, around 0.33–0.35 for both countries. Conversely, the Zanardi index consistently falls below the symmetry line for both economies, indicating strongly negative Z values. The two countries have seen a notable rise in bottom-inequality, leading to increased poverty among individuals. While the trend in Spain does not seem to be improving, Italy has experienced a significant decline in bottom-inequality, reflected in progressively less negative Z values, at least until the first half of the 2000s, before reverting to worsening. Once more, this particular case highlights the limit of the Gini index in accurately capturing inequality. The similar and constant concentration values might erroneously imply a similarity between the two economies, which is not what real data tell us.

Figure 2.8 Temporal evolution of the Gini (bullets, left scale) and Zanardi (squares, right scale), Southern European countries

Note: The horizontal line represents the zero reference line for the Zanardi index.

The same conclusion applies to the Northern European countries referred to in Figure 2.9, where despite initially displaying relatively low levels of G, there has been a significant shift in inequality dynamics between the rich and the poor over time. For instance, in countries like Denmark, there has been a transition from highly negative Zanardi index values to values approaching the symmetry line. On the other hand, Norway and Sweden have observed increased bottom-inequality since 2006, with Norway experiencing a notable shift in trend in the last year.

Figure 2.9 Temporal evolution of the Gini (bullets, left scale) and Zanardi (squares, right scale), Northern European countries

Note: The horizontal line represents the zero reference line for the Zanardi index.

This simple empirical analysis offers a clear demonstration of the importance of incorporating asymmetry into the analysis of inequality. It offers essential insights for a discussion on the accurate interpretation of inequality. In conclusion, these results call for a more detailed approach to understanding income inequality, moving beyond the conventional analysis focused solely on Lorenz curve concentration or entropy indices, in favor of a more detailed analysis that can also emphasize asymmetries in income distributions.

3 The κ-Generalized Distribution

3.1 Introduction

Mathematical statistics is dominated by two large families of distributions. The first is the exponential family, whose main characteristic is that it contains distributions that decay exponentially. Three important members of the exponential family are mentioned below. The first distribution is the generalized gamma distribution, with the probability density function (PDF) fx given by fγx=Nxαν1expHx, where N is the normalization constant and Hx=δxα, with α,δ>0, ν>0, x>0. A second distribution from the family of exponential distributions is the generalized logistic distribution, which is defined starting from its survival function Sx=λ/λ1+expHx, where λ>0, which is connected to fx by . The standard logistic distribution corresponds to λ=2 and α=1. A third distribution is the Weibull distribution, which corresponds to the case λ=1 of the second distribution, and its survival function is given by Sx=expHx.

Besides the exponential family, a second family is that of distributions whose tails are described by the Pareto law fxAxp, with p>1. This family of distributions contains three commonly used distributions defined by the survival function as follows: the first is the log-logistic distribution, with S=1/1+H; the second is the Burr Type XII or Sing-Maddala distribution, with S=1+Hr and r>0; finally, the third distribution is that of Dagum, with S=1Hr1+Hr.

In the distributions of the second family, the monomial function Hx is the same as the one that occurs in the first family. Apart from this common point, there is no other correlation between the two families of distributions. It can also be noted that the three distributions of the second family are the simplest to construct, but there are a variety of other distributions that have asymptotically Pareto tails.

This dichotomy between these two families of statistical distributions leads to some problems on a theoretical level if we keep in mind that there are a large number of systems that are well described by exponential models for low values of the variable x while these models gradually turn into power-law tail models for increasing values of x.

The problem thus arises as to whether the second family of models must be proposed independently of the first family, as has been done in the past, or whether the models of the second family must be replaced by another family which is a deformation of the models of the first family. In the last two decades, this idea has been developed and it has shown that is possible to propose a unique family of models, different from the two above-discussed families. The new family of models, for x0 reduces to the first exponential family, while for x behaves differently to the second family but it also presents Pareto power-law tails. This was possible thanks to the physical mechanism emerging in special relativity which deforms the ordinary exponential function and replaces it with a new function, the so-called κ-exponential function defined as

expκx=1+κ2x2+κx1κ,(3.1)

with 0<κ<1 (Kaniadakis, Reference Kaniadakis2001, Reference Kaniadakis2002, Reference Kaniadakis2005). For x0 or equivalently for κ0, the κ-exponential reduces to the Euler ordinary exponential, that is, expκxexpx, whereas for x the κ-exponential reduces to Pareto law, that is, expκx2κx±1κ.Footnote 9

The κ-deformed version of the three distributions of the exponential family can be introduced easily as follows (Kaniadakis, Reference Kaniadakis2021):

  1. (i) κ-deformed generalized gamma distribution; it is defined through its PDF as

    fκx=1+κν2κνΓ12κ+ν2Γ12κν2αδνΓνxαν1expκδxα,(3.2)
    with 0<ν<1κ. In the κ0 limit, the density function reduces to the ordinary Generalized Gamma density fx=αδνΓνxαν1expδxα. Asymptotically for x, the density function behaves according to fκxxαν1ακ.
  2. (ii) κ-deformed generalized logistic distribution; the survival function of the distribution is given by

    Sκx=λλ1+expκδxα,(3.3)
    with λ>0. In the κ0 limit, the function (3.3) reduces to ordinary generalized logistic survival function Sκx=λ/λ1+expδxα, while for x it behaves according to Sκxxακ.
  3. (iii) κ-deformed Weibull distribution; the special case corresponding to λ=1 of the previous distribution is the κ-deformed Weibull distribution with survival or reliability function given by

    Sκx=expκδxα.(3.4)

The κ-deformed Weibull distribution is the κ-deformed distribution that undoubtedly has found extensive applications, particularly in economics (Clementi, Reference Clementi2023; Clementi et al., Reference Clementi, Di Matteo, Gallegati and Kaniadakis2008; Clementi & Gallegati, Reference Clementi, Gallegati, Kaniadakis and Landini2016, Reference Clementi and Gallegati2017; Clementi et al., Reference Clementi, Gallegati and Kaniadakis2007, Reference Clementi, Gallegati and Kaniadakis2009, Reference Clementi, Gallegati and Kaniadakis2010, Reference Clementi, Gallegati and Kaniadakis2012a, Reference Clementi, Gallegati and Kaniadakis2012b; Clementi et al., Reference Clementi, Gallegati, Kaniadakis and Landini2016; Clementi & Gianmoena, Reference Clementi and Gallegati2017) and in seismology (Hristopulos & Baxevani, Reference Hristopulos and Baxevani2022; Hristopulos, Petrakis, & Kaniadakis, Reference Hristopulos, Petrakis and Kaniadakis2014, Reference Hristopulos, Petrakis and Kaniadakis2015). Its cumulative distribution function Fκ=1Sκ writes as

Fκx=1expκδxα,(3.5)

while the related PDF fκx=dFκxdx becomes

fκx=αδxα11+κ2δ2x2αexpκδxα.(3.6)

After noticing that

fκx=hκxSκt,(3.7)

with

hκx=αδxα11+κ2δ2x2α,(3.8)

we deduce that hκx represents the hazard function of the model.

The derivation with respect x of the survival function permits to obtain easily its rate equation in the form

dSκxdx=hκxSκx.(3.9)

The integration of this first-order linear ordinary differential equation with the initial condition Sκ0=1 permits to write the survival function in the form

Sκx=expHκx,(3.10)

where Hκx is the cumulative hazard function, defined by means of the integral

Hκx=0xhκudu.(3.11)

After performing the latter integral, the explicit form of the cumulative hazard function is obtained as

Hκx=1κarcsinhκδxα,(3.12)

which in the κ0 limit reduces to the standard Weibull cumulative hazard function H0x=Hx=δxα.

A direct comparison between Equations (3.4) and (3.10), incorporating the expression of the cumulative hazard function as provided in Equation (3.12), yields the already established second representation of the κ-exponential function, namely

expκx=exp1κarcsinhκx,(3.13)

so that the κ-Weibull survival function assumes the form

Sκx=exp1κarcsinhκδxα.(3.14)

The remainder of the section is devoted to elucidating the main statistical properties of the κ-deformed Weibull distribution, also known in econophysics literature as the κ-generalized distribution after Clementi et al. (Reference Clementi, Gallegati and Kaniadakis2007). This distribution, employing a marginally distinct parameterization from the κ-deformed Weibull, where δ=βα, offers a cohesive framework for describing real-world data, encompassing the power-law tails observed in empirical distributions of income and wealth.Footnote 10

The κ-generalized distribution has showcased remarkable efficacy and is frequently viewed as a superior alternative to other commonly utilized parametric models. Initially introduced in 2007 and subsequently refined in subsequent years, this model traces its roots back to the realm of κ-generalized statistical mechanics (Kaniadakis, Reference Kaniadakis2001, Reference Kaniadakis2002, Reference Kaniadakis2005, Reference Kaniadakis2009a, Reference Kaniadakis2009b, Reference Kaniadakis2013). It possesses a bulk closely resembling that of the Weibull distribution, with an upper tail that follows a Pareto power law for high levels of income and wealth. This characteristic enables it to offer an intermediate perspective between the two previously mentioned descriptions.

3.2 The κ-Generalized Model for Income Distribution

3.2.1 Definitions and Fundamental Properties

A variable X following a κ-generalized distribution, denoted Xκ-genα,β,κ, is characterized by a PDF given byFootnote 11

fx;α,β,κ=αβxβα1expκx/βα1+κ2x/β2α,x>0,(3.15)

where α,β>0 and κ0,1. The cumulative distribution function (CDF) of this distribution is formulated as

Fx;α,β,κ=1expκx/βα.(3.16)

Figure 3.1 visually depicts the properties of the κ-generalized PDF and the complementary CDF (a.k.a. the survival function), referred to as 1Fx;α,β,κ, under various parameter configurations. In each pair of plots, two parameters remain constant while the third is adjusted to demonstrate its effect on the distribution.

Figure 3.1 κ-generalized PDF (top) and complementary CDF (bottom) across different parameter values

Note: The complementary CDF is plotted on double-log axes to accentuate the right-tail behavior of the distribution.

The constant β acts as a scale factor, representing the income dimension. Thus, it incorporates the monetary unit, facilitating adjustments for inflation and allowing for cross-country comparisons of income distributions expressed in different currencies. Increases in β correspond to global rises in individual income and average income levels.

On the other hand, the parameters α and κ are dimensionless and shape the distribution. α primarily affects the region near the distribution’s origin, while both α and κ influence the distribution’s upper tail. A higher κ leads to a thicker upper tail, while increasing α narrows both tails and concentrates probability mass around the distribution’s peak.

As κ approaches 0, the κ-generalized distribution tends towards the Weibull distribution.Footnote 12 This convergence is evident from

limκ0fx;α,β,κ=αβxβα1expx/βα(3.17)

and

limκ0Fx;α,β,κ=1expx/βα.(3.18)

As x approaches 0 from the positive side, the distribution behaves like the Weibull model. Conversely, for large x, it tends towards a Pareto distribution of the first kind, characterized by a scale parameter k=β2κ1α and a shape parameter a=ακ, expressed as

fx;α,β,κx+akaxa+1(3.19)

and

Fx;α,β,κx+1kxa.(3.20)

Thus, it adheres to the weak Pareto law, as defined by Mandelbrot (Reference Mandelbrot1960).Footnote 13

The closed-form expression for the quantile function, derived from Equation (3.16), is

F1u;α,β,κ=βlnκ11u1α,0<u<1,(3.21)

where lnκ represents the deformed logarithmic function, defined as the inverse function of (3.1), namely lnκexpκx=expκlnκx=x. It is expressed as

lnκx=xκxκ2κ,xR+.(3.22)

Thus, random numbers from a κ-generalized distribution can be easily generated using the inversion method.

The median of the distribution is

xmed=F10.5;α,β,κ=βlnκ21α,(3.23)

and the mode occurs at

xmode=βα2+2κ2α12κ2α2κ212α×1+4κ2α2κ2α12α2+2κ2α12112α(3.24)

if α>1; otherwise, the distribution is zero-modal with a pole at the origin.

The r th raw moment of the κ-generalized distribution is given by

xr=0xrfx;α,β,κdx=βr2κrαΓ1+rα1+rακΓ12κr2αΓ12κ+r2α,(3.25)

where Γ denotes the gamma function, and it exists for α<r<ακ. Particularly,

x=β2κ1αΓ1+1α1+1ακΓ12κ12αΓ12κ+12α(3.26)

represents the mean of the distribution, and

x2x2=β22κ2αΓ1+2α1+2καΓ12κ1αΓ12κ+1αΓ1+1α1+καΓ12κ12αΓ12κ+12α2(3.27)

denotes the variance.

3.2.2 Assessing Income Inequality through the κ-Generalized Distribution

In economics, the notion of inequality traces its roots to Pareto’s early investigations (Pareto, Reference Pareto1895, Reference Pareto and Busino1896, Reference Pareto1897a, Reference Pareto1897b), which revealed that roughly 80 percent of total income/wealth was held by the top 20 percent. Subsequently, Lorenz (Reference Lorenz1905) introduced the Lorenz curve, a widely employed method for gauging income/wealth inequality. This curve compares the actual income or wealth distribution with an equal distribution. Under perfect equality, the Lorenz curve aligns with the diagonal of a unit square. Any deviation from this diagonal signifies a more unequal distribution.

The Lorenz curve for a positive and transferable random variable X with a CDF Fx and a finite mean x=xdFx is defined as presented by Gastwirth (Reference Gastwirth1971)

Lu=1x0uF1tdt,u0,1.(3.28)

By employing the closed-form expression of the quantile function F1u of the κ-generalized distribution, the Lorenz curve can be represented as indicated by Okamoto (Reference Okamoto2013)

Lu=Ix1+1α,12κ12α,x=11u2κ,(3.29)

where Ix, denotes the regularized incomplete beta function, defined in terms of the incomplete beta function and the complete beta function, as Ix,=Bx,B,. The curve exists if and only if ακ>1. Specifically, if Xiκ-genαi,βi,κi for i=1,2, the conditions for the Lorenz curves of X1 and X2 not to intersect are elaborated in Clementi et al. (Reference Clementi, Gallegati and Kaniadakis2010) as

α1α2andα1κ1α2κ2.(3.30)

The Lorenz curves of two κ-generalized distributions X1 and X2 with parameters chosen according to (3.30) are illustrated in Figure 3.2. The depicted curves indicate that X1 exhibits less inequality than X2, as the Lorenz curve of X1 neither intersects nor falls below that of X2.

Figure 3.2 Lorenz curves for two κ-generalized distributions

Economists have employed different statistical metrics to measure income and wealth inequality. Among these, the coefficient introduced by Gini (Reference Gini1914) is prominent. Starting from the general definition provided by Arnold and Laguna (Reference Arnold and Laguna1977), the Gini coefficient associated with the κ-generalized distribution is derived as

G=12α+2κ2α+κΓ1κ12αΓ1κ+12αΓ12κ+12αΓ12κ12α.(3.31)

Employing the Stirling approximation for the gamma function and taking the limit as κ0 in Equation (3.31), and after simplification, we obtain G=121α, representing the explicit form of the Gini coefficient for the Weibull distribution. Additionally, for κ0 and α=1, the exponential distribution emerges as a special limiting case of the κ-generalized distribution with a Gini coefficient of one half (Drăgulescu & Yakovenko, Reference Drăgulescu and Yakovenko2001).

Although the Gini coefficient is widely used, it assumes specific patterns of income disparities across different distribution segments. It is most sensitive to transfers around the middle of the distribution and less responsive to changes among the extremely wealthy or impoverished (Allison, Reference Allison1978). In contrast, the generalized entropy class of inequality measures (Cowell, Reference Cowell1980a, Reference Cowell1980b; Cowell & Kuga, Reference Cowell and Kuga1981a, Reference Cowell and Kuga1981b; Shorrocks, Reference Shorrocks1980) offers a range of indices sensitive to inequality across various distribution segments. The formula for this class of inequality indices in terms of the κ-generalized parameters is provided by (Clementi et al., Reference Clementi, Gallegati and Kaniadakis2009)

GE(θ)=1θ2θβmθ2κθα1+θακΓ12κθ2αΓ12κ+θ2αΓ1+θα1,(3.32)

where θ0,1 and m=x denotes the mean of the distribution. This class allows different forms of inequality measures depending on the value assigned to θ, indicating the index’s sensitivity to income differences across various distribution segments – a more positive or negative θ corresponds to greater sensitivity of GEθ to income differences at the top or bottom of the distribution. Notable limiting cases, derived when θ is set to 0 and 1, are the MLD index

MLD=limθ0GE(θ)=1αγ+ψ12κ+ln(2κ)αlnβm+κ,(3.33)

where γ=ψ1 is the Euler-Mascheroni constant and ψz=Γz/Γz is the digamma function, and the Theil (Reference Theil1967) index:

T=limθ1GE(θ)=1αψ1+1α12ψ12κ12α12ψ12κ+12αln(2κ)+αlnβmακα+κ,(3.34)

where the former is more sensitive to changes in the middle of the distribution, while the latter is more responsive to variations in the upper tail (Jenkins, Reference Jenkins2009; Sarabia, Jordá, & Remuzgo, Reference Sarabia, Jordá and Remuzgo2017).Footnote 14

Lastly, the inequality measures family introduced by Atkinson (Reference Atkinson1970) can be derived from (3.32) using the relationship (Cowell, Reference Cowell2011; Jenkins, Reference Jenkins2009)

Aϵ=1ϵϵ1GE1ϵ+111ϵ,ϵ>0,ϵ1,(3.35)

where ϵ=1θ represents the inequality aversion parameter. As ϵ increases, Aϵ becomes more sensitive to changes in lower incomes and less responsive to alterations in top incomes (Allison, Reference Allison1978). The limiting expression of (3.35) is A1=1exp(MLD).Footnote 15

3.2.3 Estimation

The estimation of parameters in the κ-generalized distribution can be achieved through maximum likelihood estimation, providing estimators known for their advantageous statistical properties (Ghosh, Reference Ghosh1994; Rao, Reference Rao1973). For a set of independent sample observations x=x1,,xn,, the likelihood function is expressed as

Lx;θ=i=1nfxi;θwi=i=1nαβxiβα1expκxi/βα1+κ2xi/β2αwi,(3.36)

where fxi;θ represents the PDF, θ=α,β,κ denotes the vector of unknown parameters, wi is the weight assigned to the i th observation, and n is the sample size. This formulation leads to determining the partial derivatives with respect to α, β, and κ for the log-likelihood function

lx;θ=lnLx;θ=i=1nwilnfxi;θ,(3.37)

which translates into solving the following system of equations

i=1nwiαlnfxi;θ=0,(3.38)
i=1nwiβlnfxi;θ=0,(3.39)
i=1nwiκlnfxi;θ=0.(3.40)

However, obtaining explicit expressions for the maximum likelihood estimators of the three κ-generalized parameters poses a challenge due to the absence of feasible analytical solutions. Therefore, resorting to numerical optimization algorithms becomes imperative to tackle the maximum likelihood estimation problem.

3.2.4 Utilizations of κ-Generalized Models in Analyzing Income and Wealth Data

Over the past two decades, the κ-generalized model has found extensive application in analyzing income and wealth data across various real-world contexts.

The initial investigation, led by Clementi et al. (Reference Clementi, Gallegati and Kaniadakis2007), scrutinized household incomes in Germany, Italy, and the United Kingdom during 2001–2002. Their study revealed a notable agreement between the model and empirical distributions across all income tiers, particularly within the intermediate range where deviations were noted when using the Weibull model and pure Pareto law for interpolation.

Subsequent studies extended the application of the κ-generalized distribution to Australian household incomes in 2002–2003 (Clementi et al., Reference Clementi, Di Matteo, Gallegati and Kaniadakis2008) and US family incomes in 2003 (Clementi et al., Reference Clementi, Di Matteo, Gallegati and Kaniadakis2008, Reference Clementi, Gallegati and Kaniadakis2009). In both instances, the model provided a comprehensive depiction of the income spectrum and yielded accurate estimations of inequality measures, such as the Lorenz curve and Gini coefficient.

Comparative analyses, pivotal for assessing relative performance, were also undertaken. For example, Clementi et al. (Reference Clementi, Gallegati and Kaniadakis2010) examined household income distributions in Italy spanning from 1989 to 2006. Their results showcased the superior performance of the κ-generalized model over three-parameter competitors, such as the Singh–Maddala and Dagum type I distributions, except for the GB2 distribution, which features an additional parameter. Similar evaluations were conducted for household income datasets from Greece, Germany, the United Kingdom, and the United States, demonstrating the superiority of the κ-generalized model, particularly in modeling the right tail of the data (Clementi, Reference Clementi2023; Clementi et al., Reference Clementi, Gallegati and Kaniadakis2012a). Additionally, Clementi and Gallegati (Reference Clementi and Gallegati2016) concluded that the κ-generalized distribution offered a superior fit to the data and more precise estimates of income inequality compared to alternatives, leveraging household income data from 45 countries extracted from the LIS Database.

The application of the κ-generalized distribution extends to examining peculiarities within survey data on net wealth, defined as gross wealth minus total debt (Clementi & Gallegati, Reference Clementi and Gallegati2016; Clementi et al., Reference Clementi, Gallegati and Kaniadakis2012b). These datasets often feature significant occurrences of households or individuals with either null or negative wealth. The model for wealth distribution, based on the κ-generalized distribution, comprises a mixture of an atomic and two continuous distributions. The atomic distribution caters to economic units with zero net worth, while negative net worth data are described by a Weibull function. Conversely, positive net worth values are characterized by the κ-generalized model outlined in Equation (3.15). Analyzing US net worth data from 1984 to 2011 (Clementi et al., Reference Clementi, Gallegati and Kaniadakis2012b), the κ-generalized mixture model for wealth distribution demonstrated remarkable accuracy, surpassing finite mixture models based on the Singh–Maddala and Dagum type I distributions for positive net worth values. A similar examination carried out by Clementi and Gallegati (Reference Clementi and Gallegati2016) explored net wealth data from nine distinct countries.

4 Modeling Income Data Using the κ-Generalized Distribution

In the following, we explore the ability of the κ-generalized model in describing real-world income distributions. Initially, we present the outcomes of fitting the κ-generalized distribution to LIS income data, demonstrating its accuracy in representing real-world data. Subsequently, we assess the performance of the κ-generalized distribution against alternative parametric models proposed in the literature for income distribution. These models are applied to all available national datasets within the income micro-database currently utilized. As previously noted in Section 2.4 of this Element, the statistical analyses presented here are based on LIS microdata updated in March 2024, incorporating additional datasets into the database and thereby broadening the scope of distributions that can be analyzed.

4.1 Results of Fitting to Empirical Distributions

Figures 4.14.4 illustrate the outcomes of fitting the κ-generalized model to empirical income data, reflecting the household income distribution in key countries categorized into the four groups discussed in Section 2.4.1: the United States, emblematic of the Anglo-Saxon countries group; Germany, representing the Western European countries group; Italy, representing the Southern European countries group; and Sweden, representing the Northern European countries group. These four cases correspond to the latest available data years for the respective countries within the LIS Database, specifically 2022 for the United States, 2020 for Germany and Italy, and 2021 for Sweden.

Figure 4.1 The κ-generalized distribution fitted to household income data for the United States in 2022

Note: The solid line depicts the κ-generalized model, which fits the data across the entire income spectrum, from low to high incomes, including the middle-income range. This model is contrasted with the Weibull (dashed line) and Pareto power-law (dotted line) distributions. The complementary cumulative distribution is plotted on a double-log scale, emphasizing the distribution’s behavior in the right tail. The Lorenz curve plot compares the empirical and theoretical curves, where the solid gray line represents the Lorenz curve of a society with equal income distribution. The Q–Q plot of sample percentiles versus theoretical percentiles of the fitted κ-generalized model shows an excellent fit, with corresponding percentiles closely aligned along the 45-degree line from the origin.

Figure 4.2 The κ-generalized distribution fitted to household income data for Germany in 2020

Note: See note to Figure 4.1.

Figure 4.3 The κ-generalized distribution fitted to household income data for Italy in 2020

Note: See note to Figure 4.1.

Figure 4.4 The κ-generalized distribution fitted to household income data for Sweden in 2021

Note: See note to Figure 4.1.

The best-fitting parameter values were determined using maximum likelihood estimation, as discussed in Section 3.2.3. This yielded the parameter estimates shown in Table 4.1. The very small errors indicate precise parameter estimation. By comparing the observed and fitted probabilities in panels (a) and (b) of the figures, it becomes apparent that the κ-generalized distribution holds great potential for accurately describing the data across their entire range, from the low-to-medium income region to the high-income Pareto power-law regime, encompassing the intermediate region where a clear deviation is evident when using two different curves.

Table 4.1 Estimated κ-generalized parameters for selected LIS country datasets

CountryYearαβκ
Germany20202.57930,999.9500.735
(0.028)(161.972)(0.020)
Italy20202.07618,634.8500.569
(0.030)(159.347)(0.026)
Sweden20212.564351,933.2400.619
(0.031)(2,104.757)(0.022)
United States20221.79056,138.9100.636
(0.009)(189.766)(0.009)

Note: Estimated standard errors in parentheses.

Panel (c) of the same figures displays the empirical data points for the Lorenz curve, overlaid with the theoretical curve derived from Equation (3.29) using the parameter estimates in place of α and κ. This curve is represented by the solid line in the plots and exhibits an exceptional fit to the data. Additionally, the plots juxtapose the empirical Lorenz curve with the theoretical curves associated with the Weibull and Pareto distributions, respectively defined as

limκ0Lu=P1+1α,ln1u,(4.1)

where P, denotes the lower regularized incomplete gamma function, and

limxLu=11u11a.(4.2)

As evident, these curves capture only a fraction of the overall narrative.

The linear development observed in the quantile–quantile (Q–Q) plot of sample percentiles against the fitted κ-generalized distribution, along with its limiting cases, depicted in panels (d) of Figures 4.1 through 4.4, confirms the validity of the model. It also highlights that the Weibull and Pareto distributions offer only partial and incomplete descriptions of the data.

To indirectly evaluate the precision of parameter estimation, we computed predicted values for mean and median disposable household income, along with two inequality measures: the Gini coefficient and the Atkinson coefficient, where the latter’s inequality aversion parameter was set to 0.5 and 1. These computations entailed substituting the estimated parameters into the relevant expressions detailed in Sections 3.2.1 and 3.2.2. The results of these calculations are shown alongside their respective empirical counterparts in Figure 4.5 (for mean and median values) and Figure 4.6 (for inequality measures).Footnote 16 The empirical data were obtained from statistics provided by LIS staff under the LIS Inequality and Poverty Key Figures for the countries and years considered.Footnote 17

Figure 4.5 Observed mean and median values of disposable household income compared with the predicted values based on the κ-generalized model

Note: To directly compare absolute monetary values across different LIS datasets, mean and median monetary values have been converted into 2017 USD PPPs by dividing them by the corresponding year’s LIS PPP, which combines CPI (Consumer Price Index) and PPP (Purchasing Power Parity) deflators to compare real amounts across countries and over time.

Figure 4.6 Observed Gini and Atkinson indices of disposable household income compared with the predicted values based on the κ-generalized model

As can be observed, the predictions derived from the κ-generalized distribution are largely encompassed by the confidence intervals constructed around the empirical values in the vast majority of cases.Footnote 18 This confirms the remarkable agreement between the model and the sample observations. Furthermore, the agreement between the estimates derived from the κ-generalized distribution and the sample values of the considered statistics remains excellent even when considering their temporal trend. Indeed, as shown in Table 4.2, the correlation coefficients between the time series of the considered statistics calculated from the data and those obtained from the estimation of the κ-generalized model are close to unity and all highly significant (the corresponding p-value is less than 0.001 in all cases).

Table 4.2 Correlation between selected distributional statistics computed from the data and the corresponding estimates implied by the fitted κ-generalized model

CountryTime spanStatistic
MeanMedianGA(0.5)A(1)
Germany1973–20200.9990.9970.9920.9690.974
(0.000)(0.000)(0.000)(0.000)(0.000)
Italy1977–20200.9990.9970.9880.9730.982
(0.000)(0.000)(0.000)(0.000)(0.000)
Sweden1975–20211.0001.0000.9980.9900.988
(0.000)(0.000)(0.000)(0.000)(0.000)
United1963–20220.9990.9990.9850.9620.980
(0.000)(0.000)(0.000)(0.000)(0.000)

Note: Values in parentheses denote p-values for testing whether the correlation coefficient is statistically significant.

The good performance of the κ-generalized distribution can also be appreciated by considering its ability to replicate the temporal trend of the Zanardi asymmetry index. As shown in Figure 4.7, the temporal pattern of the predicted Zanardi index values closely mirrors that of the corresponding empirical estimates calculated from the sample data and discussed in Section 2.4.1 of this Element.Footnote 19 The correlation coefficient values between the empirical Zanardi index and the corresponding estimates from the κ-generalized model, as presented in Table 4.3, corroborate the findings from the visual examination of the figure: all correlation coefficients indeed indicate a strong association between these series, being highly significant with p-value lesser than 0.001 in all cases.

Table 4.3 Correlation between the implied and sample estimates of the Zanardi asymmetry index

CountryTime spanCorrelationp-value
Canada1990–20190.9910.000
Denmark1992–20160.9670.000
France1990–20180.6410.001
Germany1990–20190.8950.000
Italy1991–20200.9750.000
Norway1991–20210.9760.000
Spain1990–20190.9800.000
Sweden1992–20210.9270.000
United Kingdom1990–20210.9270.000
United States1990–20210.8530.000

Note: The p-value is for testing whether a correlation coefficient is significantly different from zero.

Figure 4.7 The Zanardi asymmetry index based on the κ-generalized model

Note: The horizontal line represents the zero reference line for the Zanardi index.

4.2 Comparison of Alternative Income Parametric Models

In this section, we present comparisons of the performance between the κ-generalized distribution and other parametric models introduced in existing literature. Specifically, we evaluate the κ-generalized model’s ability to approximate empirical income distributions against the three-parameter Singh–Maddala (Singh & Maddala, Reference Singh and Maddala1976) and Dagum type I (Dagum, Reference Dagum1977) functional forms. These models are characterized, respectively, by the following probability densities

fx;a,b,q=aqxa1ba1+x/ba1+q,x>0,a,b,q>0,(4.3)

fx;a,b,p=apxap1bap1+x/bap+1,x>0,a,b,p>0.(4.4)

Kleiber and Kotz (Reference Kleiber and Kotz2003) offer analytical expressions for distribution functions, moments, and tools for inequality measurement, such as the Lorenz curve and the Gini coefficient. Additionally, Chotikapanich, Griffiths, Hajargasht, Karunarathne, and Rao (Reference Chotikapanich, Griffiths, Hajargasht, Karunarathne and Rao2018) and Jenkins (Reference Jenkins2009) provide formulas for generalized entropy measures of the GB2 distribution, from which the Singh–Maddala and Dagum versions can be readily derived.Footnote 20

Our comparisons involve all country datasets included in the LIS Database after the March 2024 update, except for those that are excluded from the calculation of LIS Inequality and Poverty Key Figures. The total number of income distributions considered amounts to 858. For space reasons, maximum likelihood estimates for the parameters of the three models under consideration are not presented here but are available upon request.

Given that a “good” model of income distribution should not only accurately capture the shape of empirical distributions but also closely align with them in a manner consistent with economic welfare considerations, we assess the performance of the three scrutinized models by analyzing the accuracy of selected distributional statistics implied by parameter estimates. These statistics include: the mean; the Gini index, G; the mean logarithmic deviation, MLD; the Theil index, T; the Atkinson index with the inequality aversion parameter set to 1, A1. For each examined model, the accuracy of predictions regarding these statistics is determined by computing the absolute percentage error using the formula

APE=PAA×100.(4.5)

Here, P represents the predicted values and A represents the actual sample estimates.

The results are summarized in Figure 4.8, which illustrates the percentage of cases where each model achieves the lowest APE for the distributional statistics under consideration. The κ-generalized distribution consistently exhibits a closer match between implied and sample estimates of the mean compared to the Singh–Maddala and Dagum type I models. Conversely, the Gini coefficient estimates are often more accurate for the latter models than for the competing κ-generalized distribution. Therefore, while the estimation procedure of the κ-generalized parameters appears to have generally preserved the mean characteristic across most analyzed country datasets, it has often poorly modeled intra- and/or inter-group variation. However, when examining income disparities across different parts of the income distribution, the κ-generalized model yields relatively more accurate estimates than its competitors for the mean logarithmic deviation, the Theil index, and the Atkinson inequality measure A(1). Given that the MLD index is sensitive to differences in middle incomes, while the T and A(1) indices are more attuned to variations at the upper and lower ends of the income distribution respectively, these findings consistently support the κ-generalized model’s closest approximation to the income distribution shape in a significant number of cases.

Figure 4.8 Percentage of occurrences in which the Singh–Maddala, Dagum type I and κ-generalized models achieve the lowest absolute percentage error between predicted values for key distributional summary measures and their sample counterparts

5 Conclusion

The κ-generalized distribution model was originally proposed by Kaniadakis (Reference Kaniadakis2001) within the context of developing a relativistic interpretation of statistical mechanics for kinetic particles. Since then, it has garnered significant attention in the field of economics, particularly through the works of Clementi et al. (Reference Clementi, Gallegati and Kaniadakis2007, Reference Clementi, Di Matteo, Gallegati and Kaniadakis2008, Reference Clementi, Gallegati and Kaniadakis2009, Reference Clementi, Gallegati and Kaniadakis2010, Reference Clementi, Gallegati and Kaniadakis2012a, Reference Clementi, Gallegati and Kaniadakis2012b). Notably, in the realm of income and wealth distribution analysis, Clementi and Gallegati (Reference Clementi and Gallegati2016) provide a comprehensive reference on this topic.

This probabilistic model defines a new family of distributions characterized by some real parameters θ, indicated by KθK. It includes either the exponential family EθE and the power-law family PθP. It grasps either concentration phenomena about the expected value and inequality induced by the tails. This is due to the fact that, beyond numerical transformations of data, this model is based on functional deformations of the limited and finite support XR of a positive and transferable quantity for which the probability field is not isotropic. Differently said, for a given positive-transferable quantity X and a finite sample Ωn={ω} of n observation units, this model well describes the distribution of masses X(ω)X as if their order of magnitude were able to deform the shape of the probability field I=[0,1] mainly on the right tail, where few observations far from the expected value weigh more than the majority on the left side fairly below the mean. The κ-generalized manages such quantities that behave strangely:Footnote 21 that is, cases where below a given threshold ξX the quantity follows the probabilistic principles of the exponential family, i.e., when Xωξ, while beyond it the quantity follows the probabilistic principles of the power-law family, that is, when Xω>ξ.

The superior capability of the κ-generalized model in describing positive-transferable quantities, which arise from countless repeated interactions among diverse individuals, such as exchanging portions of a given endowment like income or wealth, stems precisely from its relativistic nature. This characteristic underscores that the probabilistic principles governing such quantities vary across their spectrum of values. This differentiation, akin to a weak analogy with Einstein’s special relativity, underpins the κ-generalized model’s ability to shape the probability landscape significantly, particularly impacting concentration and inequality measures.

Conceptually, this can be illustrated as follows: within the observation units ωΩn, those where Xωξ exhibit bosonic behavior, akin to particles not subject to an exclusion principle. Here, many different individuals may possess similar levels of wealth within the range 0,ξX, implying XωEθE. Conversely, observations where Xω>ξ display fermionic behavior, adhering to an exclusion principle. In this regime, only a few individuals may attain similar levels of wealth within the interval ξ,+X. Although bosonic-like incomes are prevalent while fermionic-like ones are rare, the latter typically accumulate the majority of wealth compared to the former.

While it is commonly assumed that the scenario described above is typical, exceptions do exist. Indeed, it is a matter of record that a fermion-like recipient such as Elon Musk may possess more wealth than a boson-like individual, like many of us, and that there are generally more bosons than fermions. However, there are instances, be it in certain countries or at different times within the same country, where the rich-fermions exhibit greater heterogeneity than the poor-bosons, or vice versa.

When the fermions demonstrate greater heterogeneity in wealth compared to the poor-bosons, concentration on the affluent side of the distribution surpasses that on the impoverished side, denoted as Gr>Gp. This situation arises when there are more pathways to wealth than to poverty, characterizing the top-inequality scenario identifiable through the Zanardi index of inequality. Specifically, in cases where the asymmetry index of the Lorenz curve is positive, Z>0.

Conversely, in scenarios where the bosons are more heterogeneously impoverished than the rich-fermions are wealthy, concentration on the impoverished side of the distribution outweighs that on the affluent side, denoted as Gp>Gr. This circumstance occurs when there are more avenues to poverty than to wealth, delineating the bottom-inequality case discernible through the Zanardi index, which reveals a negative asymmetry, Z<0, of the associated Lorenz curve.

Considering this perspective, the Zanardi asymmetry index of the Lorenz curve emerges as a fitting measure of inequality for several reasons. Firstly, the Z index aligns with the econophysics interpretation of the κ-generalized probabilistic model, which capitalizes on the differentiated concentrations of bosons and fermions to effectively model distributions of positive-transferable quantities such as income and wealth. This alignment underscores its superiority over alternative models in capturing the nuances of wealth distribution.

Secondly, the Z index possesses the capability to identify distributional imbalances across both space and time, a feature unmatched by any other index, including the renowned Gini index, which is confined solely to assessing the overall concentration of the distribution.

Thirdly, the Z index remains robust for space-time comparisons, even in scenarios involving intersecting Lorenz curves. In contrast, traditional measures like the Gini index may yield inaccurate results in such cases, as they disregard the directional aspects of inequality stemming from imbalances in concentration between rich fermions and poor bosons.

In conclusion, this Element presents two significant analytical findings within the framework of econophysics theory, bolstered by empirical estimates drawn from a comprehensive and up-to-date database encompassing countries worldwide across various time periods.

Firstly, this Element establishes compelling evidence supporting the suitability of the κ-generalized model for fitting distributions of strange quantities. These quantities, representing positive-transferable assets, arise from the intricate interactions among heterogeneous agents within a complex economic system, exchanging portions of their endowments. Through empirical analysis, it becomes evident that the κ-generalized effectively captures the dynamics of such distributions, offering a robust framework for understanding the underlying mechanisms.

Secondly, this Element provides clear insights into the Zanardi asymmetry index of the Lorenz curve, demonstrating its efficacy as the most appropriate measure of inequality for strange quantities such as income and wealth. By offering a comprehensive interpretation, it underscores the importance of this index in accurately quantifying and assessing the disparities within these economic quantities.

In summary, these findings not only contribute to advancing our understanding of economic systems from a physics-inspired perspective but also offer practical insights into effectively modeling and measuring the distribution and inequality of vital economic resources.

Acknowledgments

The authors gratefully acknowledge Joe Stiglitz for fruitful discussions, comments, and critiques over the last five years of cooperation or so. Opinions and comments do not involve the responsibility of IRES Piemonte.

Series Editors

  • Rosario Nunzio Mantegna

  • University of Palermo

  • Rosario Nunzio Mantegna is Professor of Applied Physics at the University of Palermo and an external faculty member of the Complexity Science Hub in Vienna. He is one of the pioneers of econophysics and economic networks, and he co-authored the first book on the topic (‘Introduction to Econophysics’, Cambridge, 1999).

  • Bikas K. Chakrabarti

  • Saha Institute of Nuclear Physics

  • Bikas K. Chakrabarti is Emeritus Professor at Saha Institute of Nuclear Physics and visiting Professor of Economics in the Indian Statistical Institute, Kolkata. He has co-authored more than two hundred papers and ten books (including ‘Econophysics of Income & Wealth Distributions’, Cambridge, 2013). In 1995, he organized a conference in Kolkata, where the term "econophysics" was first coined.

  • Mauro Gallegati

  • Polytechnic University of Marche, Ancona

  • Mauro Gallegati is Professor of Economics at Polytechnic University of Marche, Ancona, Ancona. He has previously held visiting scholarships at Cambridge, Stanford, MIT, Columbia, the Santa Fe Institute, the Brookings Institution, and ETH Zurich. His research includes business fluctuations, nonlinear dynamics, models of financial fragility, and heterogeneous interacting agents.

  • Irena Vodenska

  • Boston University

  • Irena Vodenska is Professor and Director of Finance Programs at Boston University Metropolitan College. Her research is focused on network theory and complexity science in macroeconomics, particularly the modeling of early warning indicators and systemic risk propagation throughout interconnected financial and economic networks. She is a co-editor of the book ‘Econophysics and Sociophysics: Recent Progress and Future Directions’ (Springer, 2017)

About the Series

  • Econophysics is a dynamic research field at the interface of physics and economics, in which analytical and computational techniques from physics are employed to study the properties of complex economic, financial, and social systems. Elements in Econophysics explores recent developments within this multidisciplinary research area and covers key topics including: big data in econophysics, financial networks, income and wealth distributions, market microstructure, stylized agent-based models, and sociophysics.

Footnotes

1 For instance, refer to the zero-intelligence agents model by Yakovenko and Rosser (Reference Yakovenko and Rosser2009).

2 The Pietra–Ricci index represents the proportion of income needed for redistribution to achieve perfect equality, visually represented as the maximum vertical distance between the Lorenz curve and the 45-degree line. The index ranges from 0 to 1, where 0 represents perfect equality (all incomes are the same) and 1 represents perfect inequality (one person earns all the income).

3 “Symmetry” requires that, as income levels among individuals change, a society’s assessment of inequality remains unchanged. “Scale invariance” requires the inequality index to be invariant to equi-proportional changes of the original incomes. Finally, the “transfer principle” requires the inequality measure to change when income transfers occur among individuals in the income distribution – in particular, inequality indices should fall with a progressive transfer, that is, an income transfer from richer to poorer individuals, and rise with a regressive transfer, that is, an income transfer from poorer to richer individuals.

4 If a rightward asymmetry of the Lorenz curve is detected, it indicates that the concentration on the rich side of the distribution is greater than that on the poor side. Conversely, if we observe a leftward asymmetry, it indicates a greater concentration on the poor side.

5 To preserve confidentiality, access to microdata is conducted remotely, meaning that the program code is sent to LIS rather than being run directly by the user on the data. All code for processing LIS data has been developed using R, the open-source statistical software by R Core Team (2024), and is made available as supplementary material accompanying this Element.

6 The LIS equivalence scale (square root of the number of household members) has been used. Before equivalization, top and bottom coding is applied by setting boundaries for extreme values of log-transformed disposable household income at the 75th percentile (Q3) plus 3 times the interquartile range (Q3−Q1), and at the 25th percentile (Q1) minus 3 times the interquartile range. To ensure representativeness, our methodology employs person-level adjusted weights.

7 The country datasets that are publicly accessible can be found listed on the LIS website at the following URL: www.lisdatacenter.org/our-data/lis-database/. The results presented in this section are the outcome of analyses conducted prior to the update of the LIS Database in March 2024, which included an additional number of datasets into the database. The subsequent Section 4 will refer to this updated version of the LIS Database for the results of the analyses presented therein.

8 Each index offers a different perspective on inequality, with varying sensitivities to different parts of the income distribution and different societal preferences regarding inequality aversion. The MLD gives more weight to deviations of incomes at the lower end of the distribution, making it sensitive to changes in the bottom part of the income distribution. The Theil index is often used in socioeconomic contexts to understand both overall inequality and how it is distributed across different subgroups, while the Atkinson index penalizes high incomes more heavily than other measures like the Gini coefficient, making it sensitive to changes in the top end of the income distribution.

9 It is important to point out once again that the function expκx arises naturally within the theory of special relativity. The recently published review paper by Kaniadakis (Reference Kaniadakis2024) on κ-statistical mechanics focuses on its foundations and collects the most important applications appeared in the literature in this field in recent years. The self-duality property of the κ-exponential, that is, expκ−xexpκx=1, which is identical to that of the ordinary exponential, makes expκ−x a special deformation of the ordinary exponential. By sacrificing the self-duality property, it is possible to define other deformations of the exponential function. The best known in the literature is that of expq−x that leads to the non-extensive statistical mechanics that has found important applications in econophysics (e.g. Ribeiro, Reference Ribeiro2020, and references within). Recall that given two arbitrary functions f−x and g−x, it is always possible to correlate them so that f−x=gT−x. If the transformation T−x is particularly simple, one can imagine a possible relationship between the theoretical models based on the functions f−x and g−x. The transformation T−x that connects the functions expκ−x and expq−x is already known (Kaniadakis, Reference Kaniadakis2001) and has no physical relevance. Therefore, κ-statistical mechanics, which arises in the framework of special relativity, and q-statistical mechanics, which describes the so-called non-extensive phenomenology, are different theories, even though both predict statistical distributions that tend asymptotically to the Pareto law. Rather, it is very interesting that the transformations T−x connecting expq−x to the Burr-type XII or Sing-Maddala distribution and to the Dagun distribution are particularly simple, as they are a reparametrization or a power-law transformation. This emphasizes the very close connection between the above statistical distributions, which have been used extensively in econophysics.

10 Econophysics has significantly contributed to the study of income and wealth distributions by adopting a data-driven approach inspired by statistical physics, diverging from the traditional axiomatic methods of economics. This interdisciplinary effort introduced new models to describe income and wealth distributions. Among these, Chami Figueira, Moura, and Ribeiro (Reference Chami Figueira, Moura and Ribeiro2011) proposed a comprehensive model combining the Gompertz curve and Pareto’s law to fit the entire income distribution. The Gompertz curve represents the majority of lower-income individuals, while the Pareto law captures the wealthiest segment, with an exponential approximation for middle incomes. Despite these advances, finding a universally accepted single function remains an open challenge.

11 From now on, the subscript κ will be dropped in formulas unless it is essential for clarity. For a comprehensive understanding of the properties of the κ-generalized distribution, readers can refer to Clementi and Gallegati (Reference Clementi and Gallegati2016) and the additional sources cited therein. A heuristic derivation of the κ-generalized density, showcasing its natural emergence within the framework of κ-deformed analysis, is presented in Clementi et al. (Reference Clementi, Gallegati, Kaniadakis and Landini2016) and Landini (Reference Landini, Clementi and Gallegati2016).

12 The Weibull distribution, primarily studied in engineering literature, is known as the stretched exponential distribution in physics when α<1. Although sporadic, there have been instances of its use in economics for modeling income data. Some applications can be found in the works of Atoda, Suruga, and Tachibanaki (Reference Atoda, Suruga and Tachibanaki1988), Bartels (1977), Bartels and van Metelen (Reference Bartels and van Metelen1975), Bordley, McDonald, and Mantrala (Reference Bordley, McDonald and Mantrala1996), Brachmann, Stich, and Trede (Reference Brachmann, Stich and Trede1996), Espinguet and Terraza (Reference Espinguet and Terraza1983), McDonald (1984), and Tachibanaki, Suruga, and Atoda (Reference Tachibanaki, Suruga and Atoda1997).

13 Alternative formulations of the Pareto law were proposed by Kakwani (Reference Kakwani1980), expressed as limx→+∞xfx1−Fx=a, and by Esteban (Reference Esteban1986), stated as limx→+∞1+xf′xfx=−a. Given that

limx→+∞xfx;α,β,κ1−Fx;α,β,κ=ακ=aandlimx→+∞1+xf′x;α,β,κfx;α,β,κ=−ακ=−a,

it follows that the κ-generalized distribution also adheres to these alternate versions of the weak Pareto law.

14 Equation (3.32) encounters limitations when θ=0 and θ=1 due to the expression θ2−θ becoming zero in both cases. To address this issue, we resort to l’Hôpital’s rule, which facilitates the evaluation of limits of indeterminate forms through derivatives. Expressions for these specific values of θ are thus derived using this rule. For any GEθ index other than the cases θ=0,1, straightforward derivation through substitution is feasible. For further elucidation on this matter, refer to, for instance, Clementi and Gallegati (Reference Clementi and Gallegati2016).

15 All the measures discussed here depend on distributional moments, the existence of which relies on conditions ensuring the convergence of relevant integrals. Specifically, the Gini coefficient (3.31) is valid if and only if the distribution’s mean x=∫0∞xfx;α,β,κdx converges, a requirement fulfilled only when ακ>1. As noted by Kleiber (Reference Kleiber1997), parametric income distribution models face similar challenges of existence as those encountered by popular inequality measures.

16 Here, we consider the entire time span for which data for the four countries under consideration are available in the LIS Database.

17 The full set of LIS Inequality and Poverty Key Figures is accessible in an Excel workbook, which can be downloaded from www.lisdatacenter.org/data-access/key-figures/.

18 In the figures, the vertical bars denote symmetric 95 percent normal-approximation confidence intervals for the empirical values of the statistics, calculated using the bootstrap resampling method with 999 replications.

19 Figure 4.7, which considers the same countries and time intervals as the analysis presented in Section 2.4.1, does not display the empirical estimates of the Zanardi index to avoid overcrowding the graphs. Zanardi index estimates derived from the κ-generalized distribution were obtained through numerical approximation of areas under the Lorenz curve expressed in the Gini coordinate system (see e.g. Clementi et al., Reference Clementi, Gallegati, Gianmoena, Landini and Stiglitz2019, and references therein).

20 Let X be a random variable following the generalized beta distribution of the second kind (GB2) with parameters a, b, p, and q, denoted as X∼GB2a,b,p,q. The Singh–Maddala distribution arises as a special case of the GB2 distribution when p=1, while the Dagum type I distribution emerges when q=1. For further insights into other special cases, refer to Kleiber and Kotz (Reference Kleiber and Kotz2003) and McDonald (Reference McDonald1984).

21 For the mathematics of strange quantities, see Landini (Reference Landini, Clementi and Gallegati2016).

References

Allison, P. D. (1978). Measures of inequality. American Sociological Review, 43, 865880.CrossRefGoogle Scholar
Arnold, B. C., & Laguna, L. (1977). On Generalized Pareto Distributions with Applications to Income Data. Ames: Iowa State University Press.Google Scholar
Atkinson, A. B. (1970). On the measurement of inequality. Journal of Economic Theory, 2, 244263.CrossRefGoogle Scholar
Atkinson, A. B. (2015). Inequality: What Can Be Done? Cambridge, MA: Harvard University Press.CrossRefGoogle Scholar
Atkinson, A. B., & Piketty, T. (Eds.). (2007). Top Incomes over the Twentieth Century: A Contrast between European and English-Speaking Countries. Oxford: Oxford University Press.CrossRefGoogle Scholar
Atoda, N., Suruga, T., & Tachibanaki, T. (1988). Statistical inference of functional forms for income distribution. Economic Studies Quarterly, 39, 1440.Google Scholar
Bartels, C. P. A. (1977). Economic Aspects of Regional Welfare: Income Distribution and Unemployment. Leiden: Martinus Nijhoff.CrossRefGoogle Scholar
Bartels, C. P. A., & van Metelen, H. (1975). Alternative Probability Density Functions of Income: A Comparison of the Longnormal-, Gamma- and Weibull-Distribution with Dutch Data (Research Memorandum No. 29). Amsterdam: Department of Quantitative Studies, Faculty of Economics, Vrije Universiteit.Google Scholar
Bordley, R. F., McDonald, J. B., & Mantrala, A. (1996). Something new, something old: Parametric models for the size distribution of income. Journal of Income Distribution, 6, 91103.Google Scholar
Brachmann, K., Stich, A., & Trede, M. (1996). Evaluating parametric income distribution models. Allgemeines Statistisches Archiv, 80, 285298.Google Scholar
Chami Figueira, F., Moura, N. J., & Ribeiro, M. B. (2011). The Gompertz-Pareto income distribution. Physica A: Statistical Mechanics and Its Applications, 390, 689698.CrossRefGoogle Scholar
Chotikapanich, D., Griffiths, W., Hajargasht, G., Karunarathne, W., & Rao, D. (2018). Using the GB2 income distribution. Econometrics, 6, 21.CrossRefGoogle Scholar
Clementi, F. (2023). The Kaniadakis distribution for the analysis of income and wealth data. Entropy, 25, 1141.CrossRefGoogle ScholarPubMed
Clementi, F., Di Matteo, T., Gallegati, M., & Kaniadakis, G. (2008). The κ-generalized distribution: A new descriptive model for the size distribution of incomes. Physica A: Statistical Mechanics and Its Applications, 387, 32013208.CrossRefGoogle Scholar
Clementi, F., & Gallegati, M. (2016). The Distribution of Income and Wealth: Parametric Modeling with the κ-Generalized Family. Cham: Springer International Publishing AG.CrossRefGoogle Scholar
Clementi, F., & Gallegati, M. (2017). New economic windows on income and wealth: The κ-generalized family of distributions. Journal of Social and Economic Statistics, 6, 115.Google Scholar
Clementi, F., Gallegati, M., Gianmoena, L., Landini, S., & Stiglitz, J. E. (2019). Mis-measurement of inequality: A critical reflection and new insights. Journal of Economic Interaction and Coordination, 14, 891921.CrossRefGoogle Scholar
Clementi, F., Gallegati, M., & Kaniadakis, G. (2007). κ-generalized statistics in personal income distribution. The European Physical Journal B, 57, 187193.CrossRefGoogle Scholar
Clementi, F., Gallegati, M., & Kaniadakis, G. (2009). A κ-generalized statistical mechanics approach to income analysis. Journal of Statistical Mechanics: Theory and Experiment, 2009, P02037.CrossRefGoogle Scholar
Clementi, F., Gallegati, M., & Kaniadakis, G. (2010). A model of personal income distribution with application to Italian data. Empirical Economics, 39, 559591.CrossRefGoogle Scholar
Clementi, F., Gallegati, M., & Kaniadakis, G. (2012a). A new model of income distribution: The κ-generalized distribution. Journal of Economics, 105, 6391.CrossRefGoogle Scholar
Clementi, F., Gallegati, M., & Kaniadakis, G. (2012b). A generalized statistical model for the size distribution of wealth. Journal of Statistical Mechanics: Theory and Experiment, 2012, P12006.CrossRefGoogle Scholar
Clementi, F., Gallegati, M., Kaniadakis, G., & Landini, S. (2016). κ-generalized models of income and wealth distributions: A survey. The European Physical Journal Special Topics, 225(10), 19591984.CrossRefGoogle Scholar
Clementi, F., & Gianmoena, L. (2017). Modeling the joint distribution of income and consumption in Italy. In Gallegati, M., Palestrini, A., & Russo, A. (Eds.), Introduction to Agent-Based Economics (pp. 191228). Cambridge, MA: Academic Press.CrossRefGoogle Scholar
Cowell, F. A. (1980a). Generalized entropy and the measurement of distributional change. European Economic Review, 13, 147159.CrossRefGoogle Scholar
Cowell, F. A. (1980b). On the structure of additive inequality measures. Review of Economic Studies, 47, 521531.CrossRefGoogle Scholar
Cowell, F. A. (2011). Measuring Inequality. New York: Oxford University Press.CrossRefGoogle Scholar
Cowell, F. A., & Kuga, K. (1981a). Additivity and the entropy concept: An axiomatic approach to inequality measurement. Journal of Economic Theory, 25, 131143.CrossRefGoogle Scholar
Cowell, F. A., & Kuga, K. (1981b). Inequality measurement: An axiomatic approach. European Economic Review, 15, 287305.CrossRefGoogle Scholar
Dagum, C. (1977). A new model of personal income distribution: Specification and estimation. Economie Appliquée, 30, 413436.CrossRefGoogle Scholar
Drăgulescu, A., & Yakovenko, V. M. (2001). Evidence for the exponential distribution of income in the USA. The European Physical Journal B, 20, 585589.CrossRefGoogle Scholar
Espinguet, P., & Terraza, M. (1983). Essai d’extrapolation des distributions de salaires français. Economie Appliquée, 36, 535561.CrossRefGoogle Scholar
Esteban, J. M. (1986). Income-share elasticity and the size distribution of income. International Economic Review, 27, 439444.CrossRefGoogle Scholar
Gallegati, M., Landini, S., & Stiglitz, J. E. (2016). The Inequality Multiplier (Research Paper No. 16–29). New York: Columbia Business School. (https://ssrn.com/abstract=2766301).CrossRefGoogle Scholar
Gastwirth, J. L. (1971). A general definition of the Lorenz curve. Econometrica, 39, 10371039.CrossRefGoogle Scholar
Ghosh, J. K. (1994). Higher Order Asymptotics. Hayward, CA: Institute of Mathematical Statistics and American Statistical Association.CrossRefGoogle Scholar
Gini, C. (1914). Sulla misura della concentrazione e della variabilità dei caratteri. Atti del Regio Istituto Veneto di Scienze, Lettere ed Arti, 73, 12011248.Google Scholar
Hristopulos, D. T., & Baxevani, A. (2022). Kaniadakis functions beyond statistical mechanics: Weakest-link scaling, power-law tails, and modified lognormal distribution. Entropy, 24, 1362.CrossRefGoogle ScholarPubMed
Hristopulos, D. T., Petrakis, M. P., & Kaniadakis, G. (2014). Finite-size effects on return interval distributions for weakest-link-scaling systems. Physical Review E, 89, 052142.CrossRefGoogle ScholarPubMed
Hristopulos, D. T., Petrakis, M. P., & Kaniadakis, G. (2015). Weakest-link scaling and extreme events in finite-sized systems. Entropy, 17, 11031122.CrossRefGoogle Scholar
Jenkins, S. P. (2009). Distributionally-sensitive inequality indices and the GB2 income distribution. Review of Income and Wealth, 55, 392398.CrossRefGoogle Scholar
Kakwani, N. (1980). Income Inequality and Poverty: Methods of Estimation and Policy Applications. New York: Oxford University Press.Google Scholar
Kaniadakis, G. (2001). Non-linear kinetics underlying generalized statistics. Physica A: Statistical Mechanics and Its Applications, 296, 405425.CrossRefGoogle Scholar
Kaniadakis, G. (2002). Statistical mechanics in the context of special relativity. Physical Review E, 66, 056125.CrossRefGoogle ScholarPubMed
Kaniadakis, G. (2005). Statistical mechanics in the context of special relativity. II. Physical Review E, 72, 036108.CrossRefGoogle ScholarPubMed
Kaniadakis, G. (2009a). Maximum entropy principle and power-law tailed distributions. The European Physical Journal B, 70, 313.CrossRefGoogle Scholar
Kaniadakis, G. (2009b). Relativistic entropy and related Boltzmann kinetics. The European Physical Journal A, 40(3), 275287.CrossRefGoogle Scholar
Kaniadakis, G. (2013). Theoretical foundations and mathematical formalism of the power-law tailed statistical distributions. Entropy, 15, 39834010.CrossRefGoogle Scholar
Kaniadakis, G. (2021). New power-law tailed distributions emerging in κ-statistics. Europhysics Letters, 133, 10002.CrossRefGoogle Scholar
Kaniadakis, G. (2024). Relativistic roots of κ-entropy. Entropy, 26, 406.CrossRefGoogle ScholarPubMed
Kleiber, C. (1997). The existence of population inequality measures. Economics Letters, 57, 3944.CrossRefGoogle Scholar
Kleiber, C., & Kotz, S. (2003). Statistical Size Distributions in Economics and Actuarial Sciences. New York: John Wiley & Sons.CrossRefGoogle Scholar
Landini, S. (2016). Mathematics of strange quantities: Why are κ-generalized models a good fit to income and wealth distributions? An explanation. In Clementi, F. & Gallegati, M., The Distribution of Income and Wealth: Parametric Modeling with the κ-Generalized Family (pp. 93132). Cham: Springer International Publishing AG.Google Scholar
LIS Inequality and Poverty Key Figures. (April 15, 2024). https://www.lisdatacenter.org/data-access/key-figures/. Luxembourg: LIS.Google Scholar
Lorenz, M. O. (1905). Methods of measuring the concentration of wealth. Publications of the American Statistical Association, 9, 209219.CrossRefGoogle Scholar
Luxembourg Income Study (LIS) Database. (Multiple countries; February 2024 – March 2024). https://www.lisdatacenter.org/. Luxembourg: LIS.Google Scholar
Mandelbrot, B. (1960). The Pareto-Lévy law and the distribution of income. International Economic Review, 1, 79106.CrossRefGoogle Scholar
McDonald, J. B. (1984). Some generalized functions for the size distribution of income. Econometrica, 52, 647665.CrossRefGoogle Scholar
Okamoto, M. (2013). Extension of the κ-Generalized Distribution: New Four-Parameter Models for the Size Distribution of Income and Consumption (Working Paper No. 600). Luxembourg: LIS Cross-National Data Center. (www.lisdatacenter.org/wps/liswps/600.pdf).Google Scholar
Pareto, V. (1895). La legge della domanda. Giornale degli Economisti, 10, 5968.Google Scholar
Pareto, V. (1896). La courbe de la répartition de la richesse. (Reprinted in Busino, G. (Ed.). (1965). Œeuvres complètes de Vilfredo Pareto, Tome 3: Écrits sur la courbe de la répartition de la richesse (pp. 115). Geneva: Librairie Droz).Google Scholar
Pareto, V. (1897a). Cours d’économie politique. London: Macmillan.Google Scholar
Pareto, V. (1897b). Aggiunta allo studio della curva delle entrate. Giornale degli Economisti, 14, 1526.Google Scholar
Park, J., Kim, Y., & Ju, A.-J. (2021). Measuring income inequality based on unequally distributed income. Journal of Economic Interaction and Coordination, 16, 309322.CrossRefGoogle Scholar
Pietra, G. (1915). Delle relazioni tra gli indici di variabilità: Nota I. Atti del Regio Istituto Veneto di Scienze, Lettere ed Arti, 74, 775792.Google Scholar
Piketty, T. (2014). Capital in the Twenty-First Century. Cambridge, MA: The Belknap Press of Harvard University Press.CrossRefGoogle ScholarPubMed
R Core Team. (2024). R: A Language and Environment for Statistical Computing [Computer software manual]. Vienna. (www.R-project.org/).Google Scholar
Rao, C. (1973). Linear Statistical Inference and Its Applications. New York: John Wiley & Sons.CrossRefGoogle Scholar
Ribeiro, M. B. (2020). Income Distribution Dynamics of Economic Systems: An Econophysical Approach. Cambridge: Cambridge University Press.CrossRefGoogle Scholar
Ricci, U. (1916). L’indice di variabilità e la curva dei redditi. Giornale degli Economisti e Rivista di Statistica, 53, 177228.Google Scholar
Sarabia, J. M., Jordá, V., & Remuzgo, L. (2017). The Theil indices in parametric families of income distributions – a short review. Review of Income and Wealth, 63, 867880.CrossRefGoogle Scholar
Shorrocks, A. F. (1980). The class of additively decomposable inequality measures. Econometrica, 48, 613625.CrossRefGoogle Scholar
Singh, S. K., & Maddala, G. S. (1976). A function for size distribution of incomes. Econometrica, 44, 963970.CrossRefGoogle Scholar
Stiglitz, J. E. (2012). The Price of Inequality: How Today’s Divided Society Endangers Our Future. New York: W. W. Norton.Google Scholar
Stiglitz, J. E. (2015). The Great Divide: Unequal Societies and What We Can Do About Them. New York: W. W. Norton.Google Scholar
Tachibanaki, T., Suruga, T., & Atoda, N. (1997). Estimations of income distribution parameters for individual observations by maximum likelihood method. Journal of the Japan Statistical Society, 27, 191203.CrossRefGoogle Scholar
Tarsitano, A. (1987). Fitting the Lorenz curve. Statistica, 47, 437451.Google Scholar
Tarsitano, A. (1988). Measuring asymmetry of the Lorenz curve. Ricerche Economiche, 42, 507519.Google Scholar
Theil, H. (1967). Economics and Information Theory. Amsterdam: North-Holland.Google Scholar
Yakovenko, V. M., & Rosser, J. B. (2009). Colloquium: Statistical mechanics of money, wealth, and income. Reviews of Modern Physics, 81, 17031725.CrossRefGoogle Scholar
Zanardi, G. (1964). Della asimmetria condizionata delle curve di concentrazione. Lo scentramento. Rivista Italiana di Economia, Demografia e Statistica, 18, 431466.Google Scholar
Zanardi, G. (1965). L’asimmetria statistica delle curve di concentrazione. Ricerche Economiche, 19, 355396.Google Scholar
Figure 0

Figure 2.1(a) Two Lorenz curves with the same Gini index and opposite asymmetry

Figure 1

Figure 2.1(b) the characteristic point, discriminant point, and critical point of the Lorenz curve

Figure 2

Figure 2.2 Relationship between concentration and entropy indexes of inequality (×1,000)

Figure 3

Figure 2.3 Overall Gini index, G, together with the concentration on the rich, Gr, and the poor, Gp, sides (×100)

Figure 4

Figure 2.4 Overall Gini index, G, together with the Zanardi index, Z

Figure 5

Figure 2.5(a) Overall Gini index against the Pietra–Ricci index

Figure 6

Figure 2.5(b) the Zanardi index against the Pietra–Ricci

Figure 7

Figure 2.6

Figure 8

Figure 2.6

Figure 9

Figure 2.6

Figure 10

Figure 2.7

Figure 11

Figure 2.7

Figure 12

Figure 2.8

Figure 13

Figure 2.8

Figure 14

Figure 2.9

Figure 15

Figure 2.9

Figure 16

Figure 2.9

Figure 17

Figure 3.1

Figure 18

Figure 3.1

Figure 19

Figure 3.1

Figure 20

Figure 3.2 Lorenz curves for two κ-generalized distributions

Figure 21

Figure 4.1

Figure 22

Figure 4.1

Figure 23

Figure 4.1

Figure 24

Figure 4.1

Figure 25

Figure 4.2

Figure 26

Figure 4.2

Figure 27

Figure 4.2

Figure 28

Figure 4.2

Figure 29

Figure 4.3

Figure 30

Figure 4.3

Figure 31

Figure 4.3

Figure 32

Figure 4.3

Figure 33

Figure 4.4

Figure 34

Figure 4.4

Figure 35

Figure 4.4

Figure 36

Figure 4.4

Figure 37

Table 4.1 Estimated κ-generalized parameters for selected LIS country datasets

Figure 38

Figure 4.5

Figure 39

Figure 4.5

Figure 40

Figure 4.5

Figure 41

Figure 4.5

Figure 42

Figure 4.6

Figure 43

Figure 4.6

Figure 44

Figure 4.6

Figure 45

Figure 4.6

Figure 46

Table 4.2 Correlation between selected distributional statistics computed from the data and the corresponding estimates implied by the fitted κ-generalized model

Figure 47

Table 4.3 Correlation between the implied and sample estimates of the Zanardi asymmetry index

Figure 48

Figure 4.7

Figure 49

Figure 4.7

Figure 50

Figure 4.7

Figure 51

Figure 4.7

Figure 52

Figure 4.8 Percentage of occurrences in which the Singh–Maddala, Dagum type I and κ-generalized models achieve the lowest absolute percentage error between predicted values for key distributional summary measures and their sample counterparts

Save element to Kindle

To save this element to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

The κ-Statistics Approach to Income Distribution Analysis
Available formats
×

Save element to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

The κ-Statistics Approach to Income Distribution Analysis
Available formats
×

Save element to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

The κ-Statistics Approach to Income Distribution Analysis
Available formats
×