MathematicsThe formula[2] is where xi is the income of the ith person, The Theil index is derived from Shannon's measure of information entropy. Letting T be the Theil index and S be Shannon's information entropy measure, Shannon derived his entropy measure in terms of the probability of an event occurring. This can be interpreted in the Theil index as the probability a dollar drawn at random from the population came from a specific individual. This is the same as the first term, the individual's share of aggregate income. With reference to information theory[3], Theil's measure is a redundancy rather than an entropy. The redundancy of a system at a given time is the difference between its maximum entropy and its present entropy at that time.[4] Application of the Theil indexTheil's index takes an equal distribution for reference which is similar to distributions in statistical physics. An index for an actual system is an actual redundancy, that is, the difference between maximum entropy and actual entropy of that system. Theil's measure can be converted[4] into one of the indexes of Anthony Barnes Atkinson. The result of the conversion also is called normalized Theil index[5]. James E. Foster[6] used such a measure to replace the Gini coefficient in Amartya Sen's welfare function W=f(income,inequality). The income e.g. is the average income for individuals in a group of income earners. Thus, Foster's welfare function can be computed directly from the Theil index T, if the conversion is included into the computation of the average per capita welfare function: Here the "Theil-L" index should be used. The difference to the "Theil-T" index will be described later. Theil index and Hoover index
For the income distributions provided by the The World Income Inequality Database (2007-05)[7] the difference between their symmetrized Theil indices and their Hoover indices are plotted over their respective Gini indices. The difference illustrates the impact of the different inequalities on the information generated by them. Negative values occur for Theil indices, which are smaller than the respective Hoover indices.
For the following formulas, a notation[8] is used, where the amount N of quantiles only appears as upper border of summations. Thus, inequities can be computed for quantiles with different widths Ai. For example, Ei could be the income in the quantile #i and Ai could be the amount (absolute or relative) of earners in the quantile #i. Etotal then would be the sum of incomes of all N quantiles and Atotal would be the sum of the income earners in all N quantiles. Theil indexComputation of the (asymmetric) Theil index T [9]: A first variant of the Theil index refers to E as a base. With normalized data, E'i = Ei / Etotal and A'i = Ai / Atotal would apply. This would simplify the formula: The second variant of the Theil index refers to A as a base[10]. With normalized data, A'i = Ai / Atotal and E'i = Ei / Etotal would apply. This would simplify the formula: Computation of the symmetrized Theil index Ts: This leads to: Hoover indexThe formula for the Hoover index (also called Robin Hood index) H is: Difference between both indicesThe difference between the Hoover index and the symmetrized Theil index only is the operation in the deviation from equity Ei / Etotal − Ai / Atotal. A comparison of the Hoover index and the Theil index gives sense to both indices:
Pareto principleUnderstanding the range of the Theil indexThe property of not being a measure with a closed scale between 0 and 1 (or 0% and 100%), like in case of the Gini index, is a barrier, which to overcome seems to be difficult even for famous scientists: Theil's index "is not a measure that is exactly overflowing with intuitive sense," wrote Amartya Sen in a book[6], in which his co-author James Foster used the Theil index nevertheless. One way to overcome this obstacle is the normalized[5] Theil index Tnormalized = 1 − e − T. The alternative is, not to normalize the index and to use it as it is due to an interesting property of that index: For resource distributions described by only two quantiles, the Theil index is 0 for 50:50 distributions and reaches 1 at 82:18[11], which is very close to a distribution often referred to as "Pareto Principle". Higher inequities yield Theil indices above 1. This leads to a comparison, which yields to intuition:
Illustration of the relation between Theil index T and the Hoover index H for societies devides into two quantiles, where a share of A dollars is assigned to a share of B people and a share of B dollars is assigned to a share of A people, with A+B=1 (e.g. “Pareto Principle” with A=0.8 and B=0.2). For societies grouped in such a manner, the Theil index, the Theil index with swapped parameters and the symmetrized Theil index are equal to each other. Also the Gini Index G is equal to the Hoover Index H.
Computing the Theil index from an A:B distributionA Theil index T can be found for any A:B distribution in societies, which are split into two quantiles. The height A of the 1st quantile is the height B of the 2nd quantile. The width B of the 1st quantile is the width B of the 2nd quantile. First the Gini index G (which in this case is similar to the Hoover index) is calculated from the A:B distribution (the range of the variables is 0 to 1 instead of 0% to 100%): Then Reverse computationThe reverse computation is a recursion:
DecomposabilityOne of the advantages of the Theil index is that it is a weighted average of inequality within subgroups, plus inequality among those subgroups. For example, inequality within the United States is the average inequality within each state, weighted by state income, plus the inequality among states. If for the Theil-T index the population is divided into m certain subgroups and si is the income share of group i, TTi is the Theil-T index for that subgroup, and The formula for the Theil-L index is:
Welfare functionAmartya Sen proposed to use the Gini Index to compute a welfare function which would yield the per capita income earned by anyone who is randomly selected from a population within which the total income is distributed inequally: Later James E. Foster proposed as co author in the second edition of Amartya Sen's On Economic Inequality[12] written together with Amartya Sen to use one of the entropy inequality measures from Atkinson. Due to the relation between that measure and the Theil index, Fosters proposel can be implemented by this formula: The same welfare function can be computed from the right term of the Theil-L formula: (As the Theil index is decomposavle, in this formula as well as in the following formulas Theil indices also can be specified for the individual groups. But usually that index is not known. In that case its value is zero.) For the Welfare function, the Theil-L index is used. It yields an per capita income which is close to the lower end of middle class incomes. The inverse value of a welfare function computed with the Theil-T index yields an income, which is close to the upper end of middle class incomes: Example: The average monthly per capita income before taxes in Germany (2001)[13] was 2800€. A welfare function with a Theil-L index of 0.578 yields 1570€ per month. Using a Theil-T index of 0.520, the inverse value of the monthly welfare function was 4700€. In comparison, tarif agreements between the labor union and the employers of the electrical and metal industry in Bavaria cover the salary range between 1649€ und 4000€</ref>. This example does not use welfare functions to define the bounds of middle class incomes. It just puts the welfare functions into relation to real world incomes.
See also
External links
References
| |