Taxation under Autocracy: Theory and Evidence from Late Imperial

Taxation under Autocracy:
Theory and Evidence from Late Imperial China
Qiang Chen
School of Economics, Shandong University
Jinan, China
[email protected]
Yijiang Wang
CK Graduate School of Business
Beijing, China
[email protected]
Chun-Lei Yang
RCHSS, Academia Sinica
Taipei, Taiwan
[email protected]
Abstract: We model a game to show that the taxation level in an autocracy reflects
the state’s coercive power relative to people’s capacity for violence. The model also
specifies the mechanisms through which various factors affect relative state power.
The model predicts that taxation level increases with state coercion level, efficiency of
coercion technology, cost of rebelling, and likely labor incomes. Data from late
imperial China are used to test the hypotheses, and the findings are entirely consistent
with the predictions of the theory.
JEL: D74; H20; N45; O11
Key words: Taxation; Autocracy; State power; Rebellion; Imperial China

Qiang Chen is grateful for financial support from Shandong University Independent Innovation
Fund (FW12043), and Shandong University Humanities and Social Sciences Major Research
Project “Western Cliometrics and Its Applications in China” (12RWZD12). Chun-lei Yang
appreciates funding by National Science Council of Taiwan (NSC 101-2410-H-001-001-MY2).
1 1. Introduction
Taxation impacts the welfare of billions of people living in autocracies. It is a core
issue of the political economics of development.1 In this paper, we study a question
of great theoretical and practical importance: What determines the taxation level in an
autocracy?
To answer the question, we first model a game between a ruler and people who
pay taxes on their work incomes. We then use a unique set of data from late imperial
China to test the hypotheses derived from the model. The model shows that taxation
level is determined jointly with relative state power, i.e., the balance between state
coercion and the capacity of the people for violent tax rebellions. The model identifies
factors affecting relative state power and specifies a mechanism through which they
determine taxation level, generating testable hypotheses. The empirical findings are
entirely consistent with the predictions of the theory.
There is no doubt that taxation in an autocracy is determined by complex factors.
Any effort to explain it must confront two challenges: it must identify at least the
main factors; and it must specify the mechanisms through which they affect taxation.
Our model accomplishes the task by capturing the essence of taxation: it is a process
by which the state uses coercion to extract revenue from society, and people can either
pay or rebel against taxes. In our model the net-revenue maximizing ruler can
strategically spend resources to enhance state coercive power (as in Besley and
1
North et al. (2009) refer to non-democracies as “natural states” emphasizing that they are the
dominant political system of the world, both historically and contemporarily. Acemoglu and
Robinson (2006) offer an excellent study of distributive politics centering on taxation.
2 Persson, 2011a, b). The spending is a cost that needs to be justified by tax revenue.
Our analysis shed lights on the interplay between state power and taxation, based on
the cost-and-benefit calculus of the ruler’s spending on state coercion.
Specifically, we find that the upper limit of taxation is a function of the state’s
coercive power, i.e., how much a state can maximally tax depends on how effectively
it can deter people’s potential for violent tax rebellions. It follows that any factor
affecting relative state power also affects taxation. Among these factors are violence
factors such as the odds of a rebellion’s success, coercion technology and costs of
violence; and income factors such as wages and the ruler’s benefits from non-tax
sources.
This work belongs to the political economics of development and benefits from
many previous works. The model is adapted from Wang (2012), which builds on the
landmark contributions of Acemoglu and Robinson (2006), North et al. (2009) and
Besley and Persson (2011a, b) on distributive politics. Balance in violence is central
in Acemoglu and Robinson (2006) to the transition to democracy, and in Besley and
Persson’s (2011b) on the clustering of institutions. It is also central in Barzel (2002),
North et al. (2009) and Acemoglu et al. (2013) to the relationship among the elites,
and in Aoki (2014) to a three-party taxation game with a middleman between the ruler
and the peasants as taxpayers. The main contribution of the present work to the
literature is twofold: it adopts a model of relative state power to explain taxation
levels in an autocracy and derives testable hypotheses from the model; and it uses a
unique data set to test the hypotheses.
3 This work is also closely related to those studying the predatory behaviors of the
state (see Shleifer and Vishny 2008, North et al. 2009, and Wang 2012, for reviews).
Our contribution to this literature is to observe that violence is a two-way threat, and
taxpayers’ capacity for violence sets a limit to taxation. We show how the threat of
violence changes the ruler’s behavior. The logic is exactly the same as that of entry
games studied by Milgrom and Roberts (1982).
The empirical part of our work, in which we use a unique data set from late
imperial China to test the hypotheses derived from the model, is closely related to Sng
(2014), who explains the regional variations in land taxation in late imperial China
through the lens of agency cost and predicts lower taxes in prefectures more distant
from Beijing. In contrast, our theory predicts lower taxes in prefectures where state
coercion is less effective, e.g., prefectures that are more distant from the state’s major
military bases. It is also worth emphasizing that, although both works explain
variations in taxation levels across prefectures within an autocracy, our theory can
also apply to international variations, i.e., it can explain variations in taxation levels
across countries. Our empirical findings also cast doubt on Fukuyama (2011)'s
contention that emperors in imperial China may best be characterized as benevolent
dictators rather than as stationary bandits who set tax rates optimally to maximize
extraction of revenue (Olson, 2000)2.
The remainder of this paper is organized as follows. Section 2 introduces a simple
2
It is interesting to note that Olson (2000) has the Chinese warlords in the 1920s as his lead
example for the notion of “stationary bandits”.
4 game-theoretic model where the levels of state spending for coercion and taxation are
jointly determined in a sub-game perfect equilibrium.
Section 3 describes the data from 261 prefectures in late imperial China circa
1820, and focuses on the determinants of per capita land tax in each prefecture. We
observe a lot of variations in per capita land taxes across prefectures3. Among
explanatory variables, we include army size and postal distance to Beijing as proxies
for state coercion and agency costs respectively, various measures of allegiance,
cropland per capita, population density, agricultural suitability, as well as a rich set of
geographic and climate controls. In fact, the choice of historical China as an
interesting case study is largely driven by this unusual data availability.
Section 4 presents empirical results, which are consistent with the theory of
relative state power. The empirical strategy exploits exogenous variations in army
sizes due to the need for national defense and natural geographical conditions, and
uses optimal GMM estimation under over-identification. We also conduct a variety of
robustness checks with similar results. Section 5 concludes the paper.
2. The Model and Hypotheses
2.1. Players and Technology
The taxation game is played between two types of players: the ruler and the
3
In particular, the standard deviation of land tax per capita was almost as large as the mean,
implying a much larger variance than the mean (see Table 3).
5 worker class. All players are selfish and risk-neutral income maximizers with moves
and payoffs as follows.
The ruler has incomes from two sources: direct benefit from power (benefit) and
taxation of workers’ wages.4 Direct benefit f is determined outside of the model so
that the ruler concentrates on maximizing tax revenue.
The ruler is not liquidity constrained. She spends e  0 to build state capacity in
violence (state coercive power), which would inflict cost c(e) to a rebelling worker.5
The technology c(e) turns the ruler's spending e into effective coercive power, which is
increasing at an ever lower rate, i.e. c()  0 and c()  0 . Moreover, we assume
c(0)  0 . Relying on c(e) , the ruler taxes the workers on their wages at rate r   0,1 .
There are two types of workers, worker A and worker B, and each has mass one.
Let w  0 be the wage for a worker, where w is exogenously given and reflects the
level of development or labor productivity. The cost of rebellion is  c(e)  z  for
worker A, and  c(e)  z  for worker B, where c(e) is the systematic cost from state
violence, while z  0 and z  0 are the idiosyncratic costs specific to each type of
worker. It is assumed that z is prohibitively high such that worker B, a loyalist, never
rebels. On the other hand, z is low enough that worker A, an opportunist, may decide
4
The phrase “direct benefit from power” is from Maskin and Tirole (2004). Examples of it are
incomes from corruption, natural resource rent and foreign aid (as in Besley and Persson, 2011a, b)
and large gaps between illicit rent and tax revenue (as in Acemoglu, 2005, and Sng, 2014). It also
includes psychological satisfaction derived from being in power, such as national and historical
legacy (Olson, 2000, p.13f)
5
Although referred to as technology, c(e) is a mix of technology and factors such as will power
and social norms, e.g., c(e) is smaller if killing is not as easy because of social norms. Note that
the ruler may also spend tax revenue on public goods that help increase his tax base (Olson, 2000,
p.9f). For better focus, this is omitted from our simple model.
6 to work (W) or to rebel (R), i.e. his action a  W , R . If both workers work, the total
tax revenue would be 2rw and net revenue g  (2rw  e) . If worker A rebels, the total
tax revenue would be rw and net revenue g  (rw  e  h) , where h  0 is loss caused by
the rebels.
Assumption: For simplicity, assume h is prohibitively high such that the ruler
never would want worker A to rebel in equilibrium6.
If worker A rebels, he has probability    0, 1 to usurp power from the ruler,
where  is exogenously given7. If his rebellion is successful, once in power he can tax
worker B for revenue rw and receive f, just as his predecessor would do. His income
would then be ( rw  f ) . Weighting it by  and subtracting from it  c(e)  z  , his
expected rebelling income is  (rw  f )  c(e)  z  . On the other hand, if worker A
works, his after-tax income is (1  r ) w .
The payoffs for the ruler and worker A are summarized in Table 1 (worker B is
silent and ignored).
Table 1. Payoffs
Ruler
2rw  e  f
Ir  
 rw  e  h
Worker A
W
if a  
R
 (1  r ) w
W
IA  
if a  
 (rw  f )  c(e)  z
R
r   0,1 ; e, z  0 ;  , w, f , h  0 ; c(0)  0 , c  0 , c  0
6
This assumption does not mean that popular uprisings would never occur. Instead, it means that
the ruler is not a roving bandit, and he never expects to have rebellions ex ante.
7
Wang (2012) discusses how to endogenize the probability of successful rebellion. However, the
central messages from the theory remain the same. Thus, this assumption is made for brevity. 7 Information and the moves: Information is complete and perfect. The ruler moves
first to decide on spending e and tax rate r. These decisions are truthfully announced
and irreversible. After that, worker B sets to work, while worker A chooses to work or
rebel, with respective consequences as specified above.
2.2. Subgame Perfect Equilibrium
By backward induction, we know that the condition to ensure worker A’s choice of
work, a  W is
(1  r ) w   (rw  f )  c(e)  z .
(1)
Define
r0 
1
c ( e)  z   f

1 
w(1   )
(2)
as the highest tax rate which worker A will accept. In other words, the ruler can tax
maximally up to r  r 0 without causing a rebellion. Clearly, r 0 is increasing in the
spending e for state coercion. Given e (hence also c(e) ) and all the other parameters,
there is an r 0 such that worker A’s preference is to work if r  [0, r 0 ] , but to rebel
if r  (r 0 ,1] .
To maximize tax revenue, the ruler always sets tax rate at r 0 . Inserting (2) into the
ruler’s objective function, the optimization problem is
max 2r 0 (e) w  e  f .
e
The first-order condition for optimization is
8 (3)
c(e* ) 
1 
.
2
(4)
The second-order condition is satisfied iff c  0 as we assumed. From (4), it is easy
to see that
e*
1

 0 . What this means is that, intuitively, when the odds of
 2c(e* )
successful rebellion  become higher, spending e is less effective at deterring worker
A from rebelling. This leads to a lower optimal level of e. Lemma The pair (r * , e* )   r 0 (e* ), e*  that results from (2) and (4) constitutes a
Subgame Perfect Equilibrium (SPE) of the game.
2.3. Comparative Statics and Hypotheses
The above model generates a number of interesting testable hypotheses.
Hypothesis 1. The equilibrium tax rate r * increases with the equilibrium spending
on coercion e* .
This is evident from equation (2). In fact, a central message of our theory is that a
state with greater coercive power can tax more. So if there is an exogenous shock to
c’(e) resulting in a higher equilibrium spending e* , the shock also leads to a higher
equilibrium tax rate r * .
Additional hypotheses can be easily derived through comparative statics. Note
that the SPE (r * , e* ) is a function of the structural parameters  , w, f , z . Applying the
total differentiation method to (2) and (4), which is a special variation of the Implicit
Function Theorem, yields the following comparative statics results straightforwardly
9 (see Table 2)8.
Table 2. Comparative Statics
r *
1

0
z w(1   )
 c(e* )
r *
1
*

  *  f  wr   0
 w(1   )  2c (e )

r *


0
f
w(1   )

r * 1  1
1
 
 r *   0 if r * 
w w  (1   )
1 

From Table 2, we establish the following additional hypotheses.
Hypothesis 2. The equilibrium tax rate r * increases with z , the idiosyncratic cost
of rebellion to the rebels.
The intuition is simple. The state can tax a citizen more heavily if he is loathe to
engage in popular uprisings. An implication of Hypothesis 2 is that the tax rate should
be set lower for a region dominated by minority groups with little allegiance to the
central government, hence lower z (e.g. lower moral cost of rebellion).
Moreover, the equilibrium tax rate r * decreases with  , the probability of
successful rebellion. The reason is that better odds of successful rebellion provide a
stronger motivation for people to rebel. A lower tax rate is needed to offset the
stronger motivation and appease them.
8
For illustration, from equation (2) we get (1   ) wr  w  c(e)  z   f . Apply total
differentiation with respect to  , we then get (1   )wdr  wrd  cde  f d , i.e.,
dr
de
(1   ) w
 wr  c
 f . The rest follows in the same manner.
d
d
10 Hypothesis 3. When the coercion technology becomes more efficient, the
equilibrium coercion level and the equilibrium tax rate r * both rise.
Without loss of generality, assume for example that the coercion technology has
the special form of kc(e) . It is straightforward to obtain that
 e*
c

 0 and
k
kc
r *
c2

 0 . In other words, if the state can hit the rebel more effectively at
k
w(1   )c
any given level of e, then it is worthwhile for the ruler to increase the spending for
coercion and tax more.
The relation between the equilibrium tax rate r * and the exogenously given wage
rate w is more complicated. Theoretically, it can go either way. As wage increases, the
rebel has more to gain once he is in power, hence he is more motivated to rise up
against taxation (known as the "greed" motive), and a lower tax rate is needed to
appease the potential rebel. On the other hand, at higher wages the opportunity cost of
soldiering is also higher (known as the "grievance" motive), implying that rich people
are less willing to take to arms. When this latter motive dominates, higher taxes can
be levied on those earning higher wages. Thus in general the net effect of wages on
taxation is ambiguous depending on the relative strength of these two motivations.
However, in our model, equilibrium tax rate r * increases in wage
whenever r * 
1
1
holds. If   0.05 , this requires that r * 
 0.95 , which is
1 
1  0.05
a condition quite easy to satisfy in practice. Even in the extreme case
where   1 (successful uprising guaranteed), this inequality requires only r *  0.5 ,
11 which is still reasonable. These numerical examples suggest the following hypothesis.
Hypothesis 4. With parameter values in reasonable ranges, the equilibrium tax
rate r * increases with w.
Below we will test the above hypotheses, all of which turn out to be consistent
with empirical evidence.
3. Data
In this section we take the theory of relative state power to a unique data set from
261 prefectures in late imperial China circa 1820, and focus on the determinants of
per capita land tax across prefectures. The definitions of variables and data sources are
described below.
3.1. Cropland, Population, and Tax
The empirical counterpart of the theoretical tax rate is land tax per capita in late
imperial China, where land tax was the primary source of revenue in an agricultural
society. The data for total land tax in 1820 (tax) for each prefecture in China is
derived from Liang (1980), which compiles information about cropland and land tax
in the Qing dynasty from the Grand Gazetteer of the Qing during the Reign of Jiaqing
(Jiaqing Chongxiu Yitong Zhi). The payment of land tax could take three forms, i.e.
land tax payable in taels of silver, grain, or cereal. We use Grain Price Database in
the Qing Dynasty (Wang, 2009)9 to retrieve the prices of grain and cereal for each
9
This data set is available at http://140.109.152.38/DBIntro.asp. Since 1736, provincial
governments were required to report grain prices in each prefecture within their province. Thanks
12 prefecture in 1820, and convert all land tax into taels of silver.
The population of each prefecture in 1820 in thousands (pop) is derived from Cao
(2001), which is widely considered the most authoritative study on historical Chinese
demography. The dependent variable of this study, land tax per capita (taxpc), is
simply defined as total land tax in 1820 (tax) divided by population in 1820 (pop).
Since land tax in late imperial China only changed slowly over time (Wang, 1974; Liu
and Fei, 1977), we define an alternative measure of land tax per capita taxpc1776 as
total land tax in 1820 (tax) divided by population in 1776 (pop1776) as a robustness
check.
The data for cropland in mu (land) are also taken from Liang (1980). Dividing
land by pop yields cropland per capita (landpc), which may be used as a proxy for
wage, since income per capita depends heavily on land per capita in an agricultural
society. The data for the area of each prefecture in km2 (area) are from Cao (2001).
Dividing pop by area yields population density (pdensity), which is another proxy for
wage, or level of development in general (Acemoglu et al., 2002). Alternatively, we
define pdensity1776 as the ratio of pop1776 to area. These two proxies of wage are
used to test Hypothesis 4 that the equilibrium tax rate increases with the wage.
All croplands were not quite the same, since some were fertile and blessed with
an auspicious climate, while others were not. Following ecologists' approach, we
to decades of work by Yeh-chien Wang, data from the archives became a complete database,
which has been used by Shiue and Keller (2007) to examine market integration in 18th-century
China. 13 measure the agricultural suitability as the product of climate suitability and soil
suitability. The original data are drawn from Ramankutty et al. (2002), which provides
agricultural suitability data at the 0.5-degree grid level10. We define the agricultural
suitability (agrisuit) of each prefecture as the average value of grid cells within its
boundary.
3.2. Army
A key explanatory variable is the state's spending on coercive power proxied by
the size of army, which is used to test Hypothesis 1 that the equilibrium tax rate rises
with equilibrium spending on coercion. Since the data for army size are not available
at the prefectural level, we use the size of the provincial army (army) instead11, which
is available from An Investigation of the Royal Military System (Wen, 1861). As an
alternative measure, we also consider the direct distance from each prefecture to the
nearest military center where the (Manchu) Eight Banners Army was stationed. The
locations of these military centers are also available from Wen (1861), while the direct
distance is computed with Chinese Historical GIS (CHGIS, Harvard Yenching
Institute, version 5). However, this alternative measure performs rather poorly, and for
a good reason, i.e. a military center typically did not have jurisdiction over prefectures
in neighboring provinces. Therefore, we focus on the size of provincial army (army)
as a proxy for state coercive power.
10
The agricultural suitability data can be downloaded from
http://www.sage.wisc.edu/atlas/maps.php?datasetid=19&includerelatedlinks=1&dataset=19.
11
A provincial army included the (Manchu) Eight Banners Army and the (Han) Green Standard
Army. From an administrative perspective, the maintenance of local stability was mostly the duty
of the provincial government, led by the provincial governor. Therefore, for our purpose the size
of the provincial army mattered more than the size of the army stationed in each prefecture.
14 An obvious challenge in estimating the effect of army size on tax rate is its
potential endogeneity. As clear from the model in Section 2, the size of the army as a
proxy for coercive spending and the tax rate are jointly determined, so the causality
can go both ways. We use two instrumental variables to overcome the endogeneity of
army. First, we exploit the exogenous variation of army size due to the need for
national defense, and use a dummy representing whether a prefecture is located in a
province on the frontier with a foreign country, or a potentially rebellious autonomous
region (frontier) as an IV for army. Specifically, frontier takes on the value of one for
any prefecture in Zhili, Shanxi, Shaanxi, Gansu, Guangxi and Yunan provinces, and
zero otherwise. Obviously, frontier is correlated with army, since a frontier province
typically had a larger army for national defense. The correlation coefficient is 0.352 in
the sample, which is significant at the 1% level. On the other hand, the variation in
army size due to the need for foreign defense arguably had nothing to do with
domestic tax issues.
The second IV for army is the ruggedness of the terrain. We used visual
inspection in Google Earth to determine whether the dominant terrain of a prefecture
is rugged (rugged). The size of the army is correlated with rugged, because a rugged
terrain typically discourages the deployment of an army. The correlation coefficient is
-0.182 in the sample, and significant at the 1% level. On the other hand, rugged
appears to be exogenous, since we have already controlled for cropland per capita,
agricultural suitability, and population density as alternative channels for rugged to
affect the tax rate.
15 In addition, we follow Nunn and Puga (2011) to define an alternative measure of
terrain ruggedness12, rugged2, which first computes a terrain ruggedness index for
each 30-by-30 arc-second cell as the square root of the sum squared differences in
elevation in eight directions, then averages this index over the entire portion of the
prefecture not covered by water. However, rugged2 only measures small-scale terrain
irregularities, and the correlation coefficient between rugged2 and army is just -0.105
and only significant at the 10% level. Therefore, rugged2 appears to be a weaker IV
than rugged, and the former is only used as a robustness check. The correlation
coefficient between these two measures of terrain ruggedness is 0.584.
3.3. Agency Cost
Following Sng (2014), we use the postal distance from the provincial capital to
Beijing (pdprov), and the direct distance from the prefectural seat to the provincial
capital (dpref), as two proxies for agency costs13. The data for pdprov are derived
from Collected Statutes of the Qing Dynasty by Imperial Order (QDZH, 1985, v. 121),
which describes royal postal routes from Beijing to each provincial capital, and the
corresponding postal distance in 'li' units, which is converted into kilometers14. The
data for dpref are computed in CHGIS. We also define a dummy capturing whether
the prefectural seat housed the provincial capital (prov_capital) according to China
12
We thank Ting Chen and Chicheng Ma for sharing the data, which are computed with SGS
Digital Elevation Model (DEM) at 90 square-meter-cell grid resolution, matched with CHGIS,
Version 4, 2007.
13
Sng (2014) uses the sum of dprov and dpref as an explanatory variable. However, more
information may be gained if dprov and dpref are included as two separate regressors. Moreover,
the nature of postal distance and directional distance are not quite the same.
14
The formula for conversion is 1 li = 0.576 km according to Yang et al. (2008), p.2248. 16 Historical Atlas (Tan,1982). Obviously, for prefectures with prov_capital = 1, we
have dpref = 0.
While the direct distance is merely a geographic feature, the postal distance was
not. The imperial postal routes might be designed to pass through regions of higher
population density and greater tax potential. To deal with the potential problem of
endogeneity, we follow Sng (2014) and use the direct distance from the provincial
capital to Beijing (dprov) as an IV for the postal distance from the provincial capital to
Beijing (pdprov). These two measures of distance are highly correlated, with a
correlation coefficient of 0.98. On the other hand, there is no obvious reason for the
direct distance to affect tax per capita other than through the postal distance, thus
dprov appears to be exogenous.
3.4. Allegiance
To test Hypothesis 2 that tax rate rises with the cost of rebelling, we exploit the
fact that a prefecture dominated by minority groups usually showed little allegiance to
the central government, and had few scruples in contemplating a rebellion. To capture
this effect, we define minority1 as an indicator of the presence of minority groups in a
prefecture during 1661-1820 according to Atlas for Chinese History (Guo, 1996, v.2,
p.103-106). However, the definition of minority1 may be too loose, since the
presence of minority groups in a prefecture need not imply their dominance. As an
alternative measure, we define minority2 as whether there was any autonomous
county, autonomous zhou (city), or autonomous region (province) after 1949
17 according to China Historical Atlas (Tan,1982) and Atlas of China (Dizhi Press
Editorial Office, 2011). The advantage of minority2 is that it does indicate the
presence of dominant minority groups, while its disadvantage lies in the time
mismatch, since the distribution of minority groups could have changed due to
migration during 1820-1949. Nevertheless, since minority1 outperforms minority2,
and these two measures are highly correlated, with a correlation coefficient of 0.65,
we only use minority1 in the regressions.
In the Qing dynasty, municipalities under direct control of the provincial
government, known as "zhili ting", were often set up in regions dominated by
minorities. Hence, we define another proxy for minority groups, ting, to indicate
whether the prefectural unit was a zhili ting.
Moreover, we look at the frequency of past uprisings as an ad hoc measure of a
prefecture's readiness to take to arms, or absence of allegiance. For this purpose, we
define riot1 as the number of peasant uprisings in a prefecture since the beginning of
the Qing (Manchu) dynasty's rule in mainland China in 1644 until 1820. However, the
invasion of the Manchu army was met with sustained resistance by the Han majority,
which did not die down until the suppression of the Revolt of the Three Feudatories in
1682. Thus, many peasant uprisings before 1682 were actually part of the
anti-Manchu resistance movement. Hence, we define an alternative measure riot2 as
the number of peasant uprisings in a prefecture during 1682-1820. Both riot1 and
riot2 are derived from A Chronology of Warfare in Dynastic China (China’s Military
18 History Editorial Committee, 2003), which has been used by Chen (2014) and Jia
(2014) to study peasant uprisings in historical China. Since riot2 outperforms riot1,
and these two measures are highly correlated with a correlation coefficient of 0.71, we
only use riot2 in the regressions.
3.5. Geography
As geographic controls, we define latitude, longitude, and elevation as the
latitude, longitude and elevation of the prefectural seat respectively. Both latitude and
longitude are directly available from CHGIS. With the latitude and longitude of the
prefectural seat at hand, we use Google Earth to determine its elevation from the sea
level. Moreover, we define a dummy capturing whether a prefecture had a coastline
(coast), and a dummy for whether a prefecture was passed through by the Yangtze
River or the Grand Canal (river). The data for both coast and river are derived from
China Historical Atlas (Tan,1982). The variable coast is used as a proxy for less
efficient coercion technology, since rebels in a coastal prefecture had the option of
taking to the sea to avoid punishment. An additional geographic control is the size of
prefecture, measured by its area (area) as noted before.
3.6. Climate
As climate controls, we derive historical weather data from Collected Maps of
Droughts and Floods in China in the Past Five Hundred Years (State Meteorological
Society, 1981), which provides annual information on the weather for locations
19 throughout China dating back to 147015, and which has been used by Shiue and Keller
(2007) and Jia (2014). This data set contains a variable dryness, a discrete indicator of
the degree of aridity, which is coded in the following way:
1
2

dryness  3
4

 5
if exceptional flood
if limited flood
if normal weather
if limited drought
if exceptional drought
(5)
We compute the average dryness during 1644-1820 for each prefecture, and still
denote it as dryness. Since climate volatility may also matter, we compute the
standard deviation of dryness, and denote it as dry_std.
Summary statistics for the above variables are presented in Table 3.
Table 3. Summary Statistics
Variable
taxpc
army
frontier
rugged
rugged2
pdprov
dprov
dpref
prov_capital
minority1
ting
riot2
coast
landpc
pdensity
Observation
261
261
261
261
261
261
261
261
261
261
261
261
261
261
261
Mean
110.49
46.57
0.36
0.59
2.30
1970.63
1193.33
198.58
0.07
0.49
0.06
0.18
0.14
1911.88
134.11
Std. Dev.
95.35
45.05
0.48
0.49
1.82
985.43
563.07
123.44
0.25
0.50
0.23
0.52
0.35
1444.97
144.34
Min
1.15
9.442
0
0
0.04
190.08
141.12
0
0
0
0
0
0
17.41
0.45
Max
611.22
208.61
1
1
9.72
3415.68
2090.19
872.90
1
1
1
4
1
6974.89
874.1
15
The sources of State Meteorological Society (1981) include rainfall records from weather
stations, official documents of the Ming and Qing dynasties (the Veritable Records of the Ming
and Qing Dynasties, the History of the Ming Dynasty and the Qing Dynasty), as well as more than
2200 local gazetteers. 20 agrisuit
area
latitude
longitude
elevation
river
dryness
dry_std
261
261
261
261
261
261
261
261
0.68
16029.46
30.67
111.64
470.79
0.17
2.94
0.86
0.58
19404.51
4.99
5.81
606.75
0.38
0.10
0.25
0.003
1270
20.01
95.79
6
0
2.71
0.25
9.34
192200
40.97
121.54
2966
1
3.18
1.29
4. Empirical Results
Based on the above discussions, we specify the benchmark equation for the
determination of land tax per capita in prefecture i as follows,
taxpci   0  1armyi  β2di  β3 w i  β4 z i  β5 xi   i ,
(6)
where taxpc is land tax per capita, army is the size of the provincial army as a proxy
for the state's coercive spending, di = (pdprov, dpref, prov_capital) is a vector of
proxies for agency costs measuring the distance from a prefecture to political
centers, wi = (landpc, pdensity, agrisuit) is a vector of proxies for wage measuring
cropland per capita, population density, and agricultural suitability respectively, z i =
(minority1, ting, riot2) is a vector of proxies for allegiance measuring the presence of
minority groups and the frequency of past riots respectively16, and xi = (area, latitude,
longitude, elevation, river, coast, dryness, dry_std) is a vector of geographic and
climate controls.
To tackle the endogeneity of army and pdprov (the postal distance from provincial
16
Since minority1 and minority2 are highly correlated, and the former outperforms the latter, we
only keep minority1 in the regression. Similarly, we keep riot2 in the equation, but drop riot1.
21 capital to Beijing), we use three IVs as discussed in Section 3, i.e. frontier (dummy
for frontier province), rugged (dummy for rugged terrain), and dprov (direct
distance from the provincial capital to Beijing). With three valid IVs at hand for two
endogenous variables, we conduct efficient GMM estimation under overidentification.
The results are presented in column (1) of Table 4.
First and foremost, the coefficient of army is positively significant at the 1% level,
which supports the central message of Hypothesis 1, i.e. a state with more coercive
power can set a higher tax rate. On the other hand, the coefficients of the three
distance measures (pdprov, dpref, prov_capital) as proxies for agency cost are all
insignificant, which casts doubt on the role of agency costs emphasized by Sng (2014).
Intuitively, the importance of state coercion over agency costs may be due to the fact
that it was much more costly to move an entire army around than simply sending a
couple special envoys to audit local officials.
Among proxies for lack of allegiance, the coefficients of minority1 and ting are
both negatively significant at the 5% level, which lends support to Hypothesis 2, i.e.
the tax rate was set lower for regions dominated by minority groups with little
allegiance. However, the coefficient of the number of prior peasant uprisings (riot2) is
not significant.
The coefficient of coast is negatively significant at the 1% level. One explanation
is to recognize that rebels in a coastal prefecture had the option of taking to the sea as
22 pirates17, making it more difficult to defeat them completely. Thus, it is more difficult
for the state to punish rebels in a coastal prefecture, which means that the coercive
technology c(e) is less efficient for a coastal prefecture. Therefore, the negative effect
of coast is consistent with Hypothesis 3, which posits that the equilibrium tax rate
rises with the efficiency of coercion technology.
Among proxies for wage, the coefficients of cropland per capita (landpc) and
population density (pdensity) are both positively significant at the 1% level, which is
consistent with Hypothesis 4, i.e. the tax rate was set progressively higher for richer
regions, where the opportunity cost of rebellion was higher since there was more to
lose from a failed rebellion. However, while the coefficient of agricultural suitability
(agrisuit) is positive, it is not significant. Perhaps the effect of agrisuit has already
been represented through cropland per capita and population density.
Table 4. Determinants of Land Tax Per Capita
Dependent Variable: taxpc
(1) GMM
***
army
pdprov
dpref
prov_capital
minority1
ting
riot2
0.973
(0.300)
0.00513
(0.0150)
0.0561
(0.0495)
19.04
(24.72)
-28.74**
(13.40)
-43.16**
(18.57)
0.0901
(2) GMM
***
0.974
(0.299)
0.00806
(0.0152)
0.0409
(0.0493)
16.33
(24.50)
-27.67**
(13.21)
-41.37**
(17.46)
-1.446
(2) IGMM
***
0.973
(0.300)
0.00512
(0.0150)
0.0564
(0.0494)
19.21
(24.71)
-28.71**
(13.40)
-43.12**
(18.54)
0.0472
17
Piracy was a serious problem during the Ming Dynasty (1368-1644), the immediate
predecessor of the Qing dynasty. See, for example, Kung and Ma (2014).
23 (3) LIML
0.980***
(0.303)
0.00551
(0.0151)
0.0487
(0.0508)
18.67
(24.72)
-29.54**
(13.46)
-42.60**
(18.28)
0.0876
coast
landpc
pdensity
agrisuit
area
latitude
longitude
elevation
river
dryness
dry_std
_cons
N
R2
p-value for Hansen J
p-value for GMM C
F-Stat for army
F-Stat for pdprov
(8.495)
-76.14***
(22.64)
0.0249***
(0.00440)
0.332***
(0.0994)
4.331
(3.765)
-0.000127
(0.000317)
-6.108**
(2.464)
2.790
(2.292)
0.0466***
(0.0116)
9.686
(18.48)
63.14
(45.19)
118.5***
(32.57)
-458.1
(305.3)
(8.529)
-82.71***
(22.62)
0.0242***
(0.00434)
0.289***
(0.0939)
4.021
(3.739)
-0.0000472
(0.000326)
-5.309**
(2.402)
3.377
(2.180)
0.0413***
(0.0117)
7.520
(18.16)
44.13
(44.84)
111.3***
(32.21)
-480.5
(293.4)
(8.497)
-76.26***
(22.64)
0.0249***
(0.00440)
0.331***
(0.0994)
4.337
(3.764)
-0.000125
(0.000317)
-6.102**
(2.463)
2.788
(2.291)
0.0465***
(0.0116)
9.636
(18.47)
63.01
(45.17)
118.5***
(32.57)
-457.6
(305.1)
(8.567)
-78.62***
(22.81)
0.0248***
(0.00442)
0.322***
(0.0998)
4.359
(3.753)
-0.000133
(0.000322)
-6.025**
(2.475)
2.805
(2.268)
0.0446***
(0.0120)
7.593
(18.55)
56.73
(45.78)
117.4***
(32.55)
-438.7
(302.8)
261
0.327
0.409
0.000
10.695
418.436
261
0.324
0.236
0.000
10.707
435.658
261
0.327
0.410
0.000
10.695
418.436
261
0.325
10.695
418.436
Note: Robust standard errors in parentheses. * p < 0.1, ** p < 0.05, *** p < 0.01.
Among geographic controls, the coefficients of area, longitude and river are all
insignificant. The coefficient of latitude is negatively significant at the 5% level,
implying that south China, with its lower latitude, paid more tax than north China. It
is widely recognized that south China had higher agricultural productivity than north
24 China in pre-modern times18. Therefore, lower latitude could proxy for higher
productivity or labor wages, which were associated with a higher tax rate according to
Hypothesis 4.
The coefficient of elevation is positively significant at the1% level, implying that
a prefecture with a higher altitude paid more tax. A possible explanation is that a
high-elevation prefecture was typically located in a frontier province with a larger
provincial army, and hence paid more tax. In fact, elevation and frontier are positively
correlated, with a correlation coefficient of 0.49 in the sample.
Among climate controls, the coefficient of dryness is insignificant, while the
coefficient of dry_st , a measure of rainfall volatility, is positively significant at the
1% level, which appears to be counterintuitive at first sight. A possible reconciliation
has to do with China's climate and geography. While the eastern part of China is
generally more fertile and productive, it is also more heavily influenced by the fickle
monsoon, with larger climate volatility. In fact, dry_st and longitude are highly
correlated in the sample, with a correlation coefficient of 0.64. Therefore, climate
volatility (dry_std ) may proxy for higher agricultural productivity, which had a
positive effect on the tax rate.
A number of test statistics are reported at the bottom of Table 4. For the baseline
specification in column (1), the p-value for the Hansen J statistic for overidentification
18
Since south China is typically more mountainous than north China, it makes more sense to
define an alternative measure of population density, pdensity1, as the ratio of population to
cropland as a proxy for productivity and economic development. By this measure, latitude and
pdensity1 are positively correlated with a correlation coefficient of 0.155 significant at 5%
(p-value 0.0121).
25 test is 0.281, which accepts the null hypothesis that all instrumental variables are
exogenous. On the other hand, the p-value for the GMM C statistic testing for the
endogeneity is 0.006, which strongly rejects the null hypothesis that both army and
pdprov are exogenous. The F statistic for the first-stage regression with army as the
dependent variable is 10.649, while the F statistic for the first-stage regression with
pdprov as the dependent variable reaches as high as 414.657. Since both F statistics
exceed the rule-of-thumb critical value of 10, we conclude that these instruments are
not weak.
Column (2) of Table 4 uses rugged2 instead of rugged as an IV for army size, and
the results are very similar. Column (3) uses iterative GMM instead of two-step GMM,
and the results still barely change. Since the F statistic for the first-stage regression
with army as the dependent variable is only slighter larger than 10, column (4) reports
estimation by Limited Information Maximum Likelihood (LIML), which is known to
have better finite sample properties with weak instruments. The results are again quite
similar.
Table 5 conducts additional robustness checks. Since the tax quota only changed
slowly over time (Wang, 1974; Liu and Fei, 1977), in column (1) of Table 5 we
change the dependent variable to taxpc1776, which is computed as the ratio of land
tax in 1820 to population in 1776; similarly, we use pdensity1776 (population in 1776
divided by area) instead of pdensity as a regressor. The results remain similar to those
in Table 4.
26 Historically, land tax quota may have been set in consideration of cropland as
well as population (Wang, 1974). In column (2) of Table 5, we change the dependent
variable to land tax per mu of cropland (taxpl). The results are again qualitatively
similar to Table 4, with the exception that minority1, ting, and dry_std are no longer
significant.
27 Table 5. Robustness Checks
(1) taxpc1776
army
pdprov
dpref
prov_capital
minority1
ting
riot2
coast
landpc
***
1.378
(0.460)
0.000391
(0.0181)
0.0775
(0.0586)
19.92
(31.51)
-46.36***
(17.33)
-55.71**
(22.77)
10.36
(16.24)
-89.05***
(30.82)
0.0253***
(0.00669)
pdensity
pdensity1776
agrisuit
area
longitude
latitude
elevation
river
dryness
dry_std
_cons
0.605***
(0.133)
5.355
(4.683)
-0.000144
(0.000334)
0.183
(2.362)
-9.181***
(3.141)
0.0485***
(0.0156)
5.769
(19.76)
39.63
(87.44)
150.7***
(39.43)
-34.83
(371.5)
28 (2) taxpl
0.000629**
(0.000302)
-0.00000468
(0.0000129)
0.0000572
(0.0000447)
0.00687
(0.0160)
-0.0151
(0.0104)
-0.0252
(0.0227)
0.0182
(0.0168)
-0.0495***
(0.0189)
-0.0000190***
(0.00000312)
0.000163**
(0.0000733)
0.00248
(0.00307)
-0.000000211
(0.000000245)
0.00306*
(0.00175)
-0.00532**
(0.00211)
0.0000426***
(0.0000125)
0.00861
(0.0125)
0.00636
(0.0338)
0.0300
(0.0278)
-0.169
(0.247)
N
R2
p-value for Hansen J
p-value for GMM C
F-Stat for army
F-Stat for pdprov
261
0.0839
0.281
0.006
10.649
414.657
261
0.177
0.679
0.0197
10.695
418.436
Note: Robust standard errors in parentheses. * p < 0.1, ** p < 0.05, *** p < 0.01.
5. Conclusion
We have introduced a theory of relative state power to explain taxation levels under
autocracy. The model used to advance the theory sheds light on a most important issue
in the political economics of development, i.e., taxation under autocracy. We have also
empirically tested the hypotheses derived from the model. The findings are entirely
consistent with the predictions of the theory.
The findings of this study have important policy implications. One of them is that,
to avoid political violence, a revenue-maximizing ruler should adopt different policies
on taxation across regions, imposing differentiated tax rates across regions according
to local conditions of incomes, effectiveness of state coercion and attitudes towards
violence. Tax rates should also be adjusted in response to changes in these conditions,
e.g., an adversarial income shock resulting from a natural disaster. It is also politically
beneficial to adopt lower taxes in earlier stages of development.
As fruitful as it is, this work is only a first step towards understanding the
critically important issue of taxation in autocracies in light of the theory of relative
state power. Some of the assumptions made in the model can also be relaxed to allow
richer results. For example, we can relax the assumption that the ruler never wants a
29 rebellion, to study when she might tax people like a roving bandit. The model can also
be extended in different directions for additional insights, e.g., asymmetric
information on worker types, endogenous odds of successful rebellion, the role of
public goods and that of technology (productivity). On the empirical side, it would be
most important and interesting to use cross-country data to test the predictions of the
theory in an international setting.
References
Acemoglu, Daron, 2005. Politics and economics in weak and strong states. Journal of
Monetary Economics 52, 1199-1226.
Acemoglu, Daron, Simon Johnson, and James A. Robinson, 2002. Reversal of fortune:
Geography and institutions in the making of the modern world income
distribution. Quarterly Journal of Economics 117, 1231-1294.
Acemoglu, Daron, Suresh Naidu, Pascual Restrepo, James A. Robinson, 2014.
Democracy does cause growth. NBER working paper, No. 20004.
Acemoglu, Daron, and James A. Robinson, 2006. Economic Origins of Dictatorship
and Democracy. Cambridge University Press, New York.
Acemoglu, Daron, James A. Robinson and Rafael, J. Santos, 2013. The monopoly of
violence: Evidence from Columbia. Journal of the European Economic
Association 11, Issue Supplement s1, 5-44.
Acemoglu, Daron, Davide Ticchi and Andrea Vindigni, 2010. A theory of military
dictatorships. American Economic Journal: Macroeconomics 2, 1-42.
Aoki, Masahiko, 2014. Transitions from pre-modern to modern states of politicoeconomic play analyzed through the Meiji Restoration and Xinhai Revolution.
Working paper, Stanford University.
Bai, Ying, and Kung, James, 2011. Climate shocks and Sino-nomadic conflict.
Review of Economics and Statistics 93, 970-981.
Barzel, Yoram, 2002. A Theory of the State: Economic Rights, Legal Rights and the
Scope of the State. Cambridge University Press, New York.
Besley, Timothy, and Torsten Persson, 2011a. The logic of political violence.
Quarterly Journal of Economics 126, 1411-1445.
Besley, Timothy, and Torsten Persson, 2011b. The Pillars of Prosperity: The Political
Economics of Development Clusters. Princeton University Press, Princeton.
Cao, Shuji (2001). A Demographic History of China (Zhongguo Renkou Shi), v. 5.
30 Fudan University Press, Shanghai.
Chen, Qiang, 2014. Climate shocks, state capacity, and peasant uprisings in north
China during 25-1911 CE. Economica, forthcoming.
China’s Military History Editorial Committee (Eds.), 2003. A Chronology of Warfare
in Dynastic China (Zhongguo Lidai Zhanzheng Nianbiao), 2nd edition.
Jiefangjun Press, Beijing.
Dizhi Press Editorial Office, 2011. Atlas of China, 6th edition. Dizhi Press, Beijing.
Fukuyama, Francis, 2011. The Origins of Political Order: From Prehistory Times to
the French Revolution. Farrar, Straus and Giroux, New York.
Guo, Moruo (Eds.), 1996. Atlas for Chinese History (Zhongguo Shigao Ditu Ji).
Zhongguo Ditu Press, Beijing.
Jia, Ruixue, 2014. Weather shocks, sweet potatoes and peasant revolts in historical
China. Economic Journal, forthcoming.
Kung, James, and Chicheng Ma, 2014. Autarky and the rise and fall of piracy in Ming
China. Journal of Economic History, forthcoming.
Liang, Fanzhong, 1980. Statistics on Chinese Historical Demography, Land, and Land
Tax (Zhongguo Lidai Hukou, Tiandi, Tianfu Tongji). Shanghai People Press,
Shanghai.
Liu, Ts'ui-jung, and John C. H. Fei, 1977. An analysis of the land tax burden in China,
1650-1865. Journal of Economic History 37, 359-381.
Maskin, Eric, and Jean Tirole, 2004. The politician and the judge: Accountability in
government", American Economic Review 94, 1034-1054.
Milgrom, Paul R., and John Roberts, 1982. Limit pricing and entry under incomplete
information: An equilibrium analysis. Econometrica 50, 443-59.
QDZH, 1985. Collected Statutes of the Qing Dynasty by Imperial Order (Qinding
Daqing Zeli Huidian). Taiwan Shangwu Yinshu Guan, Taibei.
North, Douglass C., 1990. Institutions, Institutional Change and Economic
Performance. Cambridge University Press, Cambridge.
North, Douglass C., John Joseph Wallis, and Barry R. Weingast, 2009. Violence and
Social Orders: A Conceptual Framework for Interpreting Recorded Human History.
Cambridge University Press, New York.
Nunn, Nathan, and Diego Puga, 2011. Ruggedness: The blessing of bad geography in
Africa. Review of Economics and Statistics 94, 20-36.
Olson, Mancur, 2000. Power and Prosperity: Outgrowing Communist and Capitalist
Dictatorships. Basic Books, New York.
Ramankutty, Navin, Jonathan A. Foley , John Norman, and Kevin McSweeney, 2002.
The global distribution of cultivable lands: Current patterns and sensitivity to
possible climate change," Global Ecology and Biogeography 11, 377-392.
Shiue, Carol, and Wolfgang Keller, 2007. Markets in China and Europe on the eve of
the Industrial Revolution. American Economic Review 97, 1189-1216.
Shleifer, Andrei, and Robert W. Vishny, 1997. The Grabbing Hand. Harvard
University Press, Cambridge, MA.
Sng, Tuan-Hwee, 2014. Size and dynastic decline: The principal-agent problem in late
imperial China, 1700-1850. Explorations in Economic History, forthcoming.
31 State Meteorological Society, 1981. Collected Maps of Droughts and Floods in China
in the Past Five Hundred Years (Zhongguo Jin Wubai Nian Hanlao Fenbu Tuji).
Ditu Press, Beijing.
Tan, Qixiang (Eds.), 1982. China Historical Atlas (Zhongguo Lishi Ditu Ji).
Zhongguo Ditu Press, Beijing.
Wang, Yeh-Chien, 1974. Land Taxation in Imperial China (1750-1911). Harvard
University Press, Cambridge.
Wang, Yeh-Chien, 2009. Grain Price Database in the Qing Dynasty (Qingdai Liangjia
Shujuku). Academia Sinica, Taibei.
Wang, Yijiang, 2012. Relative state power and taxation in non-democracies. Working
paper, Cheung Kong Graduate School of Business.
Wen, Tongjue, 1861. An Investigation of the Royal Military System (Huangchao
Bingzhi Kaolue).
Yang, Kuang, Shiming Fang, Yingmu Cheng, Xulu Chen, and Qiwei Shen (Eds.),
2008. A Dictional of General Chinese History (Zhongguo Tongshi Cidian).
Shanghai Renmin Press, Shanghai.
Zhao, Hongjun, 2012. Did climate change affect the social stability of Chinese
agrarian economy in the past 2000 years? (Qihou Bianhua Shifou Yingxiang Le
Woguo Guoqu Liangqianqianjian De Nongye Shehui Wending?). China
Economic Quarterly (Jingjixue Jikan) 11, 691-722.
32