History
In ancient civilizations
Given visually complex social relationships, humans presumably have always observed and speculated about the physical differences among individuals and groups. But different societies have attributed markedly different meanings to these distinctions. For example, the Ancient Egyptian sacred text called Book of Gates identifies four categories that are now conventionally labeled "Egyptians", "Asiatics", "Libyans", and "Nubians", but such distinctions tended to conflate differences as defined by physical features such as skin tone, with tribal and national identity. Classical civilizations from Rome to China tended to invest much more importance in familial or tribal affiliation than with one's physical appearance (Dikötter 1992; Goldenberg 2003). Ancient Greek and Roman authors also attempted to explain and categorize visible biological differences among peoples known to them. Such categories often also included fantastical human-like beings that were supposed to exist in far-away lands. Some Roman writers adhered to an environmental determinism in which climate could affect the appearance and character of groups (Isaac 2004). In many ancient civilizations, individuals with widely varying physical appearances became full members of a society by growing up within that society or by adopting that society's cultural norms (Snowden 1983; Lewis 1990). Julian the Apostate was an early observer of the differences in humans, based upon ethnic, cultural, and geographic traits, but the ideology of "race" had not yet been constructed, but he believed that they were the result of "Providence":
Medieval models of "race" mixed Classical ideas with the notion that humanity as a whole was descended from Shem, Ham and Japheth, the three sons of Noah, producing distinct Semitic (Asian), Hamitic (African), and Japhetic (European) peoples. Age of DiscoveryThe word "race", along with many of the ideas now associated with the term, were products of European imperialism and colonization during the age of exploration. (Smedley 1999) As Europeans encountered people from different parts of the world, they speculated about the physical, social, and cultural differences among various human groups. The rise of the Atlantic slave trade, which gradually displaced an earlier trade in slaves from throughout the world, created a further incentive to categorize human groups in order to justify the subordination of African slaves. (Meltzer 1993) Drawing on Classical sources and upon their own internal interactions — for example, the hostility between the English and Irish was a powerful influence on early thinking about the differences between people (Takaki 1993) — Europeans began to sort themselves and others into groups associated with physical appearance and with deeply ingrained behaviors and capacities. A set of folk beliefs took hold that linked inherited physical differences between groups to inherited intellectual, behavioral, and moral qualities. (Banton 1977) Although similar ideas can be found in other cultures (Lewis 1990; Dikötter 1992), they appear not to have had as much influence upon their social structures as was found in Europe and the parts of the world colonized by Europeans. However, often brutal conflicts between ethnic groups have existed throughout history and across the world, and racial prejudice against Africans also exists today in non-colonised countries such as China and Japan. Scientific concepts
The first scientific attempts to classify humans by categories of race date from the 17th century, along with the development of European imperialism and colonization around the world. The first post-Classical published classification of humans into distinct races seems to be François Bernier's Nouvelle division de la terre par les différents espèces ou races qui l'habitent ("New division of Earth by the different species or races which inhabit it"), published in 1684. 17th and 18th centuryAccording to philosopher Michel Foucault, theories of both racial and class conflict can be traced to 17th century political debates about innate differences among ethnicities. In England radicals such as John Lilburne emphasised conflicts between Saxon and Norman peoples. In France Henri de Boulainvilliers argued that the Germanic Franks possessed a natural right to leadership, in contrast to descendants of the Gauls. In the 18th century, the differences among human groups became a focus of scientific investigation (Todorov 1993). Initially, scholars focused on cataloguing and describing "The Natural Varieties of Mankind," as Johann Friedrich Blumenbach entitled his 1775 text (which established the five major divisions of humans still reflected in some racial classifications, i.e., the Caucasoid race, Mongoloid race, Ethiopian race (later termed the Negroid race), American Indian race, and Malayan race). From the 17th through the 19th centuries, the merging of folk beliefs about group differences with scientific explanations of those differences produced what one scholar has called an "ideology of race" (Smedley 1999). According to this ideology, races are primordial, natural, enduring and distinct. It was further argued that some groups may be the result of mixture between formerly distinct populations, but that careful study could distinguish the ancestral races that had combined to produce admixed groups. 19th centuryThe 19th century saw attempts to change race from a taxonomic to a biological concept. In the 19th century a number of natural scientists wrote on race: Georges Cuvier, Charles Darwin, Alfred Wallace, Francis Galton, James Cowles Pritchard, Louis Agassiz, Charles Pickering, and Johann Friedrich Blumenbach. As the science of anthropology took shape in the 19th century, European and American scientists increasingly sought explanations for the behavioral and cultural differences they attributed to groups (Stanton 1960). For example, using anthropometrics, invented by Francis Galton and Alphonse Bertillon, they measured the shapes and sizes of skulls and related the results to group differences in intelligence or other attributes (Lieberman 2001). These scientists made three claims about race: first, that races are objective, naturally occurring divisions of humanity; second, that there is a strong relationship between biological races and other human phenomena (such as forms of activity and interpersonal relations and culture, and by extension the relative material success of cultures), thus biologizing the notion of "race", as Foucault demonstrated in his historical analysis; third, that race is therefore a valid scientific category that can be used to explain and predict individual and group behavior. Races were distinguished by skin color, facial type, cranial profile and size, texture and color of hair. Moreover, races were almost universally considered to reflect group differences in moral character and intelligence. The eugenics movement of the late 19th and early 20th centuries, inspired by Arthur Gobineau's An Essay on the Inequality of the Human Races (1853-1855) and Vacher de Lapouge's "anthroposociology", asserted as self-evident the biological inferiority of particular groups (Kevles 1985). In many parts of the world, the idea of race became a way of rigidly dividing groups by culture as well as by physical appearances (Hannaford 1996). Campaigns of oppression and genocide were often motivated by supposed racial differences (Horowitz 2001). In Charles Darwin's most controversial book, The Descent of Man, he made strong suggestions of racial differences and European superiority. In Darwin's view, stronger tribes of humans always replaced weaker tribes. As savage tribes came in conflict with civilized nations, such as England, the less advanced people were destroyed.[13] Nevertheless, he also noted the great difficulty naturalists had in trying to decide how many "races" there actually were (Darwin was himself a monogenist on the question of race, believing that all humans were of the same species and finding "race" to be a somewhat arbitrary distinction among some groups):
Modern debatesModels of human evolutionIn a recent article, Leonard Lieberman and Fatimah Jackson have suggested that any new support for a biological concept of race will likely come from another source, namely, the study of human evolution. They therefore ask what, if any, implications current models of human evolution may have for any biological conception of race.[15] Today, all humans are classified as belonging to the species Homo sapiens and sub-species Homo sapiens sapiens. However, this is not the first species of hominids: the first species of genus Homo, Homo habilis, evolved in East Africa at least 2 million years ago, and members of this species populated different parts of Africa in a relatively short time. Homo erectus evolved more than 1.8 million years ago, and by 1.5 million years ago had spread throughout the Old World. Virtually all physical anthropologists agree that Homo sapiens evolved out of Homo erectus. Anthropologists have been divided as to whether Homo sapiens evolved as one interconnected species from H. erectus (called the Multiregional Model, or the Regional Continuity Model), or evolved only in East Africa, and then migrated out of Africa and replaced H. erectus populations throughout the Old World (called the Out of Africa Model or the Complete Replacement Model). Anthropologists continue to debate both possibilities, and the evidence is technically ambiguous as to which model is correct, although most anthropologists currently favor the Out of Africa model. Multiregional modelAdvocates of the Multiregional model, primarily Milford Wolpoff and his associates, have argued that the simultaneous evolution of H. sapiens in different parts of Europe and Asia would have been possible if there were a degree of gene flow between archaic populations.[16] Similarities of morphological features between archaic European and Chinese populations and modern H. sapiens from the same regions, Wolpoff argues, support a regional continuity only possible within the Multiregional model.[17] Wolpoff and others further argue that this model is consistent with clinal patterns of phenotypic variation (Wolpoff 1993). Lieberman and Jackson have related this theory to race with the following statement:
Out of Africa model
According to the Out of Africa Model, developed by Christopher Stringer and Peter Andrews, modern Homo sapiens evolved in Africa 200,000 years ago. H. sapiens began migrating from Africa around 50,000 years ago and eventually replaced existing hominid species in Europe and Asia.[19][20] This model has gained support by recent research using mitochondrial DNA (mtDNA). After analysing genealogy trees constructed using 133 types of mtDNA, they concluded that all were descended from a woman from Africa, dubbed Mitochondrial Eve.[21] Lieberman and Jackson have related this theory to race with the following comment:
Comparison of the two modelsLieberman and Jackson have argued that while advocates of both the Multiregional Model and the Out of Africa Model use the word race and make racial assumptions, none define the term.[23] They conclude that "Each model has implications that both magnify and minimize the differences between races. Yet each model seems to take race and races as a conceptual reality. The net result is that those anthropologists who prefer to view races as a reality are encouraged to do so" and conclude that students of human evolution would be better off avoiding the word race, and instead describe genetic differences in terms of populations and clinal gradations.[24] Race as subspecies
With the advent of the modern synthesis in the early 20th century, many biologists sought to use evolutionary models and populations genetics in an attempt to formalise taxonomy below the species level. The term subspecies is used by biologists when a group of organisms are classified in such a way. In biology the term "race" is very rarely used because it is ambiguous, "'Race' is not being defined or used consistently; its referents are varied and shift depending on context. The term is often used colloquially to refer to a range of human groupings. Religious, cultural, social, national, ethnic, linguistic, genetic, geographical and anatomical groups have been and sometimes still are called 'races'".[25] Generally when it is used it is synonymous with subspecies.[26][25][27] One of the main obstacles to identifying subspecies is that, while it is a recognised taxonomic term, it has no precise definition.[26] Some species of organisms do not appear to fragment into subgroups, while others do seem to form such subspecific groups. A monotypic species comprises a single group or rather a single subspecies. Monotypic species can occur in several ways:
A polytypic species has two or more subspecies. These are separate populations that are more genetically different from one another and that are more reproductively isolated, gene flow between these populations is much reduced leading to genetic differentiation. Morphological subspeciesTraditionally subspecies are seen as geographically isolated and genetically differentiated populations.[26] Or to put it another way "the designation 'subspecies' is used to indicate an objective degree of microevolutionary divergence"[25] One objection to this idea is that it does not identify any degree of differentiation, therefore any population that is somewhat biologically different could be considered a subspecies, even to the level of a local population. As a result it is necessary to impose a threshold on the level of difference that is required for a population to be designated a subspecies.[26] This effectively means that populations of organisms must have reached a certain measurable level of difference in order to be recognised as subspecies. Dean Amadon proposed in 1949 that subspecies would be defined according to the seventy-five percent rule which means that 75% of a population must lie outside 99% of the range of other populations for a given defining morphological character or a set of characters. The 75 percent rule still has defenders but other scholars argue that it should be replaced with 90 or 95 percent rule.[28][29][30] When biologists study non-human populations, the standard threshold at which morphological diversity between two different populations is considered differentiated enough to be classified as subspecies is set at 70-75%. Smith et al. write:
Thus, according to the seventy-five percent rule two populations represent different subspecies if the morphological differences between them reach between 25-30%. According to Sewall Wright, who was born in 1889, there is no question that human populations that have long inhabited separated parts of the world should, in general, be considered to be of different subspecies by the usual criterion that most individuals of such populations can be allocated correctly by inspection. It does not require a trained anthropologist to classify an array of Englishmen, West Africans, and Chinese with 100% accuracy by features, skin color, and type of hair in spite of so much variability within each of these groups that every individual can easily be distinguished from every other. However, it is customary to use the term race rather than subspecies for the major subdivisions of the human species as well as for minor ones.[31] Humans can be correctly assigned to races at much greater than 75% accuracy on the basis of morphological traits while chimpanzee subspecies are morphologically indistinct, and difficult or impossible to classify when raised in captivity.[32][33][34] On the other hand in practice subspecies are often defined by easily observable physical appearance, but there is not necessarily any evolutionary significance to these observed differences, so this form of classification is generally not accepted by evolutionary biologists.[26][25] Because of the difficulty in classifying subspecies morphologically, many biologists began to reject the concept altogether, citing problems such as:[25]
Subspecies as isolated differentiated populationsGenetic differences between populations of organisms can be determined using the fixation index of Sewall Wright, which is often abbreviated to FST. This statistic is used to compare differences between any two given populations. For example it is often stated that the fixation index for humans is about 0.15. This means that about 85% of the variation measured in the human population is within any population, and about 15% of the variation occurs between populations.[26][25] Citing Smith, Templeton asserts that an FST value of 0.25 or 0.30 between populations is a “standard criterion” for subspecies classification. However, Smith doesn't mention the concept of FST in his paper, he is discussing variation in morphology and discusses the 75% rule. Templeton reported that the white-tailed deer has an FST of about 60% and the grey wolf has an FST approaching 90% for mtDNA. The FST of grey wolves is 0.168 in autosomal loci, however.[35][36] On the other hand, in a paper reporting on the phylogenetic structure of the Leopard Panthera pardus species of Africa and Asia, Uphyrkina et al. found that 76.04% of mtDNA variation was distributed between leopard populations and 23.96% within populations, for microsatellite autosomal data, 0.358 (35.8%) of the variation was found between populations.[37] This compares to an mtDNA FST of between 0.24-0.27 (24-27%), and a genomic FST of about 0.15 (15%) for humans,[38] and an FST of 0.09-0.32 for autosomal microsatellite DNA between three Chimpanzee (Pan troglodytes) populations and of 0.51-0.68 between these three populations and the bonobo (Pan paniscus) populations.[39] The following guidelines were suggested by Sewall Wright for interpreting FST:
Wright found differences in FST for various species from 0.023-0.501.[40] For humans the FST is usually given as 0.15, of this 15% that is distributed between populations about 3-6% is distributed between geographically close populations occupying the same continent and about 6-10% is distributed between more distant continental groups, these figures vary somewhat depending on the type of genetic systems used, but the general observation has been reproduced in numerous studies.[41] This indicates that some of the between population variation for humans is found within any "race" and about 6-10% of variation is found between "races", giving an FST of 0.06-0.1 for human "races".[42] It has also been noted that:
Population genetics: population and clineAt the beginning of the 20th century, anthropologists questioned, and eventually abandoned, the claim that biologically distinct races are isomorphic with distinct linguistic, cultural, and social groups. Shortly thereafter, the rise of population genetics provided scientists with a new understanding of the sources of phenotypic variation. This new science has led many mainstream evolutionary scientists in anthropology and biology to question the very validity of race as a scientific concept describing an objectively real phenomenon. Those who came to reject the validity of the concept of race did so for four reasons: empirical, definitional, the availability of alternative concepts, and ethical (Lieberman and Byrne 1993). The first to challenge the concept of race on empirical grounds were anthropologists Franz Boas, who demonstrated phenotypic plasticity due to environmental factors (Boas 1912), and Ashley Montagu (1941, 1942), who relied on evidence from genetics. Zoologists Edward O. Wilson and W. Brown then challenged the concept from the perspective of general animal systematics, and further rejected the claim that "races" were equivalent to "subspecies" (Wilson and Brown 1953). ClinesOne of the crucial innovations in reconceptualizing genotypic and phenotypic variation was anthropologist C. Loring Brace's observation that such variations, insofar as it is affected by natural selection, migration, or genetic drift, are distributed along geographic gradations or clines (Brace 1964). This point called attention to a problem common to phenotype-based descriptions of races (for example, those based on hair texture and skin color): they ignore a host of other similarities and differences (for example, blood type) that do not correlate highly with the markers for race. Thus, anthropologist Frank Livingstone's conclusion that, since clines cross racial boundaries, "there are no races, only clines" (Livingstone 1962: 279). In a response to Livingston, Theodore Dobzhansky argued that when talking about "race" one must be attentive to how the term is being used: "I agree with Dr. Livingston that if races have to be 'discrete units,' then there are no races, and if 'race' is used as an 'explanation' of the human variability, rather than vice versa, then the explanation is invalid." He further argued that one could use the term race if one distinguished between "race differences" and "the race concept." The former refers to any distinction in gene frequencies between populations; the latter is "a matter of judgment." He further observed that even when there is clinal variation, "Race differences are objectively ascertainable biological phenomena .... but it does not follow that racially distinct populations must be given racial (or subspecific) labels."[48] In short, Livingston and Dobzhansky agree that there are genetic differences among human beings; they also agree that the use of the race concept to classify people, and how the race concept is used, is a matter of social convention. They differ on whether the race concept remains a meaningful and useful social convention. In 1964, biologists Paul Ehrlich and Holm pointed out cases where two or more clines are distributed discordantly—for example, melanin is distributed in a decreasing pattern from the equator north and south; frequencies for the haplotype for beta-S hemoglobin, on the other hand, radiate out of specific geographical points in Africa (Ehrlich and Holm 1964). As anthropologists Leonard Lieberman and Fatimah Linda Jackson observe, "Discordant patterns of heterogeneity falsify any description of a population as if it were genotypically or even phenotypically homogeneous" (Lieverman and Jackson 1995). Patterns such as those seen in human physical and genetic variation as described above, have led to the consequence that the number and geographic location of any described races is highly dependent on the importance attributed to, and quantity of, the traits considered. For example if only skin colour and a "two race" system of classification were used, then one might classify Indigenous Australians in the same "race" as African people, and European people in the same "race" as East Asian people, but biologists and anthropologists would dispute that these classification has any scientific validity. On the other hand the greater the number of traits (or alleles) considered, the more subdivisions of humanity are detected, due to the fact that traits and gene frequencies do not always correspond to the same geographical location, or as Ossario and Duster (2005) put it:
PopulationsPopulation geneticists have debated as to whether the concept of population can provide a basis for a new conception of race. In order to do this a working definition of population must be found. Surprisingly there is no generally accepted concept of population that biologists use. It has been pointed out that the concept of population is central to ecology, evolutionary biology and conservation biology, but also that most definitions of population rely on qualitative descriptions such as "a group of organisms of the same species occupying a particular space at a particular time"[49] Waples and Gaggiotti identify two broad types of definitions for populations, those that fall into an ecological paradigm and those that fall into an evolutionary paradigm. Examples such definitions are:
Richard Lewontin, claiming that 85 percent of human variation occurs within populations, and not among populations, argued that neither "race" nor "subspecies" were appropriate or useful ways to describe populations (Lewontin 1973). Nevertheless, barriers—which may be cultural or physical— between populations can limit gene flow and increase genetic differences. Recent work by population geneticists conducting research in Europe suggests that ethnic identity can be a barrier to gene flow.[50][51][52][53] Others, such as Ernst Mayr, have argued for a notion of "geographic race" [3]. Some researchers report the variation between racial groups (measured by Sewall Wright's population structure statistic FST) accounts for as little as 5% of human genetic variation². Sewall Wright himself commented that if differences this large were seen in another species, they would be called subspecies.[54] In 2003 A. W. F. Edwards argued that cluster analysis supersedes Lewontin's arguments (see below). These empirical challenges to the concept of race forced evolutionary sciences to reconsider their definition of race. Mid-century, anthropologist William Boyd defined race as:
Lieberman and Jackson (1994) have pointed out that "the weakness of this statement is that if one gene can distinguish races then the number of races is as numerous as the number of human couples reproducing." Moreover, anthropologist Stephen Molnar has suggested that the discordance of clines inevitably results in a multiplication of races that renders the concept itself useless (Molnar 1992).
Molecular genetics: lineages and clustersWith the recent availability of large amounts of human genetic data from many geographically distant human groups scientists have again started to investigate the relationships between people from various parts of the world. One method is to investigate DNA molecules that are passed down from mother to child (mtDNA) or from father to son (Y chromosomes), these form molecular lineages and can be informative regarding prehistoric population migrations. Alternatively autosomal alleles are investigated in an attempt to understand how much genetic material groups of people share. This work has led to a debate amongst geneticists, molecular anthropologists and medical doctors as to the validity of conceps such as "race". Some researchers insist that classifying people into groups based on ancestry may be important from medical and social policy points of view, and claim to be able to do so accurately. Others claim that individuals from different groups share far too much of their genetic material for group membership to have any medical implications. This has reignited the scientific debate over the validity of human classification and concepts of "race". Molecular lineages, Y chromosomes and mitochondrial DNAMitochondria are small organelles that lie in the cytoplasm of eucaryotic cells, such as those of humans. Their primary purpose is to provide energy to the cell. Mitochondria are thought to be the vestigial remains of symbiotic bacteria that were once free living. One indication that mitochondria were once free living is that they contain a relatively small circular segment of DNA, called mitochondrial DNA (mtDNA). The overwhelming majority of a human's DNA is contained in chromosomes in the nucleus of the cell, but mtDNA is an exception. An individual inherits their cytoplasm and the organelles it contains exclusively from their mother, as these are derived from the ovum (egg cell), sperm only carry chromosomal DNA due to the necessity of maintaining motility. When a mutation arises in mtDNA molecule the mutation is therefore passed in a direct female line of descent. These mutations are derived from copying mistakes, when the DNA is copied it is possible that a single mistake occurs in the DNA sequence, these single mistakes are called single nucleotide polymorphisms (SNPs). Human Y chromosomes are male specific sex chromosomes, any human that possesses a Y chromosome will be morphologically male. Y chromosomes are therefore passed from father to son, although Y chromosomes are situated in the cell nucleus, they only recombine with the X chromosome at the ends of the Y chromosome, the vast majority of the Y chromosome (95%) does not recombine. Therefore, as with mtDNA, when mutations (SNPs) arise in the Y chromosome they are passed on directly from father to son in a direct male line of descent.
Ancestral Haplogroup Haplogroup A (Hg A) Haplogroup B (Hg B) All of these molecules are part of the ancestral haplogroup, but at some point in the past a mutation occurred in the ancestral molecule, mutation A, which produced a new lineage, this is haplogroup A and is defined by mutation A, at some more recent point in the past a new mutation, mutation B, occurred in a person carrying haplogroup A, mutation B defined haplogroup B, haplogroup B is a subgroup, or subclade of haplogroup A, both haplogrups A and B are subclades of the ancestral haplogroup.
The Y chromosome and mtDNA therefore share certain properties. Other chromosomes, autosomes and X chromosomes in women, share their genetic material (called crossing over leading to recombination) during meiosis (a special type of cell division that occurs for the purposes of sexual reproduction). Effectively this means that the genetic material from these chromosomes gets mixed up in every generation, and so any new mutations are passed down randomly from parents to offspring. The special feature that both Y chromosomes and mtDNA display is that mutations can accrue along a certain segment of both molecules and these mutations remain fixed in place on the DNA. Furthermore the historical sequence of these mutations can also be inferred. For example, if a set of ten Y chromosomes (derived from ten different men) contains a mutation, A, but only five of these chromosomes contain a second mutation, B, it must be the case that mutation B occurred after mutation A. Furthermore all ten men who carry the chromosome with mutation A are the direct male line descendants of the same man who was the first person to carry this mutation. The first man to carry mutation B was also a direct male line descendant of this man, but is also the direct male line ancestor of all men carrying mutation B. Series of mutations such as this form molecular lineages. Furthermore each mutation defines a set of specific Y chromosomes called a haplogroup. All men carrying mutation A form a single haplogroup, all men carrying mutation B are part of this haplogroup, but mutation B also defines a more recent haplogroup (which is a subgroup or subclade) of its own which men carrying only mutation A do not belong to. Both mtDNA and Y chromosomes are grouped into lineages and haplogroups, these are often presented as tree like diagrams. Groundbreaking work by molecular biologists such as Cann et al. (1987)[55] on mtDNA produced three interesting observations relevant to race and human evolution. Firstly, by estimating the rate at which mutations occur in mtDNA Cann et al. were able to estimate the age of the common ancestral mtDNA type: "the common ancestral mtDNA (type a) links mtDNA types that have diverged by an average of nearly 0.57%. Assuming a rate of 2%-4% per million years, this implies that the common ancestor of all surviving mtDNA types existed 140,000-290,000 years ago." This observation is robust, and this common direct female line ancestor (or mitochondrial most recent common ancestor (mtMRCA)) of all extant humans has become known as mitochondrial eve. The observation that the mtMRCA is the direct matrilineal ancestor of all living humans should not be interpreted as meaning that either she was the first anatomically modern human, nor that there were no other female humans living concurrently with her. A more reasonable explanation is that other women who lived at the same time as mtMRCA did indeed reproduce and pass their genes down to living humans, but that their mitochondrial lineages have been lost over time, probably due to random events such as producing only male children. It is impossible to know to what extent these non-extant lineages have been lost or how much they differed from the mtDNA of our mtMRCA. Cann et al. Secondly Cann et al. postulate that their work supports an African origin for modern human mtDNA: "We infer from the tree of minimum length... that Africa is a likely source of the human mitochondrial gene pool. This inference comes from the observation that one of the two primary branches leads exclusively to African mtDNAs... while the second primary branch also leads to African mtDNAs... By postulating that the common ancestral mtDNA (type a in Fig. 3) was African, we minimize the number of intercontinental migrations needed to account for the geographic distribution of mtDNA types." Thirdly the study shows that mtDNA types (haplogroups) do not cluster by geography, ethnicity or race, implying multiple female lineages were involved in founding modern human populations, with many closely related lineages spread geographically and many populations containing distantly related lineages: "The second implication of the tree (Fig. 3) -that each non-African population has multiple origins-can be illustrated most simply with the New Guineans. Take, as an example, mtDNA type 49, a lineage whose nearest relative is not in New Guinea, but in Asia (type 50). Asian lineage 50 is closer genealogically to this New Guinea lineage than to other Asian mtDNA lineages. Six other lineages lead exclusively to New Guinean mtDNAs, each originating at a different place in the tree (types 12, 13, 26-29, 65, 95 and 127-134 in Fig. 3). This small region of New Guinea (mainly the Eastern Highlands Province) thus seems to have been colonised by at least seven maternal lineages (Tables 2 and 3). In the same way, we calculate the minimum numbers of female lineages that colonised Australia, Asia and Europe (Tables 2 and 3). Each estimate is based on the number of region-specific clusters in the tree (Fig. 3, Tables 2 and 3). These numbers, ranging from 15 to 36 (Tables 2 and 3), will probably rise as more types of human mtDNA are discovered." The Y chromosome is much larger than mtDNA, and is relatively homogeneous, therefore it has taken much longer to find distinct lineages and to analyse them. Conversely, because the Y chromosome is so large by comparison it can hold a great deal more genetic information. With regard to the three observations made by Cann et al. concerning mtDNA, Y chromosome studies show similar patterns. The estimate for the age of the ancestral Y chromosome for all extant Y chromosomes is given at about 70,000 years ago and is also placed in Africa, this individual is sometimes referred to as Y chromosome Adam. The difference in dates between Y chromosome Adam and mitochondrial Eve is usually attributed to a higher extinction rate for Y chromosomes due to greater differential reproductive success between individual men, that means that a small number of very successful men may produce a great many children, while a larger number of less successful men will produce far less children. Keita et al. (2004) say, with reference to Y chromosome and mtDNA and concepts of race:
How much are genes shared? Clustering analyses and what they tell us
Clustering analysis from Rosenberg (2006), there are seven clusters and most individuals belong to several clusters. Note the Kalash have majority membership in a cluster in which individuals from other groups have only a tiny membership.
Genetic data can be used to infer population structure and assign individuals to groups that often correspond with their self-identified geographical ancestry. Recently, Lynn Jorde and Steven Wooding argued that "Analysis of many loci now yields reasonably accurate estimates of genetic similarity among individuals, rather than populations. Clustering of individuals is correlated with geographic origin or ancestry."[57] In 2003 A. W. F. Edwards wrote a paper called Lewontin's Fallacy, rebuking the argument that because most of the variation is within group classification of humans is not possible. He claimed that this conclusion ignores the fact that most of the information that distinguishes populations is hidden in the correlation structure of the data and not simply in the variation of the individual factors. Edwards concludes that "It is not true that 'racial classification is ... of virtually no genetic or taxonomic significance' or that 'you can't predict someone’s race by their genes'."[58] Likewise Neil Risch of Stanford University has proposed that self-identified race/ethnic group could be a valid means of categorization in the USA for public health and policy considerations.[59][60] While a 2002 paper by Noah Rosenberg's group makes a similar claim "The structure of human populations is relevant in various epidemiological contexts. As a result of variation in frequencies of both genetic and nongenetic risk factors, rates of disease and of such phenotypes as adverse drug response vary across populations. Further, information about a patient’s population of origin might provide health care practitioners with information about risk when direct causes of disease are unknown."[61] Researchers such as Neil Risch and Noah Rosenberg have argued that a person's biological and cultural background may have important implications for medical treatment decisions, for example an opinion paper by Neil Risch's group in 2002 states:
While another 2002 paper by Noah Rosenberg's group makes a similar claim
This work used samples from the Human Genome Diversity Project (HGDP), a project that has collected samples from individuals from 52 ethnic groups from various locations around the world. The HGDP has itself been criticised for collecting samples on an "ethnic group" basis, on the grounds that ethnic groups represent constructed categories rather than categories which are solely natural or biological. The molecular anthropologist Jonathan Marks states:
In the same issue of Science that published the Rosenberg data, Mary-Claire King and Arno G. Motulsky give a similar warning regarding the HGDP data:
Another study by Neil Risch in 2005 used 326 microsatellite markers and self-identified race/ethnic group (SIRE), white, African-American, Asian and Hispanic (individuals involved in the study had to choose from one of these categories), to representing discrete "populations", and showed distinct and non-overlapping clustering of the white, African-American and Asian samples. The results were claimed to confirm the integrity of self-described ancestry: "We have shown a nearly perfect correspondence between genetic cluster and SIRE for major ethnic groups living in the United States, with a discrepancy rate of only 0.14%." But also warned that: "This observation does not eliminate the potential for confounding in these populations. First, there may be subgroups within the larger population group that are too small to detect by cluster analysis. Second, there may not be discrete subgrouping but continuous ancestral variation that could lead to stratification bias. For example, African Americans have a continuous range of European ancestry that would not be detected by cluster analysis but could strongly confound genetic case-control studies. (Tang, 2005) Studies such as those by Risch and Rosenberg use a computer program called STRUCTURE to find human populations (gene clusters). It is a statistical program that works by placing individuals into one of two clusters based on their overall genetic similarity, many possible pairs of clusters are tested per individual to generate multiple clusters.[66] These populations are based on multiple genetic markers that are often shared between different human populations even over large geographic ranges. The notion of a genetic cluster is that people within the cluster share on average similar allele frequencies to each other than to those in other clusters. (Edwards, 2003 but see also infobox "Multi Locus Allele Clusters") In a test of idealised populations, the computer programme STRUCTURE was found to consistently under-estimate the numbers of populations in the data set when high migration rates between populations and slow mutation rates (such as single nucleotide polymorphisms) were considered.[67] Nevertheless the Rosenberg et al. (2002) paper shows that individuals can be assigned to specific clusters to a high degree of accuracy. One of the underlying questions regarding the distribution of human genetic diversity is related to the degree to which genes are shared between the observed clusters. It has been observed repeatedly that the majority of variation observed in the global human population is found within populations. This variation is usually calculated using Sewall Wright's Fixation index (FST), which is an estimate of between to within group variation. The degree of human genetic variation is a little different depending upon the gene type studied, but in general it is common to claim that ~85% of genetic variation is found within groups, ~6-10% between groups within the same continent and ~6-10% is found between continental groups. For example The Human Genome Project states "two random individuals from any one group are almost as different [genetically] as any two random individuals from the entire world."[68] On the other hand Edwards (2003) claims in his essay "Lewontin's Fallacy" that: "It is not true, as Nature claimed, that 'two random individuals from any one group are almost as different as any two random individuals from the entire world'" and Risch et al. (2002) state "Two Caucasians are more similar to each other genetically than a Caucasian and an Asian." It should be noted that these statements are not the same. Risch et al. simply state that two indigenous individuals from the same geographical region are more similar to each other than either is to an indigenous individual from a different geographical region, a claim few would argue with. Jorde et al put it like this:
Whereas Edwards claims that it is not true that the differences between individuals from different geographical regions represent only a small proportion of the variation within the human population (he claims that within group differences between individuals are not almost as large as between group differences). Bamshad et al. (2004) used the data from Rosenberg et al. (2002) to investigate the extent of genetic differences between individuals within continental groups relative to genetic differences between individuals between continental groups. They found that though these individuals could be classified very accurately to continental clusters, there was a significant degree of genetic overlap on the individual level, to the extent that, using 377 loci, individual Europeans were about 38% of the time more genetically similar to East Asians than to other Europeans. The results obtained by clustering analyses are dependent on several criteria:
Distribution of European clusters identified by Bauchet. When two clusters are identified there is a north-southeast cline that may be due to demic diffusion during the European Neolithic
Additionally two studies of European population clusters have been produced. Seldin et al. (2006) identified three European clusters using 5,700 genome-wide polymorphisms. Bauchet et al. (2007) used 10,000 polymorphisms to identify five distinct clusters in the European population, consisting of a south-eastern European cluster (including samples from southern Italians, Armenian, Ashkenazi Jewish and Greek "populations"); a northern-European Cluster (including samples from German, eastern English, Polish and western Irish "populations"); a Basque cluster (including samples from Basque "populations"); a Finnish cluster (including samples from Finnish "populations") and a Spanish cluster (including samples from Spanish "populations"). Most "populations" contained individuals from clusters other than the dominant cluster for that population, there were also individuals with membership of several clusters. The results of this study are presented on a map of Europe. (Bauchet, 2007) The existence of allelic clines and the observation that the bulk of human variation is continuously distributed, has led some scientists to conclude that any categorization schema attempting to partition that variation meaningfully will necessarily create artificial truncations. (Kittles & Weiss 2003). It is for this reason, Reanne Frank argues, that attempts to allocate individuals into ancestry groupings based on genetic information have yielded varying results that are highly dependent on methodological design.[69] Serre and Pääbo (2004) make a similar claim:
In a response to Serre and Pääbo (2004), Rosenberg et al. (2005) make three relevant observations. Firstly they maintain that their clustering analysis is robust. Secondly they agree with Serre and Pääbo that membership of multiple clusters can be interpreted as evidence for clinality (isolation by distance), though they also comment that this may also be due to admixture between neighbouring groups (small island model). Thirdly they comment that evidence of clusterdness is not evidence for any concepts of "biological race".
Similarly Witherspoon et al. (2007) have shown that while it is possible to classify people into genetic clusters this does not resolve the observation that any two individuals from different populations are often genetically more similar to each other than to two individuals from the same population:
Risch et al. (2002) state that "two Caucasians are more similar to each other genetically than a Caucasian and an Asian", but Bamshad et al (2004)[71] used the same data set as Rosenberg et al. (2002) to show that Europeans are more similar to Asians 38% of the time than they are to other Europeans when only 377 microsatellite markers are analysed.
If a landmass is considered with variation distributed in one dimension (west-east). Top: Distribution of genetic variation if a small island model is considered, there are two "populations" with a narrow region of hybridisation where migration occurs, this pattern is clustered. Bottom: Distribution of genetic variation if isolation by distance is considered, all variation is gradual over the extent of the landmass, this pattern is clinal.
In agreement with the observation of Bamshad et al. (2004), Witherspoon et al. (2007) have shown that many more than 326 or 377 microsatellite loci are required in order to show that individuals are always more similar to individuals in their own population group than to individuals in different population groups, even for three distinct populations.[68] In 2007 Witherspoon et al. sought to investigate these apparently contradictory observations. In their paper Genetic similarities within and between human populations[68] they expand upon the observation of Bamshad et al. (2004). They show that the observed clustering of human populations into relatively discrete groups is a product of using what they call "population trait values". This means that each individual is compared to the "typical" trait for several populations, and assigned to a population based on the individual's overall similarity to one of the populations as a whole: "population membership is treated as an additive quantitative genetic trait controlled by many loci of equal effect, and individuals are divided into populations on the basis of their trait values." They therefore claim that clustering analyses cannot necessarily be used to make inferences regarding the similarity or dissimilarity of individuals between or within clusters, but only for similarities or dissimilari |