Privacy Policy
Merchant Information
Thursday, March 30, 2006
New Sample: Modal Romany Profile

New additions to our Global Population Database include a Romany sample from eastern Hungary. The Roma, often referred to as Gypsies, are an ethnic group that entered Europe approximately 1,000 years ago from the Indian Subcontinent. Their traditional language, Romani, is related to Sanskrit. This transcontinental history is recorded in the modal genetic profile (click here to open PDF analysis) for this Hungarian Romany population sample.

The top 10 Native Match results for this profile are:
  1. Bhutia (Sikkim, India): 50
  2. Munda (Chotanagpur Plateau, India): 24
  3. Albanian (Kosovo, Albania): 24
  4. Ashkenazi Jewish (Budapest, Hungary): 16
  5. Csango (Romania): 15
  6. Szekler (Romania): 13
  7. Slovenian: 11
  8. Southern Croatian: 11
  9. Bosnian: 11
  10. Kayast (Bihar, India): 11


The top 10 Global Match results for this profile are:
  1. Bhutia (Sikkim, India): 50
  2. Romany (Eastern Hungary): 37
  3. Munda (Chotanagpur Plateau, India): 24
  4. Albanian (Kosovo, Albania): 24
  5. Ashkenazi Jewish (Budapest, Hungary): 16
  6. Csango (Romania): 15
  7. Szekler (Romania): 13
  8. Slovenian: 11
  9. Southern Croatian: 11
  10. Bosnian: 11


The top Continent Match results for this profile are:
  1. European: 33
  2. East and South Asian: 2
  3. Sub-Saharan: 0
  4. Native American: 0
These results strongly corroborate the known history of this ethnic group's origins in India and subsequent admixture with Southeastern European populations. This addition to our database can provide valuable insight for customers who have traditions or speculations of Gypsy ancestry in their family histories.


Wednesday, March 29, 2006
New Populations for March 29, 2006

We are pleased to announce the addition of 33 new populations to our Global Population Database, for a current total of 384 population, including 234 indigenous population samples and representing 96,165 individuals. Sample sizes are listed in parentheses.

New African populations:
  • African (Cape Town, South Africa) (98)
  • Berber (Asni, Morocco) (105)
  • Berber (Bouhria, Morocco) (104)
  • Kesra Berber (Central Tunisia) (44)
  • Tutsi (Rwanda) (114)
  • Zriba Arab (Central Tunisia) (45)
New European populations:
  • Ashkenazi (Budapest, Hungary) (178)
  • Romany/"Gypsy" (Eastern Hungary) (110)
  • Russian Orthodox Old Believers (Suwalki, Poland) (136)
New East Asian populations:
  • Nepalese (953)
  • Taiwanese (Taiwan) (597)
  • Tibetan (Lhasa) (196)
New Near Eastern populations:
  • Berber (Asni, Morocco) (105)
  • Berber (Bouhria, Morocco) (104)
  • Kesra Berber (Central Tunisia) (44)
  • Zriba Arab (Central Tunisia) (45)
New South Asian populations:
  • Lower Caste (Andhra Pradesh, India) (488)
  • Lower Middle Caste (Andhra Pradesh, India) (580)
  • Tribal (Andhra Pradesh, India) (214)
  • Upper Caste (Andhra Pradesh, India) (176)
  • Upper Middle Caste (Andhra Pradesh, India) (494)
New Diaspora populations from around the world:
  • Buenos Aires, Argentina (511)
  • Chaco, Argentina (55)
  • Chubut, Argentina (320)
  • Corrientes, Argentina (43)
  • Costa Rican (496)
  • European (Cape Town, South Africa) (98)
  • Formosa, Argentina (78)
  • Mendoza, Argentina (287)
  • Misiones, Argentina (169)
  • Romany (Eastern Hungary) (110)
  • Salta, Argentina (83)
  • Santa Fe, Argentina (560)
Previous customers who would like to update their results to include these new populations can order an Update through our secure online checkout.


Wednesday, March 15, 2006
Questions About Modal Profiles

The question has been asked: Why do modal profiles for populations sometimes obtain their most likely matches with other populations?

To make a simple analogy: imagine we are comparing three boxes, each of which contains ten pieces of fruit.

Box A contains 2 apple, 0 bananas, 8 oranges, and 0 pears.
Box B contains 5 apples, 3 bananas, 2 oranges, and 0 pears.
Box C contains 2 apples, 4 bananas, 3 oranges, and 1 pear.

Imagine that each box is like a population, and each type of fruit is like an allele value. Suppose that instead of looking for genetic profiles, we are looking for types of fruit.

First, we will obtain a modal fruit for each box. In Box A, oranges are most frequent, and so the modal fruit is an orange. In Box B, the modal fruit is an apple. In Box C, the modal fruit is a banana.

Now, suppose we have an orange, and we want to predict where it is most likely. The orange is from Box C, but we don't know that. The orange is most likely in Box A, then Box C, then Box B. This is analogous to a modal genetic profile for one population that obtains its highest match in another population. The reason it doesn't match its actual population of origin is that it is not distinctively typical enough of Box A. However, in a statistical sense, an orange is more typical of A than C, and more "A-like" than "C-like." If fruits were alleles (genes), then it is possible that oranges originated in Box A or another unknown box closer to Box A than to Box C. In this case, the likelihood match with A would yield information previously unknown from the recent origins of the fruit in Box C.

Now, suppose we have a pear. That pear is most likely in Box C; in fact, since pears are not found in any other boxes, it must be from Box C or else a box for which we don't have fruit data. This is analogous to a rare allele. Note that a pear is not the modal fruit for Box C, but that it nevertheless is distinctively typical of Box C.

Genetic data is considerably more complex, because: (1) there are many more kinds of fruit (allele values at 13 loci); (2) each genetic profile is not one fruit, but a set of 26; (3) there are many more boxes (currently over 300 populations); (4) fruit moves between boxes (admixture between populations); (5) fruit randomly disappear or multiply in each box over time (genetic drift); (6) fruit spontaneously acquire new characteristics (mutation); (7) two fruits can randomly generate an intermediate kind of fruit (recombination); and many other factors.

Note that genetic data is fundamentally different than genealogical information. Genetic data includes material inherited far back into prehistory; genealogical data extends back only a few generations. Genetic material is transmitted as a partial set of each parent's genes at each generation; genealogical data simply knows who the parents are, not what was inherited from them. Genetic material is transmitted somewhat unequally at each generation; genealogical data assumes a person has an equal share of inheritance from each ancestor. Genetic data concerns what genes actually are carried inside a person; genealogical data considers only where a person is from, their name, their religion, and other external information.

To continue the boxes and fruits analogy: imagine a person is from Box A (Box A is their ethnicity), and their genetic profile is an apple. Their profile (apple) is most likely in Box B, and equally likely to occur in Box A or Box C. However, just because their most likely match is Box B does not require they are 3/10 pear or 2/10 orange. Further, it does not imply that they are not from Box A. It is a likelihood calculation.

However, if that person from Box A is a pear, then their most likely match will be Box A. In fact, their matches for other boxes will be of virtually zero likelihood. This is like a rare allele value highly distinctive of Box A relative to the other boxes.

DNA Tribes uses highly distinct genetic profiles that contain complex and detailed information about each person. For any individual profile, the likelihood of obtaining an exact match anywhere in the world is infinitesimal. Matches are obtained by finding where each profile is most likely. Because genes (allele values) correlate strongly with geography, individual profiles tend to exhibit precise geographical patterns of matches. However, top matches will not always correspond to known ethnicity. However, when they do not, the most likely match gives an important clue as to a person's strongest genetic affinities that paper genealogy might not reveal.

Genes contain detailed information that human memory omits or forgets. Genetics and genealogy ask two fundamentally different kinds of questions. However, the information derived from each source can enrich understanding of the other. Genetic data can put paper genealogy in a broader context, showing how a person's family history relates to world history. Genealogical information can put genetic data in context, showing how genetic material was recently transmitted to an individual.


Tuesday, March 14, 2006
Updated Sample Modal Profiles

All sample modal profiles have been updated to reflect improvements in our algorhithm. These sample profiles can be viewed under Sample Results.


All materials
copyright © 2006-
2009 DNA Tribes.
DNA Tribes™.  All
rights reserved. All
satellite images
courtesy NASA
Earth Observatory.