Summary of public results of haplogroup R subclade SNP results

 

John McEwan

 

1st Oct 2005 Last updated 8th July 2007

 

Background

The R haplogroup comprises a large proportion of individuals in public genealogical (STR based) databases. This website is largely devoted to an analysis of the Ysearch STR database to identify clusters within R especially R1b associated with historic and geographically based populations. Haplogroups are defined by SNP mutations and are not commonly tested by the genotyping companies. Within R1b there are a series of SNPs that divide this haplogroup, but at the current time little is known about the fraction of R1b these subclades define, nor what relationship they have with STR defined clusters.

 

Currently, the only major database storing general SNP genotype results in a searchable form is DNA-Fingerprint. Individuals also often release information on the Genealogy DNA list-server or store it in Charles Kerchner’s Yahoo DNA-ANTHROGENEALOGY database tables. For many following the list server discussions, these results can be confusing and difficult to search. Similar comments apply to discussions about the DYS385 Kittler test, DYF399S1, and 464x test available from FTDNA. The most comprehensive R1b subclade SNP tests are available from Ethnoancestry although selected R1b SNPs are available from other suppliers.

 

This page attempts to address the confusion by presenting posted SNP results in a clear and consistent format in a single location. It will be updated until such time as the public databases include a provision to: enter, store and update in a searchable format individual SNP genotypes, include Ysearch IDs and are widely utilized. Results posted to the genealogy listserver, or sent direct to myself at mcewanj at xtra.co.nz (replace the at with @) will be included in this list. Please provide your Ysearch ID as well to ensure unambiguous identification. However, you are strongly encouraged to also upload your data into one of the databases storing SNP results.

 

Fig 1. Summary of the interim R1b haplogroup tree, based on SNP results available to date. For older trees see YCC 2002, Jobling 2003 , FTDNA 2005 , Athey 2005 R1b and YCC 2002 conversion table from systems in use prior to 2002. The current official tree is at ISOGG 2007 (Last updated 10th June 2007)

 

Results

 

R1a results table

R (xR1a) results table

 

 

Distribution of derived SNPs within 37 marker STR haplotype clusters

Many people often want to predict what haplogroup they are in from their STR markers. This can be done for most people with high confidence to the major haplogroup categories, but for most R1b subclades it cannot be done reliably. The table below summarizes the results to date. It is expected that as the number of STR markers genotyped increases more reliable haplogroup classification will be possible.

 

Table  1. Summary of Haplogroup classification by haplotype STR cluster assignment, R1b1c* had to have been typed for the majority of the subclade SNPs including S21 (Jul 2007)

 

Haplogroup

 

 

 

 

 

 

 

 

 

 

 

 

 

STRcluster

R1b1

R1b1b

R1b1c*

R1b1c4

R1b1c6

R1b1c7

R1b1c8

R1b1c9

R1b1c9a

R1b1c9b

R1b1c10

R1b1c11

R1b1c/464x+

R2

Grand Total

1

 

 

5

 

2

 

 

 

 

 

3

 

 

 

10

2

 

 

6

 

1

 

 

 

 

 

1

 

 

 

8

3

 

 

1

 

 

 

 

1

 

11

 

 

 

 

13

4

 

 

2

 

 

 

 

 

 

 

 

 

 

 

2

5

 

 

3

 

 

 

 

 

 

 

 

 

 

 

3

6

 

 

1

 

2

 

 

1

 

 

1

 

 

 

5

7

 

 

 

 

 

 

 

 

 

 

 

 

10

 

10

8

 

 

1

 

 

 

 

 

 

 

 

 

 

 

1

9

 

 

2

 

 

 

 

 

 

 

 

 

 

 

2

10

 

 

2

 

 

 

 

 

 

 

 

 

 

 

2

11

 

 

 

 

 

 

 

1

 

 

 

 

 

 

1

14

 

 

1

 

1

 

 

 

 

 

 

 

 

 

2

15

 

 

2

 

1

 

 

 

 

 

 

 

 

 

3

16

 

 

3

 

 

 

 

 

 

 

 

 

 

 

3

17

 

 

3

3

 

 

 

 

 

 

 

 

 

 

6

18

 

 

2

 

 

 

 

3

 

 

 

 

 

 

5

19

 

 

1

 

 

129

 

 

 

 

 

 

 

 

130

20

 

 

2

 

 

 

 

2

 

 

 

 

 

 

4

21

 

 

7

 

 

 

 

 

 

 

 

 

 

 

7

22

 

 

 

 

 

 

 

18

 

 

 

 

 

 

18

23

 

 

1

 

 

 

 

 

 

 

 

 

 

 

1

24

 

 

1

 

 

 

 

1

 

 

 

 

 

 

2

25

 

 

1

 

 

 

 

3

 

 

1

 

 

 

5

25.5

 

 

 

 

 

 

 

2

 

 

 

 

 

 

2

26

 

 

1

 

 

 

 

 

 

 

2

 

 

 

3

27

 

 

 

 

 

 

 

9

 

 

 

 

 

 

9

28

 

 

1

 

2

 

 

2

 

 

 

 

 

 

5

29

 

 

1

 

 

 

 

2

 

 

2

 

 

 

5

30

 

 

 

 

 

 

 

 

 

 

3

 

 

 

3

32

 

 

 

 

 

 

 

 

4

 

1

 

 

 

5

33

 

 

1

 

 

 

 

 

 

 

 

 

 

 

1

35

 

 

2

 

 

 

 

 

 

 

1

 

 

 

3

36

 

 

1

 

 

 

 

 

 

 

1

 

 

 

2

37

 

 

2

 

1

 

 

1

 

 

1

 

 

 

5

38

 

 

1

 

 

 

 

1

 

 

1

 

 

 

3

39

 

 

1

 

1

 

 

2

1

1

 

 

 

 

6

40

 

 

3

 

 

 

 

 

 

 

1

 

 

 

4

41

 

 

 

 

 

 

 

 

 

 

 

1

 

 

1

42

 

 

5

 

 

 

 

4

 

 

 

 

 

 

9

43

 

 

4

 

 

 

 

 

 

 

1

 

 

 

5

44

 

 

3

 

1

 

 

1

 

 

1

 

 

 

6

45

 

 

3

 

1

 

 

2

 

 

2

 

 

 

8

46

 

 

1

 

 

 

 

1

 

 

1

 

 

 

3

47

 

 

7

 

 

 

 

2

 

 

 

 

 

 

9

48

 

 

3

 

2

 

 

1

 

 

 

 

 

 

6

49

 

 

1

 

 

 

 

1

 

 

 

 

 

 

2

49.5

 

 

3

 

 

 

 

 

 

 

 

 

 

 

3

99

5

2

8

 

4

 

 

8

 

1

6

 

 

12

46

ND

 

 

1

1

4

 

1

5

 

 

4

 

 

 

16

Grand Total

5

2

100

4

23

129

1

74

5

13

34

1

10

12

413

 

Many people who test positive for a SNP want to know their likely origin, or alternatively based on their origin should they test for a certain SNP. The table below attempts to answer that question. However, be aware that sampling is biased especially for R1b1c7 (should be 1-3% of R1b1c) and the true percentage of R1b1c* is understated perhaps by twofold.

 

Table 2. Summary of Haplogroup classification by percentage geographic location. R1b1c* had to have been typed for the majority of the subclade SNPs including S21 (Jul 2007)

 

Haplogroup

 

 

 

 

 

 

 

 

 

 

 

 

 

Origin

R1b1

R1b1b

R1b1c*

R1b1c4

R1b1c6

R1b1c7

R1b1c8

R1b1c9

R1b1c9a

R1b1c9b

R1b1c10

R1b1c11

R1b1c/464x+

R2

num

Asia

0

0

0

0

0

0

0

0

0

0

0

0

0

100

4

CentEur

17

0

0

0

0

0

0

0

0

0

67

0

0

17

6

EastEur

25

0

13

0

0

0

0

13

0

0

13

0

0

38

8

England

0

0

26

0

11

5

0

24

5

12

15

0

2

2

66

Iberia

0

0

0

33

67

0

0

0

0

0

0

0

0

0

3

Ireland

0

0

17

0

3

70

0

3

0

0

0

0

7

0

100

NorthEur

0

0

31

0

0

6

0

38

0

6

19

0

0

0

16

Scotland

0

0

37

0

0

19

0

28

0

0

11

2

4

0

57

SouthEur

0

22

0

0

0

0

0

22

0

0

44

0

0

11

9

unknown

2

0

27

2

7

34

1

19

2

3

3

0

0

2

128

Wales

0

0

33

0

0

0

0

67

0

0

0

0

0

0

6

WestEur

0

0

30

10

20

0

0

20

0

0

20

0

0

0

10

percent/hap

1

0

24

1

6

31

0

18

1

3

8

0

2

3

413