Haplogroup H (Y-DNA)

Haplogroup H (Y-DNA), also known as H-L901/M2939, is a Y-chromosome haplogroup.

Haplogroup H (Y-DNA)
Haplogroup H map
Possible time of origin~48,500 ybp
Possible place of originSouth Asia or West Asia[1] or Southern Central Asia[2][3]
AncestorHIJK
DescendantsH1 (L902/M3061);
H2 (P96);
H3 (Z5857)
Defining mutationsL901/M2939
Highest frequenciesSouth Asians and Roma

The primary branch H1 (H-M69) and its subclades is one of the most predominant haplogroups amongst populations in South Asia, particularly its descendant H1a1 (M52). A primary branch of H-M52, H1a1a (H-M82), is found commonly among the Roma, who originated in South Asia and migrated into the Middle East and Europe, around the beginning of the 2nd millennium CE, and the Khmer people who got under influence from Indian populations.[4] The much rarer primary branch H3 (Z5857) is also concentrated in South Asia.

However, the primary branch H2 (P96) seems to have been found in sparse levels primarily in Europe and West Asia since prehistory. It has been found in remains of the Pre-Pottery Neolithic B (PPNB), which is part of the Pre-Pottery Neolithic, a Neolithic culture centered in upper Mesopotamia and the Levant, dating to c. 10,800 – c. 8,500 years ago, and also the later Linear Pottery culture and Neolithic Iberia.[5][6] H2 likely entered Europe during the Neolithic with the spread of agriculture.[6][7] Its present distribution is made up of various individual cases spread out throughout Europe and West Asia today.[8]

Structure edit

H-L901/M2939 is a direct descendant of Haplogroup GHIJK. There are, in turn, three direct descendants of H-L901/M2939 – their defining SNPs are as follows:

  • H1 (L902/M3061)
    • H1a previously haplogroup H1 (M69/Page45, M370)
    • H1b B108, Z34961, Z34962, Z34963, Z34964
  • H2 previously haplogroup F3,[9] (P96, L279, L281, L284, L285, L286, M282)
    • H2a FGC29299/Z19067
    • H2b Z41290
    • H2c Y21618, Z19080
  • H3 (Z5857)
    • H3a (Z5866)
    • H3b (Z13871)


Ancient distribution edit

H-L901/M2939 is believed to have split from HIJK 48,500 years before present.[10] It seems to represent the main Y-Chromosome haplogroup of the paleolithic inhabitants of South Asia. Possible site of introduction may be South Asia, since it is highly concentrated there.[11]

H1a edit

Shahr-i Sokhta and Gonur sites

Shahr-e Sukhteh, Iran and Gonur, Turkmenistan. H1a1d2 - Bronze Age, 3200-1900 BCE. [12] [13]

H1a ANCIENT SAMPLES
Sample IDLocationRadiocarbon AgeY-DNA
I11459Shahr-i Sokhta, Iran2875-2631 calBCEH1a1d2
I10409Gonur, Turkmenistan2280-2044 calBCEH1a1d2


Gogdara and Barikot sites

With limited ancient DNA testing in South Asia, accordingly there is a limited amount of ancient samples for H1a, despite it being a populous and well distributed haplogroup today. The first set of ancient DNA from South Asia was published in March 2018.[14] 65 samples were collected from the Swat Valley of northern Pakistan, 2 of which belonged to H1a.[14]

H1a ANCIENT SAMPLES
DateSubcladeLocationCountryCultureAccompanying haplogroupsSource
1100-900 BCH1a1Gogdara, Swat ValleyPakistanUdegram Iron AgeE1b1b1b2, E1b1b1b2a[14]
1000-800 BCH1a1Barikot, Swat ValleyPakistanBarikot Iron Age[14]

H2 edit

The earliest sample of H2 is found in the Pre-Pottery Neolithic B culture of the Levant 10,000 years ago.[15] From ancient samples, it is clear that H2 also has a strong association with the spread of agriculture from Anatolia into Europe, and is commonly found with haplogroup G2a.[16] H2 was found in Neolithic Anatolia, as well as in multiple later Neolithic cultures of Europe, such as the Vinča culture in Serbia,[17] and the Megalith culture of Western Europe.[17]

The 2021 study "Using Y-chromosome capture enrichment to resolve haplogroup H2 shows new evidence for a two-path Neolithic expansion to Western Europe"[7] found that while H2 is less than 0.2% in modern-day western European populations it was more common during the Neolithic, between 1.5 and 9%. They identified two major clades H2m and H2d. With respect to the current ISOGG nomenclature, H2m appears to be defined by a mix of H2, H2a, H2a1 and H2c1a SNPs while H2d appears to be defined by two H2b1 SNPs, and four additional SNPs which were previously undetected. They estimated TMRCA for H2d and H2m was  ~15.4 kya with H2m and H2d estimated TMRCAs of  ~11.8 and  ~11.9 kya respectively. H2 diversity probably existed in Near-Eastern hunter-gatherers and early farmers, and subsequently spread via the Neolithic expansion into Central and Western Europe. H2d was found along the inland/Danubian route into central Europe, but most H2m individuals are found along the Mediterranean route into Western Europe, the Iberian Peninsula and ultimately, Ireland.

There were also two occurrences of H2a found in the Neolithic Linkardstown burials in the southeast Ireland.[18] More Neolithic H2 samples have been found in Germany and France.[19]

H2 ancient samples
DateLocationCountryCultureAccompanying haplogroupsSource
7300-6750 BCMotzaIsraelLevantine Pre-Pottery Neolithic BE1b1b1b2, T1a1, T1a2a (PPNB from Jordan)[15]
6500-6200 BCBarcin site, Yenişehir ValleyTurkeyAnatolian NeolithicG2a, I2C, C1a, J2a[20]
6500-6200 BCBarcin site, Yenişehir ValleyTurkeyAnatolian NeolithicG2a, I2C, C1a, J2a[20]
5832–5667 BCСтарчевоSerbiaVinčaG2a[17]
5710–5662 BCTell Kurdu, Amik ValleyTurkeyAnatolian NeolithicJ1a2a, G2a2[21]
5702–5536 BCСтарчевоSerbiaVinčaG2a[17]
5400–5000 BCSzemelyHungaryVinčaG2a2a, G2a2b2a1a[17]
3900–3600 BCLa Mina site, SoriaSpainMegalithicI2a2a1[17]
3500–2500 BCMonte San Biagio, LatiumItalyRinaldone culture/Gaudo culture[22]
3925–3715 BCArslantepeTurkeyEarly Bronze AgeJ2a1a1a2b2a, J1a2b1, E1b1b1b2a1a1, G2a2b1, J2a1a1a2b1b, R1b1a2[21]
3366–3146 BCArslantepeTurkeyEarly Bronze AgeJ2a1a1a2b2a, J1a2b1, E1b1b1b2a1a1, G2a2b1, J2a1a1a2b1b, R1b1a2[21]
3336–3028 BCDzhulyunitsaBulgariaBulgarian Bronze AgeG2a2a1a2[23]
2899–2678 BCEl Portalon caveSpainPre-Bell BeakerI2a2a[5]
2470–2060 BCBudapest-BekasmegyerHungaryKurgan Bell BeakerR1b1a1a2a1a2b1[24]
1881–1700 BCAlalakhTurkeyLevantine Middle Bronze Age IIJ1a2a1a2, J2b2, T1a1, L2-L595, J2a1a1a2b2a1b[25]
550–332 BCBeirutLebanonIron Age III Achaemenid periodG2a2a1a2, G2a2b1a2, J1a2a1a2, I2a1b, Q1b[26]

Modern distribution edit

H1a edit

South Asia edit

H-M69 is common among populations of Bangladesh, India, Sri Lanka, Nepal and Pakistan, with lower frequency in Afghanistan.[2] The highest frequency of Halpogroup H found in tribal groups such as 87% among Koraga, 70% among Koya and 62% among Gond.[27][28] The high frequencies of H-M69 are in India, in both Dravidian and Indo-Aryan castes (32.9%).,[4][29] in Dhaka, Bangladesh (35.71%),[30] and H-M52 among Kalash (20.5%) in Pakistan.[31][28]

Haplogroup H is typically found among Indo-Aryan, Dravidian and Tribal (Indian as well as Pakistani Kalash) populations in the Indian subcontinent. In Europe it is mostly found among Roma, who belong predominantly (between 7% and 50%) to the H1a (M82) subclade.

Haplogroup H-M69 has been found in:

  • Bangladesh - 35.71% (15/42) in samples from Dhaka,[30] and 17.72% (115/649) among Bengali samples from various areas of the country.[32]
  • India - Divided into zones below:
    • South India – 27.2% (110/405) of a sample of unspecified ethnic composition.[33][34] Halpogroup H found among Dravidian tribal groups in highest frequency. Koraga people carry 87% and Koya tribe have 70% halpogroup H.[27][28] Gondi people carry around 62% of halpogroup H.[35] Another study has found haplogroup H-M69 in 26.4% (192/728) of an ethnically diverse pool of samples from various regions of India.[4]
    • West India - In Maharashtra - 33.3% of a sample of (68/204)[36] and in Gujarat, 20.69% (12/58) among Gujaratis in USA.[30] 13.8% (4/29) among unspesified Gujaratis in India,[37] 26% (13/50)among Dawoodi Bohra,[38] 27.27 (6/22) among Bhils,[39] 1.56% among Gujarati Brahmins[39]
    • North India - According to a study (Trivedi2007) it is found 24.5% (44/180) in both Caste and tribal population of North India. Most frequently found : 44.4% (8/18) among Uttar Pradesh (UP) Chamar,[4] 20.7% (6/29) among UP Rajputs,[4] 18.3% (9/49) among New Delhi Hindus,[40] 16.13% (5/31) among UP Brahmins,[39] 11.11% (6/54) among UP Kols,[39] 10.53% (2/19) among Himachal Brahmin,[39] 10.2% (5/49) in J&K Kashmiri Gujars,[39] 9.8% (5/51) in J&K Kashmiri Pandits,[39]
    • East India 14.81% (4/27) among Bihar Paswan,[39] 9.7% (3/31) among Bengalis of West Bengal (WB),[28] 5.56% (1/18) among West Bengal Brahmins.[4]
  • Sri Lanka – in 25.3% (23/91) of a sample of unspecified ethnic composition[33][34] and in 10.3% (4/39) of a sample of Sinhalese.[28]
  • Nepal – one study has found Haplogroup H-M69 in approximately 12% of a sample of males from the general population of Kathmandu (including 4/77 H-M82, 4/77 H-M52(xM82), and 1/77 H-M69(xM52, APT)) and 6% of a sample of Newars (4/66 H-M82).[41] In another study, Y-DNA that belongs to Haplogroup H-M69 has been found in 25.7% (5/37 = 13.5% H-M69 from a village in Morang District, 9/57 = 15.8% H-M69 from a village in Chitwan District, and 30/77 = 39.0% H-M69 from another village in Chitwan District) of Tharus in Nepal.[42]
  • Pakistan – in 4.1% Burusho, 20.5% Kalash, 4.2% Pashtun, and 6.3% in other Pakistanis.[4][31] Another study has found haplogroup H-M69 in approximately 8% (3/38) of a sample of Burusho (also known as Hunza), including 5% (2/38) H-M82(xM36, M97, M39/M138) and 3% (1/38) H-M36.[43]
  • Afghanistan – in 6.1% Pashtun,[2] in 7.1% Tajik.[2]

Roma people edit

Haplogroup H-M82 is a major lineage cluster in the Roma, especially Balkan Roma, among whom it accounts for approximately as high as 60% of males.[44] A 2-bp deletion at M82 locus defining this haplogroup was also reported in one-third of males from traditional Roma populations living in Bulgaria, Spain, and Lithuania.[45] High prevalence of Asian-specific Y chromosome haplogroup H-M82 supports their Indian origin and a hypothesis of a small number of founders diverging from a single ethnic group in India (Gresham et al. 2001).

Important studies show a limited introgression of the typical Roma Y-chromosome haplogroup H1 in several European groups, including approximately 0.61% in Gheg Albanians and 2.48% in Tosk Albanians.[46]

H1a in Roma populations
Populationn/Sample sizePercentageSource
Bulgarian Roma98/24839.5[45]
Hungarian Roma34/10731.8[47]
Kosovar Roma25/4259.5[48]
Lithuanian Roma10/2050[45]
Macedonian Roma34/5759.6[44]
Portuguese Roma21/12616.7[49]
Serbian Roma16/4634.8[48]
Slovakian Roma19/6230.65[47]
Spanish Roma5/2718.5[45]

Europe, Caucasus, Central Asia & Middle East edit

Haplogroup H1a is found at much lower levels outside of the Indian subcontinent and the Roma populations but is still present in other populations:

  • Europe - 0.9% (1/113) H-M82 in a sample of Serbians,[44] 2% (1/57) H-M82 in a sample of Macedonian Greeks,[40] 1% (1/92 H-M82)[40] to 2% (1/50 H-M69)[50] of Ukrainians, H1a2a in 1.3% (1/77) of a sample of Greeks.[31]
  • Caucasus- 2.6% (1/38) H-M82 in a sample of Balkarians,[40]
  • Central Asia - 12.5% (2/16) H-M52 in a sample of Tajiks from Dushanbe,[51] 5.19% (7/135) H-M69 in a sample of Salar from Qinghai,[52] 5.13% (2/39) H (including 1/39 H(xH1,H2) and 1/39 H1) in a sample of Uyghurs from Darya Boyi Village, Yutian (Keriya) County, Xinjiang,[53] 4.65% (6/129) H-M69 in a sample of Mongols from Qinghai,[52] 4.44% (2/45) H-M52 in a sample of Uzbeks from Samarkand,[51] 3.56% (17/478) H-M69 and 0.84% (4/478) F-M89(xG-M201, H-M69, I-M258, J-M304, L-M20, N-M231, O-M175, P-M45, T-M272) in a sample of Uyghurs from the Hotan area, Xinjiang,[52] 2.86% (2/70) H-M52 in a sample of Uzbeks from Xorazm,[51] 2.44% (1/41) H-M52 in a sample of Uyghurs from Kazakhstan,[51] 1.79% (1/56) H-M52 in a sample of Uzbeks from Bukhara,[51] 1.71% (3/175) H-M69 in a sample of Hui from the Changji area, Xinjiang,[52] 1.59% (1/63) H-M52 in a sample of Uzbeks from the Fergana Valley,[51] 1.56% (1/64) H1 in a sample of Uyghurs from Qarchugha Village, Yuli (Lopnur) County, Xinjiang,[53] 1.32% (1/76) H2 in a sample of Uyghurs from Horiqol Township, Awat County, Xinjiang,[53] 0.99% (1/101) H-M69 in a sample of Kazakhs from the Hami area, Xinjiang.[52]
  • West Asia- 6% (1/17) H-M52 in a sample of Turks,[50][51] 5% (1/20) H-M69 in a sample of Syrians,[50] 4% (2/53) H-M52 in a sample of Iranians from Samarkand,[51] 2.6% (3/117) H-M82 in a sample from southern Iran,[54] 4.3% (7/164) of males from the United Arab Emirates,[55] 2% of males from Oman,[56] 1.9% (3/157) of males from Saudi Arabia,[57] 1.4% (1/72 H-M82) of males from Qatar,[55] and 0.6% (3/523) H-M370 in another sample of Turks.[58]

East & South-East Asia edit

At the easternmost extent of its distribution, Haplogroup H-M69 has been found in Thais from Thailand (1/17 = 5.9% H-M69 Northern Thailand;[59] 2/290 = 0.7% H-M52 Northern Thai;[60] 2/75 = 2.7% H-M69(xM52) and 1/75 = 1.3% H-M52(xM82) general population of Thailand[61]), Balinese (19/551 = 3.45% H-M69),[34] Tibetans (3/156 = 1.9% H-M69(xM52, APT)),[41] Filipinos from southern Luzon (1/55 = 1.8% H-M69(xM52)[61]), Bamars from Myanmar (1/59 = 1.7% H-M82, with the relevant individual having been sampled in Bago Region),[62] Chams from Binh Thuan, Vietnam (1/59 = 1.7% H-M69),[59] and Mongolians (1/149 = 0.7% H-M69).[33] The subclade H-M39/M138 has been observed in the vicinity of Cambodia, including one instance in a sample of six Cambodians[4] and one instance in a sample of 18 individuals from Cambodia and Laos.[43] A genome study about Khmer people resulted in an average amount of 16,5% of Khmer belonging to y-DNA H.[4]

H1b edit

H1b is defined by the SNPs - B108, Z34961, Z34962, Z34963, and Z34964.[63] Only discovered in 2015, H1b was detected in a single sample from an individual in Myanmar.[64] Due to only being classified recently, there are currently no studies recording H1b in modern populations.

H2 edit

H2 (H-P96), which is defined by seven SNPs – P96, M282, L279, L281, L284, L285, and L286 – is the only primary branch found mainly outside South Asia.[63] Formerly named F3, H2 was reclassified as belonging to haplogroup H due to sharing the marker M3035 with H1.[65] While being found in numerous ancient samples, H2 has only been found scarcely in modern populations across West Eurasia.[5]

H2 in modern populations
RegionPopulationn/Sample sizePercentageSource
Central AsiaDolan1/761.3[66]
West AsiaUAE1/1640.6[67]
West AsiaSouth Iran2/1171.7[68]
West AsiaAssyrian1/1810.5[69]
West AsiaArmenia5/9000.6[70]
Southern EuropeSardinia2/11940.2[71]

H3 edit

H3 (Z5857) like H1, is also mostly centered in South Asia, albeit at much lower frequencies.[64]

Like other branches of H, due to it being newly classified it is not explicitly found in modern population studies. Samples belonging to H3 were likely labeled under F*.[64] In consumer testing, it has been found principally among South Indians and Sri Lankans, and other areas of Asia such as Arabia as well.[10]


The following gives a summary of most of the studies which specifically tested for the subclades H1a1a (H-M82) and H2 (H-P96), formerly F3, showing its distribution in different part of the world.[72]

Continent/subcontinental regionCountry &/or ethnicitySample sizeH1a1a (M82) freq. (%)Source
East/Southeast AsiaCambodia616.67Sengupta et al. 2006
East/Southeast AsiaCambodia/Laos185.56Underhill et al. 2000
South AsiaNepal1884.25Gayden et al. 2007
South AsiaAfghanistan2043.43Haber et al. 2012
South AsiaMalaysian Indians30118.94Pamjav et al. 2011
South AsiaTerai-Nepal19710.66Fornarino et al. 2009
South AsiaHindu New Delhi4910.2Fornarino et al. 2009
South AsiaAndhra Pradesh Tribals2927.6Fornarino et al. 2009
South AsiaChenchu Tribe India4136.6Kivisild et al. 2003
South AsiaKoya Tribe India4170.7Kivisild et al. 2003
South AsiaWest Bengal India319.6Kivisild et al. 2003
South AsiaKonkanastha Brahmin India439.3Kivisild et al. 2003
South AsiaGujarat India2913.8Kivisild et al. 2003
South AsiaLambadi India358.6Kivisild et al. 2003
South AsiaPunjab India664.5Kivisild et al. 2003
South AsiaSinhalese Sri Lanka3910.3Kivisild et al. 2003
South AsiaNorthwest India84214.49Rai et al.2012
South AsiaSouth India184520.05Rai et al.2012
South AsiaCentral India86314.83Rai et al.2012
South AsiaNorth India62213.99Rai et al.2012
South AsiaEast India17068.44Rai et al.2012
South AsiaWest India50117.17Rai et al.2012
South AsiaNortheast India10900.18Rai et al.2012
South AsiaAndaman Island200Thangaraj et al. 2003
Middle East and North AfricaSaudi Arabia1570.64Abu-Amero et al. 2009
Middle East and North AfricaTurkish5230.19Cinnioglu et al. 2004
Middle East and North AfricaIran1502Abu-Amero et al. 2009
Middle East and North AfricaIran9381.2Grugni et al. 2012

See also edit

References edit

External links edit