Sri Lanka · Genetic Ancestry Report · 2025

One Island,
Many Origins

A scientific journey through the DNA, haplogroups, and shared ancestry of every ethnic group that calls Sri Lanka home, told for everyone, not just scientists.

40,000 Years of Human Presence
8+ Distinct Ethnic Groups
6 Major Origin Streams
1 Common Human Root
Understanding the Science

What is DNA, and why does it tell us who we are?

Before we explore Sri Lanka's ethnic groups, here is everything you need to understand what DNA ancestry actually means, in plain language.

🧬 Your DNA is a biological diary written across generations

Inside every cell in your body sits a complete set of instructions, your DNA. It contains roughly 3 billion "letters" that determine everything from your eye colour to how your body processes food. Crucially, DNA is inherited: half from your mother, half from your father. Over thousands of generations, tiny mutations (changes) accumulate. Scientists track these mutations to map where populations came from, how they migrated, and who they are related to. This is the basis of genetic ancestry.

🧬
Haplogroup
A family of people who share the same ancient mutation in their DNA. Think of it like a surname that goes back tens of thousands of years. If two people share a haplogroup, they share a common ancestor, even if separated by continents.
👨
Y-DNA (Paternal Line)
The Y chromosome is passed only from father to son, unchanged. By studying it, scientists can trace your direct male ancestry, your father's father's father, back thousands of years without mixing or dilution.
👩
mtDNA (Maternal Line)
Mitochondrial DNA is passed only from mother to child. It traces your direct female line, your mother's mother's mother, across millennia. This is how scientists linked modern humans back to "Mitochondrial Eve" in Africa.
🌍
Autosomal DNA
This is the DNA you inherit from all your ancestors, the full blend from both parents. Autosomal analysis reveals your total "mixture", what percentage of your genes came from which ancient population. It mixes with each generation.
🔗
Ancestral Components
Scientists identify ancient "reference populations" (like Ancestral South Indians or Iranian Farmers) and calculate how much of your DNA resembles each one. This gives a percentage breakdown of your deep ancestry.
Genetic Drift & Admixture
Over thousands of years, populations mix (admixture). Small founding groups preserve rare DNA (drift). This is why an isolated community like the Vedda retain very old genetic signatures, while larger groups show more mixing.
Important note: A haplogroup trace does not mean you are from that region, it means one of your direct-line ancestors once carried that lineage. Modern people are blends. A Sri Lankan Tamil may carry a West Eurasian mtDNA haplogroup not because they have recent Arab ancestry, but because that lineage reached South Asia thousands of years ago through ancient migration.
History of Arrival

Who arrived when, and from where

Sri Lanka has been continuously settled for at least 40,000 years. Every wave of arrival left a genetic trace still detectable today.

~40,000 – 10,000 BCE
The Balangoda People
The island's first known inhabitants, now called "Balangoda Man," arrived during the Late Pleistocene. Their mtDNA haplogroup M lineages are among the deepest found in South Asia, older than 10,000 years. The modern Vedda (Adivasi) are their direct descendants.
~5th Century BCE
Indo-Aryan Migration (Proto-Sinhalese)
Indo-Aryan speaking settlers arrived from northeastern India. They brought the Sanskrit-derived Sinhala language, Haplogroup R1a (from Steppe ancestry) and R2, and intermarried extensively with the island's existing populations, creating the Sinhalese.
~3rd Century BCE onwards
Dravidian Settlers (Proto-Tamils)
Tamil-speaking Dravidian populations from southeastern India settled northern and eastern Sri Lanka in waves over several centuries. They brought haplogroups H, L, and high frequencies of mtDNA haplogroup M. Sri Lankan Tamils share deep genetic ties with South Indian Dravidian populations.
~9th – 14th Century CE
Arab & Indian Muslim Traders (Moors)
Arab and Muslim traders from South India settled along Sri Lanka's coastal trade routes and married local Tamil and Sinhalese women. Genetic studies reveal Moors have predominantly Indian Dravidian maternal lineages, with some West Eurasian paternal lineages reflecting Arab trading heritage.
~13th Century CE
Southeast Asian Settlers (Malays)
Sri Lankan Malays descend from Javanese and Malay Archipelago peoples brought to Sri Lanka by the Dutch colonial administration. Their DNA carries distinctive East/Southeast Asian haplogroups (O-M175), a completely different origin branch from all other Sri Lankans.
16th – 19th Century CE
European Colonists (Burghers)
Portuguese (1505), Dutch (1658), and British (1796) colonists fathered children with local Sinhalese and Tamil women. The resulting Burgher community carries Western European haplogroups (R1b, I) on paternal lines, paired with South Asian mtDNA from their local mothers, a Eurasian hybrid.
19th Century CE
Indian Tamil Plantation Workers
British colonists brought Tamil-speaking labourers from Tamil Nadu's interior highlands to work in tea plantations. Genetically similar to Sri Lankan Tamils but with higher South Indian Dravidian haplogroups (H1, L1), they remained isolated due to social and geographic segregation.
The Ethnic Groups

Native and non-native peoples of Sri Lanka

Each group's DNA tells a story of where they came from, who they mixed with, and what makes them distinct, and what unites them with everyone else on the island.

How to read these cards: Each card shows a group's origin, population share, key Y-DNA haplogroups (paternal line, teal), and mtDNA haplogroups (maternal line, terracotta). Together these paint a picture of the group's deep ancestry from both parents' sides.
🌿 Indigenous / Native
Vedda (Adivasi)
Original inhabitants · Hunter-gatherer descendants
~0.1%
The Vedda are Sri Lanka's oldest surviving population, direct descendants of the Balangoda people who inhabited the island at least 40,000 years ago. They carry the highest AASI (Ancient Ancestral South Indian) ancestry, the deepest, most archaic South Asian genetic layer, preserved by cultural endogamy over millennia.
Key Haplogroups
YH1, H3 YR2a mtM (deep) mtR (45%) mtU2 (ancient)
🏝️ Long-established / Native
Sinhalese
Indo-Aryan majority · 2,500+ years on island
74.9%
A genetic blend of Indo-Aryan migrants from northeastern India who merged with the island's pre-existing Balangoda descendants. They carry both R1a (Steppe/Indo-Aryan) and H (ancient South Asian) haplogroups, the same deep South Asian Y-DNA found in Roma (Romani) people of Europe, reflecting a shared Indian origin thousands of years ago.
Key Haplogroups
YR2a (~38%) YR1a1a (~13%) YH1, H3 (~25%) YJ/J2a mtM (42%) mtU (21%)
🌊 Long-established / Native
Sri Lankan Tamils
Dravidian settlers · Northern & Eastern Sri Lanka
11.2%
Descendants of Tamil-speaking Dravidian populations who settled from southeastern India across multiple waves, some dating back over 2,000 years. Genetically distinct from Indian Tamils. Their mtDNA is dominated by haplogroup M, the same ancient "Out of Africa" clade that defines South and East Asian populations, with Dravidian-specific sub-lineages.
Key Haplogroups
YH1 (high) YL1, R1a YJ2 mtM (43–70%) mtU (15%)
🕌 Partly Non-Native
Sri Lankan Moors
Muslim community · Arab-Indian trading heritage
9.2%
Arab and South Indian Muslim merchants settled Sri Lanka's coasts from the 9th century, marrying local Tamil and Sinhalese women. Despite the Arab cultural identity, their maternal DNA is overwhelmingly Dravidian South Indian, reflecting centuries of local women entering the community. Genetically, Moors cluster closest to Sinhalese and Tamil populations, not to Arabs.
Key Haplogroups
YJ1, J2 (Arab) YH, R2 (Indian) mtM (dominant) mtU (W. Eurasian)
🍃 Non-Native (19th C.)
Indian Tamils
Plantation workers · Central highlands
4.2%
Brought by British colonists in the 19th century as indentured tea plantation workers from Tamil Nadu's interior regions. They carry higher frequencies of South Indian Dravidian-specific haplogroups (H1, L) and lower admixture with Sinhalese populations, reflecting their relatively recent arrival and social isolation in the hill country.
Key Haplogroups
YH1 (very high) YL1, R2 mtM (71–82%) mtU (12%)
🌴 Non-Native (SE Asian)
Sri Lankan Malays
Javanese/Malay origin · Dutch colonial era
~0.2%
Genetically the most distinct non-Vedda group on the island, coming from a completely different origin branch, Southeast Asia. Brought to Sri Lanka by the Dutch from Java, their DNA carries East/Southeast Asian haplogroup O-M175 and Austronesian ancestry. They represent the island's genetic "outlier."
Key Haplogroups
YO-M175 YO2, O3 mtB (Austronesian) mtF, E (E. Asian)
⚓ Non-Native (European Colonial)
Burghers
Eurasian hybrid · Portuguese/Dutch/British fathers
~0.2%
A Eurasian community descended from European male colonists who married local Sinhalese and Tamil women. Their paternal Y-DNA reflects Western European ancestry (R1b dominant). Their maternal mtDNA is predominantly South Asian, because their mothers were Sri Lankan women. Autosomal studies show gene frequencies intermediate between European and South Asian populations.
Key Haplogroups
YR1b (W. European) YI1, I2 mtM (South Asian) mtH (European)
🏮 Non-Native (East Asian)
Chinese Sri Lankans
Hakka, Hokkien & Cantonese traders · 18–19th century
<0.1%
Descendants of Hakka, Hokkien, and Cantonese traders and merchants who arrived in the 18th and 19th centuries, settling in Colombo, Matale, and Kandy. Their DNA carries distinctively East Asian haplogroups, Y-DNA O-M122 (Sino-Tibetan) and mtDNA D and B, creating a clearly distinct genetic profile from all other Sri Lankan groups.
Key Haplogroups
YO-M122 (Han) YO-M175 mtD (E. Asian) mtB, F
Genetic Deep Dive

Where each haplogroup ultimately comes from

Every haplogroup is a branch on the human family tree. Here is where the key haplogroups found in Sri Lanka originated, and what their presence means.

How to read this table: Each haplogroup is a specific mutation found in ancient human populations. When a population carries a haplogroup, it means one of their direct-line ancestors once lived where that haplogroup originated. The group then spread it across the world as they migrated.
Haplogroup Type Geographic Origin Age (approx) Found in Sri Lanka What it means
M mtDNA Eastern Africa / Arabian Peninsula ~70,000 yrs Vedda, Sinhalese, Tamil, Moor The genetic backbone of the island. All M carriers share an ancestor who left Africa ~70,000 years ago on the "southern coastal route." The single most common mt haplogroup in South and East Asia.
R (mt) mtDNA South Asia / Near East ~65,000 yrs Vedda (45%), Sinhalese (7–25%) Sister clade to M. Very ancient. Vedda's high R frequency reflects extreme antiquity, their ancestors carried this lineage before most South Asian populations formed. Also found in Australian Aboriginals.
U2 mtDNA South Asia (ancient) ~50,000 yrs Vedda, some Sinhalese One of the oldest mtDNA lineages found in Mesolithic skeletal remains in Sri Lanka. Confirms Vedda direct descent from Balangoda-era populations older than 10,000 years.
H (mt) mtDNA West Asia / Middle East ~25,000 yrs Sinhalese, Moor (small %) The most common haplogroup in Europe. Its presence in Sri Lanka arrived through ancient Near Eastern / Iranian Farmer ancestry that entered South Asia with agricultural expansion millennia ago.
H1 / H3 Y-DNA South Asia (ancient hunter-gatherers) ~40,000–50,000 yrs Vedda, Sinhalese, Tamil One of the oldest Y-DNA lineages native to South Asia, associated with Ancestral South Indians (AASI). Also found in Roma people who migrated from India to Europe, showing the shared Indian root.
R2a Y-DNA South Asia / Central Asia ~15,000–25,000 yrs Sinhalese (~38%), Vedda, Tamil Most common Y-DNA in Sinhalese men. Also high in West Bengalis and Roma people. Reflects ancient South Asian paternal ancestry shared across multiple populations on the subcontinent.
R1a1a Y-DNA Eurasian Steppe ~5,000–8,000 yrs Sinhalese (~13%) The "Aryan marker", carried by Steppe pastoralists who spread Indo-European languages into South Asia. Its presence in Sinhalese confirms the Indo-Aryan migration. Also common in Brahmin castes and Eastern Europeans.
J1 / J2 Y-DNA Middle East / Levant ~20,000–30,000 yrs Moor (J1), Sinhalese, Tamil (J2) J1 is strongly associated with Semitic/Arab populations, present in Moors' paternal lineages reflecting Arab merchant ancestry. J2 is found across the Middle East, Caucasus, and South Asia via ancient Iranian Farmers.
L1a Y-DNA South Asia / Indus Valley ~15,000–20,000 yrs Sinhalese, Vedda, Tamil Associated with ancient Indus Valley populations. Found in many South Asian populations, connecting Sri Lankans to the ancient Indus civilisation's genetic legacy.
R1b Y-DNA Western Europe / Atlantic coast ~10,000–20,000 yrs Burghers (Portuguese/Dutch fathers) The most common Y-DNA in Western Europe. Its presence in Sri Lanka is entirely due to European colonial men fathering children with local women, a clear genetic fingerprint of the colonial era.
O-M175 Y-DNA East / Southeast Asia ~35,000 yrs Malays, Chinese Sri Lankans The dominant Y-DNA haplogroup across all of East and Southeast Asia. Found in Chinese, Japanese, Malay, Javanese. Its presence in Sri Lanka is entirely traceable to Malay and Chinese arrivals, absent from all other Sri Lankan groups.
B / F (mt) mtDNA East/Southeast Asia Various Malays, Chinese Sri Lankans Austronesian/East Asian mtDNA lineages found in Malay and Chinese communities in Sri Lanka. Each marks a distinct non-South Asian ancestry thread carried by the island's Southeast and East Asian settlers.
Ancestral Composition

The ancient populations that mix in each group's DNA

Scientists compare modern DNA against ancient "reference populations" to calculate what proportion of each group's ancestry came from which ancient source. These are estimates from admixture modelling research.

The 5 key ancient source populations for Sri Lanka: AASI (Ancient Ancestral South Indians, hunter-gatherers present before farming), Iranian Farmers (Neolithic farmers from the Fertile Crescent), Steppe Ancestry (Bronze Age Eurasian pastoralists who spread Indo-European languages), Southeast Asian (Austronesian/East Asian ancestry), and West Eurasian (broad Near Eastern/European ancestry).
AASI / South Asian Hunter-Gatherer
Iranian Farmer
Steppe / Indo-Aryan
Southeast Asian
West Eurasian / Arab
Western European
Other
🌿 Vedda (Adivasi)
AASI (Hunter-Gatherer)
~70%
Iranian Farmer
~22%
Steppe
~5%
Other
~3%
🏝️ Sinhalese
AASI (Hunter-Gatherer)
~45%
Iranian Farmer
~35%
Steppe
~15%
Other
~5%
🌊 Sri Lankan Tamils
AASI (Hunter-Gatherer)
~55%
Iranian Farmer
~30%
Steppe
~10%
Other
~5%
🕌 Sri Lankan Moors
AASI (Hunter-Gatherer)
~45%
Iranian Farmer
~30%
West Eurasian / Arab
~15%
Steppe
~8%
Other
~2%
🌴 Sri Lankan Malays
Southeast Asian
~60%
East Asian
~15%
South Asian (admix)
~20%
Other
~5%
⚓ Burghers
Western European
~35%
AASI (South Asian)
~35%
Iranian Farmer
~20%
Steppe
~10%
Faith & Identity

How religion overlaps, but does not equal, ethnicity

Religion and ethnicity are deeply intertwined in Sri Lanka but they are not the same thing. The same DNA can be found across different faiths, and the same faith spans multiple ethnic groups.

☸️
Buddhism
70.2%
Predominantly Sinhalese. Brought to Sri Lanka ~3rd century BCE by Mahinda (son of Emperor Ashoka). Theravāda tradition. Also practiced by some Vedda communities and small numbers of Tamils.
🕉️
Hinduism
12.6%
Sri Lankan Tamils and Indian Tamils. Shaivism dominant. Ancient presence predating Christianity. Some Sinhalese communities retain Hindu-syncretic practices. Kovils (Hindu temples) found island-wide.
☪️
Islam
9.7%
Sri Lankan Moors, Indian Moors, and Sri Lankan Malays, three genetically distinct groups united by faith. The genetic data shows these three groups have very different DNA origins despite sharing the same religion.
✝️
Christianity
7.4%
Burghers (Catholic & Dutch Reformed), coastal Sinhalese (Catholic, converted by Portuguese), some Tamils (Catholic & Protestant). A faith spread by colonialism that crossed ethnic lines, Sinhalese, Tamil, and Burgher Christians all exist.
The key insight: Sri Lankan Moors and Sri Lankan Malays share the Islamic faith but have completely different genetic origins, South Asian Dravidian vs. Southeast Asian Austronesian. Sri Lankan Tamils (Hindu) and Indian Tamils (also largely Hindu) share religious practice but carry measurably different DNA frequencies. This shows that culture, faith, and DNA each tell a different story, and you need all three to understand Sri Lankan identity.
Cross-Reference

Who is genetically closest to whom, and why

Genetic studies have compared all Sri Lankan groups against each other. Here are the key findings about genetic similarity and the shared threads that run through seemingly "different" communities.

Vedda Sinhalese
Despite appearing very different culturally, the Vedda and Sinhalese are genetically the most closely related of all Sri Lankan groups. The Sinhalese descend partly from absorbing or intermarrying with pre-existing Balangoda descendants. The Vedda effectively carry the "pure" form of the ancestry that partially underlies all Sinhalese.
85%
High genetic similarity · Shared AASI ancestry
Sinhalese Moors
mtDNA studies show Sinhalese and Moors clustering together more than expected, reflecting centuries of Moor men marrying Sinhalese women, meaning Moors carry significant Sinhalese maternal (mtDNA) lineages. Autosomal studies also fail to find strong genetic structure between these groups.
72%
Moderate-high similarity · Shared maternal lines
SL Tamil Indian Tamil
Genetically similar but distinct. Indian Tamils have higher frequencies of South Indian Dravidian haplogroups (H1, L) and less admixture with Sinhalese populations. Their relative isolation in hill-country plantations has preserved a more "pure South Indian interior" genetic profile.
68%
High similarity · Different admixture patterns
Sinhalese SL Tamil
Autosomal STR studies find surprisingly small genetic differences between Sinhalese and Sri Lankan Tamil populations. Both share AASI and Iranian Farmer ancestry, though Sinhalese have more Steppe-related ancestry (from Indo-Aryan admixture) and Tamils have higher AASI proportions.
65%
Moderate similarity · Different paternal origins
Sinhalese Roma (Europe)
A remarkable finding: both Sinhalese men and Roma (Romani) people of Europe carry high frequencies of Y-DNA haplogroup H and R2, because Roma people are descended from Indian migrants who left the subcontinent ~1,500 years ago. A genetic thread connecting Sri Lanka to Europe's largest minority.
40%
Surprising partial similarity · Shared Indian root
Malays All others
Sri Lankan Malays are the most genetically distinct group from all other Sri Lankans. Their Southeast Asian haplogroups (O-M175) and Austronesian mtDNA are essentially absent in every other group. They represent a completely separate branch of human migration that arrived via a different route entirely.
18%
Low similarity · Separate origin branch entirely

Every person in Sri Lanka shares a common ancestor.
Science has proven it.

No matter how different Sri Lankan communities appear, in language, faith, culture, or history, their DNA tells a story of profound shared origin. All South Asian populations, including every Sri Lankan group except the Malays, carry a dominant thread of AASI ancestry, the Ancient Ancestral South Indians, hunter-gatherers who arrived in South Asia from Africa more than 50,000 years ago. The Vedda carry this lineage in its most pristine form. The Sinhalese, Tamils, and Moors all carry it too, blended with later arrivals. Even the Malays ultimately share the haplogroup O origin from East Asia, which in turn connects to the same Out-of-Africa migration that gave rise to all modern humans.

The DNA does not care about flags, languages, or holy books. It records only one truth: we are all one species, endlessly moving, mixing, and adapting across this small island and this vast planet.

mtDNA M, carried by all South Asians
AASI ancestry, in every native group
Iranian Farmer genes, Sinhalese, Tamil, Moor
Haplogroup H, Sinhalese, Tamil, Vedda, Roma
Out-of-Africa origin, every human on Earth
~40,000 years of shared island history
Research Sources

Scientific studies behind this report

Primary Genetics
  • Ranaweera et al. (2014), Journal of Human Genetics: mtDNA history of Sri Lankan ethnic peoples
  • Population histories of Adivasi and Sinhalese using whole genomes, Current Biology (2025)
  • Illeperuma et al. (2009–2010), Genetic profiles of Sri Lankan ethnic groups (STR loci)
  • Mastana & Papiha (1996), Genetic variation in Sri Lanka (5 population groups)
  • Kivisild et al. (2003), Genetic heritage of the earliest settlers in South Asia
Additional Research
  • Sirisena & Dissanayake (2019), Genetics and genomic medicine in Sri Lanka
  • Ranasinghe et al. (2015), Mitochondrial DNA polymorphisms in Sri Lankan populations
  • Perera et al. (2021), Genetic analysis of Sri Lankan Moors
  • X-STR genetic polymorphisms and demographic history of Sri Lankan ethnicities (PMC 2021)
  • Sunday Times Sri Lanka, "Race in Sri Lanka: What genetic evidence tells us" (2014)
Disclaimer: Ancestral composition percentages shown in this report are approximate estimates derived from published admixture modelling studies. Precise percentages vary between studies depending on reference populations used, sample sizes, and methodologies. All haplogroup data reflects peer-reviewed research. This report is intended for educational purposes only. It does not make claims about individual ancestry or promote any particular political or ethnic identity.