Curley Surname Genealogy
  • Home
  • Irish
  • Scottish
  • English
  • American
  • Heraldry
  • DNA Results
  • Y-DNA Data
  • DNA Test Explained
  • News

What is the DNA Project?

The genetic aspect of the Curley surname research project uses DNA testing of its members to explore the ancestry of the surname and discover relationships between members.

By far, the majority of paternal lineage testing is done through Family Tree DNA (FTDNA), a genealogical DNA testing service designed to help you discover your family roots.  Raw test data derives from genealogical tests purchased from FTDNA.  FTDNA surname projects are administered by volunteer members of the project.  The volunteer administrators perform data analysis, help to answer questions and interpret results for project members, and facilitate communication and collaboration among the members.

The intent of this webpage is to interpret Y-DNA test results from the FTDNA Curley surname project and other public sources in order to add value to the raw data through meaningful interpretation, as well as to serve as a general resource for Curley researchers.  But the owner of this website is an independent researcher who has no formal affiliation with Family Tree DNA.

One goal of this project is to encourage more Curleys to participate in the DNA project in order to grow the database of Curley test subjects.  The ultimate goal is to develop a comprehensive tree of the Curley surname and enable participants to figure out where they fit within the tree.

What is Genetic Genealogy?

Genetic genealogy is the use of DNA testing to discover your ancestry.  By comparing your DNA to that of others, it can be estimated how closely related you are to any other person who has also taken a DNA test.  Testing also provides information on where you fit within the tree of all humanity - your ethnicity, geographic origins, and the migration routes of your distant ancestors.

Genetic genealogy is especially useful when traditional research methods have hit a brick wall.  If you get a DNA match with someone who has a documented lineage to an earlier ancestor, this will tell you that you likely share some of the same ancestry.

If there is a gap in your documentation trail, DNA testing may help you cross that gap.  If you have a suspected ancestor that you can't confirm through documentation, DNA testing may be able to confirm or refute your suspicion.  Sometimes, there are multiple individuals in the documentation with similar names, birth dates, and locations.  If you know that one of them is your ancestor, but haven't been able to establish which one, DNA testing may be able to identify the right one, or at least narrow down the list of possible candidates.
Before we get too far, there is one important caveat that needs mentioning.  In order to get a DNA match with other people, there needs to be someone in the test database that you are related to.  For surnames that are uncommon, the number of test subjects in the database is relatively small.  So there is no guarantee that you'll get a close match with another member. Curley is an uncommon surname, so the number of participants in this project is relatively low.

The good news is that the database will continue to grow over time.  Even if you don't get an immediate match when you are tested, you may get additional matches later as new test subjects are added to the database.  We are actively recruiting new participants, and as the cost of DNA testing continues to decline over time it can be expected that more and more Curleys will be tested.

Also keep in mind, even if you don't get a close DNA match to another Curley member, the test results can still be enlightening.  The DNA comparison is not limited to the Curley surname only.  There will also be more distant matches to testees with different surnames.  The DNA test results sometimes reveal connections between surnames, possibly indicating a common ancestor prior to the establishment of surnames, or possibly indicating a genetic crossover between different surnames due to a marital infidelity or adoption somewhere in the lineage. Such matches between surnames can sometimes help identify clans or localities associated with a specific genetic group.  So even if you don't get a close match to another Curley, DNA testing can still provide interesting insight into your ancestral origins.

Y-DNA Testing Explanation

There are various types of DNA testing available for genealogical purposes.  The one of primary interest for surname research is the Y-DNA test, so called because it utilizes the Y chromosome.

The Y chromosome is carried by males only.  Whereas most chromosomes are inherited from both father and mother, the Y chromosome is inherited solely from the father.  For most chromosomes, the father's and mother's DNA combine in the offspring to create a new genetic pattern.  So the genetic code will change from one generation to the next.  But the Y chromosome is unique, being passed from father to son fully intact.  It does not mix with the mother's DNA at all.  So the pattern of the Y-DNA will be mostly unchanged from one generation to the next.
Because of this, it is possible to trace a specific Y-DNA pattern back through the paternal line for many generations, which makes it especially useful for surname research.

Within DNA, there are some places in which the same pattern is sequentially repeated multiple times.  These copied segments are called "short tandem repeats", or STR's.  The number of times that these patterns repeat varies from one individual to another.  The number of repeats is usually the same for father and son.  If the father happens to have 14 repeats, then the son will also have 14 repeats.

Y-DNA is usually passed from father to son intact, without being modified.  But on rare occasions, there is a copying error in the DNA that is passed onto the son, so that the son will have a slightly different STR count than the father.  For the STR markers used in genetic genealogy, such copying errors generally occur once in every few hundred times or so.  So, if we are looking at 111 different STR markers (the highest level of Y-DNA testing available through FTDNA), we can expect there to be a change to an STR count about every 2 generations, on average.

At first thought, one might think that these copying mutations might make it impossible to track the DNA pattern through the paternal lineage, as the mutations keep accumulating over multiple generations.  But it turns out that these mutations can be used to our advantage.  Studies have determined how frequently mutations occur for each marker.  This makes it possible to compare the STR markers between two related individuals, count the number of differences, and estimate how many generations worth of mutations separate the two individuals.

FTDNA has identified 111 of these repeating sequences, or "markers" that it uses for its genetic testing. By making a table of the number of repeats for each marker, one can compare the DNA of any two individuals to determine whether they share a paternal lineage. The raw STR test results for the Curley surname project may be viewed by clicking the "Y-DNA Data" tab at the top of this page.

The first row in the chart below contains the names of the STR markers used by FTDNA for the 37 marker test. The second and third rows list the numbers of times that each STR is repeated for two different test subjects.

Looking at the first 37 markers for kits numbers 176429 and 253019, one can see that the STR values are identical.  These two project members are father and son, so this is what we should expect.

However if we compare 253019 with 266661, we can see quite a few differences in the STR values, which tells us that these two individuals are not closely related to each other.

The number of STR differences between individuals is commonly referred to as the "genetic distance", or GD.  Knowing the genetic distance between two individuals makes it possible to estimate how closely related they are through their paternal lineage.  In this way, we can estimate the most recent common ancestor between two individuals, often referred to as the MRCA (most recent common ancestor).  A MRCA value of 8 means that two individuals probably share a common ancestor about 8 generations removed.  For example, two individuals with the same great grandfather will have a MRCA value of 3.  People will also sometimes refer to the "time to most recent common ancestor", or TMRCA, meaning the time span corresponding with a certain number of generations.

The table below shows the STR test results, GD, MRCA, and TMRCA for two hypothetical test results.
The TMRCA is derived by simply multiplying the number of generations by the number of years per generation.  The appropriate time to use per generation varies depending on whom you ask, but typically ranges from 25 to 33 years.  In the example above, we use 30 years.

On average, STR's mutate one time out of a few hundred.  But different STR's mutate at different rates.  FTDNA has deliberately chosen the STR markers that they use for their test. Some of the markers mutate especially quickly and others mutate relatively slowly.  This allows the MRCA estimate to be calculated more accurately for a wide spectrum of time ranges.

The MRCA value is not an exact prediction that will nail down precisely how closely the individuals are related.  It is just a statistical estimate, based on the knowledge of how likely it is that certain mutations will occur.  Because of this, MRCA estimate is usually presented in the form of a plot or a table.
The plot above shows an example of the MRCA estimate for two test subjects having a GD of 6 differences out of 111 markers.  The horizontal axis indicates the possible number of generations separating the two individuals.  The vertical axis indicates how likely it is for the common ancestor to be a certain number of generations removed.  In this example, the highest probability occurs at 10 generations, with a likelihood just under 5%.  This means that if two individuals have a GD of 6 out of 111, in 5% of the cases their common ancestor will be 10 generations removed.

The peak of the plot gives an idea of how closely related two people may be, but as the example above shows, the possible range for a MRCA is actually much greater.  It is possible for the common ancestor to be greater than or fewer than 10 generations distant.  The numbers above or below the peak are just less likely.  For example, it is possible for the common ancestor to be 5 generations removed, but this would only be the case in about 1.5% percent of all cases.  Looking at this example, one can see that there is a significant chance for the MRCA to be anywhere from about 3 generations distant up to as many as 25 generations distant.  The MRCA estimate is a useful indicator of how distant a shared ancestor is likely to be, but it does not precisely determine the exact number of generations. 

The MRCA calculation is sometimes plotted based on "transmission events" rather than generations.  Transmission events are simply the number of offspring in the lineage, each one having the opportunity for a Y-DNA mutation to occur.  For example, if a grandfather has two grandsons through two different sons, then the two grandchildren are separated by 4 transmission events - one from the grandfather to his son and then another to his grandson, and the same thing again for the other branch.  So the number of transmission events is simply double the number of generations.  The sample chart below shows the same MRCA calculation as above, plotted in terms of transmission events rather than generations.
Keep in mind though that in practice, the generations will not always be split evenly between both branches of the tree, even though the MRCA calculations usually assume this to be the case.  In the illustration below, both trees contain 10 transmission events.  In the tree on the left, the generations are split evenly between both branches.  In the tree on the right, one branch contains only 4 generations while the other contains 6.  Both situations reflect a MRCA of 10 transmission events, and both have the same number of opportunities for Y-DNA mutations to occur.  MRCA estimates the number of transmission events, but without additional data cannot determine in which branch the mutations occurred.
When plotting the estimated MRCA, it is sometimes useful to plot the cumulative probability of a common ancestor for a range of generations, rather than incremental probability displayed in the plot above.  The plot below is an example of a cumulative plot.  This plot is based on the same data as the plot above.  It just presents the data in a different form. Rather than showing the isolated probability of a common ancestor for a specific number of generations, it shows the cumulative probability of a common ancestor within a certain number of generations or fewer. This example shows that there is a 50% probability of a common ancestor within about 11 generations or fewer.  The likelihood of a common ancestor within 5 generations or fewer is less than 5%.  There is over 95% probability of a common ancestor within 20 generations or fewer.  At 25 generations or more, the probability of a common ancestor approaches 100% certainty.
The 50% line is particularly meaningful, because this is the center dividing line of the plot.  Half of the time, the common ancestor will be within the 50% mark or fewer generations.  And the other half of the time, the common ancestor will be further out than the 50% divide.  Most often, the common ancestor will be near the 50% center line.  In the absence of any additional information, the 50% line provides a good guess for where to begin the investigation for a possible common ancestor.

FTDNA presents the MRCA estimate in the form of a table.  For each Y-DNA match, there is a TiP (Time Predictor) report available.  This report presents the MRCA data as a tabulated list, with the cumulative probability listed for a range of generations.  The example below shows the MRCA estimate for a GD of 5 out of 111 markers.  In this case, the 50% line is just less than 6 generations.  There is a 51% chance that the MRCA will be at 6 generations or fewer, and a 49% chance that the MRCA will be further than 6 generations.  There is a 92% chance that the MRCA is within 11 generations or fewer.  When investigating a possible common ancestor for this Y-DNA match, it would be wise to begin the search at 6 generations, and then expand the search outward in both directions from there.
FTDNA offers various levels of Y-DNA testing.  The reason for this is that the more markers there are to compare between individuals, the more precisely the distance of a common ancestor can be estimated.  The lowest level of testing, 12 markers, will give only a rough idea of whether two individuals share a common ancestor.  The highest level of testing, 111 markers, can give a much more precise estimate of the MRCA.

The example below shows the MRCA estimate based on 37 markers and 111 markers.  Both plots have the same amount of genetic distance as a percentage of markers tested, at 5.4% each.  The 111 marker comparison narrows down the likely range for a common ancestor to roughly half the range of the 37 marker test.  If a Y-DNA test indicates a common ancestor with another person, it is often useful to upgrade to a higher level of testing in order to narrow the estimated range of the common ancestor.

DNA Lineage Corruption

One reality which is often overlooked by genetic genealogy researchers is the possibility of corrupted DNA lineages - those cases when the documented lineage does not match the genetic lineage.  This occurs when the assumed father is not actually the biological father.  Causes of such events include marital infidelities, adoptions, children out of wedlock, and so forth.  These events are generally referred to as "Non Paternity Events", or "NPE's".  Studies have shown that such mismatches occur in about 4% of offspring on average.  This means that for a lineage going back many generations and hundreds of years, as surnames do, the probability of a corrupted lineage is significant.  As many as 50% or more of all paternal lineages have been affected by an NPE sometime within the past five centuries of surname history.  Often, genealogists feel an attachment to their personal lineage and are unwilling to entertain the possibility of a corrupted genetic lineage.  But any honest genealogist must consider this a very real possibility.

Because of corrupted genetic lineages, surname groups often have multiple genetic groupings, even if the surname has only a single documented progenitor.  As a result of this fragmentation, a surname group will typically have around one half to two thirds of its members grouped into a few major lineages, with the remainder being genetic loners that don't match with any major lineage.

With the currently available test data, the Curley surname appears to be following this pattern.  About two thirds of the test subjects fall into a few major groupings. The remainder have not yet found a DNA match to other Curleys.

Various Types of DNA Testing

On the FTDNA product page, you will notice an assortment of different DNA tests.  Y-DNA testing is the main focus of this surname project.  The other tests provide different kinds of genealogy related information, but are not generally as relevant to surname research.

Here's a brief explanation of the other kinds of testing available through FTDNA:

  • MtDNA - This is a test of the maternal DNA that is passed down from the mother to her children.  MtDNA is carried in the cellular mitochondria, unlike the more familiar nuclear DNA. MtDNA is passed from mother to child without experiencing any genetic recombination with the father's DNA.  It can be used to identify the deep ancestral origins of your maternal lineage.  Since the surname is inherited through the father, mtDNA testing is of no interest for researching surnames.
  • Autosomal DNA - FTDNA's autosomal DNA test is called the "Family Finder" test.  This is the same type of test that is offered by Ancestry.com and 23andMe.  The autosomal DNA test utilizes DNA segments from multiple chromosomes, not just the male Y chromosome.  Because of this, it can find matches through both the paternal and maternal lines.  The drawback is that, because the DNA is recombined from both parents in each offspring, some of the genealogy markers get jumbled up and lost with each generation of offspring.  So this test can only reliably find matches around 5 generations distant or so.  Because of the limited time range of this test, it has minimal utility for surname research.  However, if you get an autosomal test match through your paternal lineage to someone who has taken a Y-DNA test, then you can learn something of your own paternal lineage by association.
  • SNP - FTDNA's more advanced testing options include SNP (single nucleotide polymorphism) testing.  These tests check for a very specific mutation of a single location in the DNA. These tests divide the tree of humanity into finer and finer "subclades", or subdivisions, beyond what's possible with a standard test, helping determine relationships between surnames, clans, etc.  Some SNPs are associated with specific locales or identifiable groups of people.  For example, there's an SNP which is relatively unique to the province of Leinster, Ireland. If you have already taken a Y-DNA test and are interested in additional testing, you may wish to consult the project group administrator for a recommendation.  A genetic genealogy expert may be able to predict what SNP mutations you are likely to possess, based on your pattern of STRs.
  • Big Y - This is the ultimate Y-DNA test offered by FTDNA.  This test will sequence a large fraction of your entire Y chromosome, providing a wealth of raw data.  You will discover a large number of SNPs that are unique to yourself and your own family, and possibly find some that you share with a small group of your closely related branches.  Discovering SNPs shared by small groups of people can hypothetically help you understand the origins and history of your lineage.  However, the number of people who have taken this test is quite small.  So the benefit of taking this test is currently limited, as you are not likely to identify any new genetic groups beyond what is already known from STR data. Over time as people continue to take this test and the database grows, the data from this advanced test may be very useful for understanding the history and origin of your paternal lineage.
The contents of this website are the property of the author and may not be reused without permission, except where otherwise noted.
Powered by Create your own unique website with customizable templates.