Today in Kingston, ON, a man was arrested in connection with a kidnapping and sexual assault that took place in Calgary, AB, 20 years ago. The police were able to connect the suspect to the case by comparing his DNA to samples from the Calgary cold case.
I was asked to talk with the CTV reporter covering the story to explain DNA fingerprinting. RCMP forensics scientists-the real experts who made this type of arrest possible, aren’t available on Sundays. But if you happen to be a scientist and the sister of said reporter, you tend to be more available on Sundays.
One of the big questions from today’s interview was “how does this technology work?” Now, in the interview, my answer was pretty quick, because there wasn’t too much time to really explore the details-the story wasn’t about the science, or at least not ALL about the science. So here is a chance to go into a bit more detail.
The first thing to understand about DNA fingerprinting (the preferred term is actually genetic profiling) is that all of a person’s DNA (called the “whole genome“) is NOT sequenced. Only pieces of the DNA are sequenced. Your DNA contains TERABYTES of data. (The MacBook pro that I am writing this on has only 500 gigabytes of storage-it wouldn’t even hold the information that your DNA holds.) It would take a very long time to sift through the data from a whole genome, especially since 99.9% of our DNA is the same as everyone else. And thanks to evolution, we have a whole lot of junk DNA that we just kept with us as we climbed the evolutionary ladder.
What scientists use instead to build a DNA “fingerprint” are genetic markers called short tandem repeats (STR). These are areas of your DNA that present in every human, but are highly variable (polymorphic), meaning that they differ from person to person. There are typically 13 STR loci that forensic scientists use to create a genetic profile.
1) DNA sample is collected: could be blood, hair, saliva
2) The DNA is then broken up into smaller pieces, using an enzyme that cleaves DNA at specific locations
3) The DNA markers are amplified by a technique called PCR (Polymerase Chain Reaction). This means that the original DNA sample can be quite small-maybe only 20 DNA containing cells.
4) The DNA pieces are then run through a technique called gel electrophoresis, where a high voltage current is applied to a gel that contains the DNA fragments. The fragments separate out based on size, with the smaller fragments travelling faster. The result is the band-like structure seen in the picture at the top.
The bands on a gel from the unknown sample will be compared to suspects (in the case of criminal DNA testing). If the unknown DNA is a match for a suspect, the bands on the unknown sample will exactly match. Take a look at the samples in the picture at the top. Can you identify which suspect is a match for the sample from the crime scene? (Answer at the bottom.)
Each of these STRs are independent, meaning that a particular sequence of one does not influence the other. In probability terms this means that each of these is an independent event. The result is a one in several trillion chance of two sequences from two individuals being identical. The notable exception being identical twins, who by definition have the same DNA.
What has changed since 1995?
Well, the techniques are better, we can use smaller samples of DNA. We can even now put together samples from degraded ancient DNA. We can’t quite use those samples to clone a velociraptor (a la Jurassic Park); however, we can use the sample to identify remains of those long dead. Analysis of mitochondrial DNA was how the remains of Richard III were unequivocally identified in 2013.
Better, faster, more sensitive techniques allow for identification that may not have been possible in 1995. Further, PCR was developed in 1991, meaning that 20 years ago, it was still relatively new. Today we are much more comfortable with DNA analysis, as is the legal system.
Check out the story on CTV:
*The unknown DNA sample in the top image is a match for suspect number 2