tRNA lookalikes in the Human Genome
There was a time not long ago when we knew we had the longest genome. It was an obvious assumption because we are “in fact” the most intelligent and complex species on the planet. Boy were we wrong, as genetics progressed we came to realize that we weren’t as genetically special as we thought we were. We found that we had tons of “junk” DNA. Of course we continue to be wrong and we now know that we may not be the biggest genome on the planet, but we are still very complex and new research is proving that once again.
Transfer RNAs (tRNAs) are ancient workhorse molecules and part of the cellular process that creates the proteins, critical building blocks of life that keep a cell running smoothly. A new discovery suggests that the number of human genomic loci that might be coding for tRNAs is nearly double what is currently known. Most of the newly identified loci resemble the sequences of mitochondrial tRNAs suggesting unexpected new links between the human nuclear and mitochondrial genomes, links that are not currently understood.
Now for the average layperson that this blog is directed towards this subject can get quite complex, but hopefully we can break it down a little bit. Transfer RNAs (tRNAs) represent an integral component of the translation of a messenger RNA (mRNA) into an amino acid sequence. TRNAs are non-coding RNA molecules –meaning they aren’t translated to create a protein — and can be found in all three kingdoms of life i.e., in archaea, bacteria and eukaryotes.
At the DNA level, a triplet of consecutive nucleotides known as the “codon” (which serve different functions) is used to encode an amino acid. Frequently, a given amino acid can be encoded by more than one codon: in fact, there are 61 distinct codons encoding the 20 standard human amino acids.
During translation, each of the codons contained in the coding region of the mRNA at hand is recognized by its matching tRNA and the corresponding amino acid added to the nascent amino acid sequence. It has been known for many years that each of these 61 tRNAs has multiple copies spread throughout the genome that is found in the human nucleus.
The presence of multiple genomic loci from which the same molecule can be made is a fairly standard trick of genomic organization: processing these loci in parallel can ensure that adequate amounts of each tRNA can be generated quickly enough to meet the high demand that the amino acid translation process imposes on the cell.
In addition to the 61 tRNAs that are found in the human nuclear genome, 22 more tRNAs are encoded in the genome of the cellular organelle known as the mitochondrion: typically they are referred to as the “powerhouse” of the cell, but the mitochondrion — originally a bacterium itself — uses these 22 tRNAs to make proteins out of the just-over-a-dozen mRNAs that are encoded in its genome.
Recent research efforts have shown that tRNAs can have other roles, which go beyond their involvement in protein synthesis. For example, tRNAs can affect the physiology of a cell, they can modulate the abundance of important molecules, etc. These and other unexpected findings have revived interest in looking at tRNAs, this time under a different prism. But, how many tRNAs are actually encoded by the human genome and could be potentially involved in amino acid translation and other processes?
A team of scientists set out to tackle this question and the study was interesting to say the least.
“What we found, frankly, surprised us,” said Isidore Rigoutsos, head researcher.
The team searched the 3 billion base pairs of the human genome for DNA sequences that resembled the 530 known nuclear and mitochondrial tRNAs. Even though they used very stringent criteria in their searches, they found 454 “lookalike” loci, i.e., sequences that look like tRNA, but haven’t yet been experimentally confirmed as such.
The researchers found nearly as many as the known ones with which they started: 81% of these tRNA-lookalikes had not been reported previously. Rather unexpectedly, the team found that most of these new loci resembled some of the 22 mitochondrial tRNAs.
Interestingly, the discovered tRNA lookalikes are not spread uniformly across the 24 chromosomes. Instead, they have penetrated preferentially some chromosomes and have avoided others. For example, chromosomes 1, 2, 7, 8 and 9 claim the lion’s share of the discovered tRNA-lookalikes. On the other hand, chromosome 18 contains no lookalikes. Also, some of the codons are particularly over-represented among the lookalikes whereas other codons are absent.
The surprises did not stop there. The team also discovered that in the chromosomes where the tRNA-lookalikes are found their locations are not accidental either. Instead, the lookalikes are positioned in close proximity to known nuclear tRNAs.
This of course begs the question whether the tRNA-lookalikes are transcribed, just like the known tRNAs. By examining public repositories, the team found evidence of transcription for more than 20% of the discovered tRNA-lookalikes: the transcriptional profiles appear to depend on cell type, which suggests that more of the look-alikes will be found to be transcribed as data from more cell types become available.
On several occasions, the public data revealed evidence for molecules whose endpoints matched exactly the endpoints of the tRNA-lookalikes discovered by the team.
“This is certainly exciting, but it is currently unclear whether these molecules participate in translation as tRNAs, or have entirely different roles,” said Rigoutsos.
I could probably make a joke here about it’s not the size that counts, it’s how you use it. After all the marbled lungfish has the largest genome for any vertebrate we know about right now. There is even a freshwater Amoeba with an even larger genome than the lungfish, possibly the largest on the planet. But it would be rude to do that, because the marbled lungfish and amoeba don’t blog (much).
Telonis, A., Loher, P., Kirino, Y., & Rigoutsos, I. (2014). Nuclear and mitochondrial tRNA-lookalikes in the human genome Frontiers in Genetics, 5 DOI: 10.3389/fgene.2014.00344