In partial fulfillment of the requirements for the degree of
Doctor of Philosophy in Biology
in the
School of Biological Sciences
Ling Wang
Will defend her dissertation
Evolution of Essential and Ancient Genes: Understanding and Modeling Fitness in Transfer RNAs
10, December 2024
1PM
https://gatech.zoom.us/j/93126822349?pwd=9qbAlkbjXpBEHG2vk7Zo9nuVRG9qcA.1
Thesis Advisor:
Dr. Annalise Paaby, Ph.D.
School of Biological Sciences
Georgia Institute of Technology
Committee Members:
Dr. Francesca Storici, Ph.D.
School of Biological Sciences
Georgia Institute of Technology
Dr. Joseph Lachance, Ph.D.
School of Biological Science
Georgia Institute of Technology
Dr. Matthew Torres, Ph.D.
School of Biological Sciences
Georgia Institute of Technology
Dr. Loren Williams, Ph.D.
School of Chemistry & Biochemistry
Georgia Institute of Technology
Abstract: A central question in evolutionary biology is how genes evolve over time and how mutations to those genes affect fitness. How a mutation affects fitness may depend in part upon mutations at other sites. For example, the negative impact of a bad mutation may be compensated by other mutations that restore or enhance function. Understanding this dynamic is crucial to elucidating not just the evolution of genes, but the evolution of complex traits, including interactions between genes.
This thesis is structured into three related projects that examine the following fundamental questions: 1. How do genes evolve, and what does the compensatory evolution pattern of single-copy genes look like? Specifically, I explored mutations in mitochondrial tRNAs (mt-tRNAs) in Caenorhabditis nematodes to identify patterns of compensatory evolution over short and intermediate evolutionary timescales, including interactions between mt-tRNAs and associated factors encoded in the nuclear genome. 2. To what extent can computational techniques enhance our understanding of the impact of mutations, particularly on overall fitness? In this project, I utilized experimentally derived fitness estimates for thousands of S. cerevisiae allelic variants of a nuclear-encoded arginine tRNA (tRNACCU). I assessed how well computational inferences from sequence information, such as secondary structure prediction and the minimum free energy folding score, explained fitness variation. 3. Can machine learning improve upon computational fitness prediction and address the limitations of existing prediction tools? In this project, I developed an inclusive machine learning model that integrates multiple features to estimate fitness from sequence data. Together, these studies aim to provide a more comprehensive understanding of the dynamics of tRNA evolution by combining advanced computational techniques with concepts of evolutionary biology.