In genetics, the phenotype (from grc φαίνω (phaínō) 'to appear, show', and τύπος (túpos) 'mark, type') is the set of observable characteristics or traits of an organism.[1][2] The term covers the organism's morphology (physical form and structure), its developmental processes, its biochemical and physiological properties, its behavior, and the products of behavior. An organism's phenotype results from two basic factors: the expression of an organism's genetic code (its genotype) and the influence of environmental factors. Both factors may interact, further affecting the phenotype. When two or more clearly different phenotypes exist in the same population of a species, the species is called polymorphic. A well-documented example of polymorphism is Labrador Retriever coloring; while the coat color depends on many genes, it is clearly seen in the environment as yellow, black, and brown. Richard Dawkins in 1978[3] and then again in his 1982 book The Extended Phenotype suggested that one can regard bird nests and other built structures such as caddisfly larva cases and beaver dams as "extended phenotypes".
Wilhelm Johannsen proposed the genotype–phenotype distinction in 1911 to make clear the difference between an organism's hereditary material and what that hereditary material produces.[4][5] The distinction resembles that proposed by August Weismann (1834–1914), who distinguished between germ plasm (heredity) and somatic cells (the body). More recently, in The Selfish Gene (1976), Dawkins distinguished these concepts as replicators and vehicles.
Despite its seemingly straightforward definition, the concept of the phenotype has hidden subtleties. It may seem that anything dependent on the genotype is a phenotype, including molecules such as RNA and proteins. Most molecules and structures coded by the genetic material are not visible in the appearance of an organism, yet they are observable (for example by Western blotting) and are thus part of the phenotype; human blood groups are an example. It may seem that this goes beyond the original intentions of the concept with its focus on the (living) organism in itself. Either way, the term phenotype includes inherent traits or characteristics that are observable or traits that can be made visible by some technical procedure.
The term "phenotype" has sometimes been incorrectly used as a shorthand for the phenotypic difference between a mutant and its wild type, which would lead to the false statement that a "mutation has no phenotype".[6]
Behaviors and their consequences are also phenotypes, since behaviors are observable characteristics. Behavioral phenotypes include cognitive, personality, and behavioral patterns. Some behavioral phenotypes may characterize psychiatric disorders[7] or syndromes.[8][9]
A phenome is the set of all traits expressed by a cell, tissue, organ, organism, or species. The term was first used by Davis in 1949, "We here propose the name phenome for the sum total of extragenic, non-autoreproductive portions of the cell, whether cytoplasmic or nuclear. The phenome would be the material basis of the phenotype, just as the genome is the material basis of the genotype."[10]
Although phenome has been in use for many years, the distinction between the use of phenome and phenotype is problematic. A proposed definition for both terms as the "physical totality of all traits of an organism or of one of its subsystems" was put forth by Mahner and Kary in 1997, who argue that although scientists tend to intuitively use these and related terms in a manner that does not impede research, the terms are not well defined and usage of the terms is not consistent.[11]
Some usages of the term suggest that the phenome of a given organism is best understood as a kind of matrix of data representing physical manifestation of phenotype. For example, discussions led by A. Varki among those who had used the term up to 2003 suggested the following definition: "The body of information describing an organism's phenotypes, under the influences of genetic and environmental factors".[12] Another team of researchers characterize "the human phenome [as] a multidimensional search space with several neurobiological levels, spanning the proteome, cellular systems (e.g., signaling pathways), neural systems and cognitive and behavioural phenotypes."[13]
Plant biologists have started to explore the phenome in the study of plant physiology.[14]
In 2009, a research team demonstrated the feasibility of identifying genotype–phenotype associations using electronic health records (EHRs) linked to DNA biobanks. They called this method phenome-wide association study (PheWAS).[15]
Inspired by the evolution from genotype to genome to pan-genome, a concept of exploring the relationship ultimately among pan-phenome, pan-genome, and pan-envirome was proposed in 2023.[16]
Phenotypic variation (due to underlying heritable genetic variation) is a fundamental prerequisite for evolution by natural selection. It is the living organism as a whole that contributes (or not) to the next generation, so natural selection affects the genetic structure of a population indirectly via the contribution of phenotypes. Without phenotypic variation, there would be no evolution by natural selection.[17]
The interaction between genotype and phenotype has often been conceptualized by the following relationship:
A more nuanced version of the relationship is:
Genotypes often have much flexibility in the modification and expression of phenotypes; in many organisms these phenotypes are very different under varying environmental conditions. The plant Hieracium umbellatum is found growing in two different habitats in Sweden. One habitat is rocky, sea-side cliffs, where the plants are bushy with broad leaves and expanded inflorescences; the other is among sand dunes where the plants grow prostrate with narrow leaves and compact inflorescences. These habitats alternate along the coast of Sweden and the habitat that the seeds of Hieracium umbellatum land in, determine the phenotype that grows.[18]
An example of random variation in Drosophila flies is the number of ommatidia, which may vary (randomly) between left and right eyes in a single individual as much as they do between different genotypes overall, or between clones raised in different environments.[citation needed]
The concept of phenotype can be extended to variations below the level of the gene that affect an organism's fitness. For example, silent mutations that do not change the corresponding amino acid sequence of a gene may change the frequency of guanine-cytosine base pairs (GC content). These base pairs have a higher thermal stability (melting point) than adenine-thymine, a property that might convey, among organisms living in high-temperature environments, a selective advantage on variants enriched in GC content.[citation needed]
Richard Dawkins described a phenotype that included all effects that a gene has on its surroundings, including other organisms, as an extended phenotype, arguing that "An animal's behavior tends to maximize the survival of the genes 'for' that behavior, whether or not those genes happen to be in the body of the particular animal performing it."[3] For instance, an organism such as a beaver modifies its environment by building a beaver dam; this can be considered an expression of its genes, just as its incisor teeth are—which it uses to modify its environment. Similarly, when a bird feeds a brood parasite such as a cuckoo, it is unwittingly extending its phenotype; and when genes in an orchid affect orchid bee behavior to increase pollination, or when genes in a peacock affect the copulatory decisions of peahens, again, the phenotype is being extended. Genes are, in Dawkins's view, selected by their phenotypic effects.[19]
Other biologists broadly agree that the extended phenotype concept is relevant, but consider that its role is largely explanatory, rather than assisting in the design of experimental tests.[20]
Phenotypes are determined by an interaction of genes and the environment, but the mechanism for each gene and phenotype is different. For instance, an albino phenotype may be caused by a mutation in the gene encoding tyrosinase which is a key enzyme in melanin formation. However, exposure to UV radiation can increase melanin production, hence the environment plays a role in this phenotype as well. For most complex phenotypes the precise genetic mechanism remains unknown. For instance, it is largely unclear how genes determine the shape of bones or the human ear.Lua error: Internal error: The interpreter exited with status 1.
Gene expression plays a crucial role in determining the phenotypes of organisms. The level of gene expression can affect the phenotype of an organism. For example, if a gene that codes for a particular enzyme is expressed at high levels, the organism may produce more of that enzyme and exhibit a particular trait as a result. On the other hand, if the gene is expressed at low levels, the organism may produce less of the enzyme and exhibit a different trait.[22]
Gene expression is regulated at various levels and thus each level can affect certain phenotypes, including transcriptional and post-transcriptional regulation.
Changes in the levels of gene expression can be influenced by a variety of factors, such as environmental conditions, genetic variations, and epigenetic modifications. These modifications can be influenced by environmental factors such as diet, stress, and exposure to toxins, and can have a significant impact on an individual's phenotype. Some phenotypes may be the result of changes in gene expression due to these factors, rather than changes in genotype. An experiment involving machine learning methods utilizing gene expression measured from RNA sequencing can contain enough signal to separate individuals in the context of phenotype prediction.[23]
Although a phenotype is the ensemble of observable characteristics displayed by an organism, the word phenome is sometimes used to refer to a collection of traits, while the simultaneous study of such a collection is referred to as phenomics.[24][25] Phenomics is an important field of study because it can be used to figure out which genomic variants affect phenotypes which then can be used to explain things like health, disease, and evolutionary fitness.[26] Phenomics forms a large part of the Human Genome Project.[27]
Phenomics has applications in agriculture. For instance, genomic variations such as drought and heat resistance can be identified through phenomics to create more durable GMOs.[28][29]
Phenomics may be a stepping stone towards personalized medicine, particularly drug therapy.[30] Once the phenomic database has acquired more data, a person's phenomic information can be used to select specific drugs tailored to an individual.[30]
Large-scale genetic screens can identify the genes or mutations that affect the phenotype of an organism. Analyzing the phenotypes of mutant genes can also aid in determining gene function.[31] Most genetic screens have used microorganisms, in which genes can be easily deleted. For instance, nearly all genes have been deleted in E. coli[32] and many other bacteria, but also in several eukaryotic model organisms such as baker's yeast[33] and fission yeast.[34] Among other discoveries, such studies have revealed lists of essential genes .
More recently, large-scale phenotypic screens have also been used in animals, e.g. to study lesser understood phenotypes such as behavior. In one screen, the role of mutations in mice were studied in areas such as learning and memory, circadian rhythmicity, vision, responses to stress and response to psychostimulants.
Phenotypic Domain | Assay | Notes | Software Package |
---|---|---|---|
Circadian Rhythm | Wheel running behavior | ClockLab | |
Learning and Memory | Fear conditioning | Video-image-based scoring of freezing | FreezeFrame |
Preliminary Assessment | Open field activity and elevated plus maze | Video-image-based scoring of exploration | LimeLight |
Psychostimulant response | Hyperlocomotion behavior | Video-image-based tracking of locomotion | BigBrother |
Vision | Electroretinogram and Fundus photography | L. Pinto and colleagues |
This experiment involved the progeny of mice treated with ENU, or N-ethyl-N-nitrosourea, which is a potent mutagen that causes point mutations. The mice were phenotypically screened for alterations in the different behavioral domains in order to find the number of putative mutants (see table for details). Putative mutants are then tested for heritability in order to help determine the inheritance pattern as well as map out the mutations. Once they have been mapped out, cloned, and identified, it can be determined whether a mutation represents a new gene or not.
Phenotypic domain | ENU Progeny screened | Putative mutants | Putative mutant lines with progeny | Confirmed mutants |
---|---|---|---|---|
General assessment | 29860 | 80 | 38 | 14 |
Learning and memory | 23123 | 165 | 106 | 19 |
Psychostimulant response | 20997 | 168 | 86 | 9 |
Neuroendocrine response to stress | 13118 | 126 | 54 | 2 |
Vision | 15582 | 108 | 60 | 6 |
These experiments showed that mutations in the rhodopsin gene affected vision and can even cause retinal degeneration in mice.[35] The same amino acid change causes human familial blindness, showing how phenotyping in animals can inform medical diagnostics and possibly therapy.
The RNA world is the hypothesized pre-cellular stage in the evolutionary history of life on earth, in which self-replicating RNA molecules proliferated prior to the evolution of DNA and proteins.[36] The folded three-dimensional physical structure of the first RNA molecule that possessed ribozyme activity promoting replication while avoiding destruction would have been the first phenotype, and the nucleotide sequence of the first self-replicating RNA molecule would have been the original genotype.[36]
Lua error: Internal error: The interpreter exited with status 1.
Lua error: Internal error: The interpreter exited with status 1.
Lua error: Internal error: The interpreter exited with status 1.
Lua error: Internal error: The interpreter exited with status 1.