SON protein is a protein that in humans is encoded by the SONgene.[1][2]
SON is the name that has been given to a large Ser/Arg (SR)-related protein, which is a splicing co-factor that contributes to an efficient splicing within cell cycle progression.[3] It is also known as BASS1 (Bax antagonist selected in saccharomyces 1) or NRE-binding protein (Negative regulatory element-binding protein). The most common gene name of this splicing protein- which is only found in Humans (Homo sapiens)- is SON, but C21orf50, DBP5, KIAA1019 and NREBP can also be used as synonyms.[4]
The protein encoded by SON gene binds to a specific DNA sequence upstream of the upstream regulatory sequence of the core promoter and second enhancer of human hepatitis B virus (HBV). Through this binding, it represses HBV core promoter activity, transcription of HBV genes, and production of HBV virions. The protein shows sequence similarities with other DNA-binding structural proteins such as gallin, oncoproteins of the MYC family, and the oncoprotein MOS. It may also be involved in protecting cells from apoptosis and in pre-mRNA splicing.[2] Mutation in SON gene is associated with ZTTK syndrome.[5]
The sequence length of the SON protein consists in 2426 aminoacids and its sequence status is totally completed. Its molecular weight is 263,830 Daltons (Da) and its domain contains 8 types of repeats which are distributed in 3 regions. This protein is found in the 21st chromosome and is mostly located in nuclear speckles. Its higher expression is seen in leukocyte and heart cells.[4][6]
SON protein is essential for maintaining the subnuclear organization of the factors that are processed in the nucleus highlighting its direct role in pre-mRNA splicing.[7][page needed]
Splicing is known as the process within the maturation of the pre-RNAm takes place. The pre-RNAm which has just been transcript has sequences called introns and exons. Introns are non-active nucleotide sequences that have to be removed in order the exons (active sequences) to get joined. This process must be very controlled. The splicing takes place in the spliceosome, a complex that brings together a pre-RNAm and a variety of the binding proteins. These proteins together with the splicing factors (which are not found in the spliceosome) are in charge of recognizing the intron’s branch point sequence. The SON protein is known to be one of these binding proteins.[7][page needed]
Although there is a lack of knowledge about its exact splicing control in the progression of the cell cycle and it has remained largely unexplored, it’s certain that this splicing-associated protein is necessary for the maintenance of the embryonic stem cells because it influences the splicing of pluripotency regulators.[3][8]
SON plays an important role in the mRNA processing. Nevertheless, this process is still a little uncertain and this is why in a future it will be interesting to understand how exactly this protein interacts with the spliceosomal complex, its exact molecular function in the context of splicing. Not only the SON protein interferes in the splicing but also makes complex mechanisms such as the RNA post-transcriptional to cooperate with the splicing-mRNA processing.[9]
Human embryonic stem cells are able to undergo the process of differentiation into specific and relevant cells. To maintain the pluripotency of the embryonic stem cells, transcription factors and epigenetic modifiers play an important role despite the fact that little is known about the regulation of pluripotency throughout the process of splicing. The factor SON is identified as essential for the maintenance of this pluripotency. It is confirmed that SON regulates the splicing process of transcripts (RNAm) that will encode the gens that are going to regulate the pluripotency of the embryonic human cells.[10]
On the one hand, SON protein is required to maintain the genome stability in order to ensure an efficient RNA processing of affected genes. It also facilitates the interaction of SR proteins with RNA polymerase II and is required for processing of weak constitutive splice sites, having also strong implications in cancer and other human diseases.[3][6]
On the other side, a deficiency or knockdown of SON protein causes various and severe defects in mitotic division arrangement, chromosome alignment and microtubule dynamics when spindle pole separation takes place.[3]
But as we could read in the article called “SON protein regulates GATA-2 through transcriptional control of the microRNA 23a-27-24-a clúster”, SON protein has even more functions in the organism. It has been found that these proteins may regulate the hematopoietic cells differentiation. They have a specific job in hematopoietic process, which is based on activating other proteins called GATA. As these ones are finally activated, the cell differentiation starts normally.[11]
A recent study suggested that SON may be a novel therapeutic molecular target for pancreatic cancer as the results of a recent study show that this protein is very important as far as proliferation, survival and tumorigenicity of cancer cells are concerned. Specifically, these results revealed that the serine-arginine-rich protein involved in the RNA splicing process, could suppress pancreatic cell tumorigenicity.[9]
Mattioni T, Hume CR, Konigorski S, et al. (1992). "A cDNA clone for a novel nuclear protein with DNA binding activity". Chromosoma. 101 (10): 618–24. doi:10.1007/BF00360539. PMID1424986.
Bliskovskiĭ VV, Berdichevskiĭ FB, Tkachenko AV, et al. (1992). "[Coding part of the son gene small transcript contains four areas of complete tandem repeats]". Mol. Biol. (Mosk.). 26 (4): 793–806. PMID1435773.
Bliskovskiĭ VV, Kirillov AV, Zakhar'ev VM, Chumankov IM (1992). "[The human son gene: the large and small transcripts contains various 5'-terminal sequences]". Mol. Biol. (Mosk.). 26 (4): 807–12. PMID1435774.
Chumakov IM, Berdichevskiĭ FB, Sokolova NV, et al. (1991). "[Identification of a protein product of a novel human gene SON and the biological effect upon administering a changed form of this gene into mammalian cells]". Mol. Biol. (Mosk.). 25 (3): 731–9. PMID1944255.
Berdichevskiĭ FB, Chumakov IM, Kiselev LL (1988). "[Decoding of the primary structure of the son3 region in human genome: identification of a new protein with unusual structure and homology with DNA-binding proteins]". Mol. Biol. (Mosk.). 22 (3): 794–801. PMID3054499.
Khan IM, Fisher RA, Johnson KJ, et al. (1994). "The SON gene encodes a conserved DNA binding protein mapping to human chromosome 21". Ann. Hum. Genet. 58 (Pt 1): 25–34. doi:10.1111/j.1469-1809.1994.tb00723.x. PMID8031013.
Kikuno R, Nagase T, Ishikawa K, et al. (1999). "Prediction of the coding sequences of unidentified human genes. XIV. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro". DNA Res. 6 (3): 197–205. doi:10.1093/dnares/6.3.197. PMID10470851.
Hattori M, Fujiyama A, Taylor TD, et al. (2000). "The DNA sequence of human chromosome 21". Nature. 405 (6784): 311–9. doi:10.1038/35012518. PMID10830953.
Wynn SL, Fisher RA, Pagel C, et al. (2001). "Organization and conservation of the GART/SON/DONSON locus in mouse and human genomes". Genomics. 68 (1): 57–62. doi:10.1006/geno.2000.6254. PMID10950926.
Sun CT, Lo WY, Wang IH, et al. (2001). "Transcription repression of human hepatitis B virus genes by negative regulatory element-binding protein/SON". J. Biol. Chem. 276 (26): 24059–67. doi:10.1074/jbc.M101330200. PMID11306577.
Reymond A, Friedli M, Henrichsen CN, et al. (2002). "From PREDs and open reading frames to cDNA isolation: revisiting the human chromosome 21 transcription map". Genomics. 78 (1–2): 46–54. doi:10.1006/geno.2001.6640. PMID11707072.
Yi J, Kloeker S, Jensen CC, et al. (2002). "Members of the Zyxin family of LIM proteins interact with members of the p130Cas family of signal transducers". J. Biol. Chem. 277 (11): 9580–9. doi:10.1074/jbc.M106922200. PMID11782456.
Casadei R, Strippoli P, D'Addabbo P, et al. (2004). "mRNA 5' region sequence incompleteness: a potential source of systematic errors in translation initiation codon assignment in human mRNAs". Gene. 321: 185–93. doi:10.1016/S0378-1119(03)00835-7. PMID14637006.
Ota T, Suzuki Y, Nishikawa T, et al. (2004). "Complete sequencing and characterization of 21,243 full-length human cDNAs". Nat. Genet. 36 (1): 40–5. doi:10.1038/ng1285. PMID14702039.