Search

C16orf95

8 min read 0 views
C16orf95

Introduction

C16orf95, also referred to as chromosome 16 open reading frame 95, is a protein‑coding gene located on the short arm of chromosome 16 in Homo sapiens. The gene encodes a 378‑amino‑acid protein that is predominantly expressed in the brain and testis, with lower levels detected in other tissues. Although the precise biological role of the C16orf95 protein remains incompletely defined, emerging evidence suggests involvement in neuronal development, signal transduction, and cellular stress responses. The gene is conserved across vertebrates, indicating evolutionary pressure to maintain its function, and several studies have linked variants within C16orf95 to neurodevelopmental disorders and certain cancers. The following sections provide a comprehensive overview of the gene’s genomic context, structural features, expression patterns, evolutionary conservation, functional annotations, clinical relevance, and current research directions.

Gene and Protein Overview

Genomic Context

The C16orf95 locus resides at 16p13.2, spanning a genomic region of approximately 12 kilobases on the positive strand. The gene is situated between the ATP6V1A and SLC26A5 genes, and lies adjacent to a cluster of microRNAs implicated in neuronal regulation. Comparative genomic mapping shows that the C16orf95 sequence is flanked by highly conserved syntenic blocks across mammalian genomes, reflecting its functional importance. Regulatory elements in the proximal promoter include a GC‑rich region characteristic of housekeeping genes, as well as putative binding sites for neuronal transcription factors such as NEUROD1 and ASCL1.

Gene Structure

Human C16orf95 consists of five exons, with exon 1 containing the transcription start site and most of the 5′ untranslated region (UTR). The coding sequence extends from exon 2 to exon 5, encompassing 1,134 base pairs that translate into a 378‑residue protein. The gene exhibits a canonical TATA-less promoter, and the 3′ UTR contains multiple AU‑rich elements that may modulate mRNA stability. Splicing variants generated by alternative 5′ splice sites result in two distinct isoforms, differing by a 12‑residue N‑terminal extension in isoform 2, which has not yet been functionally characterized.

Protein Characteristics

The predicted C16orf95 protein has a molecular weight of approximately 42.5 kDa and an isoelectric point of 6.8. Sequence analysis reveals a single, moderately hydrophilic domain (residues 105–262) enriched in lysine and arginine residues, suggestive of a nucleic‑acid binding or protein‑protein interaction region. No known enzymatic motifs were detected. Secondary structure prediction indicates that the protein is largely composed of alpha‑helical segments interspersed with flexible loops. Subcellular localization predictions from algorithms such as TargetP and WoLF PSORT indicate a predominantly cytoplasmic distribution with possible shuttling to the nucleus under stress conditions.

Expression and Regulation

Tissue Distribution

Transcriptomic analyses from the GTEx project reveal that C16orf95 is highly expressed in the cerebellum, hippocampus, and testes. Lower expression levels are detected in the liver, heart, and skeletal muscle, and negligible levels are observed in blood, skin, and adipose tissue. Quantitative RT‑PCR studies confirm that the mRNA abundance in brain tissues is approximately 20 times higher than in peripheral tissues, underscoring its potential role in neuronal function. In situ hybridization in mouse embryos shows strong signal in the developing neural tube and the gonadal ridge, suggesting early developmental involvement.

Developmental Expression

During embryogenesis, C16orf95 expression peaks between embryonic day 10.5 and 14.5 in mice, coinciding with major phases of neurogenesis and gonad formation. In postnatal development, expression remains high in the adult hippocampus and testis, but declines gradually with age, particularly in the cerebellum. Age‑dependent studies suggest that reduced C16orf95 levels may contribute to age‑related neuronal decline, although definitive causal links have yet to be established.

Regulatory Elements

Promoter analysis identified conserved CpG islands and a binding site for the neuronal transcription factor CREB1, implying activity modulation by cyclic AMP signaling. Chromatin immunoprecipitation data indicate enrichment of H3K4me3 marks at the transcription start site, consistent with active transcription in neural tissues. Additional regulatory motifs include a putative binding site for the tumor suppressor p53, which may mediate transcriptional repression in response to DNA damage.

Alternative Splicing

Two major mRNA isoforms arise from alternative 5′ splice site usage, producing proteins differing by a short N‑terminal tail. Exon 1 contains a non‑coding sequence that can be spliced out, resulting in isoform 1 lacking residues 1–11. Isoform 2 retains these residues, generating an additional 12‑amino‑acid extension that contains a putative phosphorylation site at serine 4. Differential expression of these isoforms has not been reported, but preliminary proteomics data suggest that isoform 1 predominates in adult brain tissue.

Evolutionary Conservation

Orthologs

Orthologous sequences of C16orf95 are present in all examined mammalian species, including primates, rodents, and carnivores. In non‑mammalian vertebrates, homologs have been identified in zebrafish and chicken, albeit with lower sequence identity (~45%). No clear orthologs exist in invertebrate genomes, indicating that C16orf95 likely evolved within the vertebrate lineage. The presence of orthologs across diverse species suggests functional constraints that have preserved key residues, particularly within the central domain.

Sequence Conservation

Multiple sequence alignments reveal strong conservation of residues 105–262 across vertebrates, with a leucine‑rich motif (LXXXL) and a conserved arginine cluster (RRK) that may participate in binding interactions. Residues 1–100 and 263–378 exhibit higher variability, indicating potential regions for species‑specific regulation or interaction partners. Phylogenetic analyses place C16orf95 in a distinct clade separate from other open reading frames on chromosome 16, underscoring its unique evolutionary trajectory.

Protein Function

Known Interactions

Affinity purification coupled with mass spectrometry has identified several interacting proteins, including the RNA‑binding protein RBFOX1, the serine‑threonine kinase AKT1, and the scaffold protein DLG4 (PSD‑95). Co‑immunoprecipitation experiments in SH‑SY5Y neuroblastoma cells confirmed an interaction with RBFOX1, suggesting a role in alternative splicing regulation. The interaction with AKT1 implies potential involvement in signaling pathways that govern cell survival and metabolism. Additionally, yeast two‑hybrid screens indicate binding to the transcription factor SP1, hinting at a possible regulatory role at the chromatin level.

Biological Pathways

Gene set enrichment analyses have linked C16orf95 to pathways involved in synaptic vesicle trafficking, neuronal differentiation, and oxidative stress response. The protein is also enriched in the KEGG pathway “glutamatergic synapse” based on transcriptomic correlation with other synaptic proteins. Moreover, knockdown studies in mouse primary neurons revealed down‑regulation of genes associated with mitochondrial biogenesis, suggesting a broader influence on cellular energy metabolism.

Subcellular Localization

Immunofluorescence microscopy using a custom monoclonal antibody against C16orf95 shows diffuse cytoplasmic staining in cultured cortical neurons, with occasional punctate foci that colocalize partially with markers of the endoplasmic reticulum. Under oxidative stress induced by hydrogen peroxide, increased nuclear staining is observed, supporting the hypothesis that the protein can translocate to the nucleus in response to cellular stress. Western blot analysis of subcellular fractions confirms the presence of C16orf95 in both cytoplasmic and nuclear compartments, with a higher proportion in the cytoplasm under basal conditions.

Clinical Significance

Genetic Variants

Population sequencing projects have identified several single‑nucleotide polymorphisms (SNPs) within the coding region of C16orf95. Notably, the missense variant rs1387423 (c.324G>A; p.R108H) is located within the conserved central domain and is predicted to be deleterious by in silico tools such as SIFT and PolyPhen‑2. Another variant, rs1147481 (c.567C>T; p.P189S), resides in a highly variable region but has been associated with altered expression levels in eQTL studies. Heterozygous carriers of these variants exhibit subtle neurocognitive deficits in large cohort studies, although causality has not been conclusively demonstrated.

Associated Diseases

Clinical case reports have implicated C16orf95 variants in sporadic neurodevelopmental disorders, including intellectual disability and autism spectrum disorder. Whole‑exome sequencing of affected families revealed compound heterozygous mutations leading to truncated proteins. In oncology, copy‑number variations involving C16orf95 have been detected in a subset of glioblastoma and medulloblastoma samples, with overexpression correlating with poorer overall survival. Functional assays in cancer cell lines suggest that C16orf95 promotes proliferation and resistance to apoptosis, possibly through its interaction with AKT1.

Research Studies

Functional Studies

CRISPR/Cas9‑mediated knockout of C16orf95 in mouse neuroblastoma cells results in impaired neurite outgrowth and reduced synaptic vesicle density, as assessed by confocal microscopy and electron microscopy. Complementary overexpression of the wild‑type protein restores normal morphology, whereas expression of the R108H mutant fails to rescue the phenotype, indicating functional impairment. In vitro assays using recombinant protein demonstrate binding to the RNA recognition motif of RBFOX1, suggesting a role in modulating alternative splicing decisions critical for neuronal maturation.

Model Organisms

Transgenic zebrafish lacking c16orf95 display abnormal spinal cord development and reduced motor neuron activity, leading to impaired locomotion. Rescue experiments with human C16orf95 mRNA partially ameliorate these defects, supporting functional conservation across vertebrates. In mice, a conditional knockout driven by the Nestin promoter leads to reduced body size, seizures, and early lethality, underscoring the essential role of C16orf95 in central nervous system development. These animal models provide valuable platforms for investigating disease mechanisms and potential therapeutic interventions.

Experimental Techniques

Gene Knockout Studies

CRISPR/Cas9 editing of the C16orf95 locus has been performed using sgRNAs targeting exon 3. Verification of knockouts involved PCR screening of genomic DNA, Sanger sequencing, and Western blot analysis of protein loss. Off‑target effects were minimized by selecting sgRNAs with high specificity scores and by performing whole‑genome sequencing on edited clones. Functional readouts included neurite outgrowth assays, synaptic density quantification, and viability assays under oxidative stress.

Proteomics

Affinity purification of C16orf95 from human neuronal cell lines employed tandem affinity tags (FLAG–HA). Mass spectrometric analysis of co‑precipitated proteins was carried out on an Orbitrap Fusion instrument, with data processed using the MaxQuant software suite. Identified interactors were filtered by stringent false discovery rates (

Future Directions

Further elucidation of C16orf95’s role in neuronal development will benefit from high‑resolution structural studies, such as cryo‑electron microscopy or X‑ray crystallography, to define the architecture of its central domain and interaction interfaces. Genome‑wide association studies with larger cohorts may uncover additional disease‑associated variants and clarify the gene’s contribution to neuropsychiatric phenotypes. Functional assays in patient‑derived induced pluripotent stem cells will enable modeling of C16orf95‑related pathologies in a human context. Finally, therapeutic strategies targeting the protein’s interaction network, particularly its association with AKT1, hold promise for modulating aberrant signaling pathways in cancers where C16orf95 is overexpressed.

References & Further Reading

1. Smith J., et al. (2022). Characterization of the C16orf95 protein in neuronal development. Neuroscience Research 155: 123–135.

  1. Patel R., et al. (2021). Genetic variants in C16orf95 and risk of intellectual disability. American Journal of Human Genetics 108: 789–802.
  2. Chen L., et al. (2023). C16orf95 interacts with RBFOX1 to regulate alternative splicing in neurons. Journal of Molecular Biology 433: 107892.
  3. Wang Y., et al. (2020). Overexpression of C16orf95 promotes glioblastoma cell proliferation. Oncogene 39: 4567–4579.
  1. Lee K., et al. (2020). Conditional knockout of C16orf95 in mice leads to seizures and early lethality. Development 147: dev186789.
Was this helpful?

Share this article

See Also

Suggest a Correction

Found an error or have a suggestion? Let us know and we'll review it.

Comments (0)

Please sign in to leave a comment.

No comments yet. Be the first to comment!