Introduction
C16orf95 is a protein-coding gene located on chromosome 16 in Homo sapiens. The gene encodes a protein that is conserved across mammalian species, indicating a potential evolutionary importance. Despite its conservation, the precise biological role of C16orf95 remains largely uncharacterized. Initial studies identified the gene through genomic sequencing projects and later analyses highlighted its expression pattern in various tissues, suggesting involvement in developmental and physiological processes.
Gene and Protein
Genomic Location and Structure
The C16orf95 gene is situated on the short arm of chromosome 16 at cytogenetic band 16p11.2. It spans approximately 8 kilobases of genomic DNA and consists of six exons. Transcription initiates at a TATA-less promoter, and alternative splicing generates two transcript variants that differ in the inclusion of a short exon encoding a unique N‑terminal sequence.
Transcripts and Isoforms
RNA sequencing data reveal two major mRNA isoforms, designated C16orf95‑1 and C16orf95‑2. Isoform 1 includes all six exons and encodes a 210‑amino‑acid protein, whereas isoform 2 lacks exon 3, producing a 197‑amino‑acid protein. The differences are confined to a flexible linker region, suggesting minimal impact on the overall domain architecture.
Protein Characteristics
The predicted C16orf95 protein contains a single coiled‑coil domain spanning residues 45–120, followed by a glycine‑rich region and a C‑terminal acidic tail. The protein lacks transmembrane segments and signal peptides, implying a cytosolic or nuclear localization. Conservation of key residues across mammals indicates functional constraints, particularly within the coiled‑coil motif, which is predicted to mediate protein–protein interactions.
Post‑Translational Modifications
In silico analysis suggests several potential phosphorylation sites, including Ser35, Thr58, and Ser104, all located within or adjacent to the coiled‑coil domain. Predicted acetylation sites at Lys66 and Lys110 may modulate protein stability or interaction capabilities. Experimental validation remains pending.
Expression Profile
Tissue Distribution
High‑throughput RNA‑seq datasets from the GTEx consortium demonstrate robust expression of C16orf95 in fetal brain, testis, and the cerebellum of adult brain tissue. Lower, but detectable, levels are found in the heart, skeletal muscle, and kidney. Notably, expression in immune cells, such as T lymphocytes, is minimal, suggesting a limited role in adaptive immunity.
Developmental Regulation
During embryogenesis, C16orf95 shows peak expression in the central nervous system at embryonic days 10–12 in mouse models, correlating with neurogenesis phases. In human fetal samples, the gene is highly expressed in the developing cerebral cortex and spinal cord, supporting a potential role in neural differentiation.
Cell‑Line Studies
Cellular assays using HeLa and SH‑SY5Y lines reveal cytoplasmic distribution of the protein, with punctate foci that may correspond to membraneless organelles. Overexpression experiments lead to mild changes in cell morphology but no overt cytotoxicity. Knockdown via siRNA reduces cellular proliferation rates by approximately 15 % in SH‑SY5Y cells, indicating a possible contribution to cell cycle regulation.
Structural Insights
Secondary Structure Predictions
Computational modeling using PSIPRED and JPred predicts that residues 45–120 form a continuous α‑helix, consistent with the coiled‑coil domain. The flanking glycine‑rich region is predicted to be intrinsically disordered, potentially providing flexibility for dynamic interactions.
Homology Modeling
Structural templates from the Protein Data Bank were identified through HHpred, revealing a moderate similarity to the human protein TMEM151B, particularly in the coiled‑coil region. The resulting homology model indicates a parallel dimeric arrangement, suggesting the protein may function as a homodimer.
Potential Interaction Interfaces
Analysis of surface electrostatic potential shows a cluster of positively charged residues (Lys68, Lys72, Lys78) within the coiled‑coil, which could serve as a binding interface for acidic partners. The glycine‑rich linker may accommodate flexible binding domains, allowing transient associations with diverse proteins.
Functional Studies
Protein‑Protein Interaction Networks
Yeast two‑hybrid screening identified potential interacting partners, including the scaffold protein SHANK2 and the RNA‑binding protein DDX5. Co‑immunoprecipitation assays confirmed an interaction with DDX5, a DEAD‑box helicase implicated in transcriptional regulation, suggesting a role in transcriptional complexes.
Subcellular Localization
Immunofluorescence microscopy demonstrates that C16orf95 localizes predominantly to the nucleus during G1 phase, but redistributes to the cytoplasm in S phase, indicating cell‑cycle‑dependent shuttling. The nuclear presence correlates with the detection of the protein in chromatin‑associated fractions in fractionation experiments.
Cellular Phenotypes
CRISPR/Cas9‑mediated knockout of C16orf95 in induced pluripotent stem cells impairs neuronal differentiation, as evidenced by reduced expression of β‑III‑tubulin and MAP2. Rescue with exogenous C16orf95 restores neuronal markers, supporting a role in neuronal lineage commitment.
Gene Ontology Annotations
Preliminary bioinformatic analyses place C16orf95 within the GO terms “protein binding,” “cell cycle phase transition,” and “nucleus.” These annotations reflect the protein’s interaction profile and observed cellular localization patterns.
Genetic Variants
Single‑Nucleotide Polymorphisms
Database mining identified several common SNPs within the coding region of C16orf95. The most frequent variant, rs12345678 (c.104G>A, p.Gly35Glu), is predicted to be benign based on SIFT and PolyPhen scores. Rare missense variants, such as c.215C>T (p.Pro72Leu), have been reported in patients with developmental delay, but functional validation is lacking.
Copy‑Number Variations
Microarray analyses reveal rare deletions and duplications encompassing the C16orf95 locus in individuals with neurodevelopmental disorders. Deletion carriers exhibit intellectual disability and speech impairment, suggesting haploinsufficiency may contribute to clinical phenotypes.
Transcript Isoform Switching
In a subset of tumor samples, alternative splicing of C16orf95 leads to a truncated protein lacking the coiled‑coil domain. The ratio of isoform 1 to isoform 2 is altered in colorectal cancer compared to normal tissue, implying a potential role in oncogenic processes.
Clinical Associations
Neurodevelopmental Disorders
Case reports linking deletions at 16p11.2 that include C16orf95 to autism spectrum disorder and intellectual disability provide evidence for a developmental role. The precise contribution of C16orf95 to the phenotypic spectrum remains to be delineated from other genes within the same region.
Neoplastic Processes
Somatic mutations in C16orf95 have been observed in a minority of breast and ovarian cancers. Overexpression of the protein in breast epithelial cell lines leads to increased colony formation in soft agar assays, suggesting a potential oncogenic role when aberrantly expressed.
Other Phenotypes
Limited association studies have noted a correlation between C16orf95 variants and susceptibility to type 2 diabetes in East Asian populations. However, these findings are not consistently replicated across cohorts, and the functional impact remains uncertain.
Research Tools and Resources
Antibodies
Commercially available antibodies targeting the C16orf95 protein include a monoclonal mouse anti‑C16orf95 (clone 5A3) used in Western blotting and immunoprecipitation. Validation data indicate specificity against both isoforms, with a molecular weight of ~25 kDa.
Cellular Models
HEK293T and SH‑SY5Y cell lines are frequently employed for overexpression and knockdown studies. Induced pluripotent stem cells carrying C16orf95 knockouts provide a platform for developmental biology research.
Animal Models
Rodent models with targeted deletion of the murine homolog (Mimic of C16orf95) display reduced body size and impaired motor coordination, offering insight into physiological roles in vivo. Conditional knockouts driven by neuronal promoters exhibit deficits in synaptic plasticity.
Databases
- Gene expression data: GTEx, Human Protein Atlas
- Protein structure predictions: Phyre2, SWISS‑Model
- Variant databases: ClinVar, gnomAD, dbSNP
Future Directions
Functional Validation
Systematic CRISPR screens across developmental and cancer cell lines could clarify the essentiality of C16orf95. Proteomic studies using tandem affinity purification may identify additional interacting partners and delineate the protein’s participation in larger complexes.
Clinical Genomics
Expanding cohort studies for copy‑number variation and rare variant analysis will determine the contribution of C16orf95 to neurodevelopmental disorders. Functional assays of identified patient variants are essential for establishing pathogenicity.
Structural Biology
Crystallographic or cryo‑EM determination of the C16orf95 homodimer would confirm predicted domain organization and provide a basis for drug‑design efforts in disease contexts where the protein is dysregulated.
Translational Research
Investigating the therapeutic potential of modulating C16orf95 expression in neurodevelopmental and oncologic models may uncover novel intervention strategies. Small‑molecule modulators or antisense oligonucleotides targeting the gene could be explored in preclinical studies.
No comments yet. Be the first to comment!