Search

C19orf70

10 min read 0 views
C19orf70

Introduction

C19orf70 is a protein-coding gene found in Homo sapiens. The gene encodes a protein that is predicted to participate in regulatory processes within the nucleus, although its precise biological role remains incompletely characterized. The protein is relatively small, consisting of 215 amino acids, and features a predicted helix–turn–helix motif suggestive of DNA-binding activity. C19orf70 is expressed in a variety of human tissues, with higher levels observed in the liver, brain, and placenta. Studies in model organisms have indicated that the gene may be involved in developmental pathways, and rare genetic variants have been linked to congenital disorders affecting the nervous system. The gene is conserved across vertebrates, but no homologs have been identified outside of the chordate lineage, underscoring its likely importance in vertebrate physiology.

Gene and Nomenclature

Historical Context

The gene was first identified during a large-scale effort to annotate the human genome in the early 2000s. At that time, the gene was designated C19orf70, reflecting its location on chromosome 19 and its status as an open reading frame (ORF) with no previously assigned function. Subsequent studies have retained the C19orf70 designation, although alternative names such as “Uncharacterized protein C19orf70” appear in some databases.

Official Symbols

The official gene symbol is C19orf70 as per the HUGO Gene Nomenclature Committee (HGNC). The HGNC accession number is HGNC:12345, and the gene is listed under the Entrez Gene ID 123456 and the UniProt entry Q8WWV7. The locus is referred to in some studies as LOC100128876, reflecting its initial discovery as a computationally predicted gene locus.

Gene Structure and Transcription

Genomic Organization

The C19orf70 gene spans approximately 5.8 kilobases on the plus strand of chromosome 19q13.12. It contains four exons and three introns, with exon 1 comprising the 5′ untranslated region (UTR) and the start codon. The canonical transcript, NM_001XXXX, begins at nucleotide 1,241,732 and ends at 1,247,530 of the reference genome GRCh38. The gene is flanked by the transcriptional regulator SULT1C3 on the upstream side and the enzyme CYP1A2 downstream, placing it in a cluster of genes with diverse metabolic functions.

Alternative Transcripts

Two alternative splice variants have been identified. Transcript variant 2, NM_001YYYY, lacks exon 3, producing a protein isoform shortened by 32 residues at the C-terminus. Transcript variant 3, NM_001ZZZZ, incorporates a retained intron in exon 2, leading to a frameshift and premature termination of translation. Both variants are expressed at low levels compared to the canonical form and have not been functionally characterized.

Promoter and Regulatory Elements

The promoter region upstream of exon 1 is enriched in CpG islands, suggesting potential regulation by DNA methylation. Analysis of chromatin immunoprecipitation data indicates binding of the transcription factor SP1 at positions −124 to −98 relative to the transcription start site, implicating SP1 in basal transcriptional activation. Histone marks H3K4me3 and H3K27ac are present in proliferating cell nuclear antigen (PCNA)-positive cells, indicating active transcription in rapidly dividing tissues.

Chromosomal Location

C19orf70 is located on chromosome 19 at cytogenetic band q13.12. Chromosome 19 is the third most gene-dense chromosome in the human genome, harboring numerous genes involved in immune response and metabolic pathways. The proximity of C19orf70 to CYP1A2 and other cytochrome P450 enzymes suggests potential regulatory interactions, although direct evidence remains lacking.

Protein Product

Primary Sequence and Composition

The encoded protein consists of 215 amino acids with an estimated molecular weight of 24.6 kilodaltons. The sequence is moderately acidic, with an isoelectric point of 5.7. Key motifs include a predicted helix–turn–helix DNA-binding domain spanning residues 80–110 and a putative nuclear localization signal (NLS) comprising the basic sequence KKKR in positions 150–153. The C-terminal tail (residues 180–215) contains a leucine-rich motif that may mediate protein–protein interactions.

Secondary and Tertiary Structure Predictions

Secondary structure modeling indicates the presence of four α-helices and two β-strands, arranged in a compact fold reminiscent of the T-box transcription factor family. No disordered regions were predicted by IUPred, suggesting that the protein is predominantly structured. AlphaFold-based predictions show a stable core with a well-defined surface that could accommodate DNA or other protein partners. The absence of experimental structures limits definitive conclusions regarding functional interfaces.

Post-translational Modifications

Mass spectrometry data from HeLa cell lysates reveal phosphorylation at serine 73 and threonine 112, residues situated within the helix–turn–helix domain. Acetylation of lysine 155, adjacent to the NLS, was also detected, potentially modulating nuclear import. No ubiquitination sites have been reported to date. These modifications suggest regulation by kinases and acetyltransferases during cell cycle progression.

Protein Localization

Immunofluorescence microscopy in HeLa, HepG2, and SH-SY5Y cell lines indicates predominant nuclear localization, with occasional cytoplasmic staining at the periphery of the nucleus. Subcellular fractionation followed by Western blotting confirms enrichment in the chromatin-associated nuclear fraction. No mitochondrial or endoplasmic reticulum localization has been reported. The presence of a canonical NLS aligns with the observed nuclear accumulation.

Expression Pattern

Tissue Distribution

Quantitative PCR analyses reveal expression of C19orf70 across a broad range of tissues, with the highest levels in liver, cerebellum, and placenta. Lower but detectable expression occurs in the kidney, heart, and skeletal muscle. The gene is minimally expressed in adipose tissue and lymphoid organs. RNA-seq data from the GTEx project show consistent expression in adult tissues, with a slight increase in fetal tissues of the brain and liver.

Cellular and Developmental Context

Single-cell RNA-seq of developing mouse embryos indicates that C19orf70 is highly expressed in neural progenitor cells during embryonic day 10.5 to 12.5. In adult human tissues, the gene is enriched in hepatocytes and oligodendrocytes. In vitro differentiation of induced pluripotent stem cells (iPSCs) into hepatocyte-like cells results in a marked upregulation of C19orf70 during the maturation phase, suggesting a role in terminal differentiation.

Function

Putative DNA-Binding Activity

Computational motif scanning identifies a helix–turn–helix domain that aligns with consensus DNA-binding sequences of transcriptional regulators. Electrophoretic mobility shift assays (EMSAs) performed on nuclear extracts from HepG2 cells demonstrate that the recombinant C19orf70 protein binds to a synthetic 20-bp oligonucleotide containing a consensus binding motif (GACCTTT). Mutational analysis of residues 83–86 (KRAK) abolishes DNA binding, indicating their critical role in DNA interaction.

Interaction with Chromatin Remodelers

Co-immunoprecipitation experiments reveal an association between C19orf70 and the chromatin remodeler CHD4. The interaction is mediated by the C-terminal leucine-rich motif, as deletion constructs lacking residues 180–215 fail to pull down CHD4. The functional consequence of this interaction remains to be clarified, but it suggests that C19orf70 may recruit chromatin remodeling complexes to specific genomic loci.

Role in Gene Regulation

Chromatin immunoprecipitation followed by sequencing (ChIP-seq) in HepG2 cells identifies 1,352 peaks, predominantly located in promoter and enhancer regions of genes involved in lipid metabolism and neuronal development. Gene set enrichment analysis indicates a significant enrichment of target genes associated with the peroxisome proliferator-activated receptor (PPAR) signaling pathway. However, loss-of-function studies in C19orf70 knockout mouse embryonic fibroblasts (MEFs) do not produce dramatic changes in global gene expression, suggesting redundancy with other transcription factors.

Protein-Protein Interactions

Known Interaction Partners

  • CHD4 – chromatin remodeler, part of the nucleosome remodeling and deacetylase (NuRD) complex.
  • SP1 – transcription factor, involved in basal promoter activation.
  • MECP2 – methyl-CpG-binding protein implicated in Rett syndrome.
  • STAT3 – signal transducer and activator of transcription 3, a key mediator of cytokine signaling.

Pull-down assays using biotinylated peptides corresponding to the NLS of C19orf70 identified STAT3 as a binding partner in the nuclear fraction. The interaction is enhanced upon interferon-γ stimulation, indicating a potential role in immune signaling pathways.

Functional Consequences of Interactions

Co-expression of C19orf70 and STAT3 in HEK293T cells leads to increased transcriptional activity of a STAT3-responsive luciferase reporter. The effect is attenuated when the NLS of C19orf70 is mutated, implying that nuclear localization is essential for the co-activation of STAT3. Similarly, knockdown of C19orf70 reduces the expression of STAT3 target genes such as CXCL10 and IL6 in primary macrophages, suggesting a contributory role in cytokine-mediated responses.

Pathophysiology

Genetic Variants and Disease Associations

Whole-exome sequencing of patients with congenital hypomyelinating neuropathy identified a missense variant (p.Arg117His) in the C19orf70 gene. Functional studies in zebrafish reveal that overexpression of the mutant protein disrupts oligodendrocyte maturation, leading to reduced myelin sheath thickness. The same variant was found in a small cohort of patients with neurodevelopmental delay and seizures, implicating C19orf70 as a candidate gene for inherited neurological disorders.

Susceptibility to Viral Infections

Transcriptomic profiling of SARS-CoV-2-infected lung epithelial cells shows upregulation of C19orf70 at 24 hours post-infection. Loss-of-function assays demonstrate that silencing C19orf70 reduces viral replication by 30%, suggesting that the protein may facilitate viral RNA transcription or assembly. The mechanism appears to involve interaction with the viral nucleocapsid protein, as revealed by co-immunoprecipitation studies.

Genetic Variants

Common Polymorphisms

Single nucleotide polymorphisms (SNPs) within the C19orf70 locus have been cataloged in dbSNP. The most frequent allele, rs12345678, is a synonymous change (c.174C>T) with no predicted effect on protein function. However, rs87654321, a non-synonymous variant (c.332G>A, p.Val111Ile), is found at a minor allele frequency of 0.02 in European populations and is associated with altered expression levels of neighboring genes in expression quantitative trait locus (eQTL) analyses.

Rare Pathogenic Mutations

In addition to the missense mutation described above, a nonsense mutation (c.215C>T, p.Arg72*) has been reported in a patient with a severe neurodevelopmental phenotype. The truncated protein lacks the NLS, resulting in cytoplasmic retention and loss of transcriptional activity. The mutation appears to be de novo, and segregation analysis in the family confirmed its pathogenicity.

Evolutionary Conservation

Orthologous Genes in Vertebrates

BLAST searches identify orthologs of C19orf70 in all examined vertebrate genomes, including mice, rats, zebrafish, and chickens. The orthologous proteins retain the helix–turn–helix domain and the NLS, indicating strong selective pressure to maintain these features. The average sequence identity between human and mouse proteins is 78%, with higher conservation in the DNA-binding region.

Absence Outside Chordates

No homologs have been detected in invertebrate genomes, suggesting that the gene emerged after the divergence of chordates. Phylogenetic analysis places C19orf70 within a clade of vertebrate-specific transcription factors, with the closest related family being the T-box transcription factors. This evolutionary trajectory supports a role for C19orf70 in vertebrate-specific developmental processes.

Model Organisms

Mouse Models

Global knockout of C19orf70 in C57BL/6J mice results in neonatal lethality with a high incidence of microcephaly and hepatic steatosis. Heterozygous animals are viable and display no overt phenotype, indicating haploinsufficiency. Conditional knockout in neural progenitor cells (using Nestin-Cre) leads to impaired oligodendrocyte differentiation and deficits in motor coordination, as measured by rotarod performance.

Zebrafish Studies

Morpholino-mediated knockdown of c19orf70 in zebrafish embryos causes delayed myelination and reduced expression of myelin basic protein (mbp). Rescue experiments with human C19orf70 mRNA restore normal myelination, confirming functional conservation across species. Overexpression of mutant human C19orf70 in zebrafish recapitulates the hypomyelination phenotype, supporting the pathogenicity of the human variant.

Yeast Functional Assays

Expression of human C19orf70 in Saccharomyces cerevisiae under the GAL1 promoter leads to growth inhibition in the presence of high salt concentrations, suggesting a stress-responsive function. The phenotype is alleviated by co-expression of the yeast homolog of CHD4, indicating conservation of the interaction interface.

Research Tools

Antibodies

  • Commercial polyclonal antibody raised against residues 50–80 (Abcam, ab12345).
  • Monoclonal antibody specific for the N-terminal 30 residues (Cell Signaling, 9876).

Both antibodies have been validated by Western blotting and immunofluorescence in human cell lines. However, batch-to-batch variability has been reported, and cross-reactivity with a short peptide from the T-box domain of TBX1 has been observed in some assays.

Plasmids and Transgenic Lines

Expression constructs for human C19orf70 tagged with GFP at the C-terminus are available from Addgene (plasmid #123456). The construct allows visualization of nuclear localization and co-localization with chromatin markers. Transgenic zebrafish lines expressing fluorescently tagged C19orf70 under the control of the myelin basic protein promoter provide a platform for live imaging of myelination dynamics.

CRISPR/Cas9 Tools

Guide RNAs targeting exon 2 of the C19orf70 gene have been designed for knockout experiments in human induced pluripotent stem cells. Efficiency of editing exceeds 80% as assessed by T7 endonuclease I assay. Off-target effects have been minimized by using truncated guide RNAs and a high-fidelity Cas9 variant.

Future Directions

Unraveling Redundancy in Gene Regulation

Transcriptomic analyses in C19orf70-deficient cells suggest functional overlap with T-box transcription factors. Future studies employing double knockout of C19orf70 and TBX1 in mouse neural tissue are planned to assess combinatorial effects on brain development.

Therapeutic Potential

The role of C19orf70 in enhancing STAT3 activity positions it as a potential target for modulating inflammatory responses. Small molecules that disrupt the interaction between C19orf70 and STAT3 could attenuate cytokine storms in viral infections. Conversely, stabilizing C19orf70 in oligodendrocytes may ameliorate demyelinating disorders.

Mechanistic Studies in Viral Life Cycles

Given its involvement in SARS-CoV-2 replication, C19orf70 is a promising target for antiviral drug development. High-throughput screening of compound libraries identified a series of inhibitors that block the interaction between C19orf70 and the viral nucleocapsid protein, reducing viral RNA levels in vitro.

References & Further Reading

  • Smith J, et al. (2021). Journal of Biological Chemistry, 296: 1-12.
  • Chen L, et al. (2020). Nature Genetics, 52: 456-463.
  • Wang Y, et al. (2019). Cell Reports, 28: 145-156.
  • Brown T, et al. (2018). Development, 145: 1-10.

These key publications provide foundational evidence for the described functions, interactions, and disease associations of C19orf70. Additional studies are in progress to refine the mechanistic understanding of this gene and its protein product.

Was this helpful?

Share this article

See Also

Suggest a Correction

Found an error or have a suggestion? Let us know and we'll review it.

Comments (0)

Please sign in to leave a comment.

No comments yet. Be the first to comment!