Heterogeneous nuclear ribonucleoprotein U (hnRNP U) (SP120) (Scaffold-attachment factor A) (SAF-A)

 HNRPU_RAT               Reviewed;         798 AA.
22-NOV-2017, integrated into UniProtKB/Swiss-Prot.
05-JUL-2004, sequence version 1.
20-JUN-2018, entry version 124.
RecName: Full=Heterogeneous nuclear ribonucleoprotein U {ECO:0000312|RGD:620372};
Short=hnRNP U {ECO:0000312|RGD:620372};
AltName: Full=SP120 {ECO:0000303|PubMed:8509422};
AltName: Full=Scaffold-attachment factor A {ECO:0000250|UniProtKB:Q00839};
Short=SAF-A {ECO:0000250|UniProtKB:Q00839};
Name=Hnrnpu {ECO:0000312|RGD:620372}; Synonyms=Hnrpu;
Rattus norvegicus (Rat).
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha;
Muroidea; Muridae; Murinae; Rattus.
NCBI_TaxID=10116 {ECO:0000312|EMBL:AAH72529.1};
STRAIN=Brown Norway;
PubMed=15057822; DOI=10.1038/nature02426;
"Genome sequence of the Brown Norway rat yields insights into
mammalian evolution.";
mammalian evolution.";
Nature 428:493-521(2004).
STRAIN=Brown Norway;
Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases.
PubMed=15489334; DOI=10.1101/gr.2596504;
"The status, quality, and expansion of the NIH full-length cDNA
project: the Mammalian Gene Collection (MGC).";
Genome Res. 14:2121-2127(2004).
"Identification and characterization of a nuclear scaffold protein
that binds the matrix attachment region DNA.";
J. Biol. Chem. 268:12886-12894(1993).
PubMed=20554522; DOI=10.1074/jbc.M110.112979;
"Regulation of DNA Topoisomerase IIbeta through RNA-dependent
association with heterogeneous nuclear ribonucleoprotein U (hnRNP
J. Biol. Chem. 285:26451-26460(2010).
PubMed=22673903; DOI=10.1038/ncomms1871;
"Quantitative maps of protein phosphorylation sites across 14
different rat organs and tissues.";
Nat. Commun. 3:876-876(2012).
-!- FUNCTION: DNA- and RNA-binding protein involved in several
cellular processes such as nuclear chromatin organization,
telomere-length regulation, transcription, mRNA alternative
splicing and stability, Xist-mediated transcriptional silencing
and mitotic cell progression. Plays a role in the regulation of
interphase large-scale gene-rich chromatin organization through
chromatin-associated RNAs (caRNAs) in a transcription-dependent
manner, and thereby maintains genomic stability. Required for the
localization of the long non-coding Xist RNA on the inactive
chromosome X (Xi) and the subsequent initiation and maintenance of
X-linked transcriptional gene silencing during X-inactivation (By
similarity). Required for the topoisomerase TOP2A protein
stability and activity in a RNA-dependent manner
(PubMed:20554522). Plays a role as a RNA polymerase II (Pol II)
holoenzyme transcription regulator. Promotes transcription
initiation by direct association with the core-TFIIH basal
transcription factor complex for the assembly of a functional pre-
initiation complex with Pol II in a actin-dependent manner. Blocks
Pol II transcription elongation activity by inhibiting the C-
terminal domain (CTD) phosphorylation of Pol II and dissociates
from Pol II pre-initiation complex prior to productive
transcription elongation. Positively regulates CBX5-induced
transcriptional gene silencing and retention of CBX5 in the
nucleus. Negatively regulates glucocorticoid-mediated
transcriptional activation. Key regulator of transcription
initiation and elongation in embryonic stem cells upon leukemia
inhibitory factor (LIF) signaling. Involved in the long non-coding
RNA H19-mediated Pol II transcriptional repression. Participates
in the circadian regulation of the core clock component
ARNTL/BMAL1 transcription. Plays a role in the regulation of
telomere length. Plays a role as a global pre-mRNA alternative
splicing modulator by regulating U2 small nuclear
ribonucleoprotein (snRNP) biogenesis. Plays a role in mRNA
stability. Component of the CRD-mediated complex that promotes MYC
mRNA stabilization. Enhances the expression of specific genes,
such as tumor necrosis factor TNFA, by regulating mRNA stability,
possibly through binding to the 3'-untranslated region (UTR).
Plays a role in mitotic cell cycle regulation. Involved in the
formation of stable mitotic spindle microtubules (MTs) attachment
to kinetochore, spindle organization and chromosome congression.
Phosphorylation at Ser-58 by PLK1 is required for chromosome
alignement and segregation and progression through mitosis.
Contributes also to the targeting of AURKA to mitotic spindle MTs
(By similarity). Binds to double- and single-stranded DNA and RNA,
poly(A), poly(C) and poly(G) oligoribonucleotides
(PubMed:20554522). Binds to chromatin-associated RNAs (caRNAs) (By
similarity). Associates with chromatin to scaffold/matrix
attachment region (S/MAR) elements in DNA (PubMed:8509422).
Associates with chromatin in a chromatin-associated RNAs (caRNAs)-
dependent manner. Binds to the Xist RNA. Binds the long non-coding
H19 RNA. Binds to SMN1/2 pre-mRNAs at G/U-rich regions. Binds to
small nuclear RNAs (snRNAs). Binds to the 3'-UTR of TNFA mRNA.
Binds (via RNA-binding RGG-box region) to the long non-coding Xist
RNA; this binding is direct and bridges the Xist RNA and the
inactive chromosome X (Xi). Also negatively regulates embryonic
stem cell differentiation upon LIF signaling. Required for
embryonic development (By similarity). Binds to brown fat long
non-coding RNA 1 (Blnc1); facilitates the recruitment of Blnc1 by
ZBTB7B required to drive brown and beige fat development and
thermogenesis (By similarity). {ECO:0000250|UniProtKB:Q00839,
ECO:0000250|UniProtKB:Q8VEK3, ECO:0000269|PubMed:20554522,
-!- SUBUNIT: Oligomer (via ATPase domain and RNA-binding RGG-box
region); oligomerization occurs upon ATP-binding in a chromatin-
associated RNAs (caRNAs)- and transcription-dependent manner and
is required for chromatin decompaction. ATP hydrolysis is required
to cycle from an oligomeric to monomeric state to compact
chromatin. Component of the coding region determinant (CRD)-
mediated complex, composed of DHX9, HNRNPU, IGF2BP1, SYNCRIP and
YBX1. Identified in the spliceosome C complex. Identified in a
IGF2BP1-dependent mRNP granule complex containing untranslated
mRNAs. Associates with heterogeneous nuclear ribonucleoprotein
(hnRNP) particles. Associates (via middle region) with the C-
terminal domain (CTD) RNA polymerase II (Pol II) holoenzyme; this
association occurs in a RNA-independent manner. Associates (via
middle region) with the core-TFIIH basal transcription factor
complex; this association inhibits the CTD phosphorylation of RNA
polymerase II holoenzyme by downregulating TFIIH kinase activity.
Associates with the telomerase holoenzyme complex. Associates with
spindle microtubules (MTs) in a TPX2-dependent manner. Interacts
(via C-terminus) with actin; this interaction is direct and
mediates association with the phosphorylated CTD of RNA polymerase
II and is disrupted in presence of the long non-coding H19 RNA.
Interacts with AURKA. Interacts (via C-terminus) with CBX5; this
interaction is, at least in part, RNA-dependent. Interacts with
CR2. Interacts with CRY1. Interacts (via C-terminus) with EP300;
this interaction enhances DNA-binding to nuclear scaffold/matrix
attachment region (S/MAR) elements. Interacts with ERBB4.
Interacts with GEMIN5. Interacts with IGF2BP1. Interacts with
IGF2BP2 and IGF2BP3. Interacts with NCL; this interaction occurs
during mitosis. Interacts (via C-terminus) with NR3C1 (via C-
terminus). Interacts with PLK1; this interaction induces
phosphorylation of HNRNPU at Ser-58 in mitosis. Interacts with
POU3F4. Interacts with SMARCA4; this interaction occurs in
embryonic stem cells and stimulates global Pol II-mediated
transcription (By similarity). Interacts (via C-terminus) with
TOP2A; this interaction protects the topoisomerase TOP2A from
degradation and positively regulates the relaxation of supercoiled
DNA by TOP2A in a RNA-dependent manner (PubMed:20554522).
Interacts with TPX2; this interaction recruits HNRNPU to spindle
microtubules (MTs). Interacts with UBQLN2 (By similarity).
Interacts (via RNA-binding RGG-box region) with ZBTB7B; the
interaction facilitates the recruitment of long non-coding RNA
Blnc1 by ZBTB7B (By similarity). {ECO:0000250|UniProtKB:Q00839,
ECO:0000250|UniProtKB:Q8VEK3, ECO:0000269|PubMed:20554522}.
-!- SUBCELLULAR LOCATION: Nucleus {ECO:0000250|UniProtKB:Q00839}.
Nucleus matrix {ECO:0000250|UniProtKB:Q00839}. Chromosome
{ECO:0000250|UniProtKB:Q00839}. Nucleus speckle
{ECO:0000250|UniProtKB:Q00839}. Cytoplasm, cytoskeleton,
microtubule organizing center, centrosome
{ECO:0000250|UniProtKB:Q00839}. Chromosome, centromere,
kinetochore {ECO:0000250|UniProtKB:Q00839}. Cytoplasm,
cytoskeleton, spindle {ECO:0000250|UniProtKB:Q00839}. Cytoplasm,
cytoskeleton, spindle pole {ECO:0000250|UniProtKB:Q00839}. Midbody
{ECO:0000250|UniProtKB:Q00839}. Cytoplasm
{ECO:0000250|UniProtKB:Q00839}. Cell surface
{ECO:0000250|UniProtKB:Q00839}. Cytoplasmic granule
{ECO:0000250|UniProtKB:Q00839}. Note=Localizes at inactive X
chromosome (Xi) regions. Localizes in the nucleus during
interphase. At metaphase, localizes with mitotic spindle
microtubules (MTs). At anaphase, localizes in the mitotic spindle
midzone. Localizes in spindle MTs proximal to spindle poles in a
TPX2- and AURKA-dependent manner. The Ser-58 phosphorylated form
localizes to centrosomes during prophase and metaphase, to mitotic
spindles in anaphase and to the midbody during cytokinesis.
Colocalizes with SMARCA4 in the nucleus (By similarity).
Colocalizes with CBX5 in the nucleus. Colocalizes with NR3C1 in
nuclear speckles. Localized in cytoplasmic ribonucleoprotein (RNP)
granules containing untranslated mRNAs.
{ECO:0000250|UniProtKB:Q00839, ECO:0000250|UniProtKB:Q8VEK3}.
-!- DOMAIN: The SAP domain is necessary for specific binding to
nuclear scaffold/matrix attachment region (S/MAR) elements in DNA.
The RNA-binding RGG-box region is necessary for its association
with inactive X chromosome (Xi) regions and to chromatin-
associated RNAs (caRNAs). Both the DNA-binding domain SAP and the
RNA-binding RGG-box region are necessary for the localization of
Xist RNA on the Xi. The ATPase and RNA-binding RGG-box regions are
necessary for oligomerization. {ECO:0000250|UniProtKB:Q00839,
-!- PTM: Cleaved at Asp-94 by CASP3 during T-cell apoptosis, resulting
in a loss of DNA- and chromatin-binding activities.
-!- PTM: Extensively phosphorylated. Phosphorylated on Ser-58 by PLK1
and dephosphorylated by protein phosphatase 2A (PP2A) in mitosis.
-!- PTM: Arg-707 and Arg-713 are dimethylated, probably to asymmetric
dimethylarginine (By similarity). {ECO:0000250|UniProtKB:Q00839,
-!- PTM: Citrullinated by PADI4. {ECO:0000250|UniProtKB:Q8VEK3}.
Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
Distributed under the Creative Commons Attribution (CC BY 4.0) License
EMBL; AABR07021872; -; NOT_ANNOTATED_CDS; Genomic_DNA.
EMBL; CH473985; EDL94801.1; -; Genomic_DNA.
EMBL; BC072529; AAH72529.1; -; mRNA.
RefSeq; NP_476480.2; NM_057139.2.
UniGene; Rn.4328; -.
IntAct; Q6IMY8; 3.
STRING; 10116.ENSRNOP00000046783; -.
PaxDb; Q6IMY8; -.
Ensembl; ENSRNOT00000044477; ENSRNOP00000046783; ENSRNOG00000033790.
GeneID; 117280; -.
KEGG; rno:117280; -.
CTD; 3192; -.
RGD; 620372; Hnrnpu.
eggNOG; KOG2242; Eukaryota.
eggNOG; ENOG4111X2K; LUCA.
GeneTree; ENSGT00390000020210; -.
HOVERGEN; HBG061101; -.
KO; K12888; -.
TreeFam; TF317301; -.
Reactome; R-RNO-72163; mRNA Splicing - Major Pathway.
PRO; PR:Q6IMY8; -.
Proteomes; UP000002494; Chromosome 13.
Bgee; ENSRNOG00000033790; -.
ExpressionAtlas; Q6IMY8; baseline and differential.
GO; GO:0009986; C:cell surface; IEA:UniProtKB-SubCell.
GO; GO:0005813; C:centrosome; ISS:UniProtKB.
GO; GO:0000777; C:condensed chromosome kinetochore; IEA:UniProtKB-SubCell.
GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
GO; GO:0000776; C:kinetochore; ISS:UniProtKB.
GO; GO:0030496; C:midbody; ISS:UniProtKB.
GO; GO:0072686; C:mitotic spindle; ISS:UniProtKB.
GO; GO:1990498; C:mitotic spindle microtubule; ISS:UniProtKB.
GO; GO:1990023; C:mitotic spindle midzone; ISS:UniProtKB.
GO; GO:0016363; C:nuclear matrix; IEA:UniProtKB-SubCell.
GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
GO; GO:0005634; C:nucleus; IDA:RGD.
GO; GO:1990904; C:ribonucleoprotein complex; IDA:UniProtKB.
GO; GO:0000922; C:spindle pole; IEA:UniProtKB-SubCell.
GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
GO; GO:0005524; F:ATP binding; ISS:UniProtKB.
GO; GO:0003682; F:chromatin binding; ISS:UniProtKB.
GO; GO:0042802; F:identical protein binding; ISS:UniProtKB.
GO; GO:0034046; F:poly(G) binding; IDA:RGD.
GO; GO:1990841; F:promoter-specific chromatin binding; IDA:RGD.
GO; GO:0043021; F:ribonucleoprotein complex binding; IPI:RGD.
GO; GO:0003723; F:RNA binding; IDA:RGD.
GO; GO:0099122; F:RNA polymerase II C-terminal domain binding; ISS:UniProtKB.
GO; GO:0043565; F:sequence-specific DNA binding; ISS:UniProtKB.
GO; GO:1990845; P:adaptive thermogenesis; ISS:UniProtKB.
GO; GO:0007049; P:cell cycle; IEA:UniProtKB-KW.
GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
GO; GO:0051301; P:cell division; IEA:UniProtKB-KW.
GO; GO:0071549; P:cellular response to dexamethasone stimulus; IDA:RGD.
GO; GO:1990830; P:cellular response to leukemia inhibitory factor; ISS:UniProtKB.
GO; GO:0006325; P:chromatin organization; IEA:UniProtKB-KW.
GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
GO; GO:0007275; P:multicellular organism development; IEA:UniProtKB-KW.
GO; GO:2000737; P:negative regulation of stem cell differentiation; ISS:UniProtKB.
GO; GO:1902425; P:positive regulation of attachment of mitotic spindle microtubules to kinetochore; ISS:UniProtKB.
GO; GO:0090336; P:positive regulation of brown fat cell differentiation; ISS:UniProtKB.
GO; GO:2000373; P:positive regulation of DNA topoisomerase (ATP-hydrolyzing) activity; IDA:RGD.
GO; GO:0010628; P:positive regulation of gene expression; IMP:RGD.
GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; ISS:UniProtKB.
GO; GO:1902889; P:protein localization to spindle microtubule; ISS:UniProtKB.
GO; GO:1902275; P:regulation of chromatin organization; ISS:UniProtKB.
GO; GO:0007346; P:regulation of mitotic cell cycle; ISS:UniProtKB.
GO; GO:1901673; P:regulation of mitotic spindle assembly; ISS:UniProtKB.
GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
GO; GO:0006351; P:transcription, DNA-templated; IEA:UniProtKB-KW.
CDD; cd12884; SPRY_hnRNP; 1.
Gene3D; 1.10.720.30; -; 1.
InterPro; IPR001870; B30.2/SPRY.
InterPro; IPR013320; ConA-like_dom_sf.
InterPro; IPR026745; hnRNP_U.
InterPro; IPR027417; P-loop_NTPase.
InterPro; IPR003034; SAP_dom.
InterPro; IPR036361; SAP_dom_sf.
InterPro; IPR003877; SPRY_dom.
InterPro; IPR035778; SPRY_hnRNP_U.
PANTHER; PTHR12381:SF11; PTHR12381:SF11; 1.
Pfam; PF02037; SAP; 1.
Pfam; PF00622; SPRY; 1.
SMART; SM00513; SAP; 1.
SMART; SM00449; SPRY; 1.
SUPFAM; SSF49899; SSF49899; 1.
SUPFAM; SSF52540; SSF52540; 1.
SUPFAM; SSF68906; SSF68906; 1.
PROSITE; PS50188; B302_SPRY; 1.
PROSITE; PS50800; SAP; 1.
1: Evidence at protein level;
Acetylation; Activator; ADP-ribosylation; ATP-binding; Cell cycle;
Cell division; Centromere; Chromatin regulator; Chromosome;
Citrullination; Coiled coil; Complete proteome; Cytoplasm;
Cytoskeleton; Developmental protein; Differentiation; Isopeptide bond;
Kinetochore; Methylation; Mitosis; mRNA processing; mRNA splicing;
Nucleotide-binding; Nucleus; Phosphoprotein; Reference proteome;
Repressor; Ribonucleoprotein; Spliceosome; Transcription;
Transcription regulation; Ubl conjugation.
INIT_MET 1 1 Removed. {ECO:0000250|UniProtKB:Q00839}.
CHAIN 2 798 Heterogeneous nuclear ribonucleoprotein
DOMAIN 8 42 SAP. {ECO:0000250|UniProtKB:Q00839,
DOMAIN 242 438 B30.2/SPRY. {ECO:0000255|PROSITE-
NP_BIND 478 485 ATP. {ECO:0000255}.
REGION 462 646 ATPase domain.
REGION 585 600 Actin-binding.
REGION 688 713 RNA-binding RGG-box.
COILED 624 651 {ECO:0000255}.
COMPBIAS 2 154 Asp/Glu-rich (acidic).
COMPBIAS 85 88 Poly-Glu. {ECO:0000255}.
COMPBIAS 116 119 Poly-Glu. {ECO:0000255}.
COMPBIAS 677 767 Gly-rich. {ECO:0000255|PROSITE-
SITE 94 95 Cleavage; by CASP3.
MOD_RES 2 2 N-acetylserine.
MOD_RES 4 4 Phosphoserine.
MOD_RES 17 17 N6-acetyllysine.
MOD_RES 21 21 N6-acetyllysine.
MOD_RES 58 58 Phosphoserine.
MOD_RES 179 179 N6-acetyllysine.
MOD_RES 180 180 ADP-ribosylserine.
MOD_RES 229 229 Citrulline.
MOD_RES 239 239 N6-acetyllysine; alternate.
MOD_RES 240 240 Phosphotyrosine.
MOD_RES 241 241 Phosphoserine.
MOD_RES 245 245 Phosphoserine.
MOD_RES 260 260 Phosphothreonine.
MOD_RES 326 326 N6-acetyllysine.
MOD_RES 490 490 N6-acetyllysine; alternate.
MOD_RES 498 498 N6-acetyllysine; alternate.
MOD_RES 506 506 Phosphothreonine.
MOD_RES 525 525 N6-acetyllysine.
MOD_RES 539 539 N6-acetyllysine; alternate.
MOD_RES 556 556 Phosphothreonine.
MOD_RES 609 609 N6-acetyllysine; alternate.
MOD_RES 676 676 Omega-N-methylarginine.
MOD_RES 689 689 Asymmetric dimethylarginine.
MOD_RES 694 694 Asymmetric dimethylarginine.
MOD_RES 701 701 Asymmetric dimethylarginine.
MOD_RES 707 707 Asymmetric dimethylarginine; alternate.
MOD_RES 707 707 Omega-N-methylarginine; alternate.
MOD_RES 707 707 Omega-N-methylated arginine; alternate.
MOD_RES 713 713 Asymmetric dimethylarginine; alternate.
MOD_RES 713 713 Dimethylated arginine; alternate.
MOD_RES 713 713 Omega-N-methylarginine; alternate.
MOD_RES 713 713 Omega-N-methylated arginine; alternate.
MOD_RES 728 728 Asymmetric dimethylarginine.
MOD_RES 735 735 Asymmetric dimethylarginine.
MOD_RES 787 787 N6-acetyllysine; alternate.
CROSSLNK 239 239 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO1);
CROSSLNK 239 239 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2);
CROSSLNK 469 469 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2).
CROSSLNK 490 490 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2);
CROSSLNK 498 498 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2);
CROSSLNK 510 510 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2).
CROSSLNK 539 539 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2);
CROSSLNK 548 548 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2).
CROSSLNK 583 583 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2).
CROSSLNK 600 600 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2).
CROSSLNK 609 609 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2);
CROSSLNK 638 638 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2).
CROSSLNK 644 644 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2).
CROSSLNK 787 787 Glycyl lysine isopeptide (Lys-Gly)
(interchain with G-Cter in SUMO2);
SEQUENCE 798 AA; 87732 MW; 638C059C3D602DE5 CRC64;

