KIAA0895

KIAA0895 is a protein that in Homo sapiens is encoded by the KIAA0895 gene. The gene encodes a protein commonly known as the KIAA0895 protein. It's aliases include hypothetical protein LOC23366, OTTHUMP00000206979, OTTHUMP00000206980, 9530077C05Rik, and 1110003N12Rik.[5] It is located at 7p14.2.[6]

KIAA0895
Identifiers
AliasesKIAA0895, 9530077C05Rik, 1110003N12Rik, Kiaa0895, mKIAA0895
External IDsMGI: 1915533 HomoloGene: 19280 GeneCards: KIAA0895
Orthologs
SpeciesHumanMouse
Entrez

23366

68283

Ensembl

ENSG00000164542

ENSMUSG00000036411

UniProt

Q8NCT3

Q7TQE7

RefSeq (mRNA)

NM_026739
NM_183273
NM_001378982
NM_001378983

RefSeq (protein)

NP_081015
NP_001365911
NP_001365912

Location (UCSC)Chr 7: 36.32 – 36.39 MbChr 9: 22.32 – 22.36 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse

Research into the KIAA proteins has shown that they are similar to known genes with functions related to cell signaling/communication, cell structure/motility and nucleic acid management.[7]

Gene

Locus

The KIAA0895 gene is located at 7p14.2.[6] The genomic DNA is 65,976 base pairs long,[8] while the longest mRNA that it produces is 4463 bases long.

It can be transcribed into 15 transcript variants, which in turn can produce 13 different isoforms of the protein.[9]

The location of the KIAA0895 gene on chromosome 7

Gene Neighborhood

KIAA0895 is surrounded by the following genes on chromosome 7:[8]

Size of gene

The gene encoded for the KIAA0895 protein is 65,975 nucleotides long, from nucleotides 36324150 to 36390125, with seven exons.

mRNA

There are ten different isoforms for KIAA0895.[6]

  • NP_001093895.1
  • EAW94064.1
  • NP_056129.2
  • NP_001186636.1
  • NP_001186635.1
  • XP_005249746.1
  • EAW94065.1
  • XP_024302470.1
  • NP_001186637.1
  • NP_001287885.1

Protein

The longest protein isoform that is produced by the KIAA0895 gene is termed LOC23366 isoform 1 and is 520 amino acids long.[10] The predicted molecular weight is 61kDa.[11] Additionally, the theoretical isoelectric point is 10.[11]

Amino acid composition

KIAA0895 is a lysine and arginine semi-enriched protein.[12] KIAA0895 is semi-enriched in positively charged lysine and arginine groups, and positively and negatively charged lysine, arginine, glutamic acid and aspartic acid groups.[12] However, KIAA0895 is semi-depleted in non-polar alanine, glycine and proline groups.[12]

The charge distribution analysis shows that there are no negative or mixed charge clusters.[12] However, there is one positive charge cluster from amino acids 12 to 36.[12]

Domain of unknown function 1704
Identifiers
SymbolDUF1704
PfamPF08014
InterProIPR012548
Available protein structures:
Pfam  structures / ECOD  
PDBRCSB PDB; PDBe; PDBj
PDBsumstructure summary

Regions

LOC23366 contains a protein domain of unknown function called DUF1704.[13] It also contains a region of low complexity from position 120 to position 150 in the protein,[14] and an arginine-rich area from position 12 to position 51.[15]

Promoters

Using ElDorado by Genomatrix, a promoter region sequence was found.[16] The most likely promoter for KIAA0895 starts at 36389926 and goes to 36391149, with a length of 1224.[16]

The predicted serine, threonine, and tyrosine phosphorylation sites of the KIAA0895 protein

Post-translational modification

KIAA0895 is predicted to undergo phosphorylation at several serines, threonines, and tyrosines throughout its structure.[17] Phosphorylation at these sites is a form of gene regulation. Phosphorylation results in a conformational change in the structure of many enzymes and receptors. This causes them to become activated or deactivated.

Stem loops

Using a database called Mfold, a stem-loop formation for the 5' UTR region shows that there is a lack of conservation, meaning there is some precedence that these stems and loops are not used for translation regulation.[18] The ΔG value was -12.90 kcal/mol with three loops.[18] Using the same database, a stem-loop formation for the 3' UTR region shows that there is conservation, meaning there is some precedence that they are used for translation regulation.[18] The ΔG value was -636.80 kcal/mol.[18]

Tertiary structure

KIAA0895 has a tertiary structure with alpha helices and beta sheets.[19]

Proposed Tertiary Structure for KIAA0895.
Proposed Tertiary Structure for 24% of KIAA0895. Image coloured by rainbow N → C terminus.

Interacting proteins

There are three proteins likely to be interacting proteins with KIAA0895. These proteins are ELAVL1,[20] vata,[21] and glym.[21] These interactions have experimental evidence from the sources provided.

Expression

KIAA0895 is most commonly found in the testis, however it also has a strong expression in the kidneys, adrenal glands, and brain.[6]

RNA-seq tissue data is reported as mean TPM (transcripts per million).
The Mean RPKM values for KIAA0895 in 27 different tissue types.

Per Health State

Using an EST profile from NCBI, KIAA0895 has strong expression in cervical tumors and bladder carcinoma.[22]

An EST profile was created by NCBI. EST profiles show approximate gene expression patterns as inferred from EST counts and the cDNA library sources.

Interacting proteins

Using three different databases, three different interacting proteins were found. These include ELAVL1, VATA, and GLYM.[21] There was experimental evidence for all three of these interacting proteins.

Homologs and orthologs

KIAA0895 has over 228 orthologs.[6] Orthologs have been found in mammals and eukaryotes.[23] There are homologs in 9 species.[6] The full list of organisms in which homologs have been found is given below.

Paralogs

KIAA0895 has 7 paralogs in Homo sapiens:[23]

  • Unnamed protein product
  • Uncharacterized protein KIAA0895-like
  • hCG28832, isoform CRA_a
  • Hypothetical protein
  • Unnamed protein product
  • hCG38687, isoform CRA_a, partial
  • hCG38687, isoform CRA_b

References

  1. GRCh38: Ensembl release 89: ENSG00000164542 - Ensembl, May 2017
  2. GRCm38: Ensembl release 89: ENSMUSG00000036411 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "GeneCards: KIAA0895 Gene". Retrieved 2011-04-24.
  6. "KIAA0895 KIAA0895 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-02-24.
  7. Nagase T, Ishikawa K, Suyama M, Kikuno R, Hirosawa M, Miyajima N, Tanaka A, Kotani H, Nomura N, Ohara O (December 1998). "Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro". DNA Research. 5 (6): 355–64. doi:10.1093/dnares/5.6.355. PMID 10048485.
  8. "Entrez Gene: KIAA0895". Retrieved 2011-04-24.
  9. "NCBI AceView: KIAA0895". Retrieved 2011-04-24.
  10. "NCBI Protein: LOC23366 isoform 1". Retrieved 2011-05-09.
  11. "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2019-04-17.
  12. "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2019-04-17.
  13. "Wellcome Trust Sanger Institute, Pfam". Archived from the original on 2014-06-17. Retrieved 2011-05-09.
  14. "MyHits Dotlet". Retrieved 2011-05-09.
  15. "Uniprot". Retrieved 2011-05-09.
  16. "ElDorado Introduction". www.genomatix.de. Retrieved 2019-05-13.
  17. "DTU Center for Biological Sciences, NetPhos". Retrieved 2011-05-09.
  18. "The Mfold Web Server | mfold.rit.albany.edu". unafold.rna.albany.edu. Retrieved 2019-04-17.
  19. "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk. Retrieved 2019-04-20.
  20. "mentha: the interactome browser". mentha.uniroma2.it. Retrieved 2019-04-20.
  21. "2 binary interactions found for search term KIAA0895". IntAct Molecular Interaction Database. Retrieved 2019-04-20.
  22. "Tissue expression of KIAA0895 - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2019-05-13.
  23. "NCBI BLAST". Retrieved 2011-05-09.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.