CmUC03G058350.1 (mRNA) Watermelon (USVL531) v1

Overview
NameCmUC03G058350.1
TypemRNA
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrotrans_gag domain-containing protein
LocationCmU531Chr03: 9608344 .. 9618062 (-)
Sequence length1974
RNA-Seq ExpressionCmUC03G058350.1
SyntenyCmUC03G058350.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCCCCATTACGCTTGAGACCCAAGAGCAAGTTTTGTGTAAGCATTGGGATGCTCGAAACTTCAAAGCGCAAAAATTAACATAGCGTCGCAACGCTCCCCAAGAGCATCGTGATGCTCCGACACAAAAGCCTATTTAAAGGACGTTCTATTTGATTTGGGGGGGCATTACTTCTTTACGAGACTTAAACCTAAGGCAAAGCTCCACCATCTGGAGGAAGGCACTGTTGTTGGTGACGATCCGAGGCCTGAAGATGAAGGTCAGAGGTCAAGATTGGGAGATGGCAATGGTGAAGGGAGTCTGAATTCTTGTTTTTCTCACCTTTTCCATTCTTCATTGGGCTGTCATGATGTTTTCTCATTTGAATCTATTATTTATGAAATCATGTGCAGCTAAACTTATGAATTTTAGGGTGTTGTTAGAGTAATGAGTAGTTCTTGATTATTTCTTAACGTTTGAATTATTATAATTTGGTTGAATTGTAATCTATCTGCATGACTTCTTTTGATTAATTGGTCATTAATCAACTGTTCTTGGGCTGTTTTTCTTGCTTGTAAACGAGGAATCTTGGCTAGACTTGGTAGGTTAAATTACTATTTGATGATGCATGATAATGATCAGAGAATAATAAGGATTGCTATTGAAGGGTCTTAGGGCTGCCTTAGTTTAATCAATAATAAGGAATTCAATTTGATACATTAGGGTTAGAACGGATGGAGCTTGGAAAAGGACATCCCGAATTTAGACTTCTAAACTCATTCAACATGCACAATAATCTTGTATGTTAAAGATCTATCAAAGACTCATAATGAAACAACACCCTTGAATTTAAATCCCATCGAATTATTGGTTTAATTTTATTGTTTGATTTTTAATTTTGAATACCAAATTTTATTTTACATTTGCCTAGATAAAATCGATAAGTTAGTAATAATAGCTAAGCAAACAAGTTTTACACCAATCCTTGAGGACGATACCCTAATTGAGGTTTTATTACTTCGATACGTACGCTTGTGTGTTGCCAAGTTTTTGGCACCATTGCTAGGGCTTGTCAATGAAATTGGTTTGTTTAATATTGTGAGAACGAACATGATTTTAATTTAGGTTTTATTTTACTATTGAAATTGTTATTTTCTTTTTTCTTAGGTATCATGTGTATGCGACAAAGCCAACGAGCTAAACTTCTGCCTTTCAATCCAGAAATCAAATTAACTTTAAGAAAAGTAAGCAGGAATTTGAAAGCAGAATTTGTCGCCAGGGAAGATCAAACAGAAGTCACTAAGACTATTAGAGATTACTTCCACTTGATCGTCCCTACATTGCGACCTAGAATAGTAAAGGCTCCAATTAATGCAAACAACTTTGAGCTGAAACCTAGTTTAATTCAAATGGCAAAGGACAGCATATTCAGAGGGACTTCAAACGAAGATCCACATAAACATCTTCGATCTCTTCAAGAAATTTATGGAATGGTAAAAATAAATGGAGTGACCAGTGATGCTATTCATTTAAGACTATTTTCTTTTCTATGCAGGACAGGGGAAAAAACTGGCTAGAAACACTTACTCCAAGAAGCATAACTTCCTGGGATGAATTGGCGTAGGCTTTCCTTAGCAAATTTTTCCTATCTAAGACGAGCAACTTGAGAATAAAAATTGGAACATTCAAGCAGAAAGAGGATGAGCAATTGTTCGAGGCTTGGGAGCGTTATAAAGAATTGTTGAGAAAGTGTCCCCAACACAATTATCTTGACTGGCTTCAGATTCAGTTGTTCTATAATGGGTTGTTAGGAACGACGAAGTCCATCCTAAATGTTGCAGCTGGTGGATCAATCTTTTCTAAAACTGTTGATGCTACTCGATCACTTTTAGAAGACATGGTTGTCACCAGTTACAACTGGCCATCAAAACAATCAACATCCAAAGTGGCTGAGCTCTATGAGCTTGATGAGGTAACAACTCTAAAAGCACATATGGCTTCCCTAACTAATGCTATTAGTTAATTGCAGTAATGGCAACTTGTAACGCTGGCACATCTTGGGAGACTACTAATCTATAAGAGATTAACTATGTGCATCGATCCAATCAATGCCATGGACAATTTTAGTTTCCTCATTTTCAGCCTCAACAGGCACAACAACCTACTCAGTCCCACTCAGGTTTGCGTAATCATGAACTTTTTTCTTACGGTAACAAAAAAATATTTTGCAGCCTCCTTCAGGTTTTACAGATTCTAATCCTAATGCTGAGAAGAAATCATCTATGGAGGACTTGCTAACTGAATTTATCAAAGAGTCGAAAAATCGAACCAACTTATTAGAGAGCACTATTATGAGTCATGAGAAGACCATTCAAAATATGGAGGTACAAATCAGTCAAATGGCAACAACTATGAAGGCCATGCAAAAGGGCAAATTTCCCAATTTTACAGAGAAAAGCCCAAGGGAAGGTTGTAAAGCAGTCTCAGAAGTGGAAGGAGTGTAGCTGGTCCAGTTTTGGTAAAGGAAGGAGAGCAAGCGGTAGTGGAACCTGACTTGGACGAGAGAAGAGAAAATGAGGAGTGGAATGTTCAGGAGAAGATGGAAGAGGTGCCTTAACCTTCTGCTACAAGGAGAATTTCGGATCCCTTACCTTCTGTTCCTACTAACTGTTGTGTTTTATGCCCTAAAACTTGTAGATTGTAAATGTAATTATTTGACCATCATTAAATCATTAACAAAGAGTTATTAATGTGTTATTTTAATAAGCATTATTGATTGTTTTTGTTTGTTTTGTCTCAATAACCCTAAATCCAATAAACTAACATTCTAGGTTGTTTTATGAGTCTTGAACTGTATGTGGAGACATACGAAGATCAATGTTCAAGATACAGCCTAAAGGGTCTATAGTATAGGGATAAGGTTGGATACCTTATCCTGATGACACTAGGGATACAACCCACTTTGTATTTGATACAAACGCAATAATCTAACGCGTTCGTGTAGGTGACATACAAGTGGGGGCATCCTATGCAATGAGTTTGCATAAGACCAGACCACGAAATAGTGATTACTAGATGTAACTCTGTTGACTAGTTAGGTTTCTACTAATTTCAATAAGATGACCTAAGCAACTTAGTCTTAATCCTGAGTATATTATGAACTTCTATCCGCGAGGGATTGTCCTTTGATTTGTATAGGTGAGAATGGTCGTTTGCCGACTCCATATGCCTACCATCTTGAGGACAAGACCGAGGGGGGAGCCGACTCCATATGCCTACCATCTTGAGGACAAGACCGAATGGGGAGCTGGGGACATAATTATACAAGATGGAATTCACTTATTCTCATCTTTAGGGTAGGTAGATAAGTGTTCCCTTAAGTAGTGTCTCAGGGACTTGAGCAAAGAGCCCTACCCTCTTAATGGCCTGAGAAGGGTTTTTGTTTAATGGTTGGACCATAAACAGGTTGTTCATTAGAGAGCACTAATACTTAAGGATTTAGAAGTAACCTAGGGGTAAAATGGTAATTTGACCCAGCTAGTATTACGAACACTTGTGAAGGACTAATTTGCTGTTATTGATCTATATCCATGGACACATAAATATATCTACAGTGAGAAGAGTGCAGTTGTGGGTCTTTAGTGGAGTGTACCCACAGTTAACGAATATTGATTAATGTGGTTAATGAGTTTAGCCGATTAACCTCATATCATTGGAGCTTCTGATCTATAGGTCAATTAGGTCCCCTTCCTAGCTCATAAAGAGTGATGAGGTATTTATGTCTGAATGAAATTTGAAATGTTCAAATTTACGTATACAGTTTATTTTGAACTTTATAACATAATTTTAATTTTGGATATGACACAAAATTAATTTTTGAGAGAATTAAATATTTGAAGGAGTTCAAATATTAATATGAATTAGATTCATATTAAAACTATAAGTTAAAATTAATGTGTATTACATGCACATTAAAACTATAAGTTATGAGAGAATAACAATTGAATATGATTCAAATGTAGGTTAAATTAAATATTAGATATTTAATTTGACAATTAATTAATTAAAGAATTAATTTTCTTAATTTATTTTAATTAAATTTGTTTTAATTAAATTAAATGAATTAAAACTATATGTTATATGGGAGATGTTCATTTAAATATTATTTAAATGAAATAATTAATTAAATTAGATTTAATTAATTATTAATTATTAATTATTTATTATTTATTTAATTTAATAAATAAACTTTTTAAAAAAAGGTAACTCTCATATGTGGGAGTTACGTTTTACGTGGTTTAATCCCACGTTTCCTAATCTCTCTCTCCATATATGCCTGCAAAATGAAAAAATGGCAAAAGAGTAATGTAATTTTTTTTCTCTTCTTGCTCTCTAGTTTTGTCACACCCTACTTTTAAAACCAATGCGTTGAAGAGCCTATAAAGGAAAGAAGTAGGACAAGAGAGGTTTGTGCTCAAGTGTTAGTTTATATAGGCCGTAGAGGAAGTTTGAATTAGAAGGATTTGATTGAAAGGTTGCGTTGAGTGATTTCTGGATCATGGAATCAAAGTAATTCTTAAAGTTAAGAATTCTACTATGCAGAATGAATAGATCAGGAGAATGCAACCATGACTTTGAGCTTGGGAGTAAGATCTAAGGAAAGAAAGGAAATCTCATTTGGTAATGGGAAGACTTAAAAAGTTTAGTTGGAAGATAAATTAAGTTGGACGCATACCTAGTAGAAAGACCATGTTGGACACATGGTCAAGACAGGAAATGCATGATAAAGGTGAAACGCATAGGAGGAAATCAAGGAAGGAGATGTAAGCTTGGTTTAAATAGGATGTATGATTAAGGATTAAGGTTGAGTAATGATGCTTTGAGATGAAATCTAAAAGAAGGATAAAATGCATAAGATAGAATCGTGAAGAGAGGCTATGATAATTTAAGGTAAACGCATAACTTCCATTAAGGAAGATTTTAAAGTGGATAAGTTAGAAGGTTGAGAAGTAAACCTAAAGGAAGGAGGCTAGAGAGGGATATACGATAGGAATTATCATTGAGAAAATAGCTCACAGAAGGAAGAGGTTGGAAGCATGACCTTTTGAAATACAGCATAAGAATGCATAAAGGAGTAATCAAGGAAACCTCAAGGATTGGAGGAAAGTCATAGACGCAGAAGGAAAAAAAAGTAAACGCAAAGAGTGCAAATGCATAAAGGAAGGAGTTGTAGTTAAGATATGGATATAAGGAAAGAGTGATGGACGCATGATAGTATGTGGTTAAGGCAAATGCATGATATTATGCGGTAAGGAACAAATGCATAGTAGTGTGCGTTTGAGACTTTGACGCCTAGTAGTATGCAATAGAGAAACACGTTCAAAGGAATTTGGCGGAGAGTAGGGAAAGAAAGTAATGAAGCTAAAACCTTAAAGGAAATTCAAAGGACCCATAGGGGCTTAATGTTGGATGAATAGCTTGTATAGTGGAATATGTTGGACGCATAAGATAAGCTTATTAAGGCTAGGTGGCCTCATATAGACTTGACACCTTAAAAGTGGGAGGTGTTAGAATGCATGTTGAGTGTTTCATGCAAGGTGACGCTTGGATAAATACATGGTTAGTTAAGCATGCAGCATGACATGTGGTAAACAAGTGGCAAAAAGAGAGGAGAAGTAAGATGACAAGCCCTCTATTCCAAAGCTATAAATAGGACCTTGAGGGAAAATGAGTGACTGATGGTCCTTTAGTTTTGAATTCTTGAGAGAAAGGGCAAAGAGAAGGCAGAAGAAAGAAATCCTTGTGTCTGTAGTAGAAGTCTGCGGCCAAGAGAAGATATTCTTGACAACTCAGTCATCTTATGTTAAAGAAAAAGTCTTGCGTCTAAGGGGTCAGCCTTACGTTCAAGATGAACTCAACATTCAAGATGAACGCCGCCGCGTCCAAGAGAGGTCGACGCATCCAAAAGAAGAAGCAAGAAGAGTTAAGCTGCACCTTGCGTCCAAGGAGCAAGGTTGCATCCAACCGAGCTGACAAAGGAAAATAAGTTGCCTTGCGTTCCAGACTAGAACGCTGCATCCAAGGAGCTCGAGAAAGAGAAATAGAAGGAAGAAAAGAACTATTCTTGAGCAAGTATAAGGTGAGTAACACCAACGTTTTTCAGAGTATAAATGAGTTCTTTGTGATTTATATTCTTGGATATCCATTCATGAGTTTTCAGCGTATATAGAGCATGCGTTTATATATAAAAAAAAAAACACATGAGTATGCTGGTTATTAGAAATGAGAGCATGCGTCTATATATAAAATGCACATGAGTATGCTGGTTATTAAAAATGAGTTTAGTAAGAAGTATGTTGTTGATAATACAAGATTTTCATTAAGAAGCATACTGCTAGATCTTAGATGATGTTTCTAGAACATAGTGTATTTTATTAAATTTTAGAAATGGCACCCAGGGTCAAGGACTAGAGGAAGGCAAATGGGTAGCCGGAAAATAGAAATGGGACCCCAATGCTCAAGGAGAAAAGGAAGGCATGTGGGCAGCCTAGGTTATAATATCTATGTGCACATAGAGGAATATACGACGCTAATGTGTTGAGATTACTCCACACAGCTAAGGATAATCGACGTTGGGTAGCGTTATCGCCTCAACTAAGGAAAAGATATGAAATGAATTCGACGTTGAGAAGCATAGAAGCCCTGCTCCGCCTCAACTAAGGAGAATAATTAGAAATGAATAAGATTTTTGAGTATTTTCCTGAAATAAGCATGCAAGACTTAACAATGGTACTTGAGTATGATTTAATATGAATAGAATCGATATTATAAATTATGACCAGATTTATAGTTGAAATAAGTTATTTTAAAAACTGTTACTCACTGAGCTGTTAGCTCATCCTTTTATATGTTTTCCATTTTTCAGGTAGCATAACAGTTCCCGTTGCTGGTCGAAGTTGAGGTTTGTCAAGAGATAACGGAATCACCGTCCTAGGAAGAGTGTTTTAAATGAGGGAAATAAGGTAAGCTGACAGTGGGTATCAATAGGGAAGTTGATGACTGTTGTCTTCACGCTCCGTCTCGGGCTATAGAAGGGTAGTTCGGGTTGGGGTGTGACAAGTTTTCTCCCTAGTTTTTCCCAAATCAAGAAAAGGATCCACAATCCTTTATCAAATCATCACTCAGAGAAAAAGAGAGGTTTGTGGTGGTGGTGTTCAGAGTCGACAATGAATGTAAGACTTCTAACTCTAAAGCATGCTTAACCTAGAATTAGTACAGTAGGTTTTCTGTATTTTGTTTTACAGTGTATGGGTATTTCAAAATCCCAATTAATTTCGAGTTTGGTTCCATATTTCTTCTGTTGCATTAAGGATTTCAACCCTACAATAACATCCCTTATCCCCAACAATTTAGGAGAAAGAAATTAGATGAACAATCCACTAAATTTTTAGAAATTTTCAAAAAGCTTCACATAAATATTCCTTTTGCAGATGCGCTAGAACAGATGTCGAACTATGTTAAGTTTATGAAGGATGTTTTGTCGAGAAGAAGTTTGAGAAGTACAAGACAATGAGTTTAATTGAGGTGTGTAGTGTCGTTCTGCAAAAGAAGCTACCTCAAAAATTAAAAGACCCCGGGAGCTTCACAATTCCTTGTACTATAGGTGATCTGACGGTAAATAGACCCTTATTTGATTTAGGAGCCTCGATAAATTTTATGTCGTTATCTGTGTACAGGAAGTTGAACATAGGGGAAGTGTAGCCGATAAGGAGGTCGTTCTCTTACTTACCCTCATGGTATAGTGGAAAATGTGTTGATTAAAGTAGATAAGTTCATCTTCCCGGAGGATTTTGTTGTGCTGGATATGGAAGAGTATGCGGAAGTCTCAATCATCTTGGGATGACCATTCTTAGCCACAAGTAGGGCTTTGATAGATGTCTAGAAGGGTAAGCTCACCTTAAGAGTAAATGACAAAGAGGTAATATTTAACATCTATAGATCTCTGAAATATGCCGACCATGATTATACGCGTCATAGTATAGATATTGTGGACAGAAATGTAGCTAAGTTTAGCGAGTTAGTTTTGTCTACAGATCAGTTGAGTCAAAGTATGTTAAACTTTGATACGGATAGGATAGACGTTTTAGACGATGATGTTACATTTTATTCTGAATGTGTGGATTATTTGTCTGTTAGCTATGCCAAACCTGATTTAATATCTCTTGTCTATGAGTGAACACAAAACAAACACACCACCAACTGTGGAATTAAAAGAGTTACCTGCACATTTAAGATGTCTTTCTAAGAGAGTCTTCTTCTTTTCCAGTCATTATTTCGTCAAAATTGTATGAAGTAGAAGAGGAAATACTGTTAAGAGTCTTGCGAAAGCATAAGTCGACAATAGGGTGGTCAATTAACGATTTGAAAGGTATCAGCCCATCTATAGTCCCTTAATTGAGCATCAGAGATGGCTTAATCTAGCAATGAAAGAGGTACTAAGGACCAAAGTCCTTAAATTGTTAAATGCAGGAATTATTTATGTTATTTCTGATAGTTCCTGGGTGGGTCCAGTGTAGGTAGTACCCAAAAAAGGGTGGGATGACAATGGTGAAAATGAGAATAATGAAAATAGCAATCTTTTGGACCTAGCAAAACCCTTGTGCAATCTTTTGGAAAAGGATTCTGAATTTAATTTTGATGATTCATGTCTTGCCACTTTCAAAACATTAAAAGAAAAATTGATTTATGCTCCTCATATCATTGTCACGCCTATTTGGAGTCAGTCTTTTGAATTAATGTGTGATGCTAATGATTTTGCTATAGGAGTTGTCTTAGGACAACATAGGGAGAAATTGTTTGGAGTTATTTACTATGCAAGTAGAACTCTCGATGTTGCTCAAGTTAATTATACTACTACAGAGAAATAGTTGCTTGCGGTAGTGTTTGCATTTGATAAGTATAGACCCTATTTGCATGGTTCTAAAGTAGTAGTTTGTACTGATCATGCTGCTCTTAAATTCTTATTTGAGAAGAAAGATGTAAAGACCATACTAATTAGGTAGGTTCTCTAGTTGATTTGATTTTACAGTTAAGGATAGGAAGGGTAGTGAGAATGTGGTGGCAAATGGTCTGTATGATTGAGTATATTGTCTGGTAACCATCAAGCTATCATGTATTGAGTGTACCGTGTATCGAGTGTGGTTTAATTTCTATCAGGGAATGTGTAACGAGTATGATTAACAAGTTTATGGTTATCGAGGAGCGAGTCAGCATGTATTGAGTAATTGAGTAATTGGGTATCGAGGATCGAATGCCCTGGACCTAGGCAGGCGCACGGATGGTTCTTAGGGGAAATGTGAACTTTCACACGTGACTAATGTAAGCAGAAGACCACTTTTCATTAGTTTTCCAGAAGTGGGCCAGTTTTCATTTGGGTCTTTAAAAGGAGGCTATCCGTACAAAATTCCCATTAATTTTCATTTGACCCAATAAAGTGGCCCTGTGTACAATTATTCCAATAGCCTTGGACAAATTCTTCTGCTACCAGGCTACCACACAGTACTATTCAACACTATAGACACTGACTGTGAAGGAAAGGTCTCCTATCAGGATCTGCTCACCCAACTTTCAAAGGAGTTAATCGAGGCGAGGGGCGCAACCGTTGTGGTTGCCTTCAATTCAAACGGAGACAAGATGTTAGGGATGGAGGAGTTTGGAAGGTTGGTGGAGGGGGAATGA

mRNA sequence

ATGCTCCCCATTACGCTTGAGACCCAAGAGCAAGTTTTGTGCACTGTTGTTGGTGACGATCCGAGGCCTGAAGATGAAGGTCAGAGGTCAAGATTGGGAGATGGCAATGGTGAAGGGAGTATCATGTGTATGCGACAAAGCCAACGAGCTAAACTTCTGCCTTTCAATCCAGAAATCAAATTAACTTTAAGAAAAGTAAGCAGGAATTTGAAAGCAGAATTTGTCGCCAGGGAAGATCAAACAGAAGTCACTAAGACTATTAGAGATTACTTCCACTTGATCGTCCCTACATTGCGACCTAGAATAGTAAAGGCTCCAATTAATGCAAACAACTTTGAGCTGAAACCTAGTTTAATTCAAATGGCAAAGGACAGCATATTCAGAGGGACTTCAAACGAAGATCCACATAAACATCTTCGATCTCTTCAAGAAATTTATGGAATGGCTTTCCTTAGCAAATTTTTCCTATCTAAGACGAGCAACTTGAGAATAAAAATTGGAACATTCAAGCAGAAAGAGGATGAGCAATTGTTCGAGGCTTGGGAGCGTTATAAAGAATTGTTGAGAAAGTGTCCCCAACACAATTATCTTGACTGGCTTCAGATTCAGTTGTTCTATAATGGGTTGTTAGGAACGACGAAGTCCATCCTAAATGTTGCAGCTGGTGGATCAATCTTTTCTAAAACTGTTGATGCTACTCGATCACTTTTAGAAGACATGGTTGTCACCAGTTACAACTGGCCATCAAAACAATCAACATCCAAAGTGGCTGAGCTCTATGAGCTTGATGAGCCTCCTTCAGGTTTTACAGATTCTAATCCTAATGCTGAGAAGAAATCATCTATGGAGGACTTGCTAACTGAATTTATCAAAGAGTCGAAAAATCGAACCAACTTATTAGAGAGCACTATTATGAGTCATGAGAAGACCATTCAAAATATGGAGGAAGGAGAGCAAGCGGTAGTGGAACCTGACTTGGACGAGAGAAGAGAAAATGAGGAGTGGAATGTTCAGGAGAAGATGGAAGAGGTGAGAATGGTCGTTTGCCGACTCCATATGCCTACCATCTTGAGGACAAGACCGAGGGGGGAGCCGACTCCATATGCCTACCATCTTGAGGACAAGACCGAATGGGGAGCTGGGGACATAATTATACAAGATGGAATTCACTTATTCTCATCTTTAGGTAGAAAGACCATGTTGGACACATGGTCAAGACAGGAAATGCATGATAAAGGTGAAACGCATAGGAGGAAATCAAGGAAGGAGATTTTTGAATTCTTGAGAGAAAGGGCAAAGAGAAGGCAGAAGAAAGAAATCCTTGTGTCTGTAGTAGAAGTCTGCGGCCAAGAGAAGATATTCTTGACAACTCAGTCATCTTATGTTAAAGAAAAAGTCTTGCGTCTAAGGGGTCAGCCTTACGTTCAAGATGAACTCAACATTCAAGATGAACGCCGCCGCGTCCAAGAGAGGTCGACGCATCCAAAAGAAGAAGCAAGAAGAGTTAAGCTGCACCTTGCGTCCAAGGAGCAAGATGCGCTAGAACAGATGTCGAACTATGTTAAGTTTATGAAGGATGTTTTGTCGAGAAGAAGTTTGAGAATGGAAAATGTGTTGATTAAAGTAGATAAGTTCATCTTCCCGGAGGATTTTGTTGTGCTGGATATGGAAGAATCTCTGAAATATGCCGACCATGATTATACGCGTCATAGTATAGATATTGTGGACAGAAATGTAGCTAAGTTTAGCGAGTTAGTTTTGTCTACAGATCAGTTGAGTCAAAGCTACCACACAGTACTATTCAACACTATAGACACTGACTGTGAAGGAAAGGTCTCCTATCAGGATCTGCTCACCCAACTTTCAAAGGAGTTAATCGAGGCGAGGGGCGCAACCGTTGTGGTTGCCTTCAATTCAAACGGAGACAAGATGTTAGGGATGGAGGAGTTTGGAAGGTTGGTGGAGGGGGAATGA

Coding sequence (CDS)

ATGCTCCCCATTACGCTTGAGACCCAAGAGCAAGTTTTGTGCACTGTTGTTGGTGACGATCCGAGGCCTGAAGATGAAGGTCAGAGGTCAAGATTGGGAGATGGCAATGGTGAAGGGAGTATCATGTGTATGCGACAAAGCCAACGAGCTAAACTTCTGCCTTTCAATCCAGAAATCAAATTAACTTTAAGAAAAGTAAGCAGGAATTTGAAAGCAGAATTTGTCGCCAGGGAAGATCAAACAGAAGTCACTAAGACTATTAGAGATTACTTCCACTTGATCGTCCCTACATTGCGACCTAGAATAGTAAAGGCTCCAATTAATGCAAACAACTTTGAGCTGAAACCTAGTTTAATTCAAATGGCAAAGGACAGCATATTCAGAGGGACTTCAAACGAAGATCCACATAAACATCTTCGATCTCTTCAAGAAATTTATGGAATGGCTTTCCTTAGCAAATTTTTCCTATCTAAGACGAGCAACTTGAGAATAAAAATTGGAACATTCAAGCAGAAAGAGGATGAGCAATTGTTCGAGGCTTGGGAGCGTTATAAAGAATTGTTGAGAAAGTGTCCCCAACACAATTATCTTGACTGGCTTCAGATTCAGTTGTTCTATAATGGGTTGTTAGGAACGACGAAGTCCATCCTAAATGTTGCAGCTGGTGGATCAATCTTTTCTAAAACTGTTGATGCTACTCGATCACTTTTAGAAGACATGGTTGTCACCAGTTACAACTGGCCATCAAAACAATCAACATCCAAAGTGGCTGAGCTCTATGAGCTTGATGAGCCTCCTTCAGGTTTTACAGATTCTAATCCTAATGCTGAGAAGAAATCATCTATGGAGGACTTGCTAACTGAATTTATCAAAGAGTCGAAAAATCGAACCAACTTATTAGAGAGCACTATTATGAGTCATGAGAAGACCATTCAAAATATGGAGGAAGGAGAGCAAGCGGTAGTGGAACCTGACTTGGACGAGAGAAGAGAAAATGAGGAGTGGAATGTTCAGGAGAAGATGGAAGAGGTGAGAATGGTCGTTTGCCGACTCCATATGCCTACCATCTTGAGGACAAGACCGAGGGGGGAGCCGACTCCATATGCCTACCATCTTGAGGACAAGACCGAATGGGGAGCTGGGGACATAATTATACAAGATGGAATTCACTTATTCTCATCTTTAGGTAGAAAGACCATGTTGGACACATGGTCAAGACAGGAAATGCATGATAAAGGTGAAACGCATAGGAGGAAATCAAGGAAGGAGATTTTTGAATTCTTGAGAGAAAGGGCAAAGAGAAGGCAGAAGAAAGAAATCCTTGTGTCTGTAGTAGAAGTCTGCGGCCAAGAGAAGATATTCTTGACAACTCAGTCATCTTATGTTAAAGAAAAAGTCTTGCGTCTAAGGGGTCAGCCTTACGTTCAAGATGAACTCAACATTCAAGATGAACGCCGCCGCGTCCAAGAGAGGTCGACGCATCCAAAAGAAGAAGCAAGAAGAGTTAAGCTGCACCTTGCGTCCAAGGAGCAAGATGCGCTAGAACAGATGTCGAACTATGTTAAGTTTATGAAGGATGTTTTGTCGAGAAGAAGTTTGAGAATGGAAAATGTGTTGATTAAAGTAGATAAGTTCATCTTCCCGGAGGATTTTGTTGTGCTGGATATGGAAGAATCTCTGAAATATGCCGACCATGATTATACGCGTCATAGTATAGATATTGTGGACAGAAATGTAGCTAAGTTTAGCGAGTTAGTTTTGTCTACAGATCAGTTGAGTCAAAGCTACCACACAGTACTATTCAACACTATAGACACTGACTGTGAAGGAAAGGTCTCCTATCAGGATCTGCTCACCCAACTTTCAAAGGAGTTAATCGAGGCGAGGGGCGCAACCGTTGTGGTTGCCTTCAATTCAAACGGAGACAAGATGTTAGGGATGGAGGAGTTTGGAAGGTTGGTGGAGGGGGAATGA

Protein sequence

MLPITLETQEQVLCTVVGDDPRPEDEGQRSRLGDGNGEGSIMCMRQSQRAKLLPFNPEIKLTLRKVSRNLKAEFVAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYGMAFLSKFFLSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLLGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQSTSKVAELYELDEPPSGFTDSNPNAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKTIQNMEEGEQAVVEPDLDERRENEEWNVQEKMEEVRMVVCRLHMPTILRTRPRGEPTPYAYHLEDKTEWGAGDIIIQDGIHLFSSLGRKTMLDTWSRQEMHDKGETHRRKSRKEIFEFLRERAKRRQKKEILVSVVEVCGQEKIFLTTQSSYVKEKVLRLRGQPYVQDELNIQDERRRVQERSTHPKEEARRVKLHLASKEQDALEQMSNYVKFMKDVLSRRSLRMENVLIKVDKFIFPEDFVVLDMEESLKYADHDYTRHSIDIVDRNVAKFSELVLSTDQLSQSYHTVLFNTIDTDCEGKVSYQDLLTQLSKELIEARGATVVVAFNSNGDKMLGMEEFGRLVEGE
Homology
BLAST of CmUC03G058350.1 vs. NCBI nr
Match: WP_217833153.1 (retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002])

HSP 1 Score: 224.2 bits (570), Expect = 3.4e-54
Identity = 128/262 (48.85%), Postives = 156/262 (59.54%), Query Frame = 0

Query: 52  LLPFNPEIKLTLRKVSRNLKAEF-VAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINAN 111
           LLP +PEI  T R   RNL+A      E   E+ K IRDYF   +P  +P I+  PIN N
Sbjct: 24  LLPLDPEIDRTYR---RNLRALLNQTTEMAEEIPKAIRDYFQPTLPASQPGIMNVPINVN 83

Query: 112 NFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYG----------------------- 171
           NFELKP LIQMA++  FRG +NEDPHKHLRS  EI G                       
Sbjct: 84  NFELKPGLIQMARELAFRGRTNEDPHKHLRSFLEICGTVKMNGVSNDAIKLRLFPFSLQD 143

Query: 172 ---------------------MAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYK 231
                                 AFL+K+F  +K+  LR +IGTF+Q EDEQL+EAWERYK
Sbjct: 144 RAKDWLETIPPDSITTWEILAQAFLNKYFPPAKSQRLRTEIGTFRQLEDEQLYEAWERYK 203

Query: 232 ELLRKCPQHNYLDWLQIQLFYNGLLGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSY 265
           +LLR+CPQH Y DWLQIQLFYNGL  +TKSIL+  AGGSIFSK      ++LED+  TSY
Sbjct: 204 DLLRRCPQHGYPDWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTILEDLATTSY 263

BLAST of CmUC03G058350.1 vs. NCBI nr
Match: KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])

HSP 1 Score: 177.6 bits (449), Expect = 3.6e-40
Identity = 114/308 (37.01%), Postives = 170/308 (55.19%), Query Frame = 0

Query: 44  MRQSQRAKLLPFNPEIKLTLRKVSRNLKAEFVAREDQTEVTKTIRDYFHLIVPTLRPRIV 103
           MR+++   ++P +PEI+ TLR + RN K   +A ED+  + +T++DY   +V      I+
Sbjct: 1   MRRARSRDIIPVDPEIERTLRSLRRN-KILAMAEEDREVLPRTLKDYVRPVVNGNYSSIM 60

Query: 104 KAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEI------------------ 163
           + PINANNFELKP+LI M + + F G+  +DP+ HL    EI                  
Sbjct: 61  RQPINANNFELKPALISMVQQAQFSGSPLDDPNIHLAMFLEICDTVKINGVTEDTIRLRL 120

Query: 164 --------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLF 223
                                         FL+KFF  +KT+ LR +IG FKQ + E L+
Sbjct: 121 FPFSLRDKARGWLQSLQPGSIVSWQDMAERFLAKFFPPAKTAQLRSEIGQFKQNDFESLY 180

Query: 224 EAWERYKELLRKCPQHNYLDWLQIQLFYNGLLGTTKSILNVAAGGSIFSKTVDATRSLLE 283
           EAWERYK+L+R+CPQH   DWLQ+Q+FYNGL G T++I++ A+GG++ SKT +   +LLE
Sbjct: 181 EAWERYKDLIRRCPQHGLPDWLQVQMFYNGLNGQTRTIVDAASGGTLMSKTAEGATALLE 240

Query: 284 DMVVTSYNWPSKQS-TSKVAELYELDEPPSGFTDSNPNAEKKSSMEDLLTEFIKESKNRT 306
           +M   +Y WP++++   KVA +++L EP +    S   A     +  L T+ I +S   T
Sbjct: 241 EMASNNYQWPTERTLAKKVAGIHDL-EPIAAL--SAQVATLSHQISALTTQRIPQS---T 300

BLAST of CmUC03G058350.1 vs. NCBI nr
Match: XP_022157708.1 (uncharacterized protein LOC111024361 [Momordica charantia])

HSP 1 Score: 171.8 bits (434), Expect = 2.0e-38
Identity = 185/658 (28.12%), Postives = 281/658 (42.71%), Query Frame = 0

Query: 86  TIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEI 145
           TIRDY     P     I+  PINANN ELKP LIQM +++ FRG + EDP+ HL    ++
Sbjct: 26  TIRDYCQPNFPN-HVGIINLPINANNSELKPGLIQMVRENTFRGNATEDPNNHLTIFLDV 85

Query: 146 YG---------------------------MAFLSKFF-LSKTSNLRIKIGTFKQKEDEQL 205
            G                            AFL+ FF  +KT+ LR +I +F++ + EQL
Sbjct: 86  CGTVKMNGVIDDAIRLRLFPLSLQDKEMVQAFLTNFFPPAKTTQLRTEIRSFRKYDYEQL 145

Query: 206 FEAWERYKELLRKCPQHNYLDWLQIQLFYNGLLGTTKSILNVAAGGSIFSKTVDATRSLL 265
           FE WERYKELLRKCPQH  L+WLQIQ+FYNGL G T++IL+ AAGG++ S+T +    LL
Sbjct: 146 FEVWERYKELLRKCPQHGNLEWLQIQMFYNGLNGQTRTILDAAAGGTLLSRTPENAYILL 205

Query: 266 EDMVVTSYNWPSKQSTS-KVAELYELDEPPS------------------GFTDSNP---- 325
           +DM   S+ WPS++S + KVA +YE+DE  S                  G + SN     
Sbjct: 206 KDMADNSFQWPSERSNAKKVAGMYEIDELSSLKAQVQALTNAVSKLSGPGTSHSNELVAA 265

Query: 326 -------------------NAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKTIQNME 385
                               AEKKSS+EDLL  FI E ++R +            I+N  
Sbjct: 266 TDTYSYYEPTIEQAQFTSHPAEKKSSLEDLLGAFINECRSRAS-----------RIENQV 325

Query: 386 EGEQAVVEPDLDERRENEEWNVQEKMEEVRMVVCRLHMPTILRTRPRGEPTPYAYHLEDK 445
           EG +  +E +    +     N++ ++ ++         PT L T  +G+   +   +E K
Sbjct: 326 EGMEVKLEGNTTSIK-----NMEVQIGQI--------APT-LNTMQKGK---FPSDIEVK 385

Query: 446 TEWGAGDIIIQDGIHLFSSLGRKTMLDTWSRQEMHDKGETHRRKSRKEIFEFLRERAKRR 505
                  + ++ G  L              +++M +   T   +  KE  E ++E     
Sbjct: 386 PREHCKAVTLRSGKELQEP----------EKKKMEEPVITTEERENKE--EVVKEATPAL 445

Query: 506 QKKEILVSVVEVCGQEKIF----LTTQSSYVK--EKVLRLRGQPYVQDELNIQDERRRVQ 565
           Q  +   S+V        +    L    +YV+  + ++  + +    + +N+ +E   + 
Sbjct: 446 QADKPTSSIVSSPPNSLPYPQHALEQMPNYVRFMKDIMTGKRKLEAYETVNLTEECSAIL 505

Query: 566 ERSTHPKEEARRVKLHLASKEQDALEQMSNYVKFMKDVLSRRSLR----MENVLIKVDKF 622
           +R    K +         S         S++ K + D+ +  +L     +E+VL+KVD+ 
Sbjct: 506 QRKLPQKLK------DPGSFTIPCTISSSSFNKALCDICASINLMPLGVIEDVLVKVDRL 565

BLAST of CmUC03G058350.1 vs. NCBI nr
Match: XP_022883666.1 (uncharacterized protein LOC111400483 [Olea europaea var. sylvestris])

HSP 1 Score: 167.9 bits (424), Expect = 2.9e-37
Identity = 107/286 (37.41%), Postives = 164/286 (57.34%), Query Frame = 0

Query: 39  GSIMCMRQSQRAKLLPFNPEIKLT---LRKVSRNLKAEFVARE----DQTEVTKTIRDYF 98
           G+ + MR+++   LLP +PE + T   LR + RN +     ++    ++    K I DY 
Sbjct: 2   GNQLFMRRARYLDLLPVDPEPERTFRILRSIQRNEREAMTEQDARAANEDNQQKAIWDYI 61

Query: 99  HLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEIYGMA-- 158
             +V      I +  I ANNFELKP LI M + + F G + EDP+ HL S  EI      
Sbjct: 62  RPVVNNNYSGITRPAITANNFELKPGLIYMVQQNQFGGAAIEDPNAHLGSFLEICDTVKM 121

Query: 159 ---FLSKFFL-SKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLF 218
              FL+K+F  SK++ L  +I  FKQ + E  +EAWER+K+LLR+CPQH +  W+QI++F
Sbjct: 122 NGKFLTKYFTPSKSAQLHGEISQFKQLDFEPFYEAWERFKDLLRRCPQHGFQKWVQIEIF 181

Query: 219 YNGLLGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQS-TSKVAELYELDE 278
           YNGL G T+ +++ AAGG + +K  +A  +LL+D+   SY WPS++S   KVA L+E+D 
Sbjct: 182 YNGLNGQTRIMVDAAAGGILMAKIAEAAYALLDDIATNSYQWPSERSGVKKVAGLHEVD- 241

Query: 279 PPSGFTDSNPNAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKT 311
           P +        A+  S    ++T  +    N+  ++ ST  SH++T
Sbjct: 242 PITALA-----AQVASLTNQIVT--LTTQGNQQKVVMSTSSSHQET 279

BLAST of CmUC03G058350.1 vs. NCBI nr
Match: KAG7947748.1 (hypothetical protein I3843_14G109500 [Carya illinoinensis])

HSP 1 Score: 164.5 bits (415), Expect = 3.2e-36
Identity = 90/235 (38.30%), Postives = 134/235 (57.02%), Query Frame = 0

Query: 75  VAREDQTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNED 134
           +A ED+  + +T++DY   +V      I++ PINANNFELKP+LI M + + F G+  +D
Sbjct: 1   MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 135 PHKHLRSLQEI--------------------------------------------YGMAF 194
           P+ HL    EI                                                F
Sbjct: 61  PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120

Query: 195 LSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGL 254
           L+KFF  +KT+ LR +IG FKQ + E L+EAWERYK+L+R+CPQH   DWLQ+Q+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180

Query: 255 LGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQS-TSKVAELYELD 264
            G T++I++ A+GG++ SKT +   +LLE+M   +Y WP++++   KVA ++EL+
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKVAGIHELE 235

BLAST of CmUC03G058350.1 vs. ExPASy TrEMBL
Match: A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)

HSP 1 Score: 171.8 bits (434), Expect = 9.6e-39
Identity = 185/658 (28.12%), Postives = 281/658 (42.71%), Query Frame = 0

Query: 86  TIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSIFRGTSNEDPHKHLRSLQEI 145
           TIRDY     P     I+  PINANN ELKP LIQM +++ FRG + EDP+ HL    ++
Sbjct: 26  TIRDYCQPNFPN-HVGIINLPINANNSELKPGLIQMVRENTFRGNATEDPNNHLTIFLDV 85

Query: 146 YG---------------------------MAFLSKFF-LSKTSNLRIKIGTFKQKEDEQL 205
            G                            AFL+ FF  +KT+ LR +I +F++ + EQL
Sbjct: 86  CGTVKMNGVIDDAIRLRLFPLSLQDKEMVQAFLTNFFPPAKTTQLRTEIRSFRKYDYEQL 145

Query: 206 FEAWERYKELLRKCPQHNYLDWLQIQLFYNGLLGTTKSILNVAAGGSIFSKTVDATRSLL 265
           FE WERYKELLRKCPQH  L+WLQIQ+FYNGL G T++IL+ AAGG++ S+T +    LL
Sbjct: 146 FEVWERYKELLRKCPQHGNLEWLQIQMFYNGLNGQTRTILDAAAGGTLLSRTPENAYILL 205

Query: 266 EDMVVTSYNWPSKQSTS-KVAELYELDEPPS------------------GFTDSNP---- 325
           +DM   S+ WPS++S + KVA +YE+DE  S                  G + SN     
Sbjct: 206 KDMADNSFQWPSERSNAKKVAGMYEIDELSSLKAQVQALTNAVSKLSGPGTSHSNELVAA 265

Query: 326 -------------------NAEKKSSMEDLLTEFIKESKNRTNLLESTIMSHEKTIQNME 385
                               AEKKSS+EDLL  FI E ++R +            I+N  
Sbjct: 266 TDTYSYYEPTIEQAQFTSHPAEKKSSLEDLLGAFINECRSRAS-----------RIENQV 325

Query: 386 EGEQAVVEPDLDERRENEEWNVQEKMEEVRMVVCRLHMPTILRTRPRGEPTPYAYHLEDK 445
           EG +  +E +    +     N++ ++ ++         PT L T  +G+   +   +E K
Sbjct: 326 EGMEVKLEGNTTSIK-----NMEVQIGQI--------APT-LNTMQKGK---FPSDIEVK 385

Query: 446 TEWGAGDIIIQDGIHLFSSLGRKTMLDTWSRQEMHDKGETHRRKSRKEIFEFLRERAKRR 505
                  + ++ G  L              +++M +   T   +  KE  E ++E     
Sbjct: 386 PREHCKAVTLRSGKELQEP----------EKKKMEEPVITTEERENKE--EVVKEATPAL 445

Query: 506 QKKEILVSVVEVCGQEKIF----LTTQSSYVK--EKVLRLRGQPYVQDELNIQDERRRVQ 565
           Q  +   S+V        +    L    +YV+  + ++  + +    + +N+ +E   + 
Sbjct: 446 QADKPTSSIVSSPPNSLPYPQHALEQMPNYVRFMKDIMTGKRKLEAYETVNLTEECSAIL 505

Query: 566 ERSTHPKEEARRVKLHLASKEQDALEQMSNYVKFMKDVLSRRSLR----MENVLIKVDKF 622
           +R    K +         S         S++ K + D+ +  +L     +E+VL+KVD+ 
Sbjct: 506 QRKLPQKLK------DPGSFTIPCTISSSSFNKALCDICASINLMPLGVIEDVLVKVDRL 565

BLAST of CmUC03G058350.1 vs. ExPASy TrEMBL
Match: A0A6J0ZX64 (LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica OX=108875 GN=LOC110412945 PE=4 SV=1)

HSP 1 Score: 153.7 bits (387), Expect = 2.7e-33
Identity = 102/285 (35.79%), Postives = 146/285 (51.23%), Query Frame = 0

Query: 44  MRQSQRAKLLPFNPEIKLTLRKVSR-NLKAEFVAREDQT-----------------EVTK 103
           M++     L+PF+P+I+ T R+  R NL+   VA  +QT                 E  +
Sbjct: 1   MQRRNNLNLVPFDPDIERTFRRHRRENLQ---VATLNQTMAEDNNNNGNNAINLVPEANR 60

Query: 104 TIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSI-FRGTSNEDPHKHLRSLQE 163
            +RDY   +V  L   I +  INANNFE+KP+ IQM + S+ F G  ++DP+ HL +  E
Sbjct: 61  ALRDYVVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHLVNFLE 120

Query: 164 I--------------------------------------------YGMAFLSKFF-LSKT 223
           I                                                FL+KFF  +KT
Sbjct: 121 ICDTFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFFPPAKT 180

Query: 224 SNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLLGTTKSILNV 264
           + +R  I +F Q + E L+EAWER+KELLR+CP H   DWLQ+Q FYNGL+G+ K+I++ 
Sbjct: 181 AKMRNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIKTIIDA 240

BLAST of CmUC03G058350.1 vs. ExPASy TrEMBL
Match: A0A6J0ZYV0 (uncharacterized protein LOC110413413 OS=Herrania umbratica OX=108875 GN=LOC110413413 PE=4 SV=1)

HSP 1 Score: 153.3 bits (386), Expect = 3.5e-33
Identity = 102/285 (35.79%), Postives = 146/285 (51.23%), Query Frame = 0

Query: 44  MRQSQRAKLLPFNPEIKLTLRKVSR-NLKAEFVAREDQT-----------------EVTK 103
           M++     L+PF+P+I+ T R+  R NL+   VA  +QT                 E  +
Sbjct: 1   MQRRNNLNLVPFDPDIERTFRRHRRENLQ---VATLNQTMAEDNNNNGNNAINLVPEANR 60

Query: 104 TIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSI-FRGTSNEDPHKHLRSLQE 163
            +RDY   +V  L   I +  INANNFE+KP+ IQM + S+ F G  ++DP+ HL +  E
Sbjct: 61  ALRDYAVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHLVNFLE 120

Query: 164 I--------------------------------------------YGMAFLSKFF-LSKT 223
           I                                                FL+KFF  +KT
Sbjct: 121 ICDTFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFFPPAKT 180

Query: 224 SNLRIKIGTFKQKEDEQLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLLGTTKSILNV 264
           + +R  I +F Q + E L+EAWER+KELLR+CP H   DWLQ+Q FYNGL+G+ K+I++ 
Sbjct: 181 AKMRNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIKTIIDA 240

BLAST of CmUC03G058350.1 vs. ExPASy TrEMBL
Match: A0A3S3N117 (Retrotrans_gag domain-containing protein OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKAN_01212200 PE=4 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 5.1e-32
Identity = 90/269 (33.46%), Postives = 137/269 (50.93%), Query Frame = 0

Query: 44  MRQSQRAKLLPFNPEIKLTLRKVSRNLK--AEFVAREDQTEVTKTIRDYFHLIVPTLRPR 103
           MR++Q   L+P +PEI+ TLR++ +  K  +EF   E + +  +++ DY   +V      
Sbjct: 1   MRRNQNLNLVPLDPEIERTLRRLKKEKKQQSEFEITEMKEQANRSLGDYAVPLVTGATSS 60

Query: 104 IVKAPINANNFELKPSLIQMAKDSI-FRGTSNEDPHKHLRSLQEI--------------- 163
           I +  I ANNFE+KP++IQM   ++ F G  ++DP+ H+ +  E+               
Sbjct: 61  IRRPVIQANNFEIKPAIIQMVASTVQFSGLPDDDPNAHISNFLELCDTFKYNGVTDDAVR 120

Query: 164 -----------------------------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDE 223
                                            FL+KFF  +KT  +R  I TF Q E E
Sbjct: 121 LRLLPFSLRDKAKAWLNSLPQSTITTWDELAKKFLAKFFPPTKTVKMRNDITTFAQNEME 180

Query: 224 QLFEAWERYKELLRKCPQHNYLDWLQIQLFYNGLLGTTKSILNVAAGGSIFSKTVDATRS 264
            L+EAWERYKELLRKCP H    W+Q+Q FYNGL   T++ ++ A GG++  K+ +    
Sbjct: 181 SLYEAWERYKELLRKCPHHGLPLWIQVQTFYNGLQSATRTSIDAATGGTLMKKSPEEAYE 240

BLAST of CmUC03G058350.1 vs. ExPASy TrEMBL
Match: A0A061E3H0 (RT_RNaseH domain-containing protein OS=Theobroma cacao OX=3641 GN=TCM_008289 PE=4 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 2.5e-31
Identity = 81/193 (41.97%), Postives = 119/193 (61.66%), Query Frame = 0

Query: 80  QTEVTKTIRDYFHLIVPTLRPRIVKAPINANNFELKPSLIQMAKDSI-FRGTSNEDPHKH 139
           Q E TK +RDY    V +L   I + PI ANNFE+KPS+I+M +    F G  N+D + H
Sbjct: 5   QGEETKFLRDYVVPQVQSLHSSIRRPPIQANNFEIKPSIIKMNQTFFQFEGLPNDDLNAH 64

Query: 140 LRSLQEI------YGMAFLSKFF-LSKTSNLRIKIGTFKQKEDEQLFEAWERYKELLRKC 199
           + +  +I          F + FF L+KT+ +R  I  F Q +   L+ AWERYK+L+R+C
Sbjct: 65  IVNFLKICDTFKANAKKFFTNFFPLAKTTKIRNDITFFMQFDSISLYVAWERYKDLIRRC 124

Query: 200 PQHNYLDWLQIQLFYNGLLGTTKSILNVAAGGSIFSKTVDATRSLLEDMVVTSYNWPSKQ 259
           P H    WLQ+Q FYNGLLG  ++ ++  AGG++ SK++D T  LL++M   +Y WPS++
Sbjct: 125 PHHGLPKWLQLQTFYNGLLGPFRTTIDATAGGALMSKSIDNTYDLLKEMASNNYQWPSER 184

Query: 260 -STSKVAELYELD 264
            ST K+A ++ LD
Sbjct: 185 LSTRKIARVHGLD 197

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WP_217833153.13.4e-5448.85retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 70... [more]
KAG7990634.13.6e-4037.01hypothetical protein I3843_02G035100 [Carya illinoinensis][more]
XP_022157708.12.0e-3828.12uncharacterized protein LOC111024361 [Momordica charantia][more]
XP_022883666.12.9e-3737.41uncharacterized protein LOC111400483 [Olea europaea var. sylvestris][more]
KAG7947748.13.2e-3638.30hypothetical protein I3843_14G109500 [Carya illinoinensis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DU199.6e-3928.12uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J0ZX642.7e-3335.79LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica ... [more]
A0A6J0ZYV03.5e-3335.79uncharacterized protein LOC110413413 OS=Herrania umbratica OX=108875 GN=LOC11041... [more]
A0A3S3N1175.1e-3233.46Retrotrans_gag domain-containing protein OS=Cinnamomum micranthum f. kanehirae O... [more]
A0A061E3H02.5e-3141.97RT_RNaseH domain-containing protein OS=Theobroma cacao OX=3641 GN=TCM_008289 PE=... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..39
IPR018247EF-Hand 1, calcium-binding sitePROSITEPS00018EF_HAND_1coord: 605..617
IPR011992EF-hand domain pairSUPERFAMILY47473EF-handcoord: 591..654

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmUC03G058350CmUC03G058350gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9608344..9608534exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9609974..9610083exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9610210..9610279exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9610523..9610591exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9612069..9612331exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9613259..9613342exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9614726..9614883exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9615474..9615557exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9615698..9615850exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9616110..9616457exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9616589..9616914exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9617763..9617840exon
CmUC03G058350.1-exonCmUC03G058350.1-exon-CmU531Chr03:9618023..9618062exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9608344..9608534CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9609974..9610083CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9610210..9610279CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9610523..9610591CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9612069..9612331CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9613259..9613342CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9614726..9614883CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9615474..9615557CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9615698..9615850CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9616110..9616457CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9616589..9616914CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9617763..9617840CDS
CmUC03G058350.1-cdsCmUC03G058350.1-cds-CmU531Chr03:9618023..9618062CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmUC03G058350.1CmUC03G058350.1-proteinpolypeptide