CcUC03G043060 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC03G043060
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCicolChr03: 756650 .. 761632 (+)
RNA-Seq ExpressionCcUC03G043060
SyntenyCcUC03G043060
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTATTTTCTTTGAAAAAAAAGAATCATGAATTAATTAAATTTTGGGGTAGGGGCAGCGAGGGAATTTCACATCAAAAACATGTCCATACATTATTTCCTTTTTATTTTATTACCTTTAACGCGGTTTAGATTTCTTTTTCATTTTCTCTCTTTTCGTAAAGCTTTGTCTACGTAATGTAATACTTAACCAAATATATTGATGGTTATATAGTTTGTTTATAATATAAGATATGTTGATTTCACATCCTATTTATTCTAAATTAATTAATTGAAACTTTAGCATCAAAAACTAAAATATGAACTAAAAGTTTAAAATGTCGATGTCGACGAAAATATTGAAGTCTTTATTTATAGGAATTTTGATGAAAATATCAATAAAATATCTGGAAAGTGTAGAGTTGGTAAGTGTGAAGTTGTGAACTCCACTCTTTGTTTGGCCCAAAGAGTTTGTATGTTTCACTATTAAAATACATCAATTTTATACCTTATTAACTTCTTCCACTTCGAGCTCTTAGGAGTTGACAACTCTATGAAGTTCACAACTCTACTCCTTCCTCCAAACGTCCATATAGCTTTAACCAATTTTCTAACCGACATTATACTTAGTGGCTCAAAGATGTCTGTGAGATCCAGTTAACCATTAGCTGAAGGCATTCATGTTGAATACCCTCACATGAAGGTGATAACAACACTCACCATGTGGTTGCTTTGCTACTAGAACGATCTAGCTGTCAAATAGTTGCCTAAGCTAGTTCAAAAAAATCTACAACTTAAGAAAGCTTAAGCAACCCAACTCAAATCACAAAGGGTTTCTATTTGTAGGTGAATAAAATCAAAATTCGGTTTAACTTGAACTAATTTGATTTAGCATTTGTATTAAAAATGTGTGTATTCTTAGGGTTTCTCTTTTCCCATCCACTGCCCCTAGCCTTTTTTCTATAATAATAGTCTTTGTGTGCATCCCTTCATCCAACATTTCTTTTTTGTTTTCTCCGTAATGTAACTTCTTAATTTTTGAAAATCCAAATAATCCAACCTAACCCAAATTAAAATTGCAAGGGTTGAGCTGTCTAGCACTTGACCAACTCAGATGTTCTGGTTGGTAAAAAAAATAATGGCCTCTAACCCAACCCAACCAAAATTGAAGTTGTGAGGGGCTGGTTGCTCACTACTCAACCAACTCAAAATTTTCAGGTTGGTCAAAAAAATGTAAGATCTAACTTCACATGACCTAAATCGAAATTGTGAGGATGGAGTTTTGTGCCCCTTCTTGATCATCTCAGATTTTCGGATTGATCCAAAAAAACCATCCCTCTAATCTAACCTCATCCAACCTATGTACAACCCTTTAGGTAATAAGGTAGCCCTATACAAACAAACTGCATAGATGTTTGTAAAGCGAGTATTTCCAAGTTCTTAACATTTTCAAATCGCACATACAAAATTTACATGTATATTGTACACCTAAAGAATAGAAGCCGTCTCAACCAAGGCAGGTGCAAGCGAATTATAGCTCTTTAAATAGACTGAGCTCAGAGTCTCAACACATCTATTACATAGAATTACATTAAAATCCTCTATAAAGCATTATAACATCAACACTACTCTCCTCAATGCTCTAAAATGGAAGAAGAAACTTACAAAACTTCTACTTTGAGGCTCTTTTGTGCAGCCACCTTTTCCTCCTATATTGCCTTCCTCCTTTCATCTTGCTGCAACCAACTGTAAATGATATCACTACCAGCACGGTTGAGTGAGCACGGAGGGACTGAAGAAACAGAGGCAGCTGACTTCTCGAGGAACTGCCTCTGCTTCTCATCGGCGATGCACAGCAAGGAATCAGCACATACTTCTAAAGCCGATGGGGGCGGGGTTGGCTTGAACTCCAGAAGCTTTGAGTACTCTGTGATGAGGTGAAACATGTAAGAGTAGACTGTGTCCATGCTCAAGCTCTCCATGAAATTCTGTCCCTGTCGTCCTATAGCCTCGGCCTGTTTATAATATACATTTTACACGTATCGTTCTTAACATAGTCTTATTTCGCTCAATACTTCCCCCAATTAAAACCAATGGAAATTCTTCAGAACATATTCTAAGGGCAGATTTGGTTCATGATTATAACTTGTGTTTGTTAACCCATGGAATTAATTAGGCTCGGTTGGTAACTATTTGGTTTTTTGTTTTTGTTACCAAACGGGACCTAAAATACTAGATTATAGTAAAAACGTGAAAGTGGCATATATCCTCCCTCTAGAGTAAACAAACCCACATATTAAATGTCAAATCAACTTCTTAATTTCAGATTACATAAACCTAACTTATGAACCAAACATATTTCAGAGGCTTCAACTTTTAGAGTTAATCTTAATTGTGGGTTTATAATTTGTAAAATTTGGTACGCAGTATGAAGGCAGGTTTCATAAGTTAGTCTAGTCTAGCATCTTACACTTTCGAGTTCAGTGTTGATTAACAAAAACAATCACCAACTATCAGACTCACAGATTCAGCCAAACCTATTCTTCTCTATACAAAACTATTGCAGGAAGAAGAAAGTTTTGTGATAAACATGAACCAAGAAAGAAGATAAGGAAGTTAGAATAATACCTCAGGCAAATGAGTATTTCCCCAGTCAACAGCATGCTTAATAGACTCGCACATGTTAGAGAAGGGGATGGGCCAATAGTTCTTCAAAGGATGAAGACCACGGCTGAAGAAATCTTCATATTGGGGTGAAATAATCAAAGACATTGAACCACATGAAAGAATGTACTTCAAGCTCACAGACCAAGCAAACCCTTCAGCATAGATTTTATACCTGCAGAAGCAGACCAACACATCGTGGGCTCAGCTAGCTTAGACAACAAAACTGACTTGTCTATGTCCCCTCTCCTCCACAAGTAAAGAAAAAAACTTCATATTCTAATGGCTGAATTTGGTTTCTTCAAATCAATATGGAAAAAGAGTGGTTTTTGTTTGCTTTGGTTTGGTTTGGTTCAGTTTTATCCAATCAATTCGGTTCAATTAAAGTCTCTAGTTTCTTTGCTTCAATATGAATTTGCATCGGTAAAAAGCACCTCACCGGTGGTTGCATTGGTTGGATAGCTTGGATTGCCCAAAACCAGCTTTTGCTTCTTGTTCCCAGTCCTGCCATCAAACAGATAGGTTTTCTCACCTCAACAATAAATTGTTTGTGTATAATACAAAAATGCAAAGAAAGAAAGAATTTAAGGGTTCCAGTTTGAACCTGACGCATGATTTGAGCACCCCACATTCTTGAGTGATTGCACTTCAACAACTCTGTACGAGCAGGGGAATCAACATCTGGATTTCCCTTCCAATAAGCTCGAGGATACTTGTTCGACCAACTTAAGTTTTTTGAGCCTTTCTTGATATCTCGAAACTCTTCCCTCCATGATTTTAGGTTCACTTCTGGCCTGCATCAGATTGCCACATCCATTGATGAAATATAACACAAGTTTTCCCATCACATAACATATATCTTTTTGAAGAAAAATGCAACCAAGACAGAGATTCCAACAGTACTATCTAGCTTCTTTTATCTTAAAAAGTTTGTAATAAGTTCCTAAACTTTCAATTTTATGTCTAAGAGAAACTCATATTAACTCTATGCACCAGCTATTCAATGTATCATTTAAGCACTGAGTTGTTTGGTAGTTTCCTTTTAGAATATGGAGAGTAGATAAATAGCTCTCTTTTACACAATATTAAAAGTTTAGGTTAAATTATAAAAACTGCCCCAAACTTTACCATTTGTTTCAAAAATACTTTTATTCTTTCAAAAGTTGAAATATTACCCTTGACCTTTCATAAACCTTTCAAAACTACTGTTGAAGTAGAAAAATTGTTAGAATGTTCGAAAAAAAGTTGATGTGTTTGACCATTGTTCTTTCAAGCTTAATCCAAGCAAGCGTCCTAGTCACGACCAAGTTGAATTTTCATTTAACATTCTAAAGAAAGACTATTTCAGAGGTTATTTCGAAGACTATTTCAAAGATTATGTTTATACGAAAGGAAAAGTTTAGGTGATATTTTATAATTTAGCTAAAAGTTATAGGATCAAAAGTTGAAATTCAAGGATGAGACATGTCAAATCATCAAAGCAGTGAAGGGATCCTAATCTTAAACCATACATACCATCCCCAGAAAGACCAATCAGGAAAAGGAATGTCAAAGTGAGCCTCAGTCGTGCAATACCGAAACAGAGGCAGCGGCATGTCCTTGTTCTCAGTCCGATTGATACTCGGTTTGTCCATACAATCAAACATCATGTCCACATCCGGCACCATTCCGGGGAACCTTCTGAGCAATTGAACCAAACCCCAGATCGTGAAAATCGCCCTGCTCTGCACACAAGCATAGTACATATCAACATAAAGCCTACCTTTCACGATCACAACACGAAACGCCGCAAATTTCTGAGATTCATCCAAATGGGTCATCGAGATTCGGGTCCGAGCCCAGGGATCCAGATCGTGATGAATCCACCGGAAGAATTCAGGGCATTTGGGAGTGGAAACAGCGGAATGCAGACGAGGGAATCTAGTAGAATTGTTGATGACATAGCGGCAAGCGAGATAAGAACAGTGAATGATCTTAACGGCTCTGGCATGGCGAGTCTCGTCGCTGAAGGTCTTGGGAGGGAACAAATGCCATGGGGTTGGATCCAAGTTGTGGCCAGCAACAGTTTTGGTTTGTGCAGCGAAGTCATCCACCTGAAACACAGCAATGCAATAAGAAATGTGAAATTAGTAGTAAGATGATGGTGACCCCCACCAGGTGTTGGACGAAATGCGGCAATGAAGAAATGGAGTTTGAAGACCTTGTAACAGACGAGGAAAGTGAGGGAGAGGAAGGAGAGAGCGACGACGGAGGGGAGGAGGTTGGAGGGGGAGCGGGAAGAGGGAGGTCTAGGCGGCGGAGCCATGGCGGAAATGTGAGGATTTGCATTTGTGAGTGA

mRNA sequence

ATGGCACGGTTGAGTGAGCACGGAGGGACTGAAGAAACAGAGGCAGCTGACTTCTCGAGGAACTGCCTCTGCTTCTCATCGGCGATGCACAGCAAGGAATCAGCACATACTTCTAAAGCCGATGGGGGCGGGGTTGGCTTGAACTCCAGAAGCTTTGAGTACTCTGTGATGAGACCAAGCAAACCCTTCAGCATAGATTTTATACCTGCAGAAGCAGACCAACACATCGTGGGCTCAGCTAGCTTAGACAACAAAACTGACTTGTCTATGTCCCCTCTCCTCCACAAGGATCCTAATCTTAAACCATACATACCATCCCCAGAAAGACCAATCAGGAAAAGGAATGTCAAAGTGAGCCTCAGTCGTGCAATACCGAAACAGAGGCAGCGGCATGTCCTTGTTCTCAGTCCGATTGATACTCGGTTTGTCCATACAATCAAACATCATGTCCACATCCGGCACCATTCCGGGGAACCTTCTGAGCAATTGAACCAAACCCCAGATCGTGAAAATCGCCCTGCTCTGCACACAAGCATAGTACATATCAACATAAAGCCTACCTTTCACGATCACAACACGAAACGCCGCAAATTTCTGAGATTCATCCAAATGGGTCATCGAGATTCGGGTCCGAGCCCAGGGATCCAGATCGTGATGAATCCACCGGAAGAATTCAGGGCATTTGGGAGTGGAAACAGCGGAATGCAGACGAGGGAATCTAGTAGAATTGTTGATGACATAGCGGCAAGCGAGATAAGAACAGTGAATGATCTTAACGGCTCTGGCATGGCGAGTCTCGTCGCTGAAGGTCTTGGGAGGGAACAAATGCCATGGGGTTGGATCCAAGTTGTGGCCAGCAACAGTTTTGGTTTGTGCAGCGAAGTCATCCACCTGAAACACAGCAATGCAATAAGAAATGTGAAATTAGTAATGATGGTGACCCCCACCAGGTGTTGGACGAAATGCGGCAATGAAGAAATGGAGTTTGAAGACCTTGTAACAGACGAGGAAAGTGAGGGAGAGGAAGGAGAGAGCGACGACGGAGGGGAGGAGGTTGGAGGGGGAGCGGGAAGAGGGAGGTCTAGGCGGCGGAGCCATGGCGGAAATGTGAGGATTTGCATTTGTGAGTGA

Coding sequence (CDS)

ATGGCACGGTTGAGTGAGCACGGAGGGACTGAAGAAACAGAGGCAGCTGACTTCTCGAGGAACTGCCTCTGCTTCTCATCGGCGATGCACAGCAAGGAATCAGCACATACTTCTAAAGCCGATGGGGGCGGGGTTGGCTTGAACTCCAGAAGCTTTGAGTACTCTGTGATGAGACCAAGCAAACCCTTCAGCATAGATTTTATACCTGCAGAAGCAGACCAACACATCGTGGGCTCAGCTAGCTTAGACAACAAAACTGACTTGTCTATGTCCCCTCTCCTCCACAAGGATCCTAATCTTAAACCATACATACCATCCCCAGAAAGACCAATCAGGAAAAGGAATGTCAAAGTGAGCCTCAGTCGTGCAATACCGAAACAGAGGCAGCGGCATGTCCTTGTTCTCAGTCCGATTGATACTCGGTTTGTCCATACAATCAAACATCATGTCCACATCCGGCACCATTCCGGGGAACCTTCTGAGCAATTGAACCAAACCCCAGATCGTGAAAATCGCCCTGCTCTGCACACAAGCATAGTACATATCAACATAAAGCCTACCTTTCACGATCACAACACGAAACGCCGCAAATTTCTGAGATTCATCCAAATGGGTCATCGAGATTCGGGTCCGAGCCCAGGGATCCAGATCGTGATGAATCCACCGGAAGAATTCAGGGCATTTGGGAGTGGAAACAGCGGAATGCAGACGAGGGAATCTAGTAGAATTGTTGATGACATAGCGGCAAGCGAGATAAGAACAGTGAATGATCTTAACGGCTCTGGCATGGCGAGTCTCGTCGCTGAAGGTCTTGGGAGGGAACAAATGCCATGGGGTTGGATCCAAGTTGTGGCCAGCAACAGTTTTGGTTTGTGCAGCGAAGTCATCCACCTGAAACACAGCAATGCAATAAGAAATGTGAAATTAGTAATGATGGTGACCCCCACCAGGTGTTGGACGAAATGCGGCAATGAAGAAATGGAGTTTGAAGACCTTGTAACAGACGAGGAAAGTGAGGGAGAGGAAGGAGAGAGCGACGACGGAGGGGAGGAGGTTGGAGGGGGAGCGGGAAGAGGGAGGTCTAGGCGGCGGAGCCATGGCGGAAATGTGAGGATTTGCATTTGTGAGTGA

Protein sequence

MARLSEHGGTEETEAADFSRNCLCFSSAMHSKESAHTSKADGGGVGLNSRSFEYSVMRPSKPFSIDFIPAEADQHIVGSASLDNKTDLSMSPLLHKDPNLKPYIPSPERPIRKRNVKVSLSRAIPKQRQRHVLVLSPIDTRFVHTIKHHVHIRHHSGEPSEQLNQTPDRENRPALHTSIVHINIKPTFHDHNTKRRKFLRFIQMGHRDSGPSPGIQIVMNPPEEFRAFGSGNSGMQTRESSRIVDDIAASEIRTVNDLNGSGMASLVAEGLGREQMPWGWIQVVASNSFGLCSEVIHLKHSNAIRNVKLVMMVTPTRCWTKCGNEEMEFEDLVTDEESEGEEGESDDGGEEVGGGAGRGRSRRRSHGGNVRICICE
Homology
BLAST of CcUC03G043060 vs. NCBI nr
Match: KAA0031828.1 (copia protein [Cucumis melo var. makuwa])

HSP 1 Score: 110.5 bits (275), Expect = 3.1e-20
Identity = 54/57 (94.74%), Postives = 55/57 (96.49%), Query Frame = 0

Query: 2  ARLSEHGGTEETEAADFSRNCLCFSSAMHSKESAHTSKADGGGVGLNSRSFEYSVMR 59
          ARLSEHGGT+ETEAADFSRNCLCFSSAM SKESAHTSKADGGGVGL SRSFEYSVMR
Sbjct: 6  ARLSEHGGTDETEAADFSRNCLCFSSAMQSKESAHTSKADGGGVGLKSRSFEYSVMR 62

BLAST of CcUC03G043060 vs. NCBI nr
Match: TYJ97307.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 110.5 bits (275), Expect = 3.1e-20
Identity = 54/57 (94.74%), Postives = 55/57 (96.49%), Query Frame = 0

Query: 2  ARLSEHGGTEETEAADFSRNCLCFSSAMHSKESAHTSKADGGGVGLNSRSFEYSVMR 59
          ARLSEHGGT+ETEAADFSRNCLCFSSAM SKESAHTSKADGGGVGL SRSFEYSVMR
Sbjct: 6  ARLSEHGGTDETEAADFSRNCLCFSSAMQSKESAHTSKADGGGVGLKSRSFEYSVMR 62

BLAST of CcUC03G043060 vs. NCBI nr
Match: RWW42500.1 (hypothetical protein BHE74_00051954 [Ensete ventricosum] >RZS24135.1 hypothetical protein BHM03_00057168 [Ensete ventricosum])

HSP 1 Score: 70.1 bits (170), Expect = 4.6e-08
Identity = 40/141 (28.37%), Postives = 77/141 (54.61%), Query Frame = 0

Query: 104 IPSPERPIRKRNVKVSLSRAIPKQ--RQRHVLVLSPIDTRFVHTIKHHVHIRHHSGEPSE 163
           +P+PE P+ + N+   +  A+ +Q  R+R  +V+  +  R VH +++ + +R  +G P++
Sbjct: 62  LPAPEGPVGEENIHGLIVLAVMEQRRRRRRRVVVGGLHRRHVHAVEYELEVRDAAGIPAD 121

Query: 164 QLNQTPDRENRPALHTSIVHINIKPTFHDHNTKRRKFLRFIQMGHRDSGPSPGIQIVMNP 223
           +L   P+ E+ P    + V +++ P    H+ +  +  R +    RD+G  P  Q+++NP
Sbjct: 122 ELQDAPEGEDVPRQGVAEVLLDVDPAVEHHHVEGSELARPLDHLLRDAGGLPWPQVLVNP 181

Query: 224 PEEFRAFGSGNSGMQTRESSR 243
           PE F A G G  G + + S R
Sbjct: 182 PEVFGAGGGGLGGGKVQGSRR 202

BLAST of CcUC03G043060 vs. ExPASy TrEMBL
Match: A0A5D3BDV6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001060 PE=4 SV=1)

HSP 1 Score: 110.5 bits (275), Expect = 1.5e-20
Identity = 54/57 (94.74%), Postives = 55/57 (96.49%), Query Frame = 0

Query: 2  ARLSEHGGTEETEAADFSRNCLCFSSAMHSKESAHTSKADGGGVGLNSRSFEYSVMR 59
          ARLSEHGGT+ETEAADFSRNCLCFSSAM SKESAHTSKADGGGVGL SRSFEYSVMR
Sbjct: 6  ARLSEHGGTDETEAADFSRNCLCFSSAMQSKESAHTSKADGGGVGLKSRSFEYSVMR 62

BLAST of CcUC03G043060 vs. ExPASy TrEMBL
Match: A0A5A7SMG6 (Copia protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00780 PE=4 SV=1)

HSP 1 Score: 110.5 bits (275), Expect = 1.5e-20
Identity = 54/57 (94.74%), Postives = 55/57 (96.49%), Query Frame = 0

Query: 2  ARLSEHGGTEETEAADFSRNCLCFSSAMHSKESAHTSKADGGGVGLNSRSFEYSVMR 59
          ARLSEHGGT+ETEAADFSRNCLCFSSAM SKESAHTSKADGGGVGL SRSFEYSVMR
Sbjct: 6  ARLSEHGGTDETEAADFSRNCLCFSSAMQSKESAHTSKADGGGVGLKSRSFEYSVMR 62

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0031828.13.1e-2094.74copia protein [Cucumis melo var. makuwa][more]
TYJ97307.13.1e-2094.74Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
RWW42500.14.6e-0828.37hypothetical protein BHE74_00051954 [Ensete ventricosum] >RZS24135.1 hypothetica... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3BDV61.5e-2094.74Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7SMG61.5e-2094.74Copia protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00780 ... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 332..349
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 332..370
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..108
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 151..170

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC03G043060.1CcUC03G043060.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding