CmaCh00G001500.1 (mRNA) Cucurbita maxima (Rimu)

NameCmaCh00G001500.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCma_Chr00 : 9792971 .. 9794170 (+)
Sequence length525
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCGGCCTCGTGTCGCCCTGGGAGCATCCTCCATTCGGAAGGTATTTGCATTAGTCAATAACAAGGTGAATAGAGAAAGTATTTATAGTAAGCGGAAGAAGGAAGTATATCAACACGTCCTATGGTCTCCACCACTAGGTTGCTCCGTGAGATTCCCATGTTCCGCCTGCATGTTGCCCTAGAGAAACTACTCATTCGGAGGGTCGTAACATGGGAGTCGAAACAATGCAAACTTCATAAATGGATAGGATTTCTTAGGTCCATTCCCAACTTTGGCTTTTTCATTCGATAGCGTTGTTGGGACCGATCTATGAGGTCTGAAATTGATGGGTCACACTTGCGAAGAATTATTAAGAGTTAGTATATTCTTGCCAAAATAGCAATAACTAAAACACTATAGGAACAAGAGTTATTCTGGGATTAGTGTTTAAGTCAAGAATGTCTTGGTTTAGTGAATGATTGACTGCCCCTCTGTAGCAGTTGCTCTAACTAATTAAAGTATCATTGCAAATTAATAATTTTTTTAGGTACATTATTACATTTGCTAAAACTGATTGGATTAAACAGGATTAAGTCAAATCATTAATAGAATTTTCTTTATATCACAGCATGAAAAACTCAATAGTACAATTACTCGCTTTTGAGAAATTAAACGACGACAATTACGCAACTTAGAAATCAAACCTAAACACAATACTGGTAATTGATGATTTAAGGTTTGTTTTAACTGAGGAATGTCCTTCAAACCCCAGCTCAAATGCAAACCGAACAGTTCAGGATGCGTATGACAGATGGATAAAGGCAAATGACAAAGCCCGATTGTACATTGTATCCAGCATATCTGATGTTTTGGCTAAGAAACACAATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACTGTCCTTCTCCCTTCGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAACGAAGCTGTCATTGATGAGAAAAGTCAAGTTAGTTTTATAATGGAGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACAAATGTGGTGATGAGCAAAATAGAATATAACTTGACTACTCTTCTCAAGAGCTACAGACTTATCAGTCCCCCTTAA

mRNA sequence

ATGCTCGGCCTCGTGTCGCCCTGGGAGCATCCTCCATTCGGAAGGTTTGTTTTAACTGAGGAATGTCCTTCAAACCCCAGCTCAAATGCAAACCGAACAGTTCAGGATGCGTATGACAGATGGATAAAGGCAAATGACAAAGCCCGATTGTACATTGTATCCAGCATATCTGATGTTTTGGCTAAGAAACACAATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACTGTCCTTCTCCCTTCGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAACGAAGCTGTCATTGATGAGAAAAGTCAAGTTAGTTTTATAATGGAGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACAAATGTGGTGATGAGCAAAATAGAATATAACTTGACTACTCTTCTCAAGAGCTACAGACTTATCAGTCCCCCTTAA

Coding sequence (CDS)

ATGCTCGGCCTCGTGTCGCCCTGGGAGCATCCTCCATTCGGAAGGTTTGTTTTAACTGAGGAATGTCCTTCAAACCCCAGCTCAAATGCAAACCGAACAGTTCAGGATGCGTATGACAGATGGATAAAGGCAAATGACAAAGCCCGATTGTACATTGTATCCAGCATATCTGATGTTTTGGCTAAGAAACACAATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACTGTCCTTCTCCCTTCGACATGAAGCCATAAAATACATTTACAACTGTCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAACGAAGCTGTCATTGATGAGAAAAGTCAAGTTAGTTTTATAATGGAGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACAAATGTGGTGATGAGCAAAATAGAATATAACTTGACTACTCTTCTCAAGAGCTACAGACTTATCAGTCCCCCTTAA

Protein sequence

MLGLVSPWEHPPFGRFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIMESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMESLPKSFFQFRTNVVMSKIEYNLTTLLKSYRLISPP
BLAST of CmaCh00G001500.1 vs. TrEMBL
Match: E2GK51_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 2.4e-54
Identity = 108/151 (71.52%), Postives = 132/151 (87.42%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           RFVLTEECP  P+ NANRTV++AYDRW+KANDKAR+YI++S++DVLAKKH+ + TAK IM
Sbjct: 36  RFVLTEECPQAPALNANRTVREAYDRWVKANDKARVYILASMTDVLAKKHDSIATAKGIM 95

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
           +SL+ MFGQ S+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QV
Sbjct: 96  DSLREMFGQPSWSLRHEAIKHIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIDEANQV 155

Query: 135 SFIMESLPKSFFQFRTNVVMSKIEYNLTTLL 166
           SFI++SLPKSF  F+TN  ++KIE+NLTTLL
Sbjct: 156 SFILQSLPKSFVPFQTNASLNKIEFNLTTLL 186

BLAST of CmaCh00G001500.1 vs. TrEMBL
Match: E2GK52_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 168.7 bits (426), Expect = 6.3e-39
Identity = 84/127 (66.14%), Postives = 106/127 (83.46%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           RF+LTEEC   P+ NANRTV++AYDRW KANDKA +YI++S++DVLAKK++ + T K IM
Sbjct: 36  RFILTEECHQAPALNANRTVREAYDRWGKANDKACVYILASMTDVLAKKYDSIATTKGIM 95

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
           +S + MFGQ S+SLRHEAIK IY  RMKEGTSVREHVLDMM+HFN+A+ +   IDE +QV
Sbjct: 96  DSFREMFGQPSWSLRHEAIKRIYTKRMKEGTSVREHVLDMMMHFNIAKVHGGPIDEANQV 155

Query: 135 SFIMESL 142
           SFI++SL
Sbjct: 156 SFILQSL 162

BLAST of CmaCh00G001500.1 vs. TrEMBL
Match: W9SH28_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_004989 PE=4 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 3.0e-33
Identity = 76/151 (50.33%), Postives = 108/151 (71.52%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           +FVL +ECP  P++NA +T ++ YDRWIKAN+KA+ ++++S+SDVL KKH  M TA EIM
Sbjct: 36  KFVLVDECPPEPAANATKTAREPYDRWIKANNKAKCFMLASMSDVLCKKHEEMETAYEIM 95

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
           ESL+ MFG  S   R +A++   N +MK+G+SV+ HVL+M+ H + AE N A IDE +Q+
Sbjct: 96  ESLEAMFGAPSEKARLDAVRAFMNDKMKKGSSVKAHVLNMIDHLHDAELNGARIDEATQL 155

Query: 135 SFIMESLPKSFFQFRTNVVMSKIEYNLTTLL 166
             I+ESL   F +F  N VM+K + NLT L+
Sbjct: 156 GIILESLSPDFHEFVNNFVMNKKKSNLTELM 186

BLAST of CmaCh00G001500.1 vs. TrEMBL
Match: W9RXH5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_001554 PE=4 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 3.3e-32
Identity = 75/151 (49.67%), Postives = 107/151 (70.86%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           +F+L EECP  P+ NA++T ++ YDRWIKAN+KA+ ++++S+SDVL KKH  M TA EIM
Sbjct: 36  KFLLAEECPLEPADNASKTAREPYDRWIKANNKAKCFMLASMSDVLRKKHGEMETAYEIM 95

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
           ESL+ MFG  S     +A++   N +MK+G+SV+ HVL+M+ H +  E N A IDE +QV
Sbjct: 96  ESLEAMFGAPSEKACLDAVRAFMNDKMKKGSSVKAHVLNMIDHLHDTELNGARIDEATQV 155

Query: 135 SFIMESLPKSFFQFRTNVVMSKIEYNLTTLL 166
             I+ESL   F +F  N+VM+K + NLT L+
Sbjct: 156 GIILESLSPDFHEFVNNLVMNKKKSNLTELM 186

BLAST of CmaCh00G001500.1 vs. TrEMBL
Match: W9S3Q9_9ROSA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Morus notabilis GN=L484_006475 PE=4 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 4.4e-32
Identity = 77/160 (48.12%), Postives = 111/160 (69.38%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           +FVL EECP +P++NA++T ++ YDR IKAN+KA+  +++S+SDVL KKH  M TA EIM
Sbjct: 36  KFVLVEECPQDPAANASKTTREPYDRLIKANNKAKCLMLASMSDVLRKKHEEMETAYEIM 95

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
           ESL+ MFG  S   R +A++   N +MK+G+SV+ HVL+M+ H +  E N   IDE +QV
Sbjct: 96  ESLEAMFGAPSKKARLDAVRAFMNDKMKKGSSVKAHVLNMIDHLHDTELNSGRIDEATQV 155

Query: 135 SFIMESLPKSFFQFRTNVVMSKIEYNLTTLLKSYRLISPP 175
             I+ESL   F +F  N+VM+K + NLT L+    L++ P
Sbjct: 156 GIILESLSPDFHEFVNNIVMNKKKSNLTKLMND--LVAEP 193

BLAST of CmaCh00G001500.1 vs. NCBI nr
Match: gi|299474487|gb|ADJ18449.1| (gag/pol protein [Bryonia dioica])

HSP 1 Score: 219.9 bits (559), Expect = 3.4e-54
Identity = 108/151 (71.52%), Postives = 132/151 (87.42%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           RFVLTEECP  P+ NANRTV++AYDRW+KANDKAR+YI++S++DVLAKKH+ + TAK IM
Sbjct: 36  RFVLTEECPQAPALNANRTVREAYDRWVKANDKARVYILASMTDVLAKKHDSIATAKGIM 95

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
           +SL+ MFGQ S+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QV
Sbjct: 96  DSLREMFGQPSWSLRHEAIKHIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIDEANQV 155

Query: 135 SFIMESLPKSFFQFRTNVVMSKIEYNLTTLL 166
           SFI++SLPKSF  F+TN  ++KIE+NLTTLL
Sbjct: 156 SFILQSLPKSFVPFQTNASLNKIEFNLTTLL 186

BLAST of CmaCh00G001500.1 vs. NCBI nr
Match: gi|778697615|ref|XP_011654359.1| (PREDICTED: uncharacterized protein LOC105435361 [Cucumis sativus])

HSP 1 Score: 206.5 bits (524), Expect = 3.9e-50
Identity = 99/134 (73.88%), Postives = 119/134 (88.81%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           RFVLTEECP NP+SNANRT ++AYDRWIKAN+KAR+YI++S+SDVLAKKH  + TAKEIM
Sbjct: 36  RFVLTEECPQNPASNANRTGREAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM 95

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
           +SL+GMFGQ  +SLRHEA+KYIY  RMKEGTSVREHVLDMM+HFN+A+ N  +I+E +QV
Sbjct: 96  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAQVNGGLIEEVNQV 155

Query: 135 SFIMESLPKSFFQF 149
           SFI+ESLPKSF  F
Sbjct: 156 SFILESLPKSFIPF 169

BLAST of CmaCh00G001500.1 vs. NCBI nr
Match: gi|659113933|ref|XP_008456826.1| (PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo])

HSP 1 Score: 199.5 bits (506), Expect = 4.8e-48
Identity = 98/151 (64.90%), Postives = 127/151 (84.11%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           RFVL E+CP   ++NA RTV++AY+RW KAN+KAR Y+++S+S+VLAKK+  M TA+EIM
Sbjct: 36  RFVLVEKCPQVSAANATRTVREAYERWAKANEKARAYLLASLSEVLAKKNESMLTAREIM 95

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
           +SL+ MFGQ S+ ++H+A+KYIYN RM +G  VREHVL+MMV+FNVAE N AVIDE +QV
Sbjct: 96  DSLQEMFGQASYQIKHDALKYIYNARMNDGALVREHVLNMMVYFNVAEMNGAVIDEANQV 155

Query: 135 SFIMESLPKSFFQFRTNVVMSKIEYNLTTLL 166
           SFI+ESL +SF QFR+NVVM+KI Y LTTLL
Sbjct: 156 SFILESLLESFLQFRSNVVMNKIAYTLTTLL 186

BLAST of CmaCh00G001500.1 vs. NCBI nr
Match: gi|659086056|ref|XP_008443743.1| (PREDICTED: uncharacterized protein LOC103487255, partial [Cucumis melo])

HSP 1 Score: 186.4 bits (472), Expect = 4.2e-44
Identity = 90/125 (72.00%), Postives = 106/125 (84.80%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           RFVLTEECP  PSSNA++T + AYDRWIKAN+KAR+YI++S+SDVLAKKH  + TAKEIM
Sbjct: 36  RFVLTEECPQTPSSNASQTSRKAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM 95

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
            SLKGMFGQ  +SLRHE IKYIY  RMKEGTS++EHVLDMM+HFN+ E N   IDE +QV
Sbjct: 96  NSLKGMFGQPKWSLRHETIKYIYTKRMKEGTSIKEHVLDMMMHFNIFEVNGGAIDEANQV 155

Query: 135 SFIME 140
           SFI+E
Sbjct: 156 SFILE 160

BLAST of CmaCh00G001500.1 vs. NCBI nr
Match: gi|659118732|ref|XP_008459275.1| (PREDICTED: uncharacterized protein LOC103498451 [Cucumis melo])

HSP 1 Score: 176.8 bits (447), Expect = 3.3e-41
Identity = 84/151 (55.63%), Postives = 114/151 (75.50%), Query Frame = 1

Query: 15  RFVLTEECPSNPSSNANRTVQDAYDRWIKANDKARLYIVSSISDVLAKKHNVMGTAKEIM 74
           RF+L EEC    +    ++V+DAYDRW KANDKA +YI++S+SD+L+ KH +M T ++I+
Sbjct: 25  RFILMEECSLFLTQGTFKSVRDAYDRWKKANDKAHVYIMASMSDILSNKHKIMVTTRQIV 84

Query: 75  ESLKGMFGQLSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQV 134
           +SL+ MFGQLS  ++ E IKY+YN RMK+  SV++HVL+M+VHFNV E N  V DEKSQV
Sbjct: 85  DSLREMFGQLSIQIKQETIKYVYNARMKDSQSVKKHVLNMIVHFNVVEMNVVVFDEKSQV 144

Query: 135 SFIMESLPKSFFQFRTNVVMSKIEYNLTTLL 166
           SFI++ LPKS  QF  N  M+KI+YN+T  L
Sbjct: 145 SFILKYLPKSSLQFNNNAEMNKIKYNMTIFL 175

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E2GK51_BRYDI2.4e-5471.52Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
E2GK52_BRYDI6.3e-3966.14Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
W9SH28_9ROSA3.0e-3350.33Uncharacterized protein OS=Morus notabilis GN=L484_004989 PE=4 SV=1[more]
W9RXH5_9ROSA3.3e-3249.67Uncharacterized protein OS=Morus notabilis GN=L484_001554 PE=4 SV=1[more]
W9S3Q9_9ROSA4.4e-3248.13Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Morus notabilis G... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|299474487|gb|ADJ18449.1|3.4e-5471.52gag/pol protein [Bryonia dioica][more]
gi|778697615|ref|XP_011654359.1|3.9e-5073.88PREDICTED: uncharacterized protein LOC105435361 [Cucumis sativus][more]
gi|659113933|ref|XP_008456826.1|4.8e-4864.90PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo][more]
gi|659086056|ref|XP_008443743.1|4.2e-4472.00PREDICTED: uncharacterized protein LOC103487255, partial [Cucumis melo][more]
gi|659118732|ref|XP_008459275.1|3.3e-4155.63PREDICTED: uncharacterized protein LOC103498451 [Cucumis melo][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh00G001500CmaCh00G001500gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh00G001500.1CmaCh00G001500.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh00G001500.1.exon.1CmaCh00G001500.1.exon.1exon
CmaCh00G001500.1.exon.2CmaCh00G001500.1.exon.2exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh00G001500.1.CDS.1CmaCh00G001500.1.CDS.1CDS
CmaCh00G001500.1.CDS.2CmaCh00G001500.1.CDS.2CDS


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 19..167
score: 1.8
NoneNo IPR availablePANTHERPTHR11439:SF192SUBFAMILY NOT NAMEDcoord: 19..167
score: 1.8
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 41..166
score: 2.4