CmaCh19G003230 (gene) Cucurbita maxima (Rimu)

NameCmaCh19G003230
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionGag/pol protein
LocationCma_Chr19 : 3024753 .. 3030141 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGACTATGATGGAAAGAGCAAATGTTAGAGAAGAAGAAGAAGACACCATGTCTAGATTCCTTGGAGGTTTGAATCAAGAAATTGCTCATCTTGTTGACAGAAATCCACCACCATATCTAGAAGACATGTATCATTATGCTCTCAAAATTGAAGACCAATTGAAGGAAGAAAAAGAGCATTCAAAAAGGTACACATCACGAACTAACACCTTTTCAAATTCTAAAGTTTGTGGTTGTTAAAAGAGTGGAGACTGAGAGTTCCAATGGTAAAAAGAATGAAGCTTCAAAGACGGTAAATGAGAAGTCTAGTTCTATTCAATGTTGAAAGTGCAAAGGGTTTGGACACATGAGCAAAGAGTGTGTTAATAAAAAAGTTATGGTGATAAGGAATGGTATACTTGATTCAGATTATGAATGTGAGGATCATGATTCACAGCTTGTGGAAGAAATCACAGCATATGATGATGAGTATGTTGAAGAAGCTAATTCCATATCTTTGATCACAAGAAGGGTACTCAATGTTCAAATCAAGTTAAAGAAATTTGAAGATCAAAGGGAAAACTTGTTCCACCCAAGATGTTTGATCAAAGGAAACCCATGCAGCTTGGTAATTAATAGTGGAAGTTGCACAAATGTGGTAAGCAGTTTCCTTGTAAAAAGACTTCAACTTTCTGTCCATCCCCATCCAAAACCATACAAACTTCAATGGTTGACAAACAAATAAGAAATTAAGGTGAACTCCCAAGCTCTTATTTCTTTTACTCTTGGAAGATAAAAAGATGAAGTCTGTGTGATGTCATACCAATGCATGCGGGAGACATCTTGTTCGGCTGACCTTGGCAATATGATAGAAAGGTATCATTTGATGGTTTGACGAACAAGTATACTTTTACTTTGTGTGGGAAGAAATTTACTTTACTTCCGTTGTCTCCATATGATTACATTGTGACTAATTGAAATTAGAGAAGAAAAGAAAAGAATATGAAAAACAACCGTGGAGAGAAGAAACTAAGGTAGAAGAAAGAGATAAGAGTGAAAAAAAAGAGAAAAGTGAGGGACAAAGGATGAGTGAAAACCAAGAAAGAAAAAAAAAGGTGAGGAAAAAAAGTGAGAAAAAAATATATTAGTAGAGAAAAAAAAATCAATATTCAGTTTTTGTCCGGAAAGAACAAAGAATTTGATTGAGGACAAATCCTTTTAAAGAACGGGGGATGATATGAATCCAACAACTGATCACTTCCATATACTCGAACGAGCAATTACAAGAAGCAATACAAAGAAAATTGAAATATCAAATCAAGAAAATAATGGAGTGGCTAACAATTGGAAAGTGATTAGAAGACTTGTCATTGGTAGTTGAATTACCAATTTAGACAATTTATGAATGACTTATTTTCTTCAATTTGTATTTAATAGCATTTATTACTAGTTTGTATGAATGCCTTATTTTTTTCAATTTGTATTTAATACAATTTATTAGTGATTTGTAGTTGGTGGTTGAACTTAAATATGTTATGGTTATCTTGTGGGCCTATTTAAGGCATGTAATATTCATGTTTGAAAAGGAATTGTTTGATAAACAAAACTTTTGTGTTTTCTTGATACCTTTTTGGTGTTATTCTTTTTCAAACCTTGCTTTAGGTTTGATGTTTAAACCAATTCATTGAATTAGTTTTGATCAAGCTAATTCTGGAGTGAATTTTCTTGGTATCTAAGAGTGACTATCATAGATTCTAAGTTTGATCTAAGAATCATTCATCTTCTAACCAGTTCTTGGTTGGAATCAATCCGTTAAGTTAGCTTTCTCGTGAACGTTTTGCAAGGATTCTAGAAGGGCCTCTAGCTTTGCAAATTCCAACATTATGGGCGCTTATCACTCTTTCTTCATATATACTAAATCCCTCCCAAGTTTAGTCACTTATCGGATTCCACAATCTCATTCTAAGGCTGGAGAATAGTGGAAAAGACACTAGTGGTAGTTCGAAACCGGTTCGTAAGGAAAAAGAAACTTGAGCGAGAAAAGGTTAGTATTCAAACTCATTGAGTTTTAATGCTTTTGAACATGCTAGCTAATTTACTATAATTGATCTACTTAGAGTGCCTAAGATCCAAATGGTTTCCGCATATGTTATGTTAACACCATCAATTGGTATCAGAGCTAGTTTTAGGCACTTTATTTACATTAATTTGAGCATGGTGGGTACTATTTCATGAATGAAATTTTGTGGTTGCTAAAGTGGATGTTTTGGATTTTGGCATTTTTTTTTTTTTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCTTCCATTTGATACAATTCTTATAACAAGGCTAATGTGTATGGGCTTTAATGAGTGAAGAAAAGTCTGTAATTTTTATGGAGTCATTAGAGTCCATTTTAGGTTGCAATTGATCGAAAATGGAAGAGATAGCAAGCTGAACCGAAAAAGAAGATGGAAGAGGCAGTTCGGGTGTTCTTTCTGCCGTTGGGGTCCAGTTCAACGTTTTTAAAAGTTGGTTCTCCCAGCTCTTGATCCTCGGGAGTGGTTCAAGTTATTTTCAATTATTAATAATTTAGTTAATTGCTTGTTTTAAAATGCCAAAACTAATTCACTTAGTGTGTTTAATTAATTATGCATTATGAATGTATGTTTTAATTAAATAGAAAATGTATATGCATAAAGTATGTCATATAGATTTAAAATCCCACCATACGCTAAGCATGTTTCATACATTCCAAGTATGTTATAAGTGTTATAATGTATATAATGGATGATTATGTAATTGTTATATAAAGCATGTTAGTATGCATGTTTAGGGTTCCATGAGTGAAAAATATAAATTGTTATATTATTGCATGGAAAGGATGATTATGTAATTGTTAAATAAAGCATGTTAGATGCATGTATAGGTTATTTTCAACGTCATAAAACGAGATAGACGTTAAAATCTATAATAAACATGCTCACTTAGGGTTAAGAACCAAGCACGATCAATTTTAATAGTTAAAATAGGTCGAAGTCTTATAAAATTGTGATTACAAGAAAGCTCACCTGGCACGATCTTGTCTAAGGCTGGAGGTACTTAAGTTGACGGTTTACGGAACACCTACTACCTGGAGATCGAACCAGTACTTGAGCTTAGTTAGCCCAGTTTTATGAGCATGCATGTGTGATGTAAGTTTTAGAAAAATCACCTAGACTTAAGTTATATTTAATTAGTCGGAATATACCTAAGTAATGAGTATTTGATAAGATAAGGAATCACTACAAGGGGAATAAGACTCAGATTACTTTGTTGATCGAATATCTCAAGGCAAGAACATTGTTTGAGATTCGAATCACTCCACAAGCAAGATCGATCATGTATAGCTTGAATGATTCTTTTTGATCAAATATCTCAAACACTTGTTTGAGATTCGAATCACTCCACAAGCAAGATCGATCATGTCTAGCTTGAATGATTCTATATGCAATCTAAACTACAGAGAATTGCAAATAAACTTAGTCATTGGCTAAAGAAGCACAAATGCTAATGTTATCTATATTTTCCAAGTCCGTTTATAAATACAACATACATGGCTTTATATAACCTTAAAATGAATGTATTAAAGTCATTCCAAGAGTTGTAACATTCATACTTAATGACCATAATTAACCATTATGTAATTGTAACCTAATGTAAATAAAAAGTCTTAAAATACATCAATGAAATACAAGAACTCTAAATTGTAATCCACCCAAAATTTATCACATGAAACTTCATTCTTCTTCAATGTGGCATGAATTGAAATATCTTTTGATAATTTCAACAATATTTTCTTCACATCTTCATTGAAGCATATTGTATGATTGATGTCTCTTGGTTCATATCAGTATACCTAACCGTCTAAAATACTTAGTGGGAGAAAAAAGGTGTATGGGATACATCGTTTTCCTTTCACGCTCTTTGAAAAATTCACACCATGAGATTCATGCTCGGCCTCGTGTCGCCCTGGGAGCGTCCTCCATAAAGTATTTATAGTAAGTGGGAGAAGGAAGTATATCAACATGTCCTATGGTCTCTACTACTAGGTTGCACTGTGAGATTCCCATGTTGCACTTGCATGTTGCCCTAGAGCATCTACCCATTCGGAGGGTCGTAACATGAGAGTCGAAACAACGCAAACTCCAGAAATAGATGGGATTTCTTAGGTCCGTTCCCAACTTTGGCCTTTTCATTCGATAGCATTATTGGGACCGATCTCTGAGGTCTGAAAATGGCGGGTCACACTTACGAAGAATTACTAAGAGTTAGTATATTCTTGACCAAAATAGCGATAACTAAAAACACTATAGGAACAAGAGTTATTCTGGGATTAGTGTTTAAGTCAAGAATGTCTTGGTTTAGTGAATGAGTGATTGACTGCCCCTCGGTAGCAGTTGCTCTAACTCACTAAAGTATTGTTGTAAATTATTAATTTTTGGGTACATTATTACATTTGCTAAAACTGATTGGATTAAACAGGATTAAGTCAAATCACTAATAGAATTTTCTTTATATCACAGCATGACAAACTCAATAGTACAATTACTCAATTCAGAGAAATTAAACAGCGACAATTACGCAACTTGGAAATCAAACCTAAACACAATACTTGTAATTGATGATTTAAGGTTTCTTTTAACTGAGGAATGTCCTCCAAACTCCAGCTCAAATGCAAACCGAACAGTTCGGGATGTATATGAAAGATGGACAAAGGCAAATGGCAAAGCCCGAGTGTACATTTTAGCCAACATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTATTGCTAAAGAGATTATGGAATCTCTGAAAGAGATGTTTGGACAACCATCTTTCTCCCTTAAACATGATGCCATAAAATACGTTTACAATTGCCGTATGAAAGAAGGGACCTTAGTTAGAGAACATGTCTTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGGAGTCTCTTTCGAAGAGCTTCTTCCAGTTTTGCACAAATGTGATAATGAACAAAATAGAATATAACTTGACTGTTCTTCTCAATGAGCTACAGACTTATCAGTTTCTCTTAACGAACAAGGGACAAACAGGAGAAACAAATGTTGTTATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAAAAGAAGTCTGATCCTTCAACTTCTAAATGTGTTTTGATGAATAAGAAGGGTAAAGGGAAAAATAAGATTCCTCCTAACCACAAACACAAGGTTCAAAAAACAGATAAAGGAAAATATTTTCATTGCAATGAAAACGAGCACTAG

mRNA sequence

ATGGAGACTATGATGGAAAGAGCAAATGTTAGAGAAGAAGAAGAAGACACCATGTCTAGATTCCTTGGAGGTTTGAATCAAGAAATTGCTCATCTTGTTGACAGAAATCCACCACCATATCTAGAAGACATGTATCATTATGCTCTCAAAATTGAAGACCAATTGAAGGAAGAAAAAGAGCATTCAAAAAGGTTTCTTTTAACTGAGGAATGTCCTCCAAACTCCAGCTCAAATGCAAACCGAACAGTTCGGGATGTATATGAAAGATGGACAAAGGCAAATGGCAAAGCCCGAGTGTACATTTTAGCCAACATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTATTGCTAAAGAGATTATGGAATCTCTGAAAGAGATGTTTGGACAACCATCTTTCTCCCTTAAACATGATGCCATAAAATACGTTTACAATTGCCGTATGAAAGAAGGGACCTTAGTTAGAGAACATGTCTTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGGAGTCTCTTTCGAAGAGCTTCTTCCAGTTTTGCACAAATGTGATAATGAACAAAATAGAATATAACTTGACTGTTCTTCTCAATGAGCTACAGACTTATCAGTTTCTCTTAACGAACAAGGGACAAACAGGAGAAACAAATGTTGTTATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAAAAGAAGTCTGATCCTTCAACTTCTAAATGTGTTTTGATGAATAAGAAGGGTAAAGGGAAAAATAAGATTCCTCCTAACCACAAACACAAGGTTCAAAAAACAGATAAAGGAAAATATTTTCATTGCAATGAAAACGAGCACTAG

Coding sequence (CDS)

ATGGAGACTATGATGGAAAGAGCAAATGTTAGAGAAGAAGAAGAAGACACCATGTCTAGATTCCTTGGAGGTTTGAATCAAGAAATTGCTCATCTTGTTGACAGAAATCCACCACCATATCTAGAAGACATGTATCATTATGCTCTCAAAATTGAAGACCAATTGAAGGAAGAAAAAGAGCATTCAAAAAGGTTTCTTTTAACTGAGGAATGTCCTCCAAACTCCAGCTCAAATGCAAACCGAACAGTTCGGGATGTATATGAAAGATGGACAAAGGCAAATGGCAAAGCCCGAGTGTACATTTTAGCCAACATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTATTGCTAAAGAGATTATGGAATCTCTGAAAGAGATGTTTGGACAACCATCTTTCTCCCTTAAACATGATGCCATAAAATACGTTTACAATTGCCGTATGAAAGAAGGGACCTTAGTTAGAGAACATGTCTTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGGAGTCTCTTTCGAAGAGCTTCTTCCAGTTTTGCACAAATGTGATAATGAACAAAATAGAATATAACTTGACTGTTCTTCTCAATGAGCTACAGACTTATCAGTTTCTCTTAACGAACAAGGGACAAACAGGAGAAACAAATGTTGTTATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAAAAGAAGTCTGATCCTTCAACTTCTAAATGTGTTTTGATGAATAAGAAGGGTAAAGGGAAAAATAAGATTCCTCCTAACCACAAACACAAGGTTCAAAAAACAGATAAAGGAAAATATTTTCATTGCAATGAAAACGAGCACTAG

Protein sequence

METMMERANVREEEEDTMSRFLGGLNQEIAHLVDRNPPPYLEDMYHYALKIEDQLKEEKEHSKRFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIMESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMESLSKSFFQFCTNVIMNKIEYNLTVLLNELQTYQFLLTNKGQTGETNVVISKKLLRGSSSKKKSDPSTSKCVLMNKKGKGKNKIPPNHKHKVQKTDKGKYFHCNENEH
BLAST of CmaCh19G003230 vs. TrEMBL
Match: E2GK51_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 2.1e-66
Identity = 143/233 (61.37%), Postives = 176/233 (75.54%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           RF+LTEECP   + NANRTVR+ Y+RW KAN KARVYILA+++DVLAKKHD +  AK IM
Sbjct: 36  RFVLTEECPQAPALNANRTVREAYDRWVKANDKARVYILASMTDVLAKKHDSIATAKGIM 95

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           +SL+EMFGQPS+SL+H+AIK++Y  RMKEGT VREHVLDMM+HFN+AE N   IDE +QV
Sbjct: 96  DSLREMFGQPSWSLRHEAIKHIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIDEANQV 155

Query: 184 SFIMESLSKSFFQFCTNVIMNKIEYNLTVLLNELQTYQFLLTNKGQTGETNVVISK-KLL 243
           SFI++SL KSF  F TN  +NKIE+NLT LLNELQ +Q L  +KG+  E NV ++K K +
Sbjct: 156 SFILQSLPKSFVPFQTNASLNKIEFNLTTLLNELQRFQNLTLSKGKEVEANVAVTKRKFI 215

Query: 244 RGSSSKKKSDPSTSKCVLMNKKGKGKNKIPPNHKHKVQKTDKGKYFHCNENEH 296
           RGSSSK K  PS ++   M KKGKGK    PN     +  DKGK FHCN++ H
Sbjct: 216 RGSSSKNKVGPSKAQ---MKKKGKGK---APNTSKVKKNADKGKCFHCNQDGH 262

BLAST of CmaCh19G003230 vs. TrEMBL
Match: W9ST61_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003432 PE=4 SV=1)

HSP 1 Score: 174.9 bits (442), Expect = 1.5e-40
Identity = 106/232 (45.69%), Postives = 146/232 (62.93%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           +F+L EECP   ++N ++T R+ Y+ W KAN  A+ ++LAN+SDVL KKH+ M  A EIM
Sbjct: 11  KFVLVEECPQELAANTSKTTREPYDHWIKANNNAKCFMLANMSDVLRKKHEEMETAYEIM 70

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           ESL+ MFG PS   + DA+    N +MK+G+ V+ HVL+M+ H + AE N A IDE +QV
Sbjct: 71  ESLEAMFGTPSEKARLDAVWAFMNDKMKKGSSVKAHVLNMIDHLHDAELNGARIDEATQV 130

Query: 184 SFIMESLSKSFFQFCTNVIMNKIEYNLTVLLNELQTYQFLLTNKGQTGETNVVISKKLLR 243
             I+ESLS +F QF  N +MNK + NLT L+N LQ ++   TNK + GE NV+++     
Sbjct: 131 GIILESLSPNFHQFVNNFVMNKKKSNLTELMNNLQNFE--STNKRRGGEANVLVAGGY-- 190

Query: 244 GSSSKKKSDPSTSKCVLMNKKGKGKNKIPPNHKHKVQKTDKGKYFHCNENEH 296
           G + +K  +    K        KGKNK P N K  +QK  KGK FHCN + H
Sbjct: 191 GKNKRKNQNQGKGK--------KGKNKKPRNTKGPIQK-PKGKCFHCNGDWH 229

BLAST of CmaCh19G003230 vs. TrEMBL
Match: E2GK52_BRYDI (Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1)

HSP 1 Score: 164.5 bits (415), Expect = 2.0e-37
Identity = 81/129 (62.79%), Postives = 104/129 (80.62%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           RF+LTEEC    + NANRTVR+ Y+RW KAN KA VYILA+++DVLAKK+D +   K IM
Sbjct: 36  RFILTEECHQAPALNANRTVREAYDRWGKANDKACVYILASMTDVLAKKYDSIATTKGIM 95

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           +S +EMFGQPS+SL+H+AIK +Y  RMKEGT VREHVLDMM+HFN+A+ +   IDE +QV
Sbjct: 96  DSFREMFGQPSWSLRHEAIKRIYTKRMKEGTSVREHVLDMMMHFNIAKVHGGPIDEANQV 155

Query: 184 SFIMESLSK 193
           SFI++SL +
Sbjct: 156 SFILQSLRR 164

BLAST of CmaCh19G003230 vs. TrEMBL
Match: W9RV37_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006900 PE=4 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 2.9e-36
Identity = 97/213 (45.54%), Postives = 135/213 (63.38%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           +F+L EECPP  + NA +T R+ Y+RW KAN KA+ ++LA++SDVL KKH+ M  A EIM
Sbjct: 11  KFVLVEECPPEPAVNATKTAREPYDRWIKANNKAKCFMLASMSDVLRKKHEEMETAYEIM 70

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           ESL+ MFG PS   +  A++   N +MK+G+ V+ HVL+M+ H + AE N A IDE ++V
Sbjct: 71  ESLEAMFGPPSEKARLAAVRAFMNDKMKKGSSVKVHVLNMIDHLHDAELNGARIDETTKV 130

Query: 184 SFIMESLSKSFFQFCTNVIMNKIEYNLTVLLNELQTYQFLLTNKGQTGETNVVISKKLLR 243
             I+ES S  F++F  N +MNK + NLT L+N+LQ ++   TNK + GE NVV++     
Sbjct: 131 GIILESPSPVFYEFVNNFVMNKKKSNLTELMNDLQNFE--STNKRKGGEANVVVA----- 190

Query: 244 GSSSKKKSDPSTSKCVLMNKKGKGKNKIPPNHK 277
           GSS K +            K  KGKNK P   K
Sbjct: 191 GSSGKIQRTNQN-----QGKGKKGKNKRPKKAK 211

BLAST of CmaCh19G003230 vs. TrEMBL
Match: W9SH28_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_004989 PE=4 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 1.9e-35
Identity = 80/166 (48.19%), Postives = 116/166 (69.88%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           +F+L +ECPP  ++NA +T R+ Y+RW KAN KA+ ++LA++SDVL KKH+ M  A EIM
Sbjct: 36  KFVLVDECPPEPAANATKTAREPYDRWIKANNKAKCFMLASMSDVLCKKHEEMETAYEIM 95

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           ESL+ MFG PS   + DA++   N +MK+G+ V+ HVL+M+ H + AE N A IDE +Q+
Sbjct: 96  ESLEAMFGAPSEKARLDAVRAFMNDKMKKGSSVKAHVLNMIDHLHDAELNGARIDEATQL 155

Query: 184 SFIMESLSKSFFQFCTNVIMNKIEYNLTVLLNELQTYQFLLTNKGQ 230
             I+ESLS  F +F  N +MNK + NLT L+N+LQ ++     KG+
Sbjct: 156 GIILESLSPDFHEFVNNFVMNKKKSNLTELMNDLQNFESTNQAKGR 201

BLAST of CmaCh19G003230 vs. NCBI nr
Match: gi|299474487|gb|ADJ18449.1| (gag/pol protein [Bryonia dioica])

HSP 1 Score: 260.8 bits (665), Expect = 2.9e-66
Identity = 143/233 (61.37%), Postives = 176/233 (75.54%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           RF+LTEECP   + NANRTVR+ Y+RW KAN KARVYILA+++DVLAKKHD +  AK IM
Sbjct: 36  RFVLTEECPQAPALNANRTVREAYDRWVKANDKARVYILASMTDVLAKKHDSIATAKGIM 95

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           +SL+EMFGQPS+SL+H+AIK++Y  RMKEGT VREHVLDMM+HFN+AE N   IDE +QV
Sbjct: 96  DSLREMFGQPSWSLRHEAIKHIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIDEANQV 155

Query: 184 SFIMESLSKSFFQFCTNVIMNKIEYNLTVLLNELQTYQFLLTNKGQTGETNVVISK-KLL 243
           SFI++SL KSF  F TN  +NKIE+NLT LLNELQ +Q L  +KG+  E NV ++K K +
Sbjct: 156 SFILQSLPKSFVPFQTNASLNKIEFNLTTLLNELQRFQNLTLSKGKEVEANVAVTKRKFI 215

Query: 244 RGSSSKKKSDPSTSKCVLMNKKGKGKNKIPPNHKHKVQKTDKGKYFHCNENEH 296
           RGSSSK K  PS ++   M KKGKGK    PN     +  DKGK FHCN++ H
Sbjct: 216 RGSSSKNKVGPSKAQ---MKKKGKGK---APNTSKVKKNADKGKCFHCNQDGH 262

BLAST of CmaCh19G003230 vs. NCBI nr
Match: gi|659113933|ref|XP_008456826.1| (PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo])

HSP 1 Score: 241.5 bits (615), Expect = 1.9e-60
Identity = 137/236 (58.05%), Postives = 172/236 (72.88%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           RF+L E+CP  S++NA RTVR+ YERW KAN KAR Y+LA++S+VLAKK++ M  A+EIM
Sbjct: 36  RFVLVEKCPQVSAANATRTVREAYERWAKANEKARAYLLASLSEVLAKKNESMLTAREIM 95

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           +SL+EMFGQ S+ +KHDA+KY+YN RM +G LVREHVL+MMV+FNVAE N AVIDE +QV
Sbjct: 96  DSLQEMFGQASYQIKHDALKYIYNARMNDGALVREHVLNMMVYFNVAEMNGAVIDEANQV 155

Query: 184 SFIMESLSKSFFQFCTNVIMNKIEYNLTVLLNELQTYQFLLTNKGQTGETNVVIS-KKLL 243
           SFI+ESL +SF QF +NV+MNKI Y LT LLNELQT++ L+  KGQ GE NV  S +K  
Sbjct: 156 SFILESLLESFLQFRSNVVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFH 215

Query: 244 RGSSSKKKSDPSTSKCVLMNKK--GKG-KNKIPPNHKHKVQKTDKGKYFHCNENEH 296
           RGS+S  K  PS+S      KK  G+G K  +      K  K  KG  FHCN+  H
Sbjct: 216 RGSTSGTKYMPSSSGNKKWKKKKGGQGNKANLAATKTSKKAKVAKGICFHCNQEGH 271

BLAST of CmaCh19G003230 vs. NCBI nr
Match: gi|778697615|ref|XP_011654359.1| (PREDICTED: uncharacterized protein LOC105435361 [Cucumis sativus])

HSP 1 Score: 189.5 bits (480), Expect = 8.3e-45
Identity = 91/134 (67.91%), Postives = 113/134 (84.33%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           RF+LTEECP N +SNANRT R+ Y+RW KAN KARVYILA++SDVLAKKH+ +  AKEIM
Sbjct: 36  RFVLTEECPQNPASNANRTGREAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIM 95

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           +SL+ MFGQP +SL+H+A+KY+Y  RMKEGT VREHVLDMM+HFN+A+ N  +I+E +QV
Sbjct: 96  DSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAQVNGGLIEEVNQV 155

Query: 184 SFIMESLSKSFFQF 198
           SFI+ESL KSF  F
Sbjct: 156 SFILESLPKSFIPF 169

BLAST of CmaCh19G003230 vs. NCBI nr
Match: gi|659118732|ref|XP_008459275.1| (PREDICTED: uncharacterized protein LOC103498451 [Cucumis melo])

HSP 1 Score: 180.3 bits (456), Expect = 5.1e-42
Identity = 89/160 (55.62%), Postives = 118/160 (73.75%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           RF+L EEC    +    ++VRD Y+RW KAN KA VYI+A++SD+L+ KH +M   ++I+
Sbjct: 25  RFILMEECSLFLTQGTFKSVRDAYDRWKKANDKAHVYIMASMSDILSNKHKIMVTTRQIV 84

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           +SL+EMFGQ S  +K + IKYVYN RMK+   V++HVL+M+VHFNV E N  V DEKSQV
Sbjct: 85  DSLREMFGQLSIQIKQETIKYVYNARMKDSQSVKKHVLNMIVHFNVVEMNVVVFDEKSQV 144

Query: 184 SFIMESLSKSFFQFCTNVIMNKIEYNLTVLLNELQTYQFL 224
           SFI++ L KS  QF  N  MNKI+YN+T+ LNELQT+Q L
Sbjct: 145 SFILKYLPKSSLQFNNNAEMNKIKYNMTIFLNELQTFQSL 184

BLAST of CmaCh19G003230 vs. NCBI nr
Match: gi|703151425|ref|XP_010110116.1| (hypothetical protein L484_003432 [Morus notabilis])

HSP 1 Score: 174.9 bits (442), Expect = 2.1e-40
Identity = 106/232 (45.69%), Postives = 146/232 (62.93%), Query Frame = 1

Query: 64  RFLLTEECPPNSSSNANRTVRDVYERWTKANGKARVYILANISDVLAKKHDVMGIAKEIM 123
           +F+L EECP   ++N ++T R+ Y+ W KAN  A+ ++LAN+SDVL KKH+ M  A EIM
Sbjct: 11  KFVLVEECPQELAANTSKTTREPYDHWIKANNNAKCFMLANMSDVLRKKHEEMETAYEIM 70

Query: 124 ESLKEMFGQPSFSLKHDAIKYVYNCRMKEGTLVREHVLDMMVHFNVAEENEAVIDEKSQV 183
           ESL+ MFG PS   + DA+    N +MK+G+ V+ HVL+M+ H + AE N A IDE +QV
Sbjct: 71  ESLEAMFGTPSEKARLDAVWAFMNDKMKKGSSVKAHVLNMIDHLHDAELNGARIDEATQV 130

Query: 184 SFIMESLSKSFFQFCTNVIMNKIEYNLTVLLNELQTYQFLLTNKGQTGETNVVISKKLLR 243
             I+ESLS +F QF  N +MNK + NLT L+N LQ ++   TNK + GE NV+++     
Sbjct: 131 GIILESLSPNFHQFVNNFVMNKKKSNLTELMNNLQNFE--STNKRRGGEANVLVAGGY-- 190

Query: 244 GSSSKKKSDPSTSKCVLMNKKGKGKNKIPPNHKHKVQKTDKGKYFHCNENEH 296
           G + +K  +    K        KGKNK P N K  +QK  KGK FHCN + H
Sbjct: 191 GKNKRKNQNQGKGK--------KGKNKKPRNTKGPIQK-PKGKCFHCNGDWH 229

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
E2GK51_BRYDI2.1e-6661.37Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
W9ST61_9ROSA1.5e-4045.69Uncharacterized protein OS=Morus notabilis GN=L484_003432 PE=4 SV=1[more]
E2GK52_BRYDI2.0e-3762.79Gag/pol protein (Fragment) OS=Bryonia dioica PE=4 SV=1[more]
W9RV37_9ROSA2.9e-3645.54Uncharacterized protein OS=Morus notabilis GN=L484_006900 PE=4 SV=1[more]
W9SH28_9ROSA1.9e-3548.19Uncharacterized protein OS=Morus notabilis GN=L484_004989 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|299474487|gb|ADJ18449.1|2.9e-6661.37gag/pol protein [Bryonia dioica][more]
gi|659113933|ref|XP_008456826.1|1.9e-6058.05PREDICTED: uncharacterized protein LOC103496664 [Cucumis melo][more]
gi|778697615|ref|XP_011654359.1|8.3e-4567.91PREDICTED: uncharacterized protein LOC105435361 [Cucumis sativus][more]
gi|659118732|ref|XP_008459275.1|5.1e-4255.63PREDICTED: uncharacterized protein LOC103498451 [Cucumis melo][more]
gi|703151425|ref|XP_010110116.1|2.1e-4045.69hypothetical protein L484_003432 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh19G003230.1CmaCh19G003230.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 90..219
score: 7.3

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh19G003230Watermelon (97103) v2cmawmbB551
CmaCh19G003230Watermelon (97103) v2cmawmbB560
CmaCh19G003230Cucurbita moschata (Rifu)cmacmoB508
CmaCh19G003230Cucurbita moschata (Rifu)cmacmoB509
CmaCh19G003230Cucurbita moschata (Rifu)cmacmoB511
CmaCh19G003230Cucurbita pepo (Zucchini)cmacpeB522
CmaCh19G003230Bottle gourd (USVL1VR-Ls)cmalsiB482