Cucsa.319790 (gene) Cucumber (Gy14) v1

NameCucsa.319790
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationscaffold03078 : 469776 .. 474170 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACGTATATTAAAGATAGTCAATTCATTTGATGCAACATTGAATGATAAGAAGTATAAGTGAAGACCACAAGATTGAAGATCTGATGAATTAATTCAAGGGAAGCCTGAATTCATCTTCGAGGAGATTTGGGATTACCTTCAGAGGGGCCTGATTTTATTTTAAGTTGTGGTGATAACTTATCAAAGTAGCTTTAAATTTAAATTACTGACTTTTGAATCTCAATAGTATTTCTTATATCTAAGCAAAGAAACTCCCTAGACATTGGGTGAACTTGTCGAATTAGGTTACAAAAGCCTTCTGTGTTATCTGTTACTTTTGTGTTTTAACTATTCTTCTAGTTAATTACTTTAAATATAGCTAAAGATTATTTTTTATCATCTTACTTATTTTAATTGGTATTAGGACGGGTTCAGTATCTATAGAGACACTCAGAGGAGGTAGATTGACTACTAGACCACCTATACTTGATGGTAAAAACTATTCATATCGAAAAACTCGTATGACTTCGTTTATTAAGTTTATTGATATGAAGGCCTCGATGGTTGTTACTGTTGGGTGGAAGCCACTTATGATTACTGTTGATGGTAAATCTGTTCAAAAGCCTGAAAGGATTGGACAGATGCAGAAAAGCAAGCCTCATTAGGAAGGTCTCGAGCTCTAAATGCCATATACAATGGAGTAGATTTACATGTGTTCAAACTGATAAATTCTTGCAGTACTGCAAAAGACGCGTGGAAAGCACTTTAATTGGCATTTAAAGGTACTTCTAAAGTAAAGATCTCTCGTCTGCAACTTTTAACGTCTAAGTCTGAAACCTTGAGAATGATGGAAGAAGAGACCATCGCTGAATATAATGTGAGGGTTCTGGAAATCGCAAATGACTCCTTTAATTTTGGTGAGGGAACCCCAAAATCCAAACTTGTCAAAAAGGTGTTGCGATCAATGCCTAGAAAGTTTGATATGAAAGTTATGGTCATTGAAGAAGCACATGATATCACTACTCTGACACTAGATGAATTATTTGGATCTTTGTGTGCTTTCAAAATTGTTATATTTGATAGAGTAGACAATAAATGGAAATAGATTGTCTTTCAATTTGTTCATGAAGACGTATGCTTTGAGAAAGGAAAATATTCAAATGAACAAGTAAATGAGTCCATCACTATGTTGACCAAACAATATTCTAATATTATAAAAAAATTCAGAGATCTAAACAACTATGCGTCAAATAATTGAGAACAAAATAATTATAGAAGGAGAGATTTTGAAGAACAAAATGGCAGGGAGAAGAGATTTTTTAAATGCAATGAATGTGGTGGGAATGACCTTTATCAAGCTGGATGTCCAACTTATCTGAGGAGACAAAAGAAAAGTTTTGGTGTCACTCTCTCTGATGAAGAATCGAATGGAAGTGAAGAGGAAGAAGAATTTTCCAATGCATTTATCAGTATTATATCTAAAGGTGATCTTGTGTCTGATACGGAAGACATCGACTGTCAAATCGAGAATAATATGTCTTTTGATCAAATAGACAACAGTGGAAGGAGGATTCCTTAGCTCGGGCTATTCAGAAAGAAAAAATTCAAGAATTGATTGATGAAAATCAAAGATTACTAACTGTCATATCTGTTGGCTTTTTGGCCCTTAATCTCAAATTAATTTATTTTAATTATTCAATTAATTTATGTTTTGATGATAACAAATCTAATTTTTTTTATTGACAATTTTATTAATTTGAATAGACCATATCGTATAGCTTGGAAGAATGATTCTAACGCCTCTTGAATCACCCAAATCGAAGTTCAAATGAAGAAAATATTGCTAAAAGAAGCTTCACGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTAGGGTTGCCATACCAATCCAACAATATCATCATTGAAGCTGAAATTAAAAGAAGCTCAAGCAGTGTCACACCATAAACCGTATGACGAGGAAAAGCCGAACTAGTAAAAATAGGGTGCGAGTTTGTTACAACCTTGAAATTTTCTTATTGCAAAATCTTATTCTCTACTCCTTTTGAACTATTAATACTCAAAACGTAACGATGGTCAATGTAACGCCAAGCAAATTATATACAAACAAAAGCGTATACTTTATTAGTAACTGGTGTTTCATTATTCCTACTATCTAAACCTTGGAGTCAAAATTTTTCAACCTTCACAATCATCTTCAACAGGCTTTTCTTTGTTTTGAGAGTCTATCGTCACGATCTTTTGTTGGAATCGAACAAGTGTAGATTCAAAAGTCCAATACTGAATCAAGTTTTTCCCTTGATCATAAGGCAATGTTGATGGCAGAACACTTGAAGTCATCTGGACTACAACATGATTCATCTGACTAATGATTCGACTACAACATGAAGTCATTTCACGAGCCAACTTTAGAATCAGCTCCTTCAACAAGAAACACTTTTCAACATGGTGACTAATGACTTGATGATATTTATCGTAGTTAGGATCATCTACTTTTTCTGCTTATTTCGGTCGTTTGCATTCTGACAATTGAATAAATTATTTGTGTAGCAACTGCTCTAACATATTTTCAACATCAGAGTTGGGAAACTGATCAACTTTTTCATATCTTTGGAAGTGTTTGTTATATTCTTGCAAATCCAGAGTATCACAGAAAATGGGATGTCAAGTCAGATAAAAGAATCTTCTTGGGCTATTCTCCTAATAGTCGCGCTTACATAGTGTACAATACACACACTCAATTTGTTACGGAAACCATTAGTATTGTTGTCAACGATGATGAGAAACTCCCACTTAGAGATTGTGATGATGACCAGGCAGTATTGATACAAATTACAATTTGTCCTACCGAATGCATTGTACACTAAACCTTAAATACAAAATATTTAAACCTAAACCTTAAAAACAAAATTCTTTTAGATGTGCCTAAACCTAAACCGAATGCATACTAAATTTAAATTGAAGAACCTTTGATGTTGAGACTAAAACAAAGAATCTGGTGCATCTTGTTGTTTCATCATCGCATGTCACGAAGAAATCATCCAACGAGTTGTATTATTGTCAATCTCAACAATGGGATCACAACCTAAAAAAAGGATCGAATAGATTACGAAAAGTTAATGGCTAATATTTGTTATATTTCATCTCCTGAACATGTTTCTGTTGCAGATGCACTTAAAGATGAGTTCTGGATAAATGCAACGCAAGAAGAACTCTTGTTGTTTCAAAGAAATAATGTCTAGACCTTGGTGCCTAAACCTAAACATGTTAATGTCATTGACACAAAGTGGATATTTAAAAACAAAATTGATGAACAAGGTTGTGTTACAAGAAAAAAGGATAGATTGGTTACTCAAGGATACTCTCAAGTTGAAGGCATAGATTTTGATGAAACTTTTGCACCATTAGCTAGACTTGAAGCTATACGATTGTTGCTTGGCATAGCATGTCTGCAAAGGATCAAATTATATCAAATGGATGTCAAGAGTGCCTTCTTAAATGACTACTTACATGAAGAAGTATATGTAGCTCAACCAAAAGGGTTAATAGATCAAGTGTTTTTTCAGCATGTGTACAAATTAAATAAGGCTCTCTATGATCTTAAATAA

mRNA sequence

atgaacacactcagaggaggtagattgactactagaccacctatacttgatggtaaaaactattcatatcgaaaaactcgtatgacttcgtttattaagtttattgatatgaaggcctcgatggttgttactgttgggtggaagccacttatgattactgattggacagatgcagaaaagcaagcctcattaggaaggtctcgagctctaaatgccatatacaatggagtagatttacatgtgttcaaactgataaattcttgcagtacttctaaagtaaagatctctcgtctgcaacttttaacgtctaagtctgaaaccttgagaatgatggaagaagagaccatcgctgaatataatgtgagggttctggaaatcgcaaatgactcctttaattttggtgagggaaccccaaaatccaaacttgtcaaaaaggtgttgcgatcaatgcctagaaagtttgatatgaaagttatggtcattgaagaagcacatgatatcactactctgacactagatgaattatttggatctttgtgtgctttcaaaattgttatatttgatagaagattttttaaatgcaatgaatgtggtgggaatgacctttatcaagctggatgtccaacttatctgaggagacaaaagaaaagttttggtgtcactctctctgatgaagaatcgaatggaagtgaagaggaagaagaattttccaatgcatttatcagtattatatctaaaggtgatcttgtgtctgatacggaagacatcgactgtcaaatcgagaataatatgtcttttgatcaaatagacaacacaactgctctaacatattttcaacatcagagttgggaaactgatcaactttttcatatctttggaagtgtttgttatattcttgcaaatccagagtatcacagaaaatgggatgtcaagtcagataaaagaatcttcttgggctattctcctaatagtcgcgcttacatagtgtacaatacacacactcaatttgttacggaaaccattagtattgttgtcaacgatgatgagaaactcccacttagagattgtgatgatgaccaggcaaccttggtgcctaaacctaaacatgttaatgtcattgacacaaagtggatatttaaaaacaaaattgatgaacaaggttgtgttacaagaaaaaaggatagattggttactcaaggatactctcaagttgaaggcatagattttgatgaaacttttgcaccattagctagacttgaagctatacgattgttgcttggcatagcatgtctgcaaaggatcaaattatatcaaatggatgtcaagagtgccttcttaaatgactacttacatgaagaagtatatgtagctcaaccaaaagggttaatagatcaagtgttttttcagcatgtgtacaaattaaataaggctctctatgatcttaaataa

Coding sequence (CDS)

ATGAACACACTCAGAGGAGGTAGATTGACTACTAGACCACCTATACTTGATGGTAAAAACTATTCATATCGAAAAACTCGTATGACTTCGTTTATTAAGTTTATTGATATGAAGGCCTCGATGGTTGTTACTGTTGGGTGGAAGCCACTTATGATTACTGATTGGACAGATGCAGAAAAGCAAGCCTCATTAGGAAGGTCTCGAGCTCTAAATGCCATATACAATGGAGTAGATTTACATGTGTTCAAACTGATAAATTCTTGCAGTACTTCTAAAGTAAAGATCTCTCGTCTGCAACTTTTAACGTCTAAGTCTGAAACCTTGAGAATGATGGAAGAAGAGACCATCGCTGAATATAATGTGAGGGTTCTGGAAATCGCAAATGACTCCTTTAATTTTGGTGAGGGAACCCCAAAATCCAAACTTGTCAAAAAGGTGTTGCGATCAATGCCTAGAAAGTTTGATATGAAAGTTATGGTCATTGAAGAAGCACATGATATCACTACTCTGACACTAGATGAATTATTTGGATCTTTGTGTGCTTTCAAAATTGTTATATTTGATAGAAGATTTTTTAAATGCAATGAATGTGGTGGGAATGACCTTTATCAAGCTGGATGTCCAACTTATCTGAGGAGACAAAAGAAAAGTTTTGGTGTCACTCTCTCTGATGAAGAATCGAATGGAAGTGAAGAGGAAGAAGAATTTTCCAATGCATTTATCAGTATTATATCTAAAGGTGATCTTGTGTCTGATACGGAAGACATCGACTGTCAAATCGAGAATAATATGTCTTTTGATCAAATAGACAACACAACTGCTCTAACATATTTTCAACATCAGAGTTGGGAAACTGATCAACTTTTTCATATCTTTGGAAGTGTTTGTTATATTCTTGCAAATCCAGAGTATCACAGAAAATGGGATGTCAAGTCAGATAAAAGAATCTTCTTGGGCTATTCTCCTAATAGTCGCGCTTACATAGTGTACAATACACACACTCAATTTGTTACGGAAACCATTAGTATTGTTGTCAACGATGATGAGAAACTCCCACTTAGAGATTGTGATGATGACCAGGCAACCTTGGTGCCTAAACCTAAACATGTTAATGTCATTGACACAAAGTGGATATTTAAAAACAAAATTGATGAACAAGGTTGTGTTACAAGAAAAAAGGATAGATTGGTTACTCAAGGATACTCTCAAGTTGAAGGCATAGATTTTGATGAAACTTTTGCACCATTAGCTAGACTTGAAGCTATACGATTGTTGCTTGGCATAGCATGTCTGCAAAGGATCAAATTATATCAAATGGATGTCAAGAGTGCCTTCTTAAATGACTACTTACATGAAGAAGTATATGTAGCTCAACCAAAAGGGTTAATAGATCAAGTGTTTTTTCAGCATGTGTACAAATTAAATAAGGCTCTCTATGATCTTAAATAA

Protein sequence

MNTLRGGRLTTRPPILDGKNYSYRKTRMTSFIKFIDMKASMVVTVGWKPLMITDWTDAEKQASLGRSRALNAIYNGVDLHVFKLINSCSTSKVKISRLQLLTSKSETLRMMEEETIAEYNVRVLEIANDSFNFGEGTPKSKLVKKVLRSMPRKFDMKVMVIEEAHDITTLTLDELFGSLCAFKIVIFDRRFFKCNECGGNDLYQAGCPTYLRRQKKSFGVTLSDEESNGSEEEEEFSNAFISIISKGDLVSDTEDIDCQIENNMSFDQIDNTTALTYFQHQSWETDQLFHIFGSVCYILANPEYHRKWDVKSDKRIFLGYSPNSRAYIVYNTHTQFVTETISIVVNDDEKLPLRDCDDDQATLVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAIRLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDLK*
BLAST of Cucsa.319790 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 111.7 bits (278), Expect = 2.3e-23
Identity = 64/184 (34.78%), Postives = 104/184 (56.52%), Query Frame = 1

Query: 310  VKSDKRIFLGYSPNSRAYIVYNTHTQF--VTETISIVVNDDEKLPLRDCDDDQA------ 369
            +K+  +I      NS   +V N HT F  V  +   +   D+K    +  + +       
Sbjct: 861  LKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKIN 920

Query: 370  ---TLVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLAR 429
               T+  +P++ N++D++W+F  K +E G   R K RLV +G++Q   ID++ETFAP+AR
Sbjct: 921  NTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVAR 980

Query: 430  LEAIRLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKAL 483
            + + R +L +     +K++QMDVK+AFLN  L EE+Y+  P+G+       +V KLNKA+
Sbjct: 981  ISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAI 1040

BLAST of Cucsa.319790 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 97.1 bits (240), Expect = 5.8e-19
Identity = 52/120 (43.33%), Postives = 75/120 (62.50%), Query Frame = 1

Query: 363 LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
           LV  PK    +  KW+FK K D    + R K RLV +G+ Q +GIDFDE F+P+ ++ +I
Sbjct: 845 LVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSI 904

Query: 423 RLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDLK 482
           R +L +A    +++ Q+DVK+AFL+  L EE+Y+ QP+G         V KLNK+LY LK
Sbjct: 905 RTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLK 964

BLAST of Cucsa.319790 vs. Swiss-Prot
Match: M820_ARATH (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 1.1e-09
Identity = 30/67 (44.78%), Postives = 43/67 (64.18%), Query Frame = 1

Query: 363 LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
           LVP P + N++  KW+FK K+   G + R K RLV +G+ Q EGI F ET++P+ R   I
Sbjct: 59  LVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATI 118

Query: 423 RLLLGIA 430
           R +L +A
Sbjct: 119 RTILNVA 125

BLAST of Cucsa.319790 vs. TrEMBL
Match: A2Q5V4_MEDTR (Gag-pol polyprotein, putative OS=Medicago truncatula GN=MtrDRAFT_AC169177g29v1 PE=4 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 7.5e-42
Identity = 86/120 (71.67%), Postives = 99/120 (82.50%), Query Frame = 1

Query: 363 LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
           LVP+P  +NVI TKWI+KNK DE G +TR K RLV QGY+QVE +DFDETFAP+ARLE+I
Sbjct: 16  LVPRPNDINVIGTKWIYKNKSDENGIITRNKARLVAQGYTQVERLDFDETFAPVARLESI 75

Query: 423 RLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDLK 482
           RLLLG+AC+ + KL+QMDVKSAFLN YLHEEV+V QPKG ID  F  HVYKL KALY LK
Sbjct: 76  RLLLGVACILKFKLFQMDVKSAFLNGYLHEEVFVEQPKGFIDPNFPDHVYKLKKALYGLK 135

BLAST of Cucsa.319790 vs. TrEMBL
Match: Q84VI0_SOYBN (Gag-pol polyprotein OS=Glycine max GN=gag-pol PE=4 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 9.9e-42
Identity = 83/120 (69.17%), Postives = 99/120 (82.50%), Query Frame = 1

Query: 363  LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
            LVP+P+  NVI TKWIFKNK +E+G +TR K RLV QGY+Q+EG+DFDETFAP+ARLE+I
Sbjct: 1099 LVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESI 1158

Query: 423  RLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDLK 482
            RLLLG+AC+ + KLYQMDVKSAFLN YL+EE YV QPKG +D     HVY+L KALY LK
Sbjct: 1159 RLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHLDHVYRLKKALYGLK 1218

BLAST of Cucsa.319790 vs. TrEMBL
Match: Q84VI0_SOYBN (Gag-pol polyprotein OS=Glycine max GN=gag-pol PE=4 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 4.9e-41
Identity = 103/240 (42.92%), Postives = 139/240 (57.92%), Query Frame = 1

Query: 1   MNTLRGGRLTTRPPILDGKNYSYRKTRMTSFIKFIDMKASMVVTVGWK-PLMIT------ 60
           MN  + G    RPPILDG NY Y K RM +F+K +D +    V  GW+ P M+       
Sbjct: 1   MNMEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  -------DWTDAEKQASLGRSRALNAIYNGVDLHVFKLINSCS----------------T 120
                  DWT  E + +LG S+ALNA++NGVD ++F+LIN+C+                T
Sbjct: 61  NELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDACGEILKTTHEGT 120

Query: 121 SKVKISRLQLLTSKSETLRMMEEETIAEYNVRVLEIANDSFNFGEGTPKSKLVKKVLRSM 180
           SKVK+SRLQLL +K E L+M EEE I ++++ +LEIAN     GE     KLV+K+LRS+
Sbjct: 121 SKVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSL 180

Query: 181 PRKFDMKVMVIEEAHDITTLTLDELFGSLCAFKIVIFDRR-------FFKCNECGGNDLY 204
           P++FDMKV  IEEA DI  + +DEL GSL  F++ + DR         F  N+ G  D Y
Sbjct: 181 PKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRNEKKSKNLAFVSNDEGEEDEY 240


HSP 2 Score: 84.3 bits (207), Expect = 4.3e-13
Identity = 52/128 (40.62%), Postives = 74/128 (57.81%), Query Frame = 1

Query: 248  DLVSDTEDIDCQIENNMSFDQIDNTTALTYFQHQSWETDQLFHIFGSVCYILANPEYHRK 307
            +L ++  +  C I N ++  +   TT    ++ +   T + FHIFGS CYILA+ E  RK
Sbjct: 879  NLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRK-PTVKHFHIFGSPCYILADREQRRK 938

Query: 308  WDVKSDKRIFLGYSPNSRAYIVYNTHTQFVTETISIVVNDDEKLPLRDCDDDQATLVPKP 367
             D KSD  IFLGYS NSRAY V+N+ T+ V E+I++VV+D      +D ++D  T     
Sbjct: 939  MDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLTPARKKDVEEDVRT----- 998

Query: 368  KHVNVIDT 376
               NV DT
Sbjct: 999  SEDNVADT 1000


HSP 3 Score: 178.7 bits (452), Expect = 1.7e-41
Identity = 99/218 (45.41%), Postives = 133/218 (61.01%), Query Frame = 1

Query: 1   MNTLRGGRLTTRPPILDGKNYSYRKTRMTSFIKFIDMKASMVVTVGWK-PLMIT------ 60
           MN  + G    RPPILDG NY Y K RM +F+K +D +    V  GW+ P M+       
Sbjct: 1   MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  -------DWTDAEKQASLGRSRALNAIYNGVDLHVFKLINSCS---------------TS 120
                  DWT  E + +LG S+ALNA++NGVD ++F+LIN+C+               TS
Sbjct: 61  DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTS 120

Query: 121 KVKISRLQLLTSKSETLRMMEEETIAEYNVRVLEIANDSFNFGEGTPKSKLVKKVLRSMP 180
           KVKISRLQLL +K E L+M EEE I ++++ +LEIAN     GE     KLV+K+LRS+P
Sbjct: 121 KVKISRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181 RKFDMKVMVIEEAHDITTLTLDELFGSLCAFKIVIFDR 190
           ++FDMKV  IEEA DI  + +DEL GSL  F++ + DR
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDR 218

BLAST of Cucsa.319790 vs. TrEMBL
Match: Q84VI4_SOYBN (Gag-pol polyprotein OS=Glycine max GN=gag-pol PE=4 SV=1)

HSP 1 Score: 177.9 bits (450), Expect = 2.9e-41
Identity = 84/120 (70.00%), Postives = 99/120 (82.50%), Query Frame = 1

Query: 363  LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
            LVP+P+  NVI TKWIFKNK +E+G +TR K RLV QGY+Q+EG+DFDETFAP+ARLE+I
Sbjct: 1097 LVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESI 1156

Query: 423  RLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDLK 482
            RLLLG+AC+ + KLYQMDVKSAFLN YL+EEVYV QPKG  D     HVY+L KALY LK
Sbjct: 1157 RLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRLKKALYGLK 1216


HSP 2 Score: 82.8 bits (203), Expect = 1.3e-12
Identity = 47/115 (40.87%), Postives = 68/115 (59.13%), Query Frame = 1

Query: 248 DLVSDTEDIDCQIENNMSFDQIDNTTALTYFQHQSWETDQLFHIFGSVCYILANPEYHRK 307
           +L ++  +  C I N ++  +   TT    ++ +       FHIFGS CYILA+ E  RK
Sbjct: 877 NLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPSVKH-FHIFGSPCYILADREQRRK 936

Query: 308 WDVKSDKRIFLGYSPNSRAYIVYNTHTQFVTETISIVVNDDEKLPLRDCDDDQAT 363
            D KSD  IFLGYS NSRAY V+N+ T+ V E+I++VV+D      +D ++D  T
Sbjct: 937 MDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLSPARKKDVEEDVRT 990


HSP 3 Score: 177.9 bits (450), Expect = 2.9e-41
Identity = 83/120 (69.17%), Postives = 99/120 (82.50%), Query Frame = 1

Query: 363  LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
            LVP+P+  NVI TKWIFKNK +E+G +TR K RLV QGY+Q+EG+DFDETFAP+ARLE+I
Sbjct: 1100 LVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESI 1159

Query: 423  RLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDLK 482
            RLLLG+AC+ + KLYQMDVKSAFLN YL+EE YV QPKG +D     HVY+L KALY LK
Sbjct: 1160 RLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLKKALYGLK 1219

BLAST of Cucsa.319790 vs. TrEMBL
Match: Q84VH6_SOYBN (Gag-pol polyprotein OS=Glycine max GN=gag-pol PE=4 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 6.4e-41
Identity = 98/218 (44.95%), Postives = 133/218 (61.01%), Query Frame = 1

Query: 1   MNTLRGGRLTTRPPILDGKNYSYRKTRMTSFIKFIDMKASMVVTVGWK-PLMIT------ 60
           MN  + G    RPPILDG NY Y K RM +F+K +D +    V  GW+ P M+       
Sbjct: 1   MNMEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61  -------DWTDAEKQASLGRSRALNAIYNGVDLHVFKLINSCS---------------TS 120
                  DWT  E + +LG S+ALNA++NGVD ++F+LIN+C+               TS
Sbjct: 61  NELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTS 120

Query: 121 KVKISRLQLLTSKSETLRMMEEETIAEYNVRVLEIANDSFNFGEGTPKSKLVKKVLRSMP 180
           KVK+SRLQLL +K E L+M EEE I ++++ +LEIAN     GE     KLV+K+LRS+P
Sbjct: 121 KVKMSRLQLLATKFENLKMKEEECIHDFHMTILEIANACTALGERMTDEKLVRKILRSLP 180

Query: 181 RKFDMKVMVIEEAHDITTLTLDELFGSLCAFKIVIFDR 190
           ++FDMKV  IEEA DI  + +DEL GSL  F++ + DR
Sbjct: 181 KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDR 218


HSP 2 Score: 84.0 bits (206), Expect = 5.6e-13
Identity = 48/115 (41.74%), Postives = 70/115 (60.87%), Query Frame = 1

Query: 248 DLVSDTEDIDCQIENNMSFDQIDNTTALTYFQHQSWETDQLFHIFGSVCYILANPEYHRK 307
           +L ++  +  C I N ++  +   TT    ++ +   T + FHIFGS CYILA+ E  RK
Sbjct: 880 NLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRK-PTVKHFHIFGSPCYILADREQRRK 939

Query: 308 WDVKSDKRIFLGYSPNSRAYIVYNTHTQFVTETISIVVNDDEKLPLRDCDDDQAT 363
            D KSD  IFLGYS NSRAY V+N+ T+ V E+I++VV+D      +D ++D  T
Sbjct: 940 MDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVVDDLTPARKKDVEEDVRT 993


HSP 3 Score: 41.2 bits (95), Expect = 4.2e+00
Identity = 22/77 (28.57%), Postives = 40/77 (51.95%), Query Frame = 1

Query: 193 KCNECGGNDLYQAGCPTYLRRQKKSFGVTLSDEESNGSEEEEEFSNAFISIISKGDLVSD 252
           +C  C G    +A CPT+L++Q+K   V  SD+    SE+E +      ++  + +   D
Sbjct: 301 QCRGCEGYGHIKAECPTHLKKQRKGLSVCRSDDTE--SEQESDSDRDVNALTGRFESAED 360

Query: 253 TEDIDCQIENNMSFDQI 270
           + D D +I    +FD++
Sbjct: 361 SSDTDSEI----TFDEL 371


HSP 4 Score: 177.9 bits (450), Expect = 2.9e-41
Identity = 84/120 (70.00%), Postives = 99/120 (82.50%), Query Frame = 1

Query: 363  LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
            LVP+P+  NVI TKWIFKNK +E+G +TR K RLV QGY+Q+EG+DFDETFAP+ARLE+I
Sbjct: 1099 LVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESI 1158

Query: 423  RLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDLK 482
            RLLLG+AC+ + KLYQMDVKSAFLN YL+EEVYV QPKG  D     HVY+L KALY LK
Sbjct: 1159 RLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRLKKALYGLK 1218

BLAST of Cucsa.319790 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 94.0 bits (232), Expect = 2.8e-19
Identity = 48/120 (40.00%), Postives = 73/120 (60.83%), Query Frame = 1

Query: 367 PKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAIRLLL 426
           P +   I  KW++K K +  G + R K RLV +GY+Q EGIDF ETF+P+ +L +++L+L
Sbjct: 121 PPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLIL 180

Query: 427 GIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLI----DQVFFQHVYKLNKALYDLK 483
            I+ +    L+Q+D+ +AFLN  L EE+Y+  P G      D +    V  L K++Y LK
Sbjct: 181 AISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLK 240

BLAST of Cucsa.319790 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 66.2 bits (160), Expect = 6.2e-11
Identity = 30/67 (44.78%), Postives = 43/67 (64.18%), Query Frame = 1

Query: 363 LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
           LVP P + N++  KW+FK K+   G + R K RLV +G+ Q EGI F ET++P+ R   I
Sbjct: 59  LVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATI 118

Query: 423 RLLLGIA 430
           R +L +A
Sbjct: 119 RTILNVA 125

BLAST of Cucsa.319790 vs. NCBI nr
Match: gi|659126719|ref|XP_008463331.1| (PREDICTED: uncharacterized protein LOC103501512 [Cucumis melo])

HSP 1 Score: 183.3 bits (464), Expect = 9.8e-43
Identity = 102/190 (53.68%), Postives = 127/190 (66.84%), Query Frame = 1

Query: 1   MNTLRGGRLTTRPPILDGKNYSYRKTRMTSFIKFIDMKASMVVTVGWKPLMIT------- 60
           M  +R G   +RPP+LDGKNYSY K RM   IK +D KA   +  G++P M+T       
Sbjct: 1   MEIIREGPSASRPPVLDGKNYSYWKPRMIFSIKTLDGKAWRALVAGYEPPMVTMDEVSVL 60

Query: 61  ----DWTDAEKQASLGRSRALNAIYNGVDLHVFKLINSCST---------------SKVK 120
               DWTD E+QAS+G +RA+NAI+NGVDL+VFKLINSC+T               SKVK
Sbjct: 61  KPEVDWTDVEEQASVGNARAINAIFNGVDLNVFKLINSCTTTKEAWKILEVAYEGTSKVK 120

Query: 121 ISRLQLLTSKSETLRMMEEETIAEYNVRVLEIANDSFNFGEGTPKSKLVKKVLRSMPRKF 165
           IS LQL+TSK E L+M E+E+++EYN RVLEIANDS    E   +SK+V+KVLR + RKF
Sbjct: 121 ISSLQLITSKFEVLKMTEDESVSEYNERVLEIANDSLLLDEKISESKIVRKVLRYLLRKF 180

BLAST of Cucsa.319790 vs. NCBI nr
Match: gi|659074589|ref|XP_008437686.1| (PREDICTED: uncharacterized protein LOC103483027 [Cucumis melo])

HSP 1 Score: 180.3 bits (456), Expect = 8.3e-42
Identity = 100/197 (50.76%), Postives = 128/197 (64.97%), Query Frame = 1

Query: 1   MNTLRGGRLTTRPPILDGKNYSYRKTRMTSFIKFIDMKASMVVTVGWKPLMIT------- 60
           M  +R G   + PP+LDGKNYSY K  M  FIK +D KA  V+  G++P M+T       
Sbjct: 1   MEIIREGPSASHPPVLDGKNYSYWKPHMIFFIKTLDGKAWRVLVGGYEPPMVTVNEVLVP 60

Query: 61  ----DWTDAEKQASLGRSRALNAIYNGVDLHVFKLINSCSTSKVKISRLQLLTSKSETLR 120
               +WTDAE+QAS+G +RA+NAI+ GVDL+                   L+TSK E L+
Sbjct: 61  KPEINWTDAEEQASVGNARAINAIFKGVDLN-------------------LITSKFEALK 120

Query: 121 MMEEETIAEYNVRVLEIANDSFNFGEGTPKSKLVKKVLRSMPRKFDMKVMVIEEAHDITT 180
           M E+E+++EYN RVLEIANDS   GE   +SK+V KVLRS+PRK DMKV+ IEEA DITT
Sbjct: 121 MTEDESVSEYNERVLEIANDSLLLGEKISESKIVCKVLRSLPRKLDMKVIAIEEAQDITT 178

Query: 181 LTLDELFGSLCAFKIVI 187
           L LDELFGSL  F++ +
Sbjct: 181 LKLDELFGSLLTFEMAV 178

BLAST of Cucsa.319790 vs. NCBI nr
Match: gi|659129653|ref|XP_008464774.1| (PREDICTED: uncharacterized protein LOC103502582 [Cucumis melo])

HSP 1 Score: 180.3 bits (456), Expect = 8.3e-42
Identity = 88/121 (72.73%), Postives = 100/121 (82.64%), Query Frame = 1

Query: 362 TLVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEA 421
           TLV KP+ VNVI TKWIFKNK DE GC T+ K R V Q Y+QVEG+DFDETF+P+ARLEA
Sbjct: 323 TLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARSVAQRYTQVEGVDFDETFSPVARLEA 382

Query: 422 IRLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDL 481
           IRLLLGI+C+ + KLYQMDVKSAFLN YL+EEVYVAQ KG +D    +HVYKLNKALY L
Sbjct: 383 IRLLLGISCIHKFKLYQMDVKSAFLNGYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGL 442

Query: 482 K 483
           K
Sbjct: 443 K 443

BLAST of Cucsa.319790 vs. NCBI nr
Match: gi|659129653|ref|XP_008464774.1| (PREDICTED: uncharacterized protein LOC103502582 [Cucumis melo])

HSP 1 Score: 107.1 bits (266), Expect = 8.9e-20
Identity = 71/169 (42.01%), Postives = 94/169 (55.62%), Query Frame = 1

Query: 110 MMEEETIAEYNVRVLEIANDSFNFGEGTPKSKLVKKVLRSMPRKFDMKVMVIEEAHDITT 169
           M E+E++++YN RVLEIAN+S    E  P SK+V+KV+RS+P KFDMKV  IEEAHDITT
Sbjct: 1   MNEDESVSDYNKRVLEIANESLLLCETIPDSKIVRKVVRSLPSKFDMKVTAIEEAHDITT 60

Query: 170 LTLDELFGSLCAFKIVIFDRRFFKCNECGGNDLY---QAGCPTYLRRQK------KSFGV 229
           L LDELFGSL  F++   DR   K         +   +AGC T     +      K F  
Sbjct: 61  LKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEAGCDTEANMDESIALLTKQFTN 120

Query: 230 TL--------SDEESNGSEEEEEFSNAFISIISKGDLVSDTEDIDCQIE 262
            L        +DEES  S +++   NAF   I+  +   D+E   C +E
Sbjct: 121 ALQNLKSPNATDEESVDSRDDDGNINAFTIQITDENTDDDSE---CLVE 166


HSP 2 Score: 179.9 bits (455), Expect = 1.1e-41
Identity = 86/120 (71.67%), Postives = 99/120 (82.50%), Query Frame = 1

Query: 363 LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
           LVP+P  +NVI TKWI+KNK DE G +TR K RLV QGY+QVE +DFDETFAP+ARLE+I
Sbjct: 16  LVPRPNDINVIGTKWIYKNKSDENGIITRNKARLVAQGYTQVERLDFDETFAPVARLESI 75

Query: 423 RLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDLK 482
           RLLLG+AC+ + KL+QMDVKSAFLN YLHEEV+V QPKG ID  F  HVYKL KALY LK
Sbjct: 76  RLLLGVACILKFKLFQMDVKSAFLNGYLHEEVFVEQPKGFIDPNFPDHVYKLKKALYGLK 135

BLAST of Cucsa.319790 vs. NCBI nr
Match: gi|29423276|gb|AAO73525.1| (gag-pol polyprotein [Glycine max])

HSP 1 Score: 179.5 bits (454), Expect = 1.4e-41
Identity = 83/120 (69.17%), Postives = 99/120 (82.50%), Query Frame = 1

Query: 363  LVPKPKHVNVIDTKWIFKNKIDEQGCVTRKKDRLVTQGYSQVEGIDFDETFAPLARLEAI 422
            LVP+P+  NVI TKWIFKNK +E+G +TR K RLV QGY+Q+EG+DFDETFAP+ARLE+I
Sbjct: 1099 LVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESI 1158

Query: 423  RLLLGIACLQRIKLYQMDVKSAFLNDYLHEEVYVAQPKGLIDQVFFQHVYKLNKALYDLK 482
            RLLLG+AC+ + KLYQMDVKSAFLN YL+EE YV QPKG +D     HVY+L KALY LK
Sbjct: 1159 RLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHLDHVYRLKKALYGLK 1218

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
COPIA_DROME2.3e-2334.78Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
POLX_TOBAC5.8e-1943.33Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
M820_ARATH1.1e-0944.78Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana GN=AtMg0... [more]
Match NameE-valueIdentityDescription
A2Q5V4_MEDTR7.5e-4271.67Gag-pol polyprotein, putative OS=Medicago truncatula GN=MtrDRAFT_AC169177g29v1 P... [more]
Q84VI0_SOYBN9.9e-4269.17Gag-pol polyprotein OS=Glycine max GN=gag-pol PE=4 SV=1[more]
Q84VI0_SOYBN4.9e-4142.92Gag-pol polyprotein OS=Glycine max GN=gag-pol PE=4 SV=1[more]
Q84VI4_SOYBN2.9e-4170.00Gag-pol polyprotein OS=Glycine max GN=gag-pol PE=4 SV=1[more]
Q84VH6_SOYBN6.4e-4144.95Gag-pol polyprotein OS=Glycine max GN=gag-pol PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.12.8e-1940.00 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00820.16.2e-1144.78ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
Match NameE-valueIdentityDescription
gi|659126719|ref|XP_008463331.1|9.8e-4353.68PREDICTED: uncharacterized protein LOC103501512 [Cucumis melo][more]
gi|659074589|ref|XP_008437686.1|8.3e-4250.76PREDICTED: uncharacterized protein LOC103483027 [Cucumis melo][more]
gi|659129653|ref|XP_008464774.1|8.3e-4272.73PREDICTED: uncharacterized protein LOC103502582 [Cucumis melo][more]
gi|659129653|ref|XP_008464774.1|8.9e-2042.01PREDICTED: uncharacterized protein LOC103502582 [Cucumis melo][more]
gi|29423276|gb|AAO73525.1|1.4e-4169.17gag-pol polyprotein [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.319790.1Cucsa.319790.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 362..482
score: 2.2
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 289..482
score: 3.5
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 68..180
score: 3.9