Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGAGAGGTTAGAGGAAAAGAAAAAGAGAGATTGAGAGATCACAAAAAACGAAAACATTAGATCAGAATAATGGAAATGGCGACGAAGACCGATAACAGAACCGAACGAAGAAGAAGAATTATGAGCAGAGAAATGGATCGAATGGCTGTCATCACCGGCCGTTTACGTAATCTCCCTCCATCCCCACCGCCTTCTCCATCTTCACCTTCCCCATTTCTTTACCATCAAACTCACCAACGCGGCCATTCTCACACCGGAATCTCCCCTTCCTTTTTCTCCAAGGACCTCCACAACAATCCTGATTCCCTTCCTGCTCTCCCCAACATCCAAGGTATATTCATTCCTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCATTCTCATCACATTTCATTTCTTATTCATTCCATTCTTAATTAGGTTTTCTTTTCCCCCCTTCCATCTAATTTTTATGATTTACAACCTAGGTTTAATTTTATTGATTCGTTTTTTTTTTTGTATGTCTTAGAGATCGATTCTTTCTGTCTACATTGTATTGTTCATGTGGGATTCAAGTCGTGTTGTATATTTGATTATCGAGGCATCATGAGATTTCATTAGATGAGCTTCATAAAATGGATGCTAACTATGATTATTAAGATTATATTTACTTTTTAAATTATGATAATCCTCCCTCATATAATTGAGATAGGGTCACAGTTGGATAAGATTTTTAGAGAGAACTATATAAACATAAGTTTGTGAGTCCTGTATAAATTGCTATATCGAATCAGTTTACTTTGCTTGGCATAAGAAGATCGAAAAGGCTTTGTATGATCTCAATGTAGTTGGAAAGAGGTTATTTTATTGCAAGTGTGTGCTAATATTCATAAGGTTTCGGTTTGCTATTGATTGTATGTTGTATATTCGACTGAAGTTGGGTAGAACAATAATAAGAGAGATCTTTTGGTAATTGTGAAATTTTAAAAAGCAAGGTATTAAACATTAGTTTCTTCTCTTCGATCAAATCAATAACAATTTAATAGCTCTATCTTTGGTTATTGGTTCTTGTTTGAATCTTTTCTAACAATGTGTTGGATTATGATTTATCGTTCGAGGAAATTGGTAAGATGAGAAAATCTTTTTGATCATAAGTGGTAACAATACGAATGGGTATAATGGATTGATGGCTAAAGATATGTGTTAGAATTATCTTGCATTGATTATAAATTGAGTTGAATGTTGGTATGTAAATGGGAATGTAAGAGCAATTTATATCACTTAAACTAACATTTTTTTTAAAGGAGGAGCCCTGTTTGGACAATATATTATATTGAAACCATACAATCTGTTTAATTTTTAGTTTATAGGAAATAGTTCAATTTAGTTTCATTTATTTGCAGCTATTCCCAAGCCTAAAGATGCAAAAGCTACCCCTATACTGAAGCGTATGTCCATGAGTGAAGTTCGAGAAGAAAAAATTGCTGCCATAGAATTCCAAATCAACCACAAAAAACTCGACCCCATAGGAGAAGCACACACTGAAACAGTATCAACTCCATCTGCCTCATCAATGGATCAAAAAATTACCTCCATTGATAACGAGATACTCTTAAAAACACACCCTTCAAAGCCTAAACTCTTCACTTCAAAACGACTAAATGCCTCCATTTTAGCTTCTCAAACCACACGAGTTTTCTGTTCGCTAGTTATCGCATCTTTGGCCGTTCTATCGCACGTCAATCACCCGCTTGCCATAATTAGGAACATGGTGAGGTCGGAAAGCGTGGTGGCCTCAAAACCCCTCTACATTCTACTGCTTACCGATGCAACGATCGTTGTGGCGAGAATGTTGGCTGAAAGACAGAAAGACAGTGGAGAGGCAGCGGAAGAAAGTGAGAAAATGAAGGAAGATGGACATATTTGGGACTCAGCTGTGAAAGTTTTGGAGAGAGGTTTGGTTTTTTATCAAGCTTTTCGTGCGATTTTCATTGATTTTAGTGTTTATGCAGTGGTGGTTATTTGTGGCATCTGTTTGCTGTAGCTTTAATTTATTGTCTTTGGTTTTGATATAGTACGTGCTTGAATCATCTTCAGTTCTTTGGAGCTTAGAAGATGAAGCCCATTTTGGATGATCTATTTAATTTGTTTGTTTTGATTATGCTTAATTTCTTTCTTTGGAAAAAAGGTACATAGCATAACTATATATACATATTGCAAAAGCATCATTTTGGTGTGCTTGCCTATGCCCAATCTTAGTTGTTGTTAATTATTATTTGAAATGTTAAAGCTTTGGATCATTTGAATTGCTTATTGTTATTGTTAGGAATAAAACCCCCGTATGTTTGGTTTTATCACATGTTATTTCAAGTGTTTTGTGCACATTACACTAGTTAATTTGGTTTGCTCTATACATTCGAAATGTTTCTAAAAATGTCAAATAAATAACTAATAAATGAGATGAACTAACAACTTCAATTTTGCTTTTAAAACACAATAACCAGCGGTGAGGTGGCGGTGATTTATTGTCGGCAATAGGCGGCGGCAGTGATCGCTAATGGTAGTTTTCGAGAGCCACAACTATCGGTAAGTGAAGTAGTGTAAGGATGCTTTGGCCCCTTTTTCATATGTCATAAAATTTGGAAACAATTCCCAATTAAGTACTAAAAAAAAACTTTTCTCATACTAACTTATTTTACAAATTAATTTGTTAAAATATTATTTTAAGTTATTGTCAAATACTATAATTTTTATCGAAAATAAAATACTTGAAAAGTCAAACCAAATGTATCCTTAAACTTTCTTAAACTTTCTCAATATGTTGGATAAAAAACGATCTTTCAATTTAAAATTCTATACTAAAAATCGAAATTATTCACGTTGATTTTCTTCTATAGAACAAGTAGGAAATGGAAATTTAAAAATTATTTAAAGAGGTCGTCGTAATGTAGCTGATAGTTTGACATTGTTGAATGCATAAAATATAAAATTATTAGAATTTGTTCGAATTTGTATAAACTAATTATGATAAATTTCTGCAAGGTATAGAATCTACATCCAACCATGGTTATTGCGACAAACCTTATAGAATAGATTTTTTCCCATTTTCTCACTCTAGCCAGTAAGTATCATCTTACTTAAGCCTTTCCACCAATTTAAACTCAAGTCCTATTCTAAGTACAAGTTAGAATAACATCAATGAAATTTATTATACATTTCATACAGATTTCAAAGTAACTGAAATTAAGCATGTTAAATACCCACTAGATTTATTGTATTTCAACTCCTTTTTCTCATAGCTAGTGAGCTCTGCCCATTAATTTTGTAGGTTTGTTAGAGTTGATGGTTGCTATTGATCTTGTATGTATTCTTTCTAATGAGAGATTTTTATTATTATTGAAAGTTAGGAATAATGATCATCTACATATTTTCATAATGGGTACGATATATGTTGTCCACTTTGAGCAAAAACTCTCATGGTTTTGCTTTTGGTTTCACCTAAAATACTTTATACCGTTAGTTAATTATAATTGTCCTCACTTATAAACACATGATCATTCTTTTATATCTAACCAATATAGAACTTTTGTTGTATTTGCCCAACCATTATTTATCCTAATGGATAGAACATGATCCATCATTTTTCACCTTTGATATCTTGATAAAAGTCCTACTATACTTTTAATTTTT
mRNA sequence
TAGAGAGGTTAGAGGAAAAGAAAAAGAGAGATTGAGAGATCACAAAAAACGAAAACATTAGATCAGAATAATGGAAATGGCGACGAAGACCGATAACAGAACCGAACGAAGAAGAAGAATTATGAGCAGAGAAATGGATCGAATGGCTGTCATCACCGGCCGTTTACGTAATCTCCCTCCATCCCCACCGCCTTCTCCATCTTCACCTTCCCCATTTCTTTACCATCAAACTCACCAACGCGGCCATTCTCACACCGGAATCTCCCCTTCCTTTTTCTCCAAGGACCTCCACAACAATCCTGATTCCCTTCCTGCTCTCCCCAACATCCAAGCTATTCCCAAGCCTAAAGATGCAAAAGCTACCCCTATACTGAAGCGTATGTCCATGAGTGAAGTTCGAGAAGAAAAAATTGCTGCCATAGAATTCCAAATCAACCACAAAAAACTCGACCCCATAGGAGAAGCACACACTGAAACAGTATCAACTCCATCTGCCTCATCAATGGATCAAAAAATTACCTCCATTGATAACGAGATACTCTTAAAAACACACCCTTCAAAGCCTAAACTCTTCACTTCAAAACGACTAAATGCCTCCATTTTAGCTTCTCAAACCACACGAGTTTTCTGTTCGCTAGTTATCGCATCTTTGGCCGTTCTATCGCACGTCAATCACCCGCTTGCCATAATTAGGAACATGGTGAGGTCGGAAAGCGTGGTGGCCTCAAAACCCCTCTACATTCTACTGCTTACCGATGCAACGATCGTTGTGGCGAGAATGTTGGCTGAAAGACAGAAAGACAGTGGAGAGGCAGCGGAAGAAAGTGAGAAAATGAAGGAAGATGGACATATTTGGGACTCAGCTGTGAAAGTTTTGGAGAGAGGTTTGGTTTTTTATCAAGCTTTTCGTGCGATTTTCATTGATTTTAGTGTTTATGCAGTGGTGGTTATTTGTGGCATCTGTTTGCTGTAGCTTTAATTTATTGTCTTTGGTTTTGATATAGTACGTGCTTGAATCATCTTCAGTTCTTTGGAGCTTAGAAGATGAAGCCCATTTTGGATGATCTATTTAATTTGTTTGTTTTGATTATGCTTAATTTCTTTCTTTGGAAAAAAGGTACATAGCATAACTATATATACATATTGCAAAAGCATCATTTTGGTGTGCTTGCCTATGCCCAATCTTAGTTGTTGTTAATTATTATTTGAAATGTTAAAGCTTTGGATCATTTGAATTGCTTATTGTTATTGTTAGGAATAAAACCCCCGTATGTTTGGTTTTATCACATGTTATTTCAAGTGTTTTGTGCACATTACACTAGTTAATTTGGTTTGCTCTATACATTCGAAATGTTTCTAAAAATGTCAAATAAATAACTAATAAATGAGATGAACTAACAACTTCAATTTTGCTTTTAAAACACAATAACCAGCGGTGAGGTGGCGGTGATTTATTGTCGGCAATAGGCGGCGGCAGTGATCGCTAATGGTAGTTTTCGAGAGCCACAACTATCGGTAAGTGAAGTAGTGTAAGGATGCTTTGGCCCCTTTTTCATATGTCATAAAATTTGGAAACAATTCCCAATTAAGTACTAAAAAAAAACTTTTCTCATACTAACTTATTTTACAAATTAATTTGTTAAAATATTATTTTAAGTTATTGTCAAATACTATAATTTTTATCGAAAATAAAATACTTGAAAAGTCAAACCAAATGTATCCTTAAACTTTCTTAAACTTTCTCAATATGTTGGATAAAAAACGATCTTTCAATTTAAAATTCTATACTAAAAATCGAAATTATTCACGTTGATTTTCTTCTATAGAACAAGTAGGAAATGGAAATTTAAAAATTATTTAAAGAGGTCGTCGTAATGTAGCTGATAGTTTGACATTGTTGAATGCATAAAATATAAAATTATTAGAATTTGTTCGAATTTGTATAAACTAATTATGATAAATTTCTGCAAGGTATAGAATCTACATCCAACCATGGTTATTGCGACAAACCTTATAGAATAGATTTTTTCCCATTTTCTCACTCTAGCCAGTAAGTATCATCTTACTTAAGCCTTTCCACCAATTTAAACTCAAGTCCTATTCTAAGTACAAGTTAGAATAACATCAATGAAATTTATTATACATTTCATACAGATTTCAAAGTAACTGAAATTAAGCATGTTAAATACCCACTAGATTTATTGTATTTCAACTCCTTTTTCTCATAGCTAGTGAGCTCTGCCCATTAATTTTGTAGGTTTGTTAGAGTTGATGGTTGCTATTGATCTTGTATGTATTCTTTCTAATGAGAGATTTTTATTATTATTGAAAGTTAGGAATAATGATCATCTACATATTTTCATAATGGGTACGATATATGTTGTCCACTTTGAGCAAAAACTCTCATGGTTTTGCTTTTGGTTTCACCTAAAATACTTTATACCGTTAGTTAATTATAATTGTCCTCACTTATAAACACATGATCATTCTTTTATATCTAACCAATATAGAACTTTTGTTGTATTTGCCCAACCATTATTTATCCTAATGGATAGAACATGATCCATCATTTTTCACCTTTGATATCTTGATAAAAGTCCTACTATACTTTTAATTTTT
Coding sequence (CDS)
ATGGAAATGGCGACGAAGACCGATAACAGAACCGAACGAAGAAGAAGAATTATGAGCAGAGAAATGGATCGAATGGCTGTCATCACCGGCCGTTTACGTAATCTCCCTCCATCCCCACCGCCTTCTCCATCTTCACCTTCCCCATTTCTTTACCATCAAACTCACCAACGCGGCCATTCTCACACCGGAATCTCCCCTTCCTTTTTCTCCAAGGACCTCCACAACAATCCTGATTCCCTTCCTGCTCTCCCCAACATCCAAGCTATTCCCAAGCCTAAAGATGCAAAAGCTACCCCTATACTGAAGCGTATGTCCATGAGTGAAGTTCGAGAAGAAAAAATTGCTGCCATAGAATTCCAAATCAACCACAAAAAACTCGACCCCATAGGAGAAGCACACACTGAAACAGTATCAACTCCATCTGCCTCATCAATGGATCAAAAAATTACCTCCATTGATAACGAGATACTCTTAAAAACACACCCTTCAAAGCCTAAACTCTTCACTTCAAAACGACTAAATGCCTCCATTTTAGCTTCTCAAACCACACGAGTTTTCTGTTCGCTAGTTATCGCATCTTTGGCCGTTCTATCGCACGTCAATCACCCGCTTGCCATAATTAGGAACATGGTGAGGTCGGAAAGCGTGGTGGCCTCAAAACCCCTCTACATTCTACTGCTTACCGATGCAACGATCGTTGTGGCGAGAATGTTGGCTGAAAGACAGAAAGACAGTGGAGAGGCAGCGGAAGAAAGTGAGAAAATGAAGGAAGATGGACATATTTGGGACTCAGCTGTGAAAGTTTTGGAGAGAGGTTTGGTTTTTTATCAAGCTTTTCGTGCGATTTTCATTGATTTTAGTGTTTATGCAGTGGTGGTTATTTGTGGCATCTGTTTGCTGTAG
Protein sequence
MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHSHTGISPSFFSKDLHNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIEFQINHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASILASQTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARMLAERQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGICLL
Homology
BLAST of PI0020391 vs. ExPASy TrEMBL
Match:
A0A0A0L815 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G481250 PE=4 SV=1)
HSP 1 Score: 498.8 bits (1283), Expect = 1.6e-137
Identity = 268/300 (89.33%), Postives = 280/300 (93.33%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATK +NRT+RRRRIMSRE+DRMA+ITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS
Sbjct: 1 MEMATKIENRTDRRRRIMSREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
Query: 61 HTGISPSFFSKDLHNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIEFQ 120
HTGISPSFFSKD+H NPDS P LPN Q +PKPKDAKATP+LKR+SMSE REEKIAAI FQ
Sbjct: 61 HTGISPSFFSKDIHANPDS-PPLPNAQGVPKPKDAKATPLLKRLSMSEAREEKIAAIGFQ 120
Query: 121 INHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASILAS 180
INHKKLDPIGE HTETVSTPSASSM QK+TS DNEILLK HPSKPKLFTSKRLNASILAS
Sbjct: 121 INHKKLDPIGEIHTETVSTPSASSMVQKVTSTDNEILLKAHPSKPKLFTSKRLNASILAS 180
Query: 181 QTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARMLAE 240
QTTRVFCSL+IASLAVLSHVNHPL++I MVRSE VVASKPLYILLLTDATIVVARMLA
Sbjct: 181 QTTRVFCSLIIASLAVLSHVNHPLSMIWKMVRSERVVASKPLYILLLTDATIVVARMLAA 240
Query: 241 RQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGICLL 300
RQKDS EA EESEKMKEDGH WDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGI LL
Sbjct: 241 RQKDSREAEEESEKMKEDGHNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 299
BLAST of PI0020391 vs. ExPASy TrEMBL
Match:
A0A1S3BH05 (uncharacterized protein LOC103489936 OS=Cucumis melo OX=3656 GN=LOC103489936 PE=4 SV=1)
HSP 1 Score: 498.8 bits (1283), Expect = 1.6e-137
Identity = 270/302 (89.40%), Postives = 282/302 (93.38%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEM TKTDNRTERRRRI+SREMDRMA+ITGRL NLPPSPPPSPSSPSPFL+HQTHQRGHS
Sbjct: 1 MEMPTKTDNRTERRRRIISREMDRMALITGRLPNLPPSPPPSPSSPSPFLFHQTHQRGHS 60
Query: 61 HTGISPSFFSKDL--HNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIE 120
HTGISPSFFSKDL HNNPDSLP PN Q IPKPKDAKATP+LKR+SMSE REEKIAAI
Sbjct: 61 HTGISPSFFSKDLHNHNNPDSLP-FPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAAIG 120
Query: 121 FQINHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASIL 180
FQ NHKKLDPIGE HTETVSTPSASSM QKITSID++ILLKTHPSKPKLFTSKR+NASIL
Sbjct: 121 FQFNHKKLDPIGEVHTETVSTPSASSMVQKITSIDDKILLKTHPSKPKLFTSKRINASIL 180
Query: 181 ASQTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARML 240
ASQTTRVFCSL+IASL+VLSHVNHPL+II NMVRSESVVASKPLYILLLTDATIV+ARML
Sbjct: 181 ASQTTRVFCSLIIASLSVLSHVNHPLSIIWNMVRSESVVASKPLYILLLTDATIVLARML 240
Query: 241 AERQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGIC 300
AERQKD G A EE EKMKEDG WDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGIC
Sbjct: 241 AERQKDGGVAEEEIEKMKEDGRNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGIC 300
BLAST of PI0020391 vs. ExPASy TrEMBL
Match:
A0A5A7U8R0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold323G001130 PE=4 SV=1)
HSP 1 Score: 470.7 bits (1210), Expect = 4.6e-129
Identity = 270/364 (74.18%), Postives = 282/364 (77.47%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEM TKTDNRTERRRRI+SREMDRMA+ITGRL NLPPSPPPSPSSPSPFL+HQTHQRGHS
Sbjct: 1 MEMPTKTDNRTERRRRIISREMDRMALITGRLPNLPPSPPPSPSSPSPFLFHQTHQRGHS 60
Query: 61 HTGISPSFFSKDL--HNNPDSLPALPN--------------------------------- 120
HTGISPSFFSKDL HNNPDSLP PN
Sbjct: 61 HTGISPSFFSKDLHNHNNPDSLP-FPNAQGIFIHSFIHSFFFFFFFFILITFLSFSDIYS 120
Query: 121 -----------------------------IQAIPKPKDAKATPILKRMSMSEVREEKIAA 180
I IPKPKDAKATP+LKR+SMSE REEKIAA
Sbjct: 121 ILSSFSLSRKSVRWFVLVNFFDEKIFFLIISGIPKPKDAKATPLLKRLSMSEAREEKIAA 180
Query: 181 IEFQINHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNAS 240
I FQ NHKKLDPIGE HTETVSTPSASSM QKITSID++ILLKTHPSKPKLFTSKR+NAS
Sbjct: 181 IGFQFNHKKLDPIGEVHTETVSTPSASSMVQKITSIDDKILLKTHPSKPKLFTSKRINAS 240
Query: 241 ILASQTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVAR 300
ILASQTTRVFCSL+IASL+VLSHVNHPL+II NMVRSESVVASKPLYILLLTDATIV+AR
Sbjct: 241 ILASQTTRVFCSLIIASLSVLSHVNHPLSIIWNMVRSESVVASKPLYILLLTDATIVLAR 300
BLAST of PI0020391 vs. ExPASy TrEMBL
Match:
A0A6J1KF67 (uncharacterized protein LOC111493320 OS=Cucurbita maxima OX=3661 GN=LOC111493320 PE=4 SV=1)
HSP 1 Score: 396.7 bits (1018), Expect = 8.4e-107
Identity = 225/300 (75.00%), Postives = 250/300 (83.33%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATKTD R ERRRRI SRE DRMA+ITGRLRNLPPSPPPSPSSP PF +H THQRGHS
Sbjct: 1 MEMATKTDVRAERRRRISSREGDRMALITGRLRNLPPSPPPSPSSP-PFFHHYTHQRGHS 60
Query: 61 HTGISPSFFSKDLHNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIEFQ 120
HTGI+PSFF+KD H NPDS P LP + KPKD KA P+LK +S++EV AAIE+Q
Sbjct: 61 HTGINPSFFAKDTHKNPDSGP-LPQNHDVSKPKDEKAPPLLKHISINEVHNN--AAIEYQ 120
Query: 121 INHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASILAS 180
N KKLDPIGE TE + +PS+ +M QK IDNE L KT PSKP+L TSKRLNASILAS
Sbjct: 121 FNPKKLDPIGEGSTELILSPSSVTMVQK-ACIDNEPLPKTKPSKPRLITSKRLNASILAS 180
Query: 181 QTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARMLAE 240
QTTRVFCSL+IASLA+LS V+ PL IIRN VRSE+V+ASKPLYILLLT+ATIVVARMLAE
Sbjct: 181 QTTRVFCSLIIASLAILSQVDIPLTIIRNTVRSETVMASKPLYILLLTNATIVVARMLAE 240
Query: 241 RQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGICLL 300
+QKD GEA EE EKMKED WDSAVKVLERGLVFYQAFRA+FIDFSVYAVVVICG+ +L
Sbjct: 241 KQKDRGEAEEECEKMKEDAQNWDSAVKVLERGLVFYQAFRAVFIDFSVYAVVVICGLSVL 295
BLAST of PI0020391 vs. ExPASy TrEMBL
Match:
A0A6J1GB34 (uncharacterized protein LOC111452396 OS=Cucurbita moschata OX=3662 GN=LOC111452396 PE=4 SV=1)
HSP 1 Score: 396.4 bits (1017), Expect = 1.1e-106
Identity = 225/300 (75.00%), Postives = 250/300 (83.33%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATKTD R ERRRRI SRE DRMA+ITGRLRNLPPSPPPSPSSP PF +H THQRGHS
Sbjct: 1 MEMATKTDVRAERRRRISSREGDRMALITGRLRNLPPSPPPSPSSP-PFFHHYTHQRGHS 60
Query: 61 HTGISPSFFSKDLHNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIEFQ 120
HTGI+PSFF+KD H NPDS P LP + KPKD KA P+LK +S++EV AAIE+Q
Sbjct: 61 HTGINPSFFAKDTHKNPDSGP-LPQNHDVSKPKDEKAPPLLKHISINEVHNN--AAIEYQ 120
Query: 121 INHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASILAS 180
N KKLDPIGE TE + +PS+ +M QK IDNE L KT PSKP+L TSKRLNASILAS
Sbjct: 121 FNPKKLDPIGEGSTELILSPSSVTMVQK-ACIDNEPLPKTKPSKPRLITSKRLNASILAS 180
Query: 181 QTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARMLAE 240
QTTRVFCSL+IASLA+LS V+ PL IIRN+VRSE+V+ASKPLYILLLT+ATIVVARMLAE
Sbjct: 181 QTTRVFCSLIIASLAILSQVDIPLTIIRNIVRSETVMASKPLYILLLTNATIVVARMLAE 240
Query: 241 RQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGICLL 300
+QKD GEA EE EKMKED WDSAVKVLERGLVFYQAFRA FIDFSVYAVVVICG+ +L
Sbjct: 241 KQKDRGEAEEECEKMKEDAQNWDSAVKVLERGLVFYQAFRAFFIDFSVYAVVVICGLSVL 295
BLAST of PI0020391 vs. NCBI nr
Match:
XP_004150355.1 (uncharacterized protein LOC101203675 [Cucumis sativus] >KGN58065.1 hypothetical protein Csa_023413 [Cucumis sativus])
HSP 1 Score: 498.8 bits (1283), Expect = 3.2e-137
Identity = 268/300 (89.33%), Postives = 280/300 (93.33%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATK +NRT+RRRRIMSRE+DRMA+ITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS
Sbjct: 1 MEMATKIENRTDRRRRIMSREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
Query: 61 HTGISPSFFSKDLHNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIEFQ 120
HTGISPSFFSKD+H NPDS P LPN Q +PKPKDAKATP+LKR+SMSE REEKIAAI FQ
Sbjct: 61 HTGISPSFFSKDIHANPDS-PPLPNAQGVPKPKDAKATPLLKRLSMSEAREEKIAAIGFQ 120
Query: 121 INHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASILAS 180
INHKKLDPIGE HTETVSTPSASSM QK+TS DNEILLK HPSKPKLFTSKRLNASILAS
Sbjct: 121 INHKKLDPIGEIHTETVSTPSASSMVQKVTSTDNEILLKAHPSKPKLFTSKRLNASILAS 180
Query: 181 QTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARMLAE 240
QTTRVFCSL+IASLAVLSHVNHPL++I MVRSE VVASKPLYILLLTDATIVVARMLA
Sbjct: 181 QTTRVFCSLIIASLAVLSHVNHPLSMIWKMVRSERVVASKPLYILLLTDATIVVARMLAA 240
Query: 241 RQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGICLL 300
RQKDS EA EESEKMKEDGH WDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGI LL
Sbjct: 241 RQKDSREAEEESEKMKEDGHNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 299
BLAST of PI0020391 vs. NCBI nr
Match:
XP_008447503.1 (PREDICTED: uncharacterized protein LOC103489936 [Cucumis melo])
HSP 1 Score: 498.8 bits (1283), Expect = 3.2e-137
Identity = 270/302 (89.40%), Postives = 282/302 (93.38%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEM TKTDNRTERRRRI+SREMDRMA+ITGRL NLPPSPPPSPSSPSPFL+HQTHQRGHS
Sbjct: 1 MEMPTKTDNRTERRRRIISREMDRMALITGRLPNLPPSPPPSPSSPSPFLFHQTHQRGHS 60
Query: 61 HTGISPSFFSKDL--HNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIE 120
HTGISPSFFSKDL HNNPDSLP PN Q IPKPKDAKATP+LKR+SMSE REEKIAAI
Sbjct: 61 HTGISPSFFSKDLHNHNNPDSLP-FPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAAIG 120
Query: 121 FQINHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASIL 180
FQ NHKKLDPIGE HTETVSTPSASSM QKITSID++ILLKTHPSKPKLFTSKR+NASIL
Sbjct: 121 FQFNHKKLDPIGEVHTETVSTPSASSMVQKITSIDDKILLKTHPSKPKLFTSKRINASIL 180
Query: 181 ASQTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARML 240
ASQTTRVFCSL+IASL+VLSHVNHPL+II NMVRSESVVASKPLYILLLTDATIV+ARML
Sbjct: 181 ASQTTRVFCSLIIASLSVLSHVNHPLSIIWNMVRSESVVASKPLYILLLTDATIVLARML 240
Query: 241 AERQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGIC 300
AERQKD G A EE EKMKEDG WDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGIC
Sbjct: 241 AERQKDGGVAEEEIEKMKEDGRNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGIC 300
BLAST of PI0020391 vs. NCBI nr
Match:
KAA0050777.1 (uncharacterized protein E6C27_scaffold404G00270 [Cucumis melo var. makuwa] >TYK08569.1 uncharacterized protein E5676_scaffold323G001130 [Cucumis melo var. makuwa])
HSP 1 Score: 470.7 bits (1210), Expect = 9.4e-129
Identity = 270/364 (74.18%), Postives = 282/364 (77.47%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEM TKTDNRTERRRRI+SREMDRMA+ITGRL NLPPSPPPSPSSPSPFL+HQTHQRGHS
Sbjct: 1 MEMPTKTDNRTERRRRIISREMDRMALITGRLPNLPPSPPPSPSSPSPFLFHQTHQRGHS 60
Query: 61 HTGISPSFFSKDL--HNNPDSLPALPN--------------------------------- 120
HTGISPSFFSKDL HNNPDSLP PN
Sbjct: 61 HTGISPSFFSKDLHNHNNPDSLP-FPNAQGIFIHSFIHSFFFFFFFFILITFLSFSDIYS 120
Query: 121 -----------------------------IQAIPKPKDAKATPILKRMSMSEVREEKIAA 180
I IPKPKDAKATP+LKR+SMSE REEKIAA
Sbjct: 121 ILSSFSLSRKSVRWFVLVNFFDEKIFFLIISGIPKPKDAKATPLLKRLSMSEAREEKIAA 180
Query: 181 IEFQINHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNAS 240
I FQ NHKKLDPIGE HTETVSTPSASSM QKITSID++ILLKTHPSKPKLFTSKR+NAS
Sbjct: 181 IGFQFNHKKLDPIGEVHTETVSTPSASSMVQKITSIDDKILLKTHPSKPKLFTSKRINAS 240
Query: 241 ILASQTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVAR 300
ILASQTTRVFCSL+IASL+VLSHVNHPL+II NMVRSESVVASKPLYILLLTDATIV+AR
Sbjct: 241 ILASQTTRVFCSLIIASLSVLSHVNHPLSIIWNMVRSESVVASKPLYILLLTDATIVLAR 300
BLAST of PI0020391 vs. NCBI nr
Match:
XP_038890203.1 (uncharacterized protein LOC120079844 [Benincasa hispida])
HSP 1 Score: 463.0 bits (1190), Expect = 2.0e-126
Identity = 251/300 (83.67%), Postives = 275/300 (91.67%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATKTD+R ERRRRI+SRE+DRMA+ITGRLRNLPPSPPPSPSSPSPFL+HQTHQRG+S
Sbjct: 1 MEMATKTDSRKERRRRIVSREVDRMALITGRLRNLPPSPPPSPSSPSPFLFHQTHQRGYS 60
Query: 61 HTGISPSFFSKDLHNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIEFQ 120
HTGISPSFFSK+LH NPDS+P L +I AIPKP+D KATP+LK MSM EV+EEKI+AI +Q
Sbjct: 61 HTGISPSFFSKELHKNPDSIP-LSHIHAIPKPEDGKATPLLKHMSMKEVQEEKISAIGYQ 120
Query: 121 INHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASILAS 180
++HKKLDPIGE HTE VSTPSA SM QK+ SIDNE KT PSKPKLFTSKRLNA ILAS
Sbjct: 121 MSHKKLDPIGEVHTEIVSTPSALSMVQKV-SIDNETRSKTQPSKPKLFTSKRLNACILAS 180
Query: 181 QTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARMLAE 240
QTTRVFCSL++ASLA+LS V+HPL IIRN+VRSESVVASKPLYILLLT+ATIVVARMLAE
Sbjct: 181 QTTRVFCSLILASLAILSQVDHPLFIIRNIVRSESVVASKPLYILLLTNATIVVARMLAE 240
Query: 241 RQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGICLL 300
+QKDSGEA EE EKMKEDGH WDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGI LL
Sbjct: 241 KQKDSGEAEEELEKMKEDGHNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 298
BLAST of PI0020391 vs. NCBI nr
Match:
XP_022998749.1 (uncharacterized protein LOC111493320 [Cucurbita maxima])
HSP 1 Score: 396.7 bits (1018), Expect = 1.7e-106
Identity = 225/300 (75.00%), Postives = 250/300 (83.33%), Query Frame = 0
Query: 1 MEMATKTDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATKTD R ERRRRI SRE DRMA+ITGRLRNLPPSPPPSPSSP PF +H THQRGHS
Sbjct: 1 MEMATKTDVRAERRRRISSREGDRMALITGRLRNLPPSPPPSPSSP-PFFHHYTHQRGHS 60
Query: 61 HTGISPSFFSKDLHNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIEFQ 120
HTGI+PSFF+KD H NPDS P LP + KPKD KA P+LK +S++EV AAIE+Q
Sbjct: 61 HTGINPSFFAKDTHKNPDSGP-LPQNHDVSKPKDEKAPPLLKHISINEVHNN--AAIEYQ 120
Query: 121 INHKKLDPIGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASILAS 180
N KKLDPIGE TE + +PS+ +M QK IDNE L KT PSKP+L TSKRLNASILAS
Sbjct: 121 FNPKKLDPIGEGSTELILSPSSVTMVQK-ACIDNEPLPKTKPSKPRLITSKRLNASILAS 180
Query: 181 QTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARMLAE 240
QTTRVFCSL+IASLA+LS V+ PL IIRN VRSE+V+ASKPLYILLLT+ATIVVARMLAE
Sbjct: 181 QTTRVFCSLIIASLAILSQVDIPLTIIRNTVRSETVMASKPLYILLLTNATIVVARMLAE 240
Query: 241 RQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGICLL 300
+QKD GEA EE EKMKED WDSAVKVLERGLVFYQAFRA+FIDFSVYAVVVICG+ +L
Sbjct: 241 KQKDRGEAEEECEKMKEDAQNWDSAVKVLERGLVFYQAFRAVFIDFSVYAVVVICGLSVL 295
BLAST of PI0020391 vs. TAIR 10
Match:
AT1G52343.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32680.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 94.4 bits (233), Expect = 1.7e-19
Identity = 96/289 (33.22%), Postives = 138/289 (47.75%), Query Frame = 0
Query: 9 NRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHSHTGISPSF 68
+R ERRRRIM R DR+A+ITG+L NL PS P S SS S +H R +S + + +
Sbjct: 4 DREERRRRIMERGSDRLALITGQLHNLDPSSPSSSSSSS-----ASHNRTYSESFMPQT- 63
Query: 69 FSKDLHNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVREEKIAAIEFQINHKKLDP 128
D H +S P LK EV+ + HK L
Sbjct: 64 -KSDHHQILES-------------------PSLKYQFKEEVKARSEEPKLSTVLHKPLKI 123
Query: 129 IGEAHTETVSTPSASSMDQKITSIDNEILLKTHPSKPKLFTSKRLNASILASQTTRVFCS 188
E + +T S S +Q+ F+SK+LNASI++S+ TR S
Sbjct: 124 --EPTKQEEATRSQKSQNQRPIC---------------FFSSKKLNASIISSERTRSLSS 183
Query: 189 LVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILLLTDATIVVARMLAERQKDS-GE 248
L IA+ V L N+ S +++A +PL++L+LTD IV++ + E
Sbjct: 184 LTIAAFVV-------LLPRLNITSSNTILALRPLWLLILTDCAIVMSHLTTEASGGGLSH 242
Query: 249 AAEESEKMKE--DGHIWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVI 295
EE K ++ +G W A ++LERG+V YQA R +FID S+Y VVV+
Sbjct: 244 EMEEDGKGRDGNNGENWSDAERLLERGVVVYQALRGMFIDCSLYMVVVV 242
BLAST of PI0020391 vs. TAIR 10
Match:
AT4G32680.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G52343.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 79.3 bits (194), Expect = 5.7e-15
Identity = 89/311 (28.62%), Postives = 145/311 (46.62%), Query Frame = 0
Query: 7 TDNRTERRRRIMSREMDRMAVITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHSHTGISP 66
+++R RRR+I+ R DR+A ITG++ + PSPPPS S+ S +S
Sbjct: 3 SNSREARRRKILDRGSDRLAFITGQINGV-PSPPPSDSTSS----------------LSQ 62
Query: 67 SFFSKDLHNNPDSLPALPNIQAIPKPKDAKATPILKRMSMSEVRE----EKIAAIEFQIN 126
S D + PD++P P+ + KA I + + E + I Q
Sbjct: 63 SDLQTD-QSLPDTIP--------PRDQILKAQEIAFTSHQDNISDAAMLENVDHIIHQSR 122
Query: 127 HKKLDPIGEAHTETVSTPSASSMDQKIT--------SIDNEILLKTHPSKP--------K 186
+ L P + H ET++ SAS T S+ N ++ S+
Sbjct: 123 EEPLQP--QRHAETLAEASASDPRDTTTIQPPPTTSSVQNPSVVDLGASQAFIPVVSFVN 182
Query: 187 LFTSKRLNASILASQTTRVFCSLVIASLAVLSHVNHPLAIIRNMVRSESVVASKPLYILL 246
T K + A+I AS+ R+F +L IA + +LSH+ ++V+ +P+++L+
Sbjct: 183 AITPKHIGAAIDASEYARMFTALAIALVVILSHL--------GFSSLGNIVSFRPVFLLV 242
Query: 247 LTDATIVVARMLAERQKDSGEAAEESEKMKEDGHIWDSAVKVLERGLVFYQAFRAIFIDF 298
LTDATIV+ R+L + DS A S + I D LE ++ + A+ +DF
Sbjct: 243 LTDATIVLGRVLLSHRGDSSSA---SGTVMSGQGIVDQVGNALETVMMVKKIMDALLMDF 274
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L815 | 1.6e-137 | 89.33 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G481250 PE=4 SV=1 | [more] |
A0A1S3BH05 | 1.6e-137 | 89.40 | uncharacterized protein LOC103489936 OS=Cucumis melo OX=3656 GN=LOC103489936 PE=... | [more] |
A0A5A7U8R0 | 4.6e-129 | 74.18 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A6J1KF67 | 8.4e-107 | 75.00 | uncharacterized protein LOC111493320 OS=Cucurbita maxima OX=3661 GN=LOC111493320... | [more] |
A0A6J1GB34 | 1.1e-106 | 75.00 | uncharacterized protein LOC111452396 OS=Cucurbita moschata OX=3662 GN=LOC1114523... | [more] |
Match Name | E-value | Identity | Description | |
XP_004150355.1 | 3.2e-137 | 89.33 | uncharacterized protein LOC101203675 [Cucumis sativus] >KGN58065.1 hypothetical ... | [more] |
XP_008447503.1 | 3.2e-137 | 89.40 | PREDICTED: uncharacterized protein LOC103489936 [Cucumis melo] | [more] |
KAA0050777.1 | 9.4e-129 | 74.18 | uncharacterized protein E6C27_scaffold404G00270 [Cucumis melo var. makuwa] >TYK0... | [more] |
XP_038890203.1 | 2.0e-126 | 83.67 | uncharacterized protein LOC120079844 [Benincasa hispida] | [more] |
XP_022998749.1 | 1.7e-106 | 75.00 | uncharacterized protein LOC111493320 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT1G52343.1 | 1.7e-19 | 33.22 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT4G32680.1 | 5.7e-15 | 28.62 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |