Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGTTGATGAGGTCTATCTTGATCTCCTTGCACTGAGGGCATTATATATCCTCCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACGTGTAAGCCTCTGTTAATTATTCTTCTCGTTCTCATTCTTGTTGTTTTCAACTTTATTAAAGATTGGATAATTATTCCTAACTTGTGGGATTCACACTTAAATTTGGAGTCATCACCATTCATTGATTGAATGGGGAAATGTCATTCCTATTCCCTCTTTTGGTGGGTTTTGCAACTTTCTTTTATTTATTCTTTTGTTTCATTTCTAGTCTTTTTCATCTCACCTCAATTTAAATTTGATTCTAATTTTAAACAATAGTAAACAATATCAATGCCATTCTATGTCATTCAAAGTTTAGTCCAAATGAAATTTTAATCAATATAATTATAGAAATTTTGTTCTAATTCCACAAATGCTATCATGAGAACTAGAGCTGTTCAATCATTATACTCATAGTCTGATTGTCTCAAGTGACTGTATTATGATTTGACTCCATCCTTATCTTATTTGGAAGGGGAGTGTTACCCCACCCAACTATTTTGGATACATGAAACTGATGTACCAAATTTGCTGCAGAGATTAGTAGTTTTTTGCTGCTCTGAAATATTACGGCCCCCTCCCCTGTGGTTTTTATGTGGATTTAGCAGAATAGAGAATGTGAGATAATGACAGTCTGAGTGAACTTTCTACTTGAATATTTCACTATTGTAGTTTGGCTTTATTGTCAAAGCAGCTGGATGAAAGGGCACAGATTTTGTTGAAGAATTTGCTCGATGATGCTACTGCAGGAGTTCTTGAGTTACACTCAAAGGTTTGTATTTCCTTCTGCATGAAAACCTAATGTTTTTTTGACAAAAAAAACTATTGGACAACGAAAGTTTACACTTTATGGATGTTTGGCTAATATACATTCCCATTTGAGTTTGTACAGATCTTGGCAACAGACTCTGGCTTTTTTAACAACTTTCGGCATAAAGAGGGATCAAGCTTTCTTACTGGCATTGATGCTAAACAGACGAAGCCACTGGACAAGAAAGTTGCTGAATGGATGGAACATAATCAAAGTGCAAGAAAGATGGGAAATCTGGAGACTGAAGACAATCCCAGAATGGCCAGATCTTCAGCTTTAAATGTCGCCACTAATCACTTATCAAATGGTATTAGTTTAGCTCTCAGAAGAATTGAACTTCACATTTTATCTCTGCAACGTTGTACAAGTCAAAGTAGGAGGAATACAAGAAGCCATATCAATGGAGCTAAATTAGCTAACTATCTTCAAGGGAATGAGATATTGAGCCAGCAGAAAGTTCAGTCAAGGACAGATCACTCAACTTTGAAGGCCAGAATTACTGAGCCGATTAGAGGTAGTCATAACTTGCGCAGTCATATAAGTCGTCATCTTCTTGGTGGACAGAATGTTAAGCCAGTAGTGAGGGGCGTTGAGTCACTAACTAGAGCGTGTCAGATGAACCATTGTTCTGAGTTCGTTCATGGGTTCAGAATACCTCTGAGTCAAGACAATGATGAGGTCAGGAAACCTCCAACCGTTGAAACCCAGATATCTAAAGAACACAAACTTATAAATCCAATGATTCTGATAGATAAATCTGGATGTTCAGTGGGATCCAAGGCTACCGTCAGGTCCGGTAGGAAACTGCTCAATCAACCTCGGATACAAGAAAGGAGGTGCCAGAATTCACCTGGTCGTATGATCATGAGGCCAACTTTGCTGGATCATATCTCCAGAGGAGTAGAAAGAGAAAAGGAAAACCATAAGAAGACCCATGTGGCTACTCAGCAAGAATCTGAAAACACAAACTCAGAATCAGAATCAGCTTCTTCTTCGAGTTGGGAAACTCAGCAGACCAGTGAAAGTGAAACCACTGATTACCCTTCTTCGCCAACTCACCAAAAGGGTCCACCGGCAACCGGTTCTGAAGCAAGTAGCCGGTACAGAAGCAGCAGCATTTCAACAAAAACATTCAGATTCAGCCATGGGAAAAAGGGGTCCAAGAAAGCAATCGGACGGTTCAAGAGACTCAAGAACAAGTTAGGCCTTATCTTCCACCACCATCACCACCACCACCACCACCATAACACCAACACCTTCATGTGGAAGCATCTAAGAAAGATCTTCCATCTCCATCGCACAGATAACAAAAAACTAACAAGTGAAGGAGGATATGGGAAGCTAAAGAAATCAGCAATCAGAAGTGTGTCTCGCAAGAACCAAGTTGGGAAGTTTCAGGCTCTTGCTGAAGGGCTTCGGAGCCATGTTTGGAAATCGAAAGCCATGAAGAAGAAAGAGCTTAGGAGGCTGGGTGGTGGGAGGAAGAAGGGTGTGAAGAAGTTGCAGTGGTGGCAGATGTTTCGTCGCCGCCGTGGAGTGAAGTTACCCAAAAAAGGGCGTGTTAAGATAGGGTATGTAAACAGAAAACCACAGCTTAAGGTAGTTTAG
mRNA sequence
ATGGATGTTGATGAGGTCTATCTTGATCTCCTTGCACTGAGGGCATTATATATCCTCCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACGTCTGGATGAAAGGGCACAGATTTTGTTGAAGAATTTGCTCGATGATGCTACTGCAGGAGTTCTTGAGTTACACTCAAAGATCTTGGCAACAGACTCTGGCTTTTTTAACAACTTTCGGCATAAAGAGGGATCAAGCTTTCTTACTGGCATTGATGCTAAACAGACGAAGCCACTGGACAAGAAAGTTGCTGAATGGATGGAACATAATCAAAGTGCAAGAAAGATGGGAAATCTGGAGACTGAAGACAATCCCAGAATGGCCAGATCTTCAGCTTTAAATGTCGCCACTAATCACTTATCAAATGGTATTAGTTTAGCTCTCAGAAGAATTGAACTTCACATTTTATCTCTGCAACGTTGTACAAGTCAAAGTAGGAGGAATACAAGAAGCCATATCAATGGAGCTAAATTAGCTAACTATCTTCAAGGGAATGAGATATTGAGCCAGCAGAAAGTTCAGTCAAGGACAGATCACTCAACTTTGAAGGCCAGAATTACTGAGCCGATTAGAGGTAGTCATAACTTGCGCAGTCATATAAGTCGTCATCTTCTTGGTGGACAGAATGTTAAGCCAGTAGTGAGGGGCGTTGAGTCACTAACTAGAGCGTGTCAGATGAACCATTGTTCTGAGTTCGTTCATGGGTTCAGAATACCTCTGAGTCAAGACAATGATGAGGTCAGGAAACCTCCAACCGTTGAAACCCAGATATCTAAAGAACACAAACTTATAAATCCAATGATTCTGATAGATAAATCTGGATGTTCAGTGGGATCCAAGGCTACCGTCAGGTCCGGTAGGAAACTGCTCAATCAACCTCGGATACAAGAAAGGAGGTGCCAGAATTCACCTGGTCGTATGATCATGAGGCCAACTTTGCTGGATCATATCTCCAGAGGAGTAGAAAGAGAAAAGGAAAACCATAAGAAGACCCATGTGGCTACTCAGCAAGAATCTGAAAACACAAACTCAGAATCAGAATCAGCTTCTTCTTCGAGTTGGGAAACTCAGCAGACCAGTGAAAGTGAAACCACTGATTACCCTTCTTCGCCAACTCACCAAAAGGGTCCACCGGCAACCGGTTCTGAAGCAAGTAGCCGGTACAGAAGCAGCAGCATTTCAACAAAAACATTCAGATTCAGCCATGGGAAAAAGGGGTCCAAGAAAGCAATCGGACGGTTCAAGAGACTCAAGAACAAGTTAGGCCTTATCTTCCACCACCATCACCACCACCACCACCACCATAACACCAACACCTTCATGTGGAAGCATCTAAGAAAGATCTTCCATCTCCATCGCACAGATAACAAAAAACTAACAAGTGAAGGAGGATATGGGAAGCTAAAGAAATCAGCAATCAGAAGTGTGTCTCGCAAGAACCAAGTTGGGAAGTTTCAGGCTCTTGCTGAAGGGCTTCGGAGCCATGTTTGGAAATCGAAAGCCATGAAGAAGAAAGAGCTTAGGAGGCTGGGTGGTGGGAGGAAGAAGGGTGTGAAGAAGTTGCAGTGGTGGCAGATGTTTCGTCGCCGCCGTGGAGTGAAGTTACCCAAAAAAGGGCGTGTTAAGATAGGGTATGTAAACAGAAAACCACAGCTTAAGGTAGTTTAG
Coding sequence (CDS)
ATGGATGTTGATGAGGTCTATCTTGATCTCCTTGCACTGAGGGCATTATATATCCTCCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACGTCTGGATGAAAGGGCACAGATTTTGTTGAAGAATTTGCTCGATGATGCTACTGCAGGAGTTCTTGAGTTACACTCAAAGATCTTGGCAACAGACTCTGGCTTTTTTAACAACTTTCGGCATAAAGAGGGATCAAGCTTTCTTACTGGCATTGATGCTAAACAGACGAAGCCACTGGACAAGAAAGTTGCTGAATGGATGGAACATAATCAAAGTGCAAGAAAGATGGGAAATCTGGAGACTGAAGACAATCCCAGAATGGCCAGATCTTCAGCTTTAAATGTCGCCACTAATCACTTATCAAATGGTATTAGTTTAGCTCTCAGAAGAATTGAACTTCACATTTTATCTCTGCAACGTTGTACAAGTCAAAGTAGGAGGAATACAAGAAGCCATATCAATGGAGCTAAATTAGCTAACTATCTTCAAGGGAATGAGATATTGAGCCAGCAGAAAGTTCAGTCAAGGACAGATCACTCAACTTTGAAGGCCAGAATTACTGAGCCGATTAGAGGTAGTCATAACTTGCGCAGTCATATAAGTCGTCATCTTCTTGGTGGACAGAATGTTAAGCCAGTAGTGAGGGGCGTTGAGTCACTAACTAGAGCGTGTCAGATGAACCATTGTTCTGAGTTCGTTCATGGGTTCAGAATACCTCTGAGTCAAGACAATGATGAGGTCAGGAAACCTCCAACCGTTGAAACCCAGATATCTAAAGAACACAAACTTATAAATCCAATGATTCTGATAGATAAATCTGGATGTTCAGTGGGATCCAAGGCTACCGTCAGGTCCGGTAGGAAACTGCTCAATCAACCTCGGATACAAGAAAGGAGGTGCCAGAATTCACCTGGTCGTATGATCATGAGGCCAACTTTGCTGGATCATATCTCCAGAGGAGTAGAAAGAGAAAAGGAAAACCATAAGAAGACCCATGTGGCTACTCAGCAAGAATCTGAAAACACAAACTCAGAATCAGAATCAGCTTCTTCTTCGAGTTGGGAAACTCAGCAGACCAGTGAAAGTGAAACCACTGATTACCCTTCTTCGCCAACTCACCAAAAGGGTCCACCGGCAACCGGTTCTGAAGCAAGTAGCCGGTACAGAAGCAGCAGCATTTCAACAAAAACATTCAGATTCAGCCATGGGAAAAAGGGGTCCAAGAAAGCAATCGGACGGTTCAAGAGACTCAAGAACAAGTTAGGCCTTATCTTCCACCACCATCACCACCACCACCACCACCATAACACCAACACCTTCATGTGGAAGCATCTAAGAAAGATCTTCCATCTCCATCGCACAGATAACAAAAAACTAACAAGTGAAGGAGGATATGGGAAGCTAAAGAAATCAGCAATCAGAAGTGTGTCTCGCAAGAACCAAGTTGGGAAGTTTCAGGCTCTTGCTGAAGGGCTTCGGAGCCATGTTTGGAAATCGAAAGCCATGAAGAAGAAAGAGCTTAGGAGGCTGGGTGGTGGGAGGAAGAAGGGTGTGAAGAAGTTGCAGTGGTGGCAGATGTTTCGTCGCCGCCGTGGAGTGAAGTTACCCAAAAAAGGGCGTGTTAAGATAGGGTATGTAAACAGAAAACCACAGCTTAAGGTAGTTTAG
Protein sequence
MDVDEVYLDLLALRALYILLLKSCLRDANSERLDERAQILLKNLLDDATAGVLELHSKILATDSGFFNNFRHKEGSSFLTGIDAKQTKPLDKKVAEWMEHNQSARKMGNLETEDNPRMARSSALNVATNHLSNGISLALRRIELHILSLQRCTSQSRRNTRSHINGAKLANYLQGNEILSQQKVQSRTDHSTLKARITEPIRGSHNLRSHISRHLLGGQNVKPVVRGVESLTRACQMNHCSEFVHGFRIPLSQDNDEVRKPPTVETQISKEHKLINPMILIDKSGCSVGSKATVRSGRKLLNQPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHKKTHVATQQESENTNSESESASSSSWETQQTSESETTDYPSSPTHQKGPPATGSEASSRYRSSSISTKTFRFSHGKKGSKKAIGRFKRLKNKLGLIFHHHHHHHHHHNTNTFMWKHLRKIFHLHRTDNKKLTSEGGYGKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELRRLGGGRKKGVKKLQWWQMFRRRRGVKLPKKGRVKIGYVNRKPQLKVV
Homology
BLAST of Sgr019020 vs. NCBI nr
Match:
XP_022154939.1 (protein KOKOPELLI isoform X2 [Momordica charantia])
HSP 1 Score: 605.1 bits (1559), Expect = 6.0e-169
Identity = 367/572 (64.16%), Postives = 405/572 (70.80%), Query Frame = 0
Query: 1 MDVDEVYLDLLALRALYILLLKSCLRDANSERLDERAQILLKNLLDDATAGVLELHSKIL 60
M+V+E+YLDLLALR LYILLLKSCLRDANSE LDERAQILLK+LLDDATA +++ HSK
Sbjct: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSK-- 60
Query: 61 ATDSGFFNNFRHKEGSSFLTGIDAKQTKPLDKKVAEWMEHNQSARKMGNLETEDNPRMAR 120
TKP+++KVAEWME+NQS RK G
Sbjct: 61 --------------------------TKPVEEKVAEWMEYNQSTRKTG------------ 120
Query: 121 SSALNVATNHLSNGISLALRRIELHILSLQRCTSQSRRNTRSHINGAKLANYLQGNEILS 180
NVA N LSNGI LALRRIE HILSLQ TSQS RNTRSHINGAKL+ N L
Sbjct: 121 ----NVAANDLSNGIGLALRRIEFHILSLQHYTSQS-RNTRSHINGAKLS-----NSPLD 180
Query: 181 QQKVQSRTDHSTLKARITEPIRGSHNLRSHISRHLLGGQNVKPVVRGVESLTRACQMNHC 240
QQKVQSR DHS LKAR+ EPI G HC
Sbjct: 181 QQKVQSRMDHSNLKARVAEPING-----------------------------------HC 240
Query: 241 SEFVHGFRIPLSQDNDEVRKPPTVETQISKEHKLINPMILIDKSGCSVGSKATVRSGRKL 300
SEFVHGFR+PLSQDN E KPP V TQ+SK++K+INP+ILIDKS CSVGSKATVRS
Sbjct: 241 SEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS---- 300
Query: 301 LNQPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHKKTHVATQQESENTNSESESA 360
+N+ +I ERRCQN PG MIMRPTLL NH KT + TQQESE TNSESES
Sbjct: 301 VNRTQIHERRCQNLPGHMIMRPTLL------------NHMKTRMPTQQESEFTNSESESV 360
Query: 361 SSSSWETQQTSESETTDYPSSPTHQKGPPATGSEASSRYRSSSISTKTFRFSHGKKGSKK 420
SSSSW TQQTSE+ETTDYPSS +HQ+ PATGSE SSRYRSS IS+K FR SHGKKGSKK
Sbjct: 361 SSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKK 420
Query: 421 AIGRFKRLKNKLGLIFHHHHHHHHHHNTNT----FMWKHLRKIFHLHRTDNKKLTSEGGY 480
AIGRFKRL+NKLGLIFHHHHHHHHHH+ N+ FMWK LRKIF H TD K++TS+G +
Sbjct: 421 AIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIF--HGTDKKRVTSKGRH 466
Query: 481 GKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELR--RLGGGRKKGVKKLQW 540
LKK+AIRSVSRKNQVG+FQALAEGLRSHVWK AMKKKELR RLG KKGVKKL W
Sbjct: 481 ETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLG---KKGVKKLHW 466
Query: 541 WQMFRRRRGVKLPKKGRVKIGYVNRKPQLKVV 567
W+MF RRRGVKLP KGRVKIGYVNRKPQ K+V
Sbjct: 541 WRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 466
BLAST of Sgr019020 vs. NCBI nr
Match:
XP_022154937.1 (protein KOKOPELLI isoform X1 [Momordica charantia] >XP_022154938.1 protein KOKOPELLI isoform X1 [Momordica charantia])
HSP 1 Score: 601.7 bits (1550), Expect = 6.7e-168
Identity = 367/573 (64.05%), Postives = 406/573 (70.86%), Query Frame = 0
Query: 1 MDVDEVYLDLLALRALYILLLKSCLRDANSE-RLDERAQILLKNLLDDATAGVLELHSKI 60
M+V+E+YLDLLALR LYILLLKSCLRDANSE +LDERAQILLK+LLDDATA +++ HSK
Sbjct: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELQLDERAQILLKHLLDDATAEIVQFHSK- 60
Query: 61 LATDSGFFNNFRHKEGSSFLTGIDAKQTKPLDKKVAEWMEHNQSARKMGNLETEDNPRMA 120
TKP+++KVAEWME+NQS RK G
Sbjct: 61 ---------------------------TKPVEEKVAEWMEYNQSTRKTG----------- 120
Query: 121 RSSALNVATNHLSNGISLALRRIELHILSLQRCTSQSRRNTRSHINGAKLANYLQGNEIL 180
NVA N LSNGI LALRRIE HILSLQ TSQS RNTRSHINGAKL+ N L
Sbjct: 121 -----NVAANDLSNGIGLALRRIEFHILSLQHYTSQS-RNTRSHINGAKLS-----NSPL 180
Query: 181 SQQKVQSRTDHSTLKARITEPIRGSHNLRSHISRHLLGGQNVKPVVRGVESLTRACQMNH 240
QQKVQSR DHS LKAR+ EPI G H
Sbjct: 181 DQQKVQSRMDHSNLKARVAEPING-----------------------------------H 240
Query: 241 CSEFVHGFRIPLSQDNDEVRKPPTVETQISKEHKLINPMILIDKSGCSVGSKATVRSGRK 300
CSEFVHGFR+PLSQDN E KPP V TQ+SK++K+INP+ILIDKS CSVGSKATVRS
Sbjct: 241 CSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS--- 300
Query: 301 LLNQPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHKKTHVATQQESENTNSESES 360
+N+ +I ERRCQN PG MIMRPTLL NH KT + TQQESE TNSESES
Sbjct: 301 -VNRTQIHERRCQNLPGHMIMRPTLL------------NHMKTRMPTQQESEFTNSESES 360
Query: 361 ASSSSWETQQTSESETTDYPSSPTHQKGPPATGSEASSRYRSSSISTKTFRFSHGKKGSK 420
SSSSW TQQTSE+ETTDYPSS +HQ+ PATGSE SSRYRSS IS+K FR SHGKKGSK
Sbjct: 361 VSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSK 420
Query: 421 KAIGRFKRLKNKLGLIFHHHHHHHHHHNTNT----FMWKHLRKIFHLHRTDNKKLTSEGG 480
KAIGRFKRL+NKLGLIFHHHHHHHHHH+ N+ FMWK LRKIF H TD K++TS+G
Sbjct: 421 KAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIF--HGTDKKRVTSKGR 467
Query: 481 YGKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELR--RLGGGRKKGVKKLQ 540
+ LKK+AIRSVSRKNQVG+FQALAEGLRSHVWK AMKKKELR RLG KKGVKKL
Sbjct: 481 HETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLG---KKGVKKLH 467
Query: 541 WWQMFRRRRGVKLPKKGRVKIGYVNRKPQLKVV 567
WW+MF RRRGVKLP KGRVKIGYVNRKPQ K+V
Sbjct: 541 WWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 467
BLAST of Sgr019020 vs. NCBI nr
Match:
XP_022154940.1 (uncharacterized protein LOC111022084 isoform X3 [Momordica charantia])
HSP 1 Score: 558.9 bits (1439), Expect = 5.0e-155
Identity = 341/543 (62.80%), Postives = 378/543 (69.61%), Query Frame = 0
Query: 30 SERLDERAQILLKNLLDDATAGVLELHSKILATDSGFFNNFRHKEGSSFLTGIDAKQTKP 89
S++LDERAQILLK+LLDDATA +++ HSK TKP
Sbjct: 11 SKQLDERAQILLKHLLDDATAEIVQFHSK----------------------------TKP 70
Query: 90 LDKKVAEWMEHNQSARKMGNLETEDNPRMARSSALNVATNHLSNGISLALRRIELHILSL 149
+++KVAEWME+NQS RK G NVA N LSNGI LALRRIE HILSL
Sbjct: 71 VEEKVAEWMEYNQSTRKTG----------------NVAANDLSNGIGLALRRIEFHILSL 130
Query: 150 QRCTSQSRRNTRSHINGAKLANYLQGNEILSQQKVQSRTDHSTLKARITEPIRGSHNLRS 209
Q TSQS RNTRSHINGAKL+ N L QQKVQSR DHS LKAR+ EPI G
Sbjct: 131 QHYTSQS-RNTRSHINGAKLS-----NSPLDQQKVQSRMDHSNLKARVAEPING------ 190
Query: 210 HISRHLLGGQNVKPVVRGVESLTRACQMNHCSEFVHGFRIPLSQDNDEVRKPPTVETQIS 269
HCSEFVHGFR+PLSQDN E KPP V TQ+S
Sbjct: 191 -----------------------------HCSEFVHGFRVPLSQDNVEAMKPPNVGTQVS 250
Query: 270 KEHKLINPMILIDKSGCSVGSKATVRSGRKLLNQPRIQERRCQNSPGRMIMRPTLLDHIS 329
K++K+INP+ILIDKS CSVGSKATVRS +N+ +I ERRCQN PG MIMRPTLL
Sbjct: 251 KQNKVINPVILIDKSRCSVGSKATVRS----VNRTQIHERRCQNLPGHMIMRPTLL---- 310
Query: 330 RGVEREKENHKKTHVATQQESENTNSESESASSSSWETQQTSESETTDYPSSPTHQKGPP 389
NH KT + TQQESE TNSESES SSSSW TQQTSE+ETTDYPSS +HQ+ P
Sbjct: 311 --------NHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQP 370
Query: 390 ATGSEASSRYRSSSISTKTFRFSHGKKGSKKAIGRFKRLKNKLGLIFHHHHHHHHHHNTN 449
ATGSE SSRYRSS IS+K FR SHGKKGSKKAIGRFKRL+NKLGLIFHHHHHHHHHH+ N
Sbjct: 371 ATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHN 430
Query: 450 T----FMWKHLRKIFHLHRTDNKKLTSEGGYGKLKKSAIRSVSRKNQVGKFQALAEGLRS 509
+ FMWK LRKIF H TD K++TS+G + LKK+AIRSVSRKNQVG+FQALAEGLRS
Sbjct: 431 SHNNFFMWKQLRKIF--HGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRS 447
Query: 510 HVWKSKAMKKKELR--RLGGGRKKGVKKLQWWQMFRRRRGVKLPKKGRVKIGYVNRKPQL 567
HVWK AMKKKELR RLG KKGVKKL WW+MF RRRGVKLP KGRVKIGYVNRKPQ
Sbjct: 491 HVWKPTAMKKKELRKPRLG---KKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQH 447
BLAST of Sgr019020 vs. NCBI nr
Match:
XP_038877121.1 (protein KOKOPELLI-like isoform X1 [Benincasa hispida])
HSP 1 Score: 536.2 bits (1380), Expect = 3.4e-148
Identity = 345/577 (59.79%), Postives = 400/577 (69.32%), Query Frame = 0
Query: 1 MDVDEVYLDLLALRALYILLLKSCLRDANSERLDERAQILLKNLLDDATAGVLELHSKIL 60
MDVD++YLDLLALR LYILLLKSCL DANSE LDERAQILLK+LLDDATAGVLE S L
Sbjct: 31 MDVDKLYLDLLALRELYILLLKSCLGDANSELLDERAQILLKHLLDDATAGVLEFLSNDL 90
Query: 61 ATDSGFFNNFRHKEGSSFLTGIDAKQTKPLDKKVAEWMEHNQSARKMGNLETEDNPRMAR 120
AT+S F+NF HK D KQ KPL KV EWM+HNQ+ RKMGN E D R
Sbjct: 91 ATNSNIFDNFLHK---------DDKQVKPLADKVPEWMKHNQTRRKMGNPEIRD-----R 150
Query: 121 SSALNVATNHLSNGISLALRRIELHILSLQRCTSQSRRNTRSHINGAKLANYLQGNEILS 180
+SA NVA N+LS+ IS ALRRIELHILSLQ CTSQ RR TR H + LQ NE L+
Sbjct: 151 ASASNVAINNLSHSISSALRRIELHILSLQHCTSQ-RRKTRCH-----WQSVLQWNESLN 210
Query: 181 QQKVQSRTDHSTLKARITEPIRGSHNLRSHISRHLLGGQ-NVKPVVRGVESLTRACQMNH 240
QQ V RT STL++R T+PI+G H +G Q VKP NH
Sbjct: 211 QQNVHPRTGPSTLRSRFTKPIKG--------RGHFVGEQKKVKPKT-----------ANH 270
Query: 241 CSEFVHGFRIPLSQDNDEVRKPPTVETQISKEHKLINPMILIDKSG-CSVGSKATVRSGR 300
CSE+VHGFRIPLSQ NDE KP T+ET I+K+HK++NPM LIDKSG SVGSKAT R
Sbjct: 271 CSEYVHGFRIPLSQTNDEAMKPLTIETHITKQHKVVNPMTLIDKSGYTSVGSKATFRPAM 330
Query: 301 KLLNQPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHKKTHV-ATQQESENTNSE- 360
KL + Q +R QNS G+M+M PTLLDH R + + KTH+ ATQQESE T+SE
Sbjct: 331 KLNQTSKQQAKRNQNSYGQMVMGPTLLDHHPSKETRNERINSKTHLAATQQESEFTSSEF 390
Query: 361 -SESASSSSWETQQTSESETT-----DYPSSPTHQKGPPATGSEASSRYRSSSISTKTFR 420
S S+SSSSW TQ+TS SET PSSP+HQ P +T S++SS TKTF
Sbjct: 391 QSASSSSSSWTTQETSVSETVANDGDSNPSSPSHQDDPLSTDSKSSS-------LTKTFY 450
Query: 421 FSHGKKGSKKAIGRFKRLKNKLGLIF-HHHHHHHHHHNTNTFMWK-HLRKIFHLHRTDNK 480
GK SKK +GRFKRLKNKLG++F HHHHHHHHHHN+N FMWK LRKIF H DNK
Sbjct: 451 IKQGKTESKKVLGRFKRLKNKLGVVFHHHHHHHHHHHNSNNFMWKQQLRKIF--HSRDNK 510
Query: 481 KL--TSEGGYGKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELRRLGGGRK 540
+L + E G K+KK AIR+V KNQVGKFQALAEGLRSHVW+SKAMK+K ++ + G K
Sbjct: 511 RLLVSKEDGNEKVKKRAIRNVCYKNQVGKFQALAEGLRSHVWRSKAMKRKGVKGMKCG-K 558
Query: 541 KGVKKLQWWQMFRRRRGVKLPKKGRVKIGYVNRKPQL 564
KGVKKL WW+MFR RRGV+LP KG +KIGYVN+K +L
Sbjct: 571 KGVKKLHWWKMFRNRRGVRLPNKGHMKIGYVNKKAKL 558
BLAST of Sgr019020 vs. NCBI nr
Match:
XP_038877123.1 (protein KOKOPELLI-like isoform X3 [Benincasa hispida] >XP_038877124.1 protein KOKOPELLI-like isoform X3 [Benincasa hispida])
HSP 1 Score: 536.2 bits (1380), Expect = 3.4e-148
Identity = 345/577 (59.79%), Postives = 400/577 (69.32%), Query Frame = 0
Query: 1 MDVDEVYLDLLALRALYILLLKSCLRDANSERLDERAQILLKNLLDDATAGVLELHSKIL 60
MDVD++YLDLLALR LYILLLKSCL DANSE LDERAQILLK+LLDDATAGVLE S L
Sbjct: 1 MDVDKLYLDLLALRELYILLLKSCLGDANSELLDERAQILLKHLLDDATAGVLEFLSNDL 60
Query: 61 ATDSGFFNNFRHKEGSSFLTGIDAKQTKPLDKKVAEWMEHNQSARKMGNLETEDNPRMAR 120
AT+S F+NF HK D KQ KPL KV EWM+HNQ+ RKMGN E D R
Sbjct: 61 ATNSNIFDNFLHK---------DDKQVKPLADKVPEWMKHNQTRRKMGNPEIRD-----R 120
Query: 121 SSALNVATNHLSNGISLALRRIELHILSLQRCTSQSRRNTRSHINGAKLANYLQGNEILS 180
+SA NVA N+LS+ IS ALRRIELHILSLQ CTSQ RR TR H + LQ NE L+
Sbjct: 121 ASASNVAINNLSHSISSALRRIELHILSLQHCTSQ-RRKTRCH-----WQSVLQWNESLN 180
Query: 181 QQKVQSRTDHSTLKARITEPIRGSHNLRSHISRHLLGGQ-NVKPVVRGVESLTRACQMNH 240
QQ V RT STL++R T+PI+G H +G Q VKP NH
Sbjct: 181 QQNVHPRTGPSTLRSRFTKPIKG--------RGHFVGEQKKVKPKT-----------ANH 240
Query: 241 CSEFVHGFRIPLSQDNDEVRKPPTVETQISKEHKLINPMILIDKSG-CSVGSKATVRSGR 300
CSE+VHGFRIPLSQ NDE KP T+ET I+K+HK++NPM LIDKSG SVGSKAT R
Sbjct: 241 CSEYVHGFRIPLSQTNDEAMKPLTIETHITKQHKVVNPMTLIDKSGYTSVGSKATFRPAM 300
Query: 301 KLLNQPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHKKTHV-ATQQESENTNSE- 360
KL + Q +R QNS G+M+M PTLLDH R + + KTH+ ATQQESE T+SE
Sbjct: 301 KLNQTSKQQAKRNQNSYGQMVMGPTLLDHHPSKETRNERINSKTHLAATQQESEFTSSEF 360
Query: 361 -SESASSSSWETQQTSESETT-----DYPSSPTHQKGPPATGSEASSRYRSSSISTKTFR 420
S S+SSSSW TQ+TS SET PSSP+HQ P +T S++SS TKTF
Sbjct: 361 QSASSSSSSWTTQETSVSETVANDGDSNPSSPSHQDDPLSTDSKSSS-------LTKTFY 420
Query: 421 FSHGKKGSKKAIGRFKRLKNKLGLIF-HHHHHHHHHHNTNTFMWK-HLRKIFHLHRTDNK 480
GK SKK +GRFKRLKNKLG++F HHHHHHHHHHN+N FMWK LRKIF H DNK
Sbjct: 421 IKQGKTESKKVLGRFKRLKNKLGVVFHHHHHHHHHHHNSNNFMWKQQLRKIF--HSRDNK 480
Query: 481 KL--TSEGGYGKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELRRLGGGRK 540
+L + E G K+KK AIR+V KNQVGKFQALAEGLRSHVW+SKAMK+K ++ + G K
Sbjct: 481 RLLVSKEDGNEKVKKRAIRNVCYKNQVGKFQALAEGLRSHVWRSKAMKRKGVKGMKCG-K 528
Query: 541 KGVKKLQWWQMFRRRRGVKLPKKGRVKIGYVNRKPQL 564
KGVKKL WW+MFR RRGV+LP KG +KIGYVN+K +L
Sbjct: 541 KGVKKLHWWKMFRNRRGVRLPNKGHMKIGYVNKKAKL 528
BLAST of Sgr019020 vs. ExPASy Swiss-Prot
Match:
Q9FFP2 (Protein KOKOPELLI OS=Arabidopsis thaliana OX=3702 GN=KPL PE=2 SV=1)
HSP 1 Score: 87.4 bits (215), Expect = 5.5e-16
Identity = 88/269 (32.71%), Postives = 131/269 (48.70%), Query Frame = 0
Query: 303 QPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHK--KTHVATQQESENT------- 362
+P R Q P IM+PTL+D + + + + +T AT ESE+
Sbjct: 235 KPNQSNRASQKMP---IMKPTLMDQETETFDDDSSETEADQTPSATGSESEDEEVSTSQE 294
Query: 363 -NSESESASSSSWETQQTSESETTDYPSSPTHQKGPPATGSEASSRYRSSSISTKTFRFS 422
+ E+ S+S S WETQ +++E S + PP S S + +
Sbjct: 295 YSGETGSSSGSEWETQAENDTE------SKSESSYPPQNDDSVSEVSTSPPHTDRDTSRE 354
Query: 423 HGKKGSKKAIGRFKRLKNKLGLIFHHHHHHHHHHN----TNTFMWKHLRKIFHLHRTDNK 482
GK+ + +GRFKR+KNK+G IFHHHHHHHHHH+ W L+ FH H+ K
Sbjct: 355 PGKQ-RRNVMGRFKRIKNKIGQIFHHHHHHHHHHHHHDKEKPSAWNKLQSKFH-HKHQEK 414
Query: 483 KLTSEGGYGKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELRRLGGGRKKG 542
+ E + + + +++Q G F AL EGL H SK K + K
Sbjct: 415 --SKERKRPMSESKGLTTHKQQHQGGHFHALVEGLVRHRKHSKKQKHQ--------LKSD 474
Query: 543 VKKLQWWQMFRRRR--GVKLPKKGRVKIG 556
KK +WW++ ++R+ GVK+PK+GRVK+G
Sbjct: 475 AKKTEWWKLLKKRQGGGVKIPKRGRVKLG 482
BLAST of Sgr019020 vs. ExPASy TrEMBL
Match:
A0A6J1DNR3 (protein KOKOPELLI isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4 SV=1)
HSP 1 Score: 605.1 bits (1559), Expect = 2.9e-169
Identity = 367/572 (64.16%), Postives = 405/572 (70.80%), Query Frame = 0
Query: 1 MDVDEVYLDLLALRALYILLLKSCLRDANSERLDERAQILLKNLLDDATAGVLELHSKIL 60
M+V+E+YLDLLALR LYILLLKSCLRDANSE LDERAQILLK+LLDDATA +++ HSK
Sbjct: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSK-- 60
Query: 61 ATDSGFFNNFRHKEGSSFLTGIDAKQTKPLDKKVAEWMEHNQSARKMGNLETEDNPRMAR 120
TKP+++KVAEWME+NQS RK G
Sbjct: 61 --------------------------TKPVEEKVAEWMEYNQSTRKTG------------ 120
Query: 121 SSALNVATNHLSNGISLALRRIELHILSLQRCTSQSRRNTRSHINGAKLANYLQGNEILS 180
NVA N LSNGI LALRRIE HILSLQ TSQS RNTRSHINGAKL+ N L
Sbjct: 121 ----NVAANDLSNGIGLALRRIEFHILSLQHYTSQS-RNTRSHINGAKLS-----NSPLD 180
Query: 181 QQKVQSRTDHSTLKARITEPIRGSHNLRSHISRHLLGGQNVKPVVRGVESLTRACQMNHC 240
QQKVQSR DHS LKAR+ EPI G HC
Sbjct: 181 QQKVQSRMDHSNLKARVAEPING-----------------------------------HC 240
Query: 241 SEFVHGFRIPLSQDNDEVRKPPTVETQISKEHKLINPMILIDKSGCSVGSKATVRSGRKL 300
SEFVHGFR+PLSQDN E KPP V TQ+SK++K+INP+ILIDKS CSVGSKATVRS
Sbjct: 241 SEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS---- 300
Query: 301 LNQPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHKKTHVATQQESENTNSESESA 360
+N+ +I ERRCQN PG MIMRPTLL NH KT + TQQESE TNSESES
Sbjct: 301 VNRTQIHERRCQNLPGHMIMRPTLL------------NHMKTRMPTQQESEFTNSESESV 360
Query: 361 SSSSWETQQTSESETTDYPSSPTHQKGPPATGSEASSRYRSSSISTKTFRFSHGKKGSKK 420
SSSSW TQQTSE+ETTDYPSS +HQ+ PATGSE SSRYRSS IS+K FR SHGKKGSKK
Sbjct: 361 SSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKK 420
Query: 421 AIGRFKRLKNKLGLIFHHHHHHHHHHNTNT----FMWKHLRKIFHLHRTDNKKLTSEGGY 480
AIGRFKRL+NKLGLIFHHHHHHHHHH+ N+ FMWK LRKIF H TD K++TS+G +
Sbjct: 421 AIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIF--HGTDKKRVTSKGRH 466
Query: 481 GKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELR--RLGGGRKKGVKKLQW 540
LKK+AIRSVSRKNQVG+FQALAEGLRSHVWK AMKKKELR RLG KKGVKKL W
Sbjct: 481 ETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLG---KKGVKKLHW 466
Query: 541 WQMFRRRRGVKLPKKGRVKIGYVNRKPQLKVV 567
W+MF RRRGVKLP KGRVKIGYVNRKPQ K+V
Sbjct: 541 WRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 466
BLAST of Sgr019020 vs. ExPASy TrEMBL
Match:
A0A6J1DLN1 (protein KOKOPELLI isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4 SV=1)
HSP 1 Score: 601.7 bits (1550), Expect = 3.2e-168
Identity = 367/573 (64.05%), Postives = 406/573 (70.86%), Query Frame = 0
Query: 1 MDVDEVYLDLLALRALYILLLKSCLRDANSE-RLDERAQILLKNLLDDATAGVLELHSKI 60
M+V+E+YLDLLALR LYILLLKSCLRDANSE +LDERAQILLK+LLDDATA +++ HSK
Sbjct: 1 MEVNELYLDLLALRELYILLLKSCLRDANSELQLDERAQILLKHLLDDATAEIVQFHSK- 60
Query: 61 LATDSGFFNNFRHKEGSSFLTGIDAKQTKPLDKKVAEWMEHNQSARKMGNLETEDNPRMA 120
TKP+++KVAEWME+NQS RK G
Sbjct: 61 ---------------------------TKPVEEKVAEWMEYNQSTRKTG----------- 120
Query: 121 RSSALNVATNHLSNGISLALRRIELHILSLQRCTSQSRRNTRSHINGAKLANYLQGNEIL 180
NVA N LSNGI LALRRIE HILSLQ TSQS RNTRSHINGAKL+ N L
Sbjct: 121 -----NVAANDLSNGIGLALRRIEFHILSLQHYTSQS-RNTRSHINGAKLS-----NSPL 180
Query: 181 SQQKVQSRTDHSTLKARITEPIRGSHNLRSHISRHLLGGQNVKPVVRGVESLTRACQMNH 240
QQKVQSR DHS LKAR+ EPI G H
Sbjct: 181 DQQKVQSRMDHSNLKARVAEPING-----------------------------------H 240
Query: 241 CSEFVHGFRIPLSQDNDEVRKPPTVETQISKEHKLINPMILIDKSGCSVGSKATVRSGRK 300
CSEFVHGFR+PLSQDN E KPP V TQ+SK++K+INP+ILIDKS CSVGSKATVRS
Sbjct: 241 CSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS--- 300
Query: 301 LLNQPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHKKTHVATQQESENTNSESES 360
+N+ +I ERRCQN PG MIMRPTLL NH KT + TQQESE TNSESES
Sbjct: 301 -VNRTQIHERRCQNLPGHMIMRPTLL------------NHMKTRMPTQQESEFTNSESES 360
Query: 361 ASSSSWETQQTSESETTDYPSSPTHQKGPPATGSEASSRYRSSSISTKTFRFSHGKKGSK 420
SSSSW TQQTSE+ETTDYPSS +HQ+ PATGSE SSRYRSS IS+K FR SHGKKGSK
Sbjct: 361 VSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSK 420
Query: 421 KAIGRFKRLKNKLGLIFHHHHHHHHHHNTNT----FMWKHLRKIFHLHRTDNKKLTSEGG 480
KAIGRFKRL+NKLGLIFHHHHHHHHHH+ N+ FMWK LRKIF H TD K++TS+G
Sbjct: 421 KAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIF--HGTDKKRVTSKGR 467
Query: 481 YGKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELR--RLGGGRKKGVKKLQ 540
+ LKK+AIRSVSRKNQVG+FQALAEGLRSHVWK AMKKKELR RLG KKGVKKL
Sbjct: 481 HETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLG---KKGVKKLH 467
Query: 541 WWQMFRRRRGVKLPKKGRVKIGYVNRKPQLKVV 567
WW+MF RRRGVKLP KGRVKIGYVNRKPQ K+V
Sbjct: 541 WWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 467
BLAST of Sgr019020 vs. ExPASy TrEMBL
Match:
A0A6J1DL21 (uncharacterized protein LOC111022084 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4 SV=1)
HSP 1 Score: 558.9 bits (1439), Expect = 2.4e-155
Identity = 341/543 (62.80%), Postives = 378/543 (69.61%), Query Frame = 0
Query: 30 SERLDERAQILLKNLLDDATAGVLELHSKILATDSGFFNNFRHKEGSSFLTGIDAKQTKP 89
S++LDERAQILLK+LLDDATA +++ HSK TKP
Sbjct: 11 SKQLDERAQILLKHLLDDATAEIVQFHSK----------------------------TKP 70
Query: 90 LDKKVAEWMEHNQSARKMGNLETEDNPRMARSSALNVATNHLSNGISLALRRIELHILSL 149
+++KVAEWME+NQS RK G NVA N LSNGI LALRRIE HILSL
Sbjct: 71 VEEKVAEWMEYNQSTRKTG----------------NVAANDLSNGIGLALRRIEFHILSL 130
Query: 150 QRCTSQSRRNTRSHINGAKLANYLQGNEILSQQKVQSRTDHSTLKARITEPIRGSHNLRS 209
Q TSQS RNTRSHINGAKL+ N L QQKVQSR DHS LKAR+ EPI G
Sbjct: 131 QHYTSQS-RNTRSHINGAKLS-----NSPLDQQKVQSRMDHSNLKARVAEPING------ 190
Query: 210 HISRHLLGGQNVKPVVRGVESLTRACQMNHCSEFVHGFRIPLSQDNDEVRKPPTVETQIS 269
HCSEFVHGFR+PLSQDN E KPP V TQ+S
Sbjct: 191 -----------------------------HCSEFVHGFRVPLSQDNVEAMKPPNVGTQVS 250
Query: 270 KEHKLINPMILIDKSGCSVGSKATVRSGRKLLNQPRIQERRCQNSPGRMIMRPTLLDHIS 329
K++K+INP+ILIDKS CSVGSKATVRS +N+ +I ERRCQN PG MIMRPTLL
Sbjct: 251 KQNKVINPVILIDKSRCSVGSKATVRS----VNRTQIHERRCQNLPGHMIMRPTLL---- 310
Query: 330 RGVEREKENHKKTHVATQQESENTNSESESASSSSWETQQTSESETTDYPSSPTHQKGPP 389
NH KT + TQQESE TNSESES SSSSW TQQTSE+ETTDYPSS +HQ+ P
Sbjct: 311 --------NHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQP 370
Query: 390 ATGSEASSRYRSSSISTKTFRFSHGKKGSKKAIGRFKRLKNKLGLIFHHHHHHHHHHNTN 449
ATGSE SSRYRSS IS+K FR SHGKKGSKKAIGRFKRL+NKLGLIFHHHHHHHHHH+ N
Sbjct: 371 ATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHN 430
Query: 450 T----FMWKHLRKIFHLHRTDNKKLTSEGGYGKLKKSAIRSVSRKNQVGKFQALAEGLRS 509
+ FMWK LRKIF H TD K++TS+G + LKK+AIRSVSRKNQVG+FQALAEGLRS
Sbjct: 431 SHNNFFMWKQLRKIF--HGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRS 447
Query: 510 HVWKSKAMKKKELR--RLGGGRKKGVKKLQWWQMFRRRRGVKLPKKGRVKIGYVNRKPQL 567
HVWK AMKKKELR RLG KKGVKKL WW+MF RRRGVKLP KGRVKIGYVNRKPQ
Sbjct: 491 HVWKPTAMKKKELRKPRLG---KKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQH 447
BLAST of Sgr019020 vs. ExPASy TrEMBL
Match:
A0A6J1DQ76 (protein KOKOPELLI isoform X4 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4 SV=1)
HSP 1 Score: 513.8 bits (1322), Expect = 8.9e-142
Identity = 312/475 (65.68%), Postives = 340/475 (71.58%), Query Frame = 0
Query: 98 MEHNQSARKMGNLETEDNPRMARSSALNVATNHLSNGISLALRRIELHILSLQRCTSQSR 157
ME+NQS RK G NVA N LSNGI LALRRIE HILSLQ TSQS
Sbjct: 1 MEYNQSTRKTG----------------NVAANDLSNGIGLALRRIEFHILSLQHYTSQS- 60
Query: 158 RNTRSHINGAKLANYLQGNEILSQQKVQSRTDHSTLKARITEPIRGSHNLRSHISRHLLG 217
RNTRSHINGAKL+ N L QQKVQSR DHS LKAR+ EPI G
Sbjct: 61 RNTRSHINGAKLS-----NSPLDQQKVQSRMDHSNLKARVAEPING-------------- 120
Query: 218 GQNVKPVVRGVESLTRACQMNHCSEFVHGFRIPLSQDNDEVRKPPTVETQISKEHKLINP 277
HCSEFVHGFR+PLSQDN E KPP V TQ+SK++K+INP
Sbjct: 121 ---------------------HCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINP 180
Query: 278 MILIDKSGCSVGSKATVRSGRKLLNQPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKE 337
+ILIDKS CSVGSKATVRS +N+ +I ERRCQN PG MIMRPTLL
Sbjct: 181 VILIDKSRCSVGSKATVRS----VNRTQIHERRCQNLPGHMIMRPTLL------------ 240
Query: 338 NHKKTHVATQQESENTNSESESASSSSWETQQTSESETTDYPSSPTHQKGPPATGSEASS 397
NH KT + TQQESE TNSESES SSSSW TQQTSE+ETTDYPSS +HQ+ PATGSE SS
Sbjct: 241 NHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSS 300
Query: 398 RYRSSSISTKTFRFSHGKKGSKKAIGRFKRLKNKLGLIFHHHHHHHHHHNTNT----FMW 457
RYRSS IS+K FR SHGKKGSKKAIGRFKRL+NKLGLIFHHHHHHHHHH+ N+ FMW
Sbjct: 301 RYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMW 360
Query: 458 KHLRKIFHLHRTDNKKLTSEGGYGKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAM 517
K LRKIF H TD K++TS+G + LKK+AIRSVSRKNQVG+FQALAEGLRSHVWK AM
Sbjct: 361 KQLRKIF--HGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAM 397
Query: 518 KKKELR--RLGGGRKKGVKKLQWWQMFRRRRGVKLPKKGRVKIGYVNRKPQLKVV 567
KKKELR RLG KKGVKKL WW+MF RRRGVKLP KGRVKIGYVNRKPQ K+V
Sbjct: 421 KKKELRKPRLG---KKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV 397
BLAST of Sgr019020 vs. ExPASy TrEMBL
Match:
A0A6J1K5J4 (uncharacterized protein LOC111491355 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111491355 PE=4 SV=1)
HSP 1 Score: 496.1 bits (1276), Expect = 1.9e-136
Identity = 327/579 (56.48%), Postives = 379/579 (65.46%), Query Frame = 0
Query: 1 MDVDEVYLDLLALRALYILLLKSCLRDANSER-LDERAQILLKNLLDDATAGVLELHSKI 60
M+ DE+YLDLLALR LY LLK CLRDANSE + RA+ILLK+LLDDAT G+LE HSK
Sbjct: 45 MEADELYLDLLALRQLYFFLLKCCLRDANSELVVGARAKILLKHLLDDATTGLLEFHSKT 104
Query: 61 LATDSGFFNNFRHKEGSSFLTGIDAKQTKPLDKKVAEWMEHNQSARKMGNLE-TEDNPRM 120
LA F NF K D KQTKPLD+KVAEWMEHNQ+AR+M N E E PR
Sbjct: 105 LA-----FYNFLRK---------DDKQTKPLDEKVAEWMEHNQTARRMANPEKIEHKPRR 164
Query: 121 ARSSALNVATNHLSNGISLALRRIELHILSLQRCTSQSRRNTRSHINGAKLANY----LQ 180
R+SA NVA N LS+GI+ ALRRIELHILSLQ R TRSHI+ KLA Y Q
Sbjct: 165 DRASASNVAANDLSSGINSALRRIELHILSLQ-------RYTRSHISETKLAYYGQSVNQ 224
Query: 181 GNEILSQQKVQSRTDHSTLKARITEPIRGSHNLRSHISRHLLGGQNVKPVVRGVESLTRA 240
GNE +QQK VKP+V
Sbjct: 225 GNESFNQQK-------------------------------------VKPMV--------- 284
Query: 241 CQMNHCSEFVHGFRIPLSQDNDEVRKPPTVETQISKEHKLINPMILIDKSGCSVGSKATV 300
NHCS+FV+GFRIPL+QD DE K+H+L+ P L+DKSGC GSKAT
Sbjct: 285 --ANHCSKFVNGFRIPLTQDKDEA----------MKQHELVLPPTLMDKSGCPEGSKATA 344
Query: 301 RSGRKLLNQPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHKKTHVATQQESENTN 360
R K LN+ IQE+R +NS GR++M+PTL H SR V +E+ +H + H+A QQESE TN
Sbjct: 345 RRAMK-LNRTWIQEKRSKNSRGRIVMKPTLWHHPSREVRKEQTHHNRRHLAAQQESEFTN 404
Query: 361 SESESASSSSWETQQTSESETTDYPSSPTHQKGPPATGSEASSRY--RSSSISTKTFRFS 420
SES S SS + T QTSESETTD SSP +Q P ATGSEASS+Y SS+I+ K F+FS
Sbjct: 405 SESASCSSPA--TLQTSESETTDDSSSPDNQSSPTATGSEASSQYGNSSSNITRKAFKFS 464
Query: 421 HGKKGSKKAIGRFKRLKNKLGLIFHHH----HHHHHHHNTNTFMWKHLRKIFHLHRTDNK 480
HGKK S A+GRFK L+NKLGLIFHHH HHHHHHH+ + MWK +R +F HRTD K
Sbjct: 465 HGKKESNGAVGRFKSLRNKLGLIFHHHQHHQHHHHHHHHGHNSMWKQVRTVF--HRTDKK 524
Query: 481 KLTS-EGGYGKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELRRLGGGRKK 540
+LTS E GKL+K+ IRSVSR NQVGKFQAL EGLRSHVWKSKAMKKKE R L G
Sbjct: 525 ELTSKEEKTGKLRKTTIRSVSRNNQVGKFQALPEGLRSHVWKSKAMKKKEQRGLNCG--- 534
Query: 541 GVKKLQWWQMFRRRRGVKLPKKGRVKIGYVNRKPQLKVV 567
KKL WW+M RRRRGVK P KGRVKIGYVNRKP +K++
Sbjct: 585 --KKLHWWKMIRRRRGVKFPNKGRVKIGYVNRKPDVKLI 534
BLAST of Sgr019020 vs. TAIR 10
Match:
AT5G63720.1 (kokopelli )
HSP 1 Score: 87.4 bits (215), Expect = 3.9e-17
Identity = 88/269 (32.71%), Postives = 131/269 (48.70%), Query Frame = 0
Query: 303 QPRIQERRCQNSPGRMIMRPTLLDHISRGVEREKENHK--KTHVATQQESENT------- 362
+P R Q P IM+PTL+D + + + + +T AT ESE+
Sbjct: 235 KPNQSNRASQKMP---IMKPTLMDQETETFDDDSSETEADQTPSATGSESEDEEVSTSQE 294
Query: 363 -NSESESASSSSWETQQTSESETTDYPSSPTHQKGPPATGSEASSRYRSSSISTKTFRFS 422
+ E+ S+S S WETQ +++E S + PP S S + +
Sbjct: 295 YSGETGSSSGSEWETQAENDTE------SKSESSYPPQNDDSVSEVSTSPPHTDRDTSRE 354
Query: 423 HGKKGSKKAIGRFKRLKNKLGLIFHHHHHHHHHHN----TNTFMWKHLRKIFHLHRTDNK 482
GK+ + +GRFKR+KNK+G IFHHHHHHHHHH+ W L+ FH H+ K
Sbjct: 355 PGKQ-RRNVMGRFKRIKNKIGQIFHHHHHHHHHHHHHDKEKPSAWNKLQSKFH-HKHQEK 414
Query: 483 KLTSEGGYGKLKKSAIRSVSRKNQVGKFQALAEGLRSHVWKSKAMKKKELRRLGGGRKKG 542
+ E + + + +++Q G F AL EGL H SK K + K
Sbjct: 415 --SKERKRPMSESKGLTTHKQQHQGGHFHALVEGLVRHRKHSKKQKHQ--------LKSD 474
Query: 543 VKKLQWWQMFRRRR--GVKLPKKGRVKIG 556
KK +WW++ ++R+ GVK+PK+GRVK+G
Sbjct: 475 AKKTEWWKLLKKRQGGGVKIPKRGRVKLG 482
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022154939.1 | 6.0e-169 | 64.16 | protein KOKOPELLI isoform X2 [Momordica charantia] | [more] |
XP_022154937.1 | 6.7e-168 | 64.05 | protein KOKOPELLI isoform X1 [Momordica charantia] >XP_022154938.1 protein KOKOP... | [more] |
XP_022154940.1 | 5.0e-155 | 62.80 | uncharacterized protein LOC111022084 isoform X3 [Momordica charantia] | [more] |
XP_038877121.1 | 3.4e-148 | 59.79 | protein KOKOPELLI-like isoform X1 [Benincasa hispida] | [more] |
XP_038877123.1 | 3.4e-148 | 59.79 | protein KOKOPELLI-like isoform X3 [Benincasa hispida] >XP_038877124.1 protein KO... | [more] |
Match Name | E-value | Identity | Description | |
Q9FFP2 | 5.5e-16 | 32.71 | Protein KOKOPELLI OS=Arabidopsis thaliana OX=3702 GN=KPL PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DNR3 | 2.9e-169 | 64.16 | protein KOKOPELLI isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4... | [more] |
A0A6J1DLN1 | 3.2e-168 | 64.05 | protein KOKOPELLI isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4... | [more] |
A0A6J1DL21 | 2.4e-155 | 62.80 | uncharacterized protein LOC111022084 isoform X3 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DQ76 | 8.9e-142 | 65.68 | protein KOKOPELLI isoform X4 OS=Momordica charantia OX=3673 GN=LOC111022084 PE=4... | [more] |
A0A6J1K5J4 | 1.9e-136 | 56.48 | uncharacterized protein LOC111491355 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |