Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTGTAATTTGTATGAAGAACAAAAGAAGAGGAAAAAGGGGTCCAAAGATTTGCGGAGAAGGTGAGTGCGTGTGCGACAAGGCGAAGTCGAGGCGGTCGTTGCGATCCCTAAGAACAAGAAGAGGAAAAAAAACTTGGAGAGGAGCTTTTCAATCTCTCTCTCTTGAATCCCGTTTGTGTTTTCAAATCCTCTTTTTTTATGAAGGACGGGGATGAGTCTCCACAATTCAAATCCATTCTGTAAAATGGAACTTTGTGTTCTCTCTACAAACTTTCCGACTCCTCTCCGATCCTAGGGCTTTTCCCCCGCCTTCCTCTCCTTTTCCAGGTACTTTCCCTTCTGGGATTTTTCTTTTGTTCGATTTTGAATGGGGGATATGGAGAATTCCTGTTCTGGGGTTGTTTGTTTTTGGTTTTTAGAGGTTGGGTGTGCTCGTAATTTGAGTATCTTTGATGGGTTGTGTTGGGATTTCTTTATTTTCGTTTAATTTTGAGTTTTTTGTATGATTTTGTTGCTGGGTTATCTTTGTTTTGCAACATTTCTCATGATTAGTTGGAGGTTGAGCTTTGTGTGTGTGTGTTTTTCTTCTTTCGTACGATTTATCTGTTTTGAATTTTGTTTGTTGCTTCGTTTTTGGTGTAATTTGTTTGTGGAAATTTGGAATAGCTAGTTTTTGTGATGTTTCTTATGACTGACCATCCATGGTTTTTTATTTGGATTCTGTTGTGCTTTATATTACCATGGTTTTGGTTTATGCTTCATTTTTTGAGATTTGGCTGGATGGAGCTTTTAGCTTTCAGCAATGTTTTGTTTTAAATTTGTGCAATTTGATGGAGGTTTTGGGGATTATTGTATATTCACTGGGGTTGCTGCACCTTTTGAAGGCAGAATCATGCCAACGTTTACTACGATTGCGTTGGATAGGTTGTTAGAACCTGGAACTTCCAAATCTGTCGATAAGCCCCTTCCTAAAGTGAAGCCTGCTTTGACCTCTAATCGTGCGCCCACCACGAAGTTGGAGAGGAGAAATAGCGCATCACTTGCTGATAGAAAGGTTCAGCGGCCTCAAATTAAGCCAGAACTATATACCACTCCAGAGGCAACTCCTCTCCCCGACTCACCGCCTTCGTTTTCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGCCCTCGTCTGTTGAAGAGTTTCTCTCAGGATGGTGTCTTCTTTAGTCAAAAGACAAATGATAAGGATACAGGAAATGGAAGCGCGAATGGTTCAGATGGCAAAGTTGTAAAATTGACTGAGGGTGCTTCTGTTACTGTTGACATACCTATCCCAAACAAAGATGGACACAGAAATGGTCTAGACCATGTTAGTAATACTAATGTTGCTCAAAATGGGATTGTTGATGGTGATCATGGTGCTGTTGGGAGTAATCTTAGTAATCATGAAAGTAGTACAATGTTGAACAATGGTGTTGCTCTGGATAAGGATTCATTGAAGGTTGTTGTGACAAATTTACAAAGTGTTGGAGATACTGATGACTTCTATGACCCACAGGATTTTTTGAGTGCCAAGAGTAACACAGATGGAGAAGATAACGGATTTGAACGTTCAGCCAAGACTTATACTCCTATGGGGGTATTTTATGATGCTTGGGAAGGTAAACATTATTCTATGCATTGAAATTGTGTTCTTTGACTTCTCTTGTGCCTAGTTATTCAATTTAGTTTTTCGTAGTTGGTACTTCATTTTTCATTTTTAAATAACATTTTCACCTTTTCTTTTCTCTTTCAATTTTTTTGTATTCAAAGTATAAATGGATCAGAAATTGCATAACCTATGCATCTTTTGTTGTTAGGTTGGATATGTTATAATGAATAGTTGCGACCTGAGTGGAAAGCTCATGGTCCATTTGATAAACATTTGGTTTTTGAAATTTAAACTTATTTCTACTCAAATTTCCTACCAAGTGTTCTATCTTTCCTACAATGTATCCATCTTTCCTTTGGAAAGTATGCAAATACTAACCAAATTTCAAAAACAAAAATTTCGAAAAAACATATGGTAACCAAATTTCCAAAAGCTATTTTTTCTCTTCCAATTATTTGGATTTTCTCTAATAATATAGGTAAGATGCATATATTCAAGTAAGAAAAAACATATGGTAAAGCAGTTGTTTTAGGCTTAAATTTCGAAAACAAAAAACTAAAAAAAAAATGGTTATCAAACGACCTTAGATATTGGATTGGATGTTCAATTCTATTTTAATCTATCAGTTAATGTTCTCAATAACTTCCCTAAACTATGATTCTCAACTTGAAGTGACCTTTAATACAATATAACGTTATTTTTTTTCTGCAGAGCTTTCGTCTGAGGGGTTTTCACATCCTTCTTTTCCTGATATTGAAGCTGAGTTGCGTGAAATGAAACTATCCCTATTGATGGAACTTGAGAAACGAAAGCTTTCTGAGGAAGCACTGAAAAAATTGCAGGGTCAGTGGCAGAGGCTTAGTGAACAGCTACTGCTTGTAGGATTGACCCTTCCTTCAGATCCCACAGTAGCCATGGAAGGGAAACAGTTAGCTTCTGACCCTGTTGAAGAATTGTGTCAACAAGTTAATCTTGCTAGGTTTATATCGGGTTCAATTGGAAGGGGTATAGTGAGGGCCGAAGTGGAGACTGAGATGGAGGCACAGCTTGAAGTGAAGAATTTTGAGATTGCTCGATTGTTGGACCGGCTTCGTTACTATGAGGCTGTGAATCATGAAATGTCTCAGAGGAATCAAGAAGCTGTAGGTAAGGACCTGTTAAACCTTGACTAGTATTTGGATCACTCAATGTTGATGATTTTGATCTTCCCCTTCTCTTTCATTTTTCTTGAGCGTGGTGGGTGTTGAGGGCGTTTCTTATATTGTGTGTTTAATTTCTGTCCTGTATCAGTTTCATTCTTTGTACCCCGGATTTCTTTTTACTCATTTGCTAGTTGCCACTGCATCAGATTTTTATCACTTCTTTCTAATCTTTAGAGGTAAACAAATTATGATGGGTTGGTGTAGGGGTCGAGACTCAAGAGGGCTTTGAGGAAACTAAGAGGTTATGAAGTTCAATCCATAGTAACCACCATCTACTTAGGAATTAATTTTTTACGAGTTTCCTTATCAACCAAATTTTTATTGTCACATGAGAATAGTGAAGGTGTGCATAAGCTTGTCCAATGTCCGGAAACTCATCGCTATTAAAATATGTATAGAGAGAGAGAGGGAAACAGTGAGTGGTTAGGTGATTGTAAATTGTAACGACTAGATTATAATTTTTGTGCCTATAAATTAAGGCCTCTGATTAAGAAGGGCGTGATGGGAATACTTATTAGTTATTAGCTAGGAACTTAAGGAGGCGTGGCTTTCCGACAAAGAGGCGTATGGTCGTTATAATTATTAGTTAGGATTTAGGAACATCGGAGGCGTGGTTTTTCAAAGTGTCATTCTTTCATGTGTAGTAAGGGAGGAATGGCTAGGTGAGCCATTGGGTTCGTGTAGTTGACATAATCATGGGTCCAAATCTCAATGGGGTCTTACCCACGTAGAATATGCAACTATTTCAATAGTCTTCAATAGTGGAGCTTATATGATGAATGCAGCAGACAGATGCAATGATCATTAAGGCAAATTTACTCTGATCATTATGAATACTCTCCAATATTTGTTTTTGAGTGTTGCCTCACTGTTCCTGATAAAAAGAAAGTATTTGTATGTATTTCTTTTATGTCCGTGAGTGTTCTGGACAGCTTCCATTCTTATGAGTTATGACATAAATTATATGTAGGTAAATCTTCATTCTTTTTCGTTTTTTTCATGGGTCTGAGTGCCCAGTTTATTGTTCCAATTAGTTATGGACCTTTGGTTGTCAAAAGTGTGCAAGTTGCCTTGCCCCAAGCTCTGTAGCTCGTTTTTTTTACTTGGAGCTCCACTGGGTCAAGGTTTAGAACAGGATTCCTTTGTAACCACTGGTCCATAGGCATCCCAGTCCTACCCGACCTGGCCTTGGTGCTTTTTTCTTTCACAAGTTGTCATGTCAAGGATAAAACAACATTCAGACAATGATATAAAATTTTATTTTTTGTATCTTATTCCTGAACCCTATCATCGTCTTCTGTTTCAGTTCTCTGTTGTTGATCATGCTTCTAATCTTTCTCTGTTGTTGATTATGCTTCTAATCTATGTACTATTGGCTAAGCTTTTCCTGTGTTTGTTTGGTTTCCAGATTTGGCGCGACGCGAGAGGGTGAAAAGAAAAAGGAGGCAAAGGTGGATGTGGGGTTCGGTTGCCGCTGTGATCACACTCGGCACTGCAGTCTTAGCTTGGTCGTTCCTTCCACCGGGAAAGGATTCGTCATCCATGGCCACTGATCGTGATGATGGAACAGATGGATGACCACTGACAAAAGAAGTAGCTTATGTACATGTGGTATAATGTTGATATCAAATGTTTGTTGCTTTTGTCAGAAGTATTTGCATGCCCGAAGGTATAGTTCTTAAAAATCTTCGGCCCTTATAATTAAAGGTGAAAGAAAAGGTTGGTTCTTGATATTTCTTTTTCCCTCAGATCTAACAGGGTTATAACACCTGAAATTTTGGTAATTCATGACAAACCATACACTGTCTATTTACTGTGCTAACTGTAGTTGAAATATTGTGCCTGACTTAGAGACGGGCGTGATTGCCCTTGTTTCGAAAAAAAAAAGAACTGTAGTTGAAATATTGCGTTGAGCATTCAAATAATTATTGGCAGAGTGGAGTTGAACTCTCTATGGTAATGCCTGTTTCATCCTTCAACTATTGTATATAGCAC
mRNA sequence
ATTGTAATTTGTATGAAGAACAAAAGAAGAGGAAAAAGGGGTCCAAAGATTTGCGGAGAAGGTGAGTGCGTGTGCGACAAGGCGAAGTCGAGGCGGTCGTTGCGATCCCTAAGAACAAGAAGAGGAAAAAAAACTTGGAGAGGAGCTTTTCAATCTCTCTCTCTTGAATCCCGTTTGTGTTTTCAAATCCTCTTTTTTTATGAAGGACGGGGATGAGTCTCCACAATTCAAATCCATTCTGTAAAATGGAACTTTGTGTTCTCTCTACAAACTTTCCGACTCCTCTCCGATCCTAGGGCTTTTCCCCCGCCTTCCTCTCCTTTTCCAGGCAGAATCATGCCAACGTTTACTACGATTGCGTTGGATAGGTTGTTAGAACCTGGAACTTCCAAATCTGTCGATAAGCCCCTTCCTAAAGTGAAGCCTGCTTTGACCTCTAATCGTGCGCCCACCACGAAGTTGGAGAGGAGAAATAGCGCATCACTTGCTGATAGAAAGGTTCAGCGGCCTCAAATTAAGCCAGAACTATATACCACTCCAGAGGCAACTCCTCTCCCCGACTCACCGCCTTCGTTTTCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGCCCTCGTCTGTTGAAGAGTTTCTCTCAGGATGGTGTCTTCTTTAGTCAAAAGACAAATGATAAGGATACAGGAAATGGAAGCGCGAATGGTTCAGATGGCAAAGTTGTAAAATTGACTGAGGGTGCTTCTGTTACTGTTGACATACCTATCCCAAACAAAGATGGACACAGAAATGGTCTAGACCATGTTAGTAATACTAATGTTGCTCAAAATGGGATTGTTGATGGTGATCATGGTGCTGTTGGGAGTAATCTTAGTAATCATGAAAGTAGTACAATGTTGAACAATGGTGTTGCTCTGGATAAGGATTCATTGAAGGTTGTTGTGACAAATTTACAAAGTGTTGGAGATACTGATGACTTCTATGACCCACAGGATTTTTTGAGTGCCAAGAGTAACACAGATGGAGAAGATAACGGATTTGAACGTTCAGCCAAGACTTATACTCCTATGGGGGTATTTTATGATGCTTGGGAAGAGCTTTCGTCTGAGGGGTTTTCACATCCTTCTTTTCCTGATATTGAAGCTGAGTTGCGTGAAATGAAACTATCCCTATTGATGGAACTTGAGAAACGAAAGCTTTCTGAGGAAGCACTGAAAAAATTGCAGGGTCAGTGGCAGAGGCTTAGTGAACAGCTACTGCTTGTAGGATTGACCCTTCCTTCAGATCCCACAGTAGCCATGGAAGGGAAACAGTTAGCTTCTGACCCTGTTGAAGAATTGTGTCAACAAGTTAATCTTGCTAGGTTTATATCGGGTTCAATTGGAAGGGGTATAGTGAGGGCCGAAGTGGAGACTGAGATGGAGGCACAGCTTGAAGTGAAGAATTTTGAGATTGCTCGATTGTTGGACCGGCTTCGTTACTATGAGGCTGTGAATCATGAAATGTCTCAGAGGAATCAAGAAGCTGTAGATTTGGCGCGACGCGAGAGGGTGAAAAGAAAAAGGAGGCAAAGGTGGATGTGGGGTTCGGTTGCCGCTGTGATCACACTCGGCACTGCAGTCTTAGCTTGGTCGTTCCTTCCACCGGGAAAGGATTCGTCATCCATGGCCACTGATCGTGATGATGGAACAGATGGATGACCACTGACAAAAGAAGTAGCTTATGTACATGTGGTATAATGTTGATATCAAATGTTTGTTGCTTTTGTCAGAAGTATTTGCATGCCCGAAGGTATAGTTCTTAAAAATCTTCGGCCCTTATAATTAAAGGTGAAAGAAAAGGTTGGTTCTTGATATTTCTTTTTCCCTCAGATCTAACAGGGTTATAACACCTGAAATTTTGGTAATTCATGACAAACCATACACTGTCTATTTACTGTGCTAACTGTAGTTGAAATATTGTGCCTGACTTAGAGACGGGCGTGATTGCCCTTGTTTCGAAAAAAAAAAGAACTGTAGTTGAAATATTGCGTTGAGCATTCAAATAATTATTGGCAGAGTGGAGTTGAACTCTCTATGGTAATGCCTGTTTCATCCTTCAACTATTGTATATAGCAC
Coding sequence (CDS)
ATGCCAACGTTTACTACGATTGCGTTGGATAGGTTGTTAGAACCTGGAACTTCCAAATCTGTCGATAAGCCCCTTCCTAAAGTGAAGCCTGCTTTGACCTCTAATCGTGCGCCCACCACGAAGTTGGAGAGGAGAAATAGCGCATCACTTGCTGATAGAAAGGTTCAGCGGCCTCAAATTAAGCCAGAACTATATACCACTCCAGAGGCAACTCCTCTCCCCGACTCACCGCCTTCGTTTTCTCCTTCCCCTTATATTGTCAATCACAAGCGGCGTGGCCCTCGTCTGTTGAAGAGTTTCTCTCAGGATGGTGTCTTCTTTAGTCAAAAGACAAATGATAAGGATACAGGAAATGGAAGCGCGAATGGTTCAGATGGCAAAGTTGTAAAATTGACTGAGGGTGCTTCTGTTACTGTTGACATACCTATCCCAAACAAAGATGGACACAGAAATGGTCTAGACCATGTTAGTAATACTAATGTTGCTCAAAATGGGATTGTTGATGGTGATCATGGTGCTGTTGGGAGTAATCTTAGTAATCATGAAAGTAGTACAATGTTGAACAATGGTGTTGCTCTGGATAAGGATTCATTGAAGGTTGTTGTGACAAATTTACAAAGTGTTGGAGATACTGATGACTTCTATGACCCACAGGATTTTTTGAGTGCCAAGAGTAACACAGATGGAGAAGATAACGGATTTGAACGTTCAGCCAAGACTTATACTCCTATGGGGGTATTTTATGATGCTTGGGAAGAGCTTTCGTCTGAGGGGTTTTCACATCCTTCTTTTCCTGATATTGAAGCTGAGTTGCGTGAAATGAAACTATCCCTATTGATGGAACTTGAGAAACGAAAGCTTTCTGAGGAAGCACTGAAAAAATTGCAGGGTCAGTGGCAGAGGCTTAGTGAACAGCTACTGCTTGTAGGATTGACCCTTCCTTCAGATCCCACAGTAGCCATGGAAGGGAAACAGTTAGCTTCTGACCCTGTTGAAGAATTGTGTCAACAAGTTAATCTTGCTAGGTTTATATCGGGTTCAATTGGAAGGGGTATAGTGAGGGCCGAAGTGGAGACTGAGATGGAGGCACAGCTTGAAGTGAAGAATTTTGAGATTGCTCGATTGTTGGACCGGCTTCGTTACTATGAGGCTGTGAATCATGAAATGTCTCAGAGGAATCAAGAAGCTGTAGATTTGGCGCGACGCGAGAGGGTGAAAAGAAAAAGGAGGCAAAGGTGGATGTGGGGTTCGGTTGCCGCTGTGATCACACTCGGCACTGCAGTCTTAGCTTGGTCGTTCCTTCCACCGGGAAAGGATTCGTCATCCATGGCCACTGATCGTGATGATGGAACAGATGGATGA
Protein sequence
MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQIKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVGSNLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSMATDRDDGTDG
Homology
BLAST of Sed0002223 vs. NCBI nr
Match:
XP_022962776.1 (uncharacterized protein LOC111463164 [Cucurbita moschata] >XP_022962777.1 uncharacterized protein LOC111463164 [Cucurbita moschata] >XP_022962778.1 uncharacterized protein LOC111463164 [Cucurbita moschata])
HSP 1 Score: 669.8 bits (1727), Expect = 1.6e-188
Identity = 357/460 (77.61%), Postives = 386/460 (83.91%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGTSKSVDKPLPK PALT NRAPTT LERRNSAS A+RKVQRPQI
Sbjct: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQI 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGS 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V QK ND D GN +
Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDVVSSRQKMNDNDIGNVN 120
Query: 121 ANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GS 180
NGSD VKL+EGASVTVD+PIPNKDGHRNGLD +++NV QNG VDGDHGA GS
Sbjct: 121 VNGSDSNDVKLSEGASVTVDLPIPNKDGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGS 180
Query: 181 NLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFER 240
N +N+ S+ M++N VA +KDSLKVVV L S+GD +DF+DPQD LS SNTDGEDNG+ER
Sbjct: 181 NHTNNGSTIMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYER 240
Query: 241 SAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ 300
SAK TPMG FYDAWEE+SS+G HPS IEAELREM+LSLLMELEKRK +EEAL L+
Sbjct: 241 SAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEALDNLR 300
Query: 301 GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAE 360
GQWQRL E L LVGLTLPSDPTVA G L SDP EELCQQVN+ARF+SGSIGRGI RAE
Sbjct: 301 GQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAE 360
Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWG 420
VETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRR RWMWG
Sbjct: 361 VETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLRWMWG 420
Query: 421 SVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD 453
SVA ITLGTAVLAWS+LP GKDSSSM AT+ DD TD
Sbjct: 421 SVATAITLGTAVLAWSYLPSGKDSSSMNDSKATEHDDATD 460
BLAST of Sed0002223 vs. NCBI nr
Match:
KAG6594816.1 (hypothetical protein SDJN03_11369, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 663.7 bits (1711), Expect = 1.1e-186
Identity = 357/464 (76.94%), Postives = 385/464 (82.97%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGTSKSVDKPLPK PALT NRAPTT LERRNSAS A+RKVQRPQI
Sbjct: 41 MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQI 100
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTG--- 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V QK ND D G
Sbjct: 101 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDVVSSHQKMNDNDIGNVN 160
Query: 121 -NGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV-- 180
N + NGSD VKL+EGASVTVD+PIPNKDGHRNGLD +N+NV QNG VDGDHGA
Sbjct: 161 VNVNVNGSDSNDVKLSEGASVTVDLPIPNKDGHRNGLDCATNSNVGQNGSVDGDHGATAV 220
Query: 181 --GSNLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDN 240
GSN +N+ S+ ++N VA +KDSLKVVV L S+GD +DF+DPQD LS SNTDGEDN
Sbjct: 221 QHGSNHTNNGSTMTVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDN 280
Query: 241 GFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEAL 300
G+ERSAK TPMG FYDAWEE+SS+G HPS IEAELREM+LSLLMELEKRK +EEAL
Sbjct: 281 GYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL 340
Query: 301 KKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGI 360
L+GQWQRL E L LVGLTLPSDPTVA G L SDP EELCQQVN+ARF+SGSIGRGI
Sbjct: 341 DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGI 400
Query: 361 VRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQR 420
RAEVETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRR R
Sbjct: 401 ARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLR 460
Query: 421 WMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD 453
WMWGSVA ITLGTAVLAWS+LP GKDSSSM AT+ DD TD
Sbjct: 461 WMWGSVATAITLGTAVLAWSYLPSGKDSSSMNDSKATEHDDATD 504
BLAST of Sed0002223 vs. NCBI nr
Match:
XP_038882592.1 (uncharacterized protein LOC120073808 [Benincasa hispida])
HSP 1 Score: 663.3 bits (1710), Expect = 1.5e-186
Identity = 357/459 (77.78%), Postives = 386/459 (84.10%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGTSKSVDK LPK KPALT NRAP+TKLERRNSAS+ADRKVQRPQI
Sbjct: 1 MPTFTTIALDRLLEPGTSKSVDKSLPKPKPALTLNRAPSTKLERRNSASVADRKVQRPQI 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGS 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V +K ND D GNGS
Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV-SRKKMNDNDVGNGS 120
Query: 121 ANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----S 180
GSD VK TEG+SVTVD+PIP KDG RNG D S++NV QNG VDGDHGA +
Sbjct: 121 VKGSDSNDVKSTEGSSVTVDMPIPEKDGDRNGPDCASSSNVRQNGSVDGDHGATAVQLVN 180
Query: 181 NLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFER 240
N SNHES +++NGVA +K+SLKVVV+N +S+GDT+DF+DP D LS SNTDGEDNGFER
Sbjct: 181 NHSNHESRIVVSNGVAREKNSLKVVVSNSESIGDTEDFFDPHDSLSVTSNTDGEDNGFER 240
Query: 241 SAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ 300
SAK TPMG FYDAWEELSSEG PS DIEAELREMKL+LLMELEKRK +EEAL KLQ
Sbjct: 241 SAKFGTPMGEFYDAWEELSSEGLPQPSISDIEAELREMKLTLLMELEKRKQAEEALNKLQ 300
Query: 301 GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAE 360
GQW RL EQLLLVGLTLPSDP VA EG QL SDP EELCQQV LARF+S SIGRGI RAE
Sbjct: 301 GQWWRLREQLLLVGLTLPSDPPVATEGNQLDSDPAEELCQQVYLARFVSDSIGRGIARAE 360
Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWG 420
VETEMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRRQRW+WG
Sbjct: 361 VETEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420
Query: 421 SVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD 453
SVA ITLGTAVLAWS+LP GKD S++ + DD TD
Sbjct: 421 SVATAITLGTAVLAWSYLPSGKDLPSSNNTKAEHDDVTD 458
BLAST of Sed0002223 vs. NCBI nr
Match:
KAG7026781.1 (hypothetical protein SDJN02_10788 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 660.6 bits (1703), Expect = 9.7e-186
Identity = 355/464 (76.51%), Postives = 386/464 (83.19%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGTSKSVDKPLPK PALT NRAPTT LERRNSAS A+RKVQRPQI
Sbjct: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQI 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTG--- 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V QK ND D G
Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDVVSSRQKMNDNDIGNVN 120
Query: 121 -NGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV-- 180
N + NGSD VKL+EGASVTVD+PIPNK+GHRNGLD +++NV QNG VDGDHGA
Sbjct: 121 VNVNVNGSDSNDVKLSEGASVTVDLPIPNKNGHRNGLDCATSSNVGQNGSVDGDHGATAV 180
Query: 181 --GSNLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDN 240
GSN +N+ S+ M++N VA +KDSLKVVV L S+GD +DF+DPQD LS SNTDGEDN
Sbjct: 181 QHGSNHTNNGSTMMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDN 240
Query: 241 GFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEAL 300
G+ERSAK TPMG FYDAWEE+SS+G HPS IEAELREM+LSLLMELEKRK +EEAL
Sbjct: 241 GYERSAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEAL 300
Query: 301 KKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGI 360
L+GQWQRL E L LVGLTLPSDPTVA G L SDP EELCQQVN+ARF+SGSIGRGI
Sbjct: 301 DNLRGQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGI 360
Query: 361 VRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQR 420
RAEVETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRR R
Sbjct: 361 ARAEVETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLR 420
Query: 421 WMWGSVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD 453
WMWGSVA ITLGTAVLAWS+LP GKDSSS+ AT+ DD TD
Sbjct: 421 WMWGSVATAITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD 464
BLAST of Sed0002223 vs. NCBI nr
Match:
XP_023003998.1 (uncharacterized protein LOC111497447 [Cucurbita maxima] >XP_023003999.1 uncharacterized protein LOC111497447 [Cucurbita maxima] >XP_023004000.1 uncharacterized protein LOC111497447 [Cucurbita maxima])
HSP 1 Score: 660.2 bits (1702), Expect = 1.3e-185
Identity = 352/460 (76.52%), Postives = 384/460 (83.48%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGTSKSVDKPLPK PALT NRAP+T LERRNSAS A+RKVQRPQI
Sbjct: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPSTNLERRNSASAAERKVQRPQI 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGS 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V QK ND D GN +
Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDVVSSCQKINDNDIGNVN 120
Query: 121 ANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GS 180
NGSD VKL+EGASVTVD+PIPNKDG RNG D +++NV QNG VDGDHGA GS
Sbjct: 121 VNGSDSNDVKLSEGASVTVDLPIPNKDGQRNGQDCATSSNVGQNGSVDGDHGATAVQHGS 180
Query: 181 NLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFER 240
N +N+ SS M++N VA +KDSLKVVV L S+GD +DF+DPQD LS SNTDGEDNG+ER
Sbjct: 181 NHTNNGSSMMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYER 240
Query: 241 SAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ 300
SAK TPMG FYDAWEE+SS+G HPS IEAELREM+LSLLMELEKRK +EEAL L+
Sbjct: 241 SAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEALDNLR 300
Query: 301 GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAE 360
GQWQRL E L LVGLTLPSDPTV+ G + SDP EELCQQVN+ARF+SGSIGRGI RAE
Sbjct: 301 GQWQRLREHLSLVGLTLPSDPTVSTNGNLVYSDPAEELCQQVNIARFVSGSIGRGIARAE 360
Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWG 420
VETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRR RWMWG
Sbjct: 361 VETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLRWMWG 420
Query: 421 SVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD 453
SVA ITLGTAVLAWS+LP GKDSSS+ AT+ DD TD
Sbjct: 421 SVATTITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD 460
BLAST of Sed0002223 vs. ExPASy TrEMBL
Match:
A0A6J1HDH8 (uncharacterized protein LOC111463164 OS=Cucurbita moschata OX=3662 GN=LOC111463164 PE=4 SV=1)
HSP 1 Score: 669.8 bits (1727), Expect = 7.7e-189
Identity = 357/460 (77.61%), Postives = 386/460 (83.91%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGTSKSVDKPLPK PALT NRAPTT LERRNSAS A+RKVQRPQI
Sbjct: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPTTNLERRNSASAAERKVQRPQI 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGS 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V QK ND D GN +
Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDVVSSRQKMNDNDIGNVN 120
Query: 121 ANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GS 180
NGSD VKL+EGASVTVD+PIPNKDGHRNGLD +++NV QNG VDGDHGA GS
Sbjct: 121 VNGSDSNDVKLSEGASVTVDLPIPNKDGHRNGLDCATSSNVGQNGSVDGDHGATAVQHGS 180
Query: 181 NLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFER 240
N +N+ S+ M++N VA +KDSLKVVV L S+GD +DF+DPQD LS SNTDGEDNG+ER
Sbjct: 181 NHTNNGSTIMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYER 240
Query: 241 SAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ 300
SAK TPMG FYDAWEE+SS+G HPS IEAELREM+LSLLMELEKRK +EEAL L+
Sbjct: 241 SAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEALDNLR 300
Query: 301 GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAE 360
GQWQRL E L LVGLTLPSDPTVA G L SDP EELCQQVN+ARF+SGSIGRGI RAE
Sbjct: 301 GQWQRLREHLSLVGLTLPSDPTVATNGNLLYSDPAEELCQQVNIARFVSGSIGRGIARAE 360
Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWG 420
VETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRR RWMWG
Sbjct: 361 VETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLRWMWG 420
Query: 421 SVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD 453
SVA ITLGTAVLAWS+LP GKDSSSM AT+ DD TD
Sbjct: 421 SVATAITLGTAVLAWSYLPSGKDSSSMNDSKATEHDDATD 460
BLAST of Sed0002223 vs. ExPASy TrEMBL
Match:
A0A6J1KTC8 (uncharacterized protein LOC111497447 OS=Cucurbita maxima OX=3661 GN=LOC111497447 PE=4 SV=1)
HSP 1 Score: 660.2 bits (1702), Expect = 6.1e-186
Identity = 352/460 (76.52%), Postives = 384/460 (83.48%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGTSKSVDKPLPK PALT NRAP+T LERRNSAS A+RKVQRPQI
Sbjct: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKHMPALTFNRAPSTNLERRNSASAAERKVQRPQI 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGS 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V QK ND D GN +
Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDVVSSCQKINDNDIGNVN 120
Query: 121 ANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAV----GS 180
NGSD VKL+EGASVTVD+PIPNKDG RNG D +++NV QNG VDGDHGA GS
Sbjct: 121 VNGSDSNDVKLSEGASVTVDLPIPNKDGQRNGQDCATSSNVGQNGSVDGDHGATAVQHGS 180
Query: 181 NLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFER 240
N +N+ SS M++N VA +KDSLKVVV L S+GD +DF+DPQD LS SNTDGEDNG+ER
Sbjct: 181 NHTNNGSSMMVSNDVAREKDSLKVVVPTLDSLGDAEDFFDPQDSLSVASNTDGEDNGYER 240
Query: 241 SAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ 300
SAK TPMG FYDAWEE+SS+G HPS IEAELREM+LSLLMELEKRK +EEAL L+
Sbjct: 241 SAKFCTPMGEFYDAWEEMSSDGLPHPSISVIEAELREMRLSLLMELEKRKQAEEALDNLR 300
Query: 301 GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAE 360
GQWQRL E L LVGLTLPSDPTV+ G + SDP EELCQQVN+ARF+SGSIGRGI RAE
Sbjct: 301 GQWQRLREHLSLVGLTLPSDPTVSTNGNLVYSDPAEELCQQVNIARFVSGSIGRGIARAE 360
Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWG 420
VETEMEAQLE KNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRR RWMWG
Sbjct: 361 VETEMEAQLEAKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRLRWMWG 420
Query: 421 SVAAVITLGTAVLAWSFLPPGKDSSSM----ATDRDDGTD 453
SVA ITLGTAVLAWS+LP GKDSSS+ AT+ DD TD
Sbjct: 421 SVATTITLGTAVLAWSYLPSGKDSSSINDSKATEHDDATD 460
BLAST of Sed0002223 vs. ExPASy TrEMBL
Match:
A0A5D3CMF0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G001960 PE=4 SV=1)
HSP 1 Score: 641.0 bits (1652), Expect = 3.8e-180
Identity = 347/459 (75.60%), Postives = 382/459 (83.22%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGT+KS+DK LPK KPALT NRAP+TKLERRNSAS+ADRKVQRPQI
Sbjct: 1 MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGS 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V +K NDKD GNGS
Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV-SHKKMNDKDVGNGS 120
Query: 121 ANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----S 180
SDG VKLTEGASVTV PIP+K G RNGLD S++N+ +NG VDGDHGA S
Sbjct: 121 VERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVS 180
Query: 181 NLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFER 240
+ +NHESS + ++G+A +KDSLK VV+N +S GD +DF+DP D LS SNTDGEDNGFER
Sbjct: 181 SHNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFER 240
Query: 241 SAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ 300
SAK TPMG FYDAWEELSSEG PS DIE + REM+ LLME+EKRK +EEAL KLQ
Sbjct: 241 SAKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKRKQAEEALNKLQ 300
Query: 301 GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAE 360
QWQRL EQLLLVGLTLPSDPTVA EGKQL SDP EELCQQVNLARF+S SIG+GI RAE
Sbjct: 301 CQWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAE 360
Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWG 420
VE EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRRQRW+WG
Sbjct: 361 VEAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420
Query: 421 SVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD 453
SVA ITLGTAVLAWS+LP GKD S++ + DD TD
Sbjct: 421 SVATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455
BLAST of Sed0002223 vs. ExPASy TrEMBL
Match:
A0A5A7T005 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold18G00100 PE=4 SV=1)
HSP 1 Score: 639.4 bits (1648), Expect = 1.1e-179
Identity = 346/459 (75.38%), Postives = 382/459 (83.22%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGT+KS+DK LPK KPALT NRAP+TKLERRNSAS+ADRKVQRPQI
Sbjct: 1 MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGS 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V +K NDKD GNGS
Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV-SHKKMNDKDVGNGS 120
Query: 121 ANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----S 180
SDG VKLTEGASVTV PIP+K G RNGLD S++N+ +NG VDGDHGA S
Sbjct: 121 VERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVS 180
Query: 181 NLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFER 240
+ +NHESS + ++G+A +KDSLK VV+N +S GD +DF+DP D LS SNTDGEDNGFER
Sbjct: 181 SHNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFER 240
Query: 241 SAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ 300
SAK TPMG FYDAWEELSSEG PS DIE + REM+ LLME+EK+K +EEAL KLQ
Sbjct: 241 SAKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKQKQAEEALNKLQ 300
Query: 301 GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAE 360
QWQRL EQLLLVGLTLPSDPTVA EGKQL SDP EELCQQVNLARF+S SIG+GI RAE
Sbjct: 301 CQWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAE 360
Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWG 420
VE EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRRQRW+WG
Sbjct: 361 VEAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420
Query: 421 SVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD 453
SVA ITLGTAVLAWS+LP GKD S++ + DD TD
Sbjct: 421 SVATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455
BLAST of Sed0002223 vs. ExPASy TrEMBL
Match:
A0A1S3B1E0 (uncharacterized protein LOC103485065 OS=Cucumis melo OX=3656 GN=LOC103485065 PE=4 SV=1)
HSP 1 Score: 639.4 bits (1648), Expect = 1.1e-179
Identity = 346/459 (75.38%), Postives = 382/459 (83.22%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTFTTIALDRLLEPGT+KS+DK LPK KPALT NRAP+TKLERRNSAS+ADRKVQRPQI
Sbjct: 1 MPTFTTIALDRLLEPGTTKSIDKSLPKPKPALTFNRAPSTKLERRNSASVADRKVQRPQI 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQDGVFFSQKTNDKDTGNGS 120
KP LYTTPEATPLPDSP SF PSPYIVNHKRRGPRLLKSFS+D V +K NDKD GNGS
Sbjct: 61 KPALYTTPEATPLPDSPSSFPPSPYIVNHKRRGPRLLKSFSEDDV-SHKKMNDKDVGNGS 120
Query: 121 ANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHGAVG----S 180
SDG VKLTEGASVTV PIP+K G RNGLD S++N+ +NG VDGDHGA S
Sbjct: 121 VERSDGNDVKLTEGASVTVTTPIPDKHGDRNGLDCASSSNIGENGCVDGDHGATAVQLVS 180
Query: 181 NLSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGEDNGFER 240
+ +NHESS + ++G+A +KDSLK VV+N +S GD +DF+DP D LS SNTDGEDNGFER
Sbjct: 181 SHNNHESSILTSSGIAQEKDSLK-VVSNSESTGDNEDFFDPHDSLSVASNTDGEDNGFER 240
Query: 241 SAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEALKKLQ 300
SAK TPMG FYDAWEELSSEG PS DIE + REM+ LLME+EK+K +EEAL KLQ
Sbjct: 241 SAKFGTPMGEFYDAWEELSSEGVPQPSISDIEPDQREMR--LLMEIEKQKQAEEALNKLQ 300
Query: 301 GQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGIVRAE 360
QWQRL EQLLLVGLTLPSDPTVA EGKQL SDP EELCQQVNLARF+S SIG+GI RAE
Sbjct: 301 CQWQRLREQLLLVGLTLPSDPTVATEGKQLDSDPAEELCQQVNLARFVSESIGKGIARAE 360
Query: 361 VETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQRWMWG 420
VE EMEAQLEVKNFEIARLLDRL YYEAVNHEMSQRNQEAVDLARRER++RKRRQRW+WG
Sbjct: 361 VEAEMEAQLEVKNFEIARLLDRLHYYEAVNHEMSQRNQEAVDLARRERLRRKRRQRWIWG 420
Query: 421 SVAAVITLGTAVLAWSFLPPGKD---SSSMATDRDDGTD 453
SVA ITLGTAVLAWS+LP GKD S++ + DD TD
Sbjct: 421 SVATAITLGTAVLAWSYLPSGKDLPSSNNSKAEHDDVTD 455
BLAST of Sed0002223 vs. TAIR 10
Match:
AT3G50910.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G66480.1); Has 76 Blast hits to 75 proteins in 28 species: Archae - 0; Bacteria - 10; Metazoa - 7; Fungi - 2; Plants - 49; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )
HSP 1 Score: 348.2 bits (892), Expect = 9.9e-96
Identity = 220/452 (48.67%), Postives = 289/452 (63.94%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLPKVKPALTSNRAPTTKLERRNSASLADRKVQRPQI 60
MPTF+ IALDR+LEPG S SV+ +P L ++ P +KLE+ +R V RP +
Sbjct: 1 MPTFSAIALDRMLEPGASTSVES-VPST-TNLFYSKPPISKLEKGKGKLPNERTVTRPLM 60
Query: 61 KPELYTTPEATPLPDSPPSFSPSPYIVNHKRRG-PRLLKSFSQDGVFFS--QKTNDKDTG 120
P LY TP+A PLP+SP SF PSPYI+NHK RG PRLLKS S+ V S QKT +++T
Sbjct: 61 SPALYATPDAIPLPNSPSSFPPSPYIINHKSRGPPRLLKSSSEANVVSSSHQKTLEEETI 120
Query: 121 NGSANGSDGKVVKLTEGASVTVDIPIPNKDGHRNGLDHVSNTNVAQNGIVDGDHG--AVG 180
+D KV S + I +D + NG+ + N +GIVDG G +
Sbjct: 121 TAE---TDVKVSPRRRSTSFSFPITEVTEDDYSNGVHARTVGNYNFDGIVDGPVGNWSPL 180
Query: 181 SNLSNHESSTMLN--NGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSNTDGE-DN 240
S + S + N NG+ + V +++DFYDP + S SNTD E D
Sbjct: 181 DGKSGNGKSELDNAANGLERVNGLSEPVPIKTDKESESEDFYDPGESASFTSNTDVEGDA 240
Query: 241 GFERSAKTYTPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELEKRKLSEEAL 300
G E S + TP+G FYDAW+ELS++ S +IE+EL E++LSLLME+EKRK +EEAL
Sbjct: 241 GDESSQRLATPVGEFYDAWDELSTDSGMQSSVNNIESELSEIRLSLLMEIEKRKQTEEAL 300
Query: 301 KKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARFISGSIGRGI 360
+++Q WQRL EQ+ VGL +P DPT + L+ EEL Q+ +ARF+S S+GRG+
Sbjct: 301 EQMQIHWQRLREQMAQVGLFVPIDPTASTNNMNLS----EELRCQLEIARFVSDSLGRGM 360
Query: 361 VRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRERVKRKRRQR 420
+AEVE EME+ LE KNFEI RL DRL YYEAVN EMSQRNQEA+++ARRER KRK+RQR
Sbjct: 361 AKAEVEMEMESMLETKNFEITRLSDRLHYYEAVNREMSQRNQEAIEVARRERQKRKKRQR 420
Query: 421 WMWGSVAAVITLGTAVLAWSFLPPGKDSSSMA 445
W+WGS+AA ITLG+A LAWS++P K SS ++
Sbjct: 421 WIWGSIAATITLGSAALAWSYIPAAKPSSEVS 443
BLAST of Sed0002223 vs. TAIR 10
Match:
AT5G66480.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G50910.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 270.0 bits (689), Expect = 3.4e-72
Identity = 187/461 (40.56%), Postives = 263/461 (57.05%), Query Frame = 0
Query: 1 MPTFTTIALDRLLEPGTSKSVDKPLP-KVKPALTSNRAPTTKLERRNSASLADRKVQRPQ 60
MPTF+ AL R L GTS S P + KP++ ++ + K ++ RPQ
Sbjct: 1 MPTFSAAALGRSLNSGTSLSSKFPSTLQSKPSILNDESKQPK----------EKTFTRPQ 60
Query: 61 IKPELYTTPEATPLPDSPPSFSPSPYIVNHKRRGPRLLKSFSQ-DGVFFS-QKTNDKDTG 120
+ P LY T + P P+SP S+ PSPYI+NHK RGP L S+ DG +K +G
Sbjct: 61 MSPSLYATTKEIPHPNSPSSYPPSPYIINHKARGPVLFNRDSEVDGPSHPITSGEEKISG 120
Query: 121 NGSANGSDGKVVKLTEGASVTVDIPIPNKDG-HRNGLDH--------VSNTNVAQNGIVD 180
N + + +T I + + +G H G+ T + + D
Sbjct: 121 NVDVEATASLSKSTSLSFPITEAIAVDHTNGVHTQGIHERPVWDCSPPLGTFLNEKSGRD 180
Query: 181 GDHGAVGSN--LSNHESSTMLNNGVALDKDSLKVVVTNLQSVGDTDDFYDPQDFLSAKSN 240
+G +GSN SN E + L V + D + ++FY+P + +S SN
Sbjct: 181 ISNGGIGSNNATSNLEWQSYLLEPVRIKADKEL----------EPENFYNPGELVSFTSN 240
Query: 241 TDGEDNGFERSAKTY---TPMGVFYDAWEELSSEGFSHPSFPDIEAELREMKLSLLMELE 300
T+ ED FER+ ++ T +G FYDA +ELS++ S +IE+E+REM+L LLME+E
Sbjct: 241 TEVED--FERAESSHSLATHVGEFYDACDELSTDSGMQSSANNIESEVREMRLGLLMEIE 300
Query: 301 KRKLSEEALKKLQGQWQRLSEQLLLVGLTLPSDPTVAMEGKQLASDPVEELCQQVNLARF 360
+R+ +E L+++Q W+RL +QL VG+ LP DPT + LA +EL Q+ + RF
Sbjct: 301 RRRQAEATLEQMQVHWRRLRDQLADVGMFLPLDPTRSQYSMNLA----DELRCQLEVTRF 360
Query: 361 ISGSIGRGIVRAEVETEMEAQLEVKNFEIARLLDRLRYYEAVNHEMSQRNQEAVDLARRE 420
+S ++G + + EVE EMEA+LE KNFEI RL DRL YYE VN EMSQRNQEA+++ARR+
Sbjct: 361 VSDTLGSDLAKTEVEMEMEAELEAKNFEITRLSDRLHYYETVNQEMSQRNQEAIEVARRD 420
Query: 421 RVKRKRRQRWMWGSVAAVITLGTAVLAWSFLPPGKDSSSMA 445
KRKRRQRW+WGS+AA ITLG+ VLAWS+LPPG SS A
Sbjct: 421 GQKRKRRQRWIWGSIAATITLGSGVLAWSYLPPGMLSSDEA 435
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022962776.1 | 1.6e-188 | 77.61 | uncharacterized protein LOC111463164 [Cucurbita moschata] >XP_022962777.1 unchar... | [more] |
KAG6594816.1 | 1.1e-186 | 76.94 | hypothetical protein SDJN03_11369, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_038882592.1 | 1.5e-186 | 77.78 | uncharacterized protein LOC120073808 [Benincasa hispida] | [more] |
KAG7026781.1 | 9.7e-186 | 76.51 | hypothetical protein SDJN02_10788 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_023003998.1 | 1.3e-185 | 76.52 | uncharacterized protein LOC111497447 [Cucurbita maxima] >XP_023003999.1 uncharac... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1HDH8 | 7.7e-189 | 77.61 | uncharacterized protein LOC111463164 OS=Cucurbita moschata OX=3662 GN=LOC1114631... | [more] |
A0A6J1KTC8 | 6.1e-186 | 76.52 | uncharacterized protein LOC111497447 OS=Cucurbita maxima OX=3661 GN=LOC111497447... | [more] |
A0A5D3CMF0 | 3.8e-180 | 75.60 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A5A7T005 | 1.1e-179 | 75.38 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A1S3B1E0 | 1.1e-179 | 75.38 | uncharacterized protein LOC103485065 OS=Cucumis melo OX=3656 GN=LOC103485065 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT3G50910.1 | 9.9e-96 | 48.67 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT5G66480.1 | 3.4e-72 | 40.56 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |