Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAAACCCTAGAAAAGAAGAATCGATCGCCAGCAACGTTAACGATGGCGCCGATCGTGATAACGTTGAAGAATTCGGGGACTCATCTTGCGTTGGTGGTGTCTCTTCGAATGCTGTTGAGGTTTCTGGAGGTTCGCATGCTTCGACGAGGGAGATTAACCTTACGGAGAGGCTGACTGATATTCTTGTCGATGAAGGAGATGGCGATCTGTTGCTTCAGCAGAGCGATCGGGAAGATAGGGTTATTCGGTGGCTTCAAGCGCTAGATATGCAAGTCATGGGCGCTTGTCGGGCTGACGAAAGGTTGAAGCCGTTATTGAAGATGACTACGTCTAGCGGCATAGCGGAAGATCGTCTTCTTGCTCAATTGAGTCAGGTTTGTGCTTAAATATGTATTTATTTTTCTAAGTTTAAATATATGGCGGATTTTTAATTGTCAACGATAATTCGCTGCTTTATATTCCATGCTCGTTCTGCGGTTCTACATAGGACTCCTGGTGCCGATGGAGATGGCATGGAATCTGTATTTGGCGTGAGTAAAAAAGATTGAACGATTTTTTTTTTCAATTTCATTTTCGTCGTTTCAATTTGGTGGACTTCCCCCCGTCGTTCTAGATTCTGTCCCATTCTGTAGTACTGATGGTTATTAGGTTTTGCAGTGTCTAGTTCTGCGTCTTTCGTTTTTCCTTTTCGCTGATGTGATCCACCCCCATTTTTGTTACTCAGCATTTCGAGCCGGTTGAAGTTGGCATTCTAGCGAGGTGTTTCTGTATACCTCTCGTCTCTATTCGCGTTGGAAAAATTGAGAAGCAAGGAAGCCTCCTTTGCCCTACGACCACTAGGTAATCATGTCATGCGATTGTTTGTGCTCTTGCCTATTCTATTATCCTTGAATAATATGAATGTGTGCCACTCTGATCAGAAGCAAATAACTAAGAAAGGAATGCATATATTCATTTGGTTGTTTCTATCTCTATTTTTTGATGCGTTTATTATGTTAGAAGAACCCTGTAATAGCCTAATAGGGTTAGAGAATGAGATTTTTTTTTTTTTTTTTTTTGATGCCACTTGACTCTGCCAATGTTTAAATTCGTTTGATTCAGTAGAAGGTCTATTGTCTTTCATTTTCTAAATTGCAACTTAGAATGCTCTAATTTGCAATATCCATTTATAATCCAATCTCTTCATATGATATATTAGAAGACCTGGCACCATCATTTCTATTTTGTTAGTCTGTAGGATTTGCGTGTTTAATGTAAGCGTGGACTCTGAGTTCCTTGTCCTGTTTCTGTGTAAAATATATTTGTTTCTTGCTTGACTTTGTATGACTTCAAATTGTTGCTTTACTAATATAACTTGACTAGAAGGTGACTGTTAGGACAGTTCTTATGCAGTTTTCCCCCACCTTTTCTTTTCTTTTTTTTTTTTTCTTTTTTATTTTTTTTTTTTAATAAGAAATGTCGATTGTCTTAACATTGTGGTCATTGCCCACTTTCACTATTGGCTCGACAGCACCTGATGAATCAAATATGATAACAACAACCACACCAGCACATGTGGGGTGGGTTCCTATGCTTCCCTGGGGTAGCTTCTATGCTTTTTTTTCCTTCCCATGTGTACTCTTGTGTCTTCTCGTGACTATATTTCCACATCTGGTCTATTGATCATTTTTAGGGGGGTTGGGTCTAATTAGGATTCTAACTAGGAGTGCTTTTACTTCTATGGTATAACCATATAATGTTACTTGAAAGTGTGTAATTAGTGGTAATTTATAGATTTAGGACTCTACATGTTAGCCTACTAATAGCAACAATGTTCATCCTTCCAAATAAATATATCTTTTGGTTTGTAGGTAAAGTGGGCGTTTTCTATTATGATCTCTGAATTATGAGGAAATAAAGGTAGTTTCATGCTCTTATACTATAATAATAGCTTGCTCCTGTTTTTTAGTTGTGTAGGGTTGCAACTTATAATTAAAATGGGATTGTTGGAATGAGTGCAATATATGCTGTATTTTTGTAATCACCTAAGTCCTAACTGTTCAGAACAACTCGTTACTTTCCCTGCTCTATTTATCTTATTGTTTTTTCAATGTATGCTGAATCCTTTTGTATTGACCAGAGTGTTGGGAGCCTAACTGTTTGTTATGAGAAAATGTAATGATGTAGTTCTTTCTTGAAGAAATTATATTTTGTTTTTTAATTATTGGATGCTGATTTATATTTGTACTTAGTATAGTGTCATATAAACTTCATGAAATGATGATGTCATATAAACTTCATGAAATGATGATGTAGAAAGATTTGTGTGATATTTTTAATTTTTAAAGTTAAATCAAATTTAAGGTTTTGAATCTTTTGATTTAAAGTGAATCATTTATCAATTAAGATTTAACAATGGAACATCTTAACCTTTTAGTGTTGAACCTTGTTCTGGATGTTGAGTCTTCTTAGTATCTTTCTACATTTATTTTTTTGTGTTTAGTTGCTTTTCTTTGGTAGTCAAGTTGAAAATTTCCACAGTGCTTTCTATAGGAACTTTGCTTAATTGATTTTAGATGAGATGGAGACTTTAGTGGTGTACTTAGCAGTGAAGTAGAAGTTATTTCAATTGATTTAATTGATTTTAGATGGTGAGGAAATTTCAAATGTTGTTGAGGTTCTGGTTGTAAATAAATGAGATAGATTTTGTATTCAAGTCTACTATCTGATTCTTTCATCTCATAAATTTATGAATGTTTTAGTTTTTGTATTCCACAAACTTTTTTTCTCCTCATTTTTATTTCTTTTATCTCCCTCAGGAATTTAGATTCTACCCTTAAGATTTTGATTGATTATTTGAAATTGGATTTTTTTATGCTAAAAGATCTATTTTGTCAATCATCACAGGGGAAACTTAAATCTAATGGTTGTTCCATCATCCGACTTTCGACTCTCATTCATTGGGGATAATGGCCAGGTAGAGAGACTATTCACTCTGAGTAGCAGATCGTCAAGTGCAATTACAATCGATGAGATTGCATCTGATAATTCTGGCCGTTCATTTGTTATCAAAGCAAATGATCAAAATATCTATTTTTGGTGCTCAGAGAAATCAAAGCTCTTGGGAACAGAACTACTTGTGAAGGTATTTGTACTTCTGTATTGAGGCTAATGATGAGTAGTTTTTTGGTTCAAATTACAGCTTTAATAAGCAATGTATATAATAGACATTTTCTAACGTTCCTCACTCATGAAAAGATCATGTTGGAAGGGCAGACATGTGAAGCTTTTATGCGATTTATTTCTTCATGAATTGTGGTAGTGTTATGAATTGACACTATTTATTTTGATGGTTTCCCAATGTTATGACTGGATATATATTTAGAACAATTTTGTGTTTTGGCTGGTTGCATATTTCCTTATGTCTCTTCTCCTGTATCACAGATGAAAGATTTACTACAGAGGAGGCCCTCTATTTCTGAATTAACTGGAATCAGTGGATCACGTCTTGGTTGCTTTGCAACACGCCTTCGTGCCTATCTTGTGGAGTCAACTGTTGCTAACCACCATCCAGCAAGTTCTGCTGACTCAAATTCTTCAGCAGACACCACTAGAGAACTATCTCATTCATCTCATTTTGGACAATCATCTGCATCATCAAAATCTATGCGGTCAAGAAATTCTGGTAGTCCAGCAACTAAAGCAAATTCTGCACACCAGGGTAGTCTTAGCCCCAGGTTGAATTCCTTTAAAGAAGGCCTGCCTAAGACATTGCTTTCTCTGAGAGATGCCGCTAGGGAAAAATTCAGGAGGCGTGGAGAGAACTTGGCTTTAGACAACCATATTGTGGCATCATCGATTTCCACTGATGCATTTTGTGTTAACTCTGAAACACAAGTTGCTGATTCAAATTGCCCATTATCTCCATCAAACTTTTTGGAATCATTGGGAAAATTAGCTGCCCCAATTCCTGCAAGTTCATCTCTTCCCTGTGTGGTTTCACCTCTCTTTACTCCTTACTATTGCTGGTGTCCGGGTGCATCCTCAATTCTGCAGCGAAGGGAAGAACCTTCTCAACTTCCCATCCCATCCATCAGTGCATCTTCTCTTCCGCCATTTCCTTCACTGTTACCAGCTTCTACACCTTCAAACTTATCGGTCCCAATATCACCTTTAAATTTAGTTGATTCTCCGTCAGTGGATTTTCCTGCTCTATTTCCAGAGCCACTGGTCCGTTTGCCTCTGAAAACCTCTCAGCAGATCCCGACCTTCACTCCTTTGTTCTGCGATCCTATTGTCCATGTTCCTGTAATTGATGTTTGCTCTTCGGGTCCAGGCTACCTTGTTAGTGCTGGCCCTACCATTTCAACCTCCATTCCCCCACTGCATCCTAAACTCGTGAATCCAATGATACCTGCTACTGATGTGGAAAAGGATGCTAGAGAGACTCTGCGTCTGCTCATCAGCGGTTCAAGCCCGGGTAACTCTCAATTGATGAATGTACTCCCTGTTGTTCTAACAGATTCCGAAGCAAATCAAAGTTTATTTTTGACTGGAAGCCGTGGTCTGTACAGTAATGCTCGAGACATTGATGCTATTGCGAACAGCATTGCTTCTCTAGGCATTGTGTCACTTTCAGGGCAATCCACAAGTGAGCATGTAGGGAAGAGATTTAATGTTGACGGTTCGAGTTGCCATTCTGACGGCAGTATTGATCCAGAAAGCTCTTATTTGGATGGCGATGATGTTCTTTCCCCATCTCACTCCAAGGAAAGGAAGTCTGGTTGA
mRNA sequence
ATGTCAAACCCTAGAAAAGAAGAATCGATCGCCAGCAACGTTAACGATGGCGCCGATCGTGATAACGTTGAAGAATTCGGGGACTCATCTTGCGTTGGTGGTGTCTCTTCGAATGCTGTTGAGGTTTCTGGAGGTTCGCATGCTTCGACGAGGGAGATTAACCTTACGGAGAGGCTGACTGATATTCTTGTCGATGAAGGAGATGGCGATCTGTTGCTTCAGCAGAGCGATCGGGAAGATAGGGTTATTCGGTGGCTTCAAGCGCTAGATATGCAAGTCATGGGCGCTTGTCGGGCTGACGAAAGGTTGAAGCCGTTATTGAAGATGACTACGTCTAGCGGCATAGCGGAAGATCGTCTTCTTGCTCAATTGAGTCAGCATTTCGAGCCGGTTGAAGTTGGCATTCTAGCGAGGTGTTTCTGTATACCTCTCGTCTCTATTCGCGTTGGAAAAATTGAGAAGCAAGGAAGCCTCCTTTGCCCTACGACCACTAGGGGAAACTTAAATCTAATGGTTGTTCCATCATCCGACTTTCGACTCTCATTCATTGGGGATAATGGCCAGGTAGAGAGACTATTCACTCTGAGTAGCAGATCGTCAAGTGCAATTACAATCGATGAGATTGCATCTGATAATTCTGGCCGTTCATTTGTTATCAAAGCAAATGATCAAAATATCTATTTTTGGTGCTCAGAGAAATCAAAGCTCTTGGGAACAGAACTACTTGTGAAGATGAAAGATTTACTACAGAGGAGGCCCTCTATTTCTGAATTAACTGGAATCAGTGGATCACGTCTTGGTTGCTTTGCAACACGCCTTCGTGCCTATCTTGTGGAGTCAACTGTTGCTAACCACCATCCAGCAAGTTCTGCTGACTCAAATTCTTCAGCAGACACCACTAGAGAACTATCTCATTCATCTCATTTTGGACAATCATCTGCATCATCAAAATCTATGCGGTCAAGAAATTCTGGTAGTCCAGCAACTAAAGCAAATTCTGCACACCAGGGTAGTCTTAGCCCCAGGTTGAATTCCTTTAAAGAAGGCCTGCCTAAGACATTGCTTTCTCTGAGAGATGCCGCTAGGGAAAAATTCAGGAGGCGTGGAGAGAACTTGGCTTTAGACAACCATATTGTGGCATCATCGATTTCCACTGATGCATTTTGTGTTAACTCTGAAACACAAGTTGCTGATTCAAATTGCCCATTATCTCCATCAAACTTTTTGGAATCATTGGGAAAATTAGCTGCCCCAATTCCTGCAAGTTCATCTCTTCCCTGTGTGGTTTCACCTCTCTTTACTCCTTACTATTGCTGGTGTCCGGGTGCATCCTCAATTCTGCAGCGAAGGGAAGAACCTTCTCAACTTCCCATCCCATCCATCAGTGCATCTTCTCTTCCGCCATTTCCTTCACTGTTACCAGCTTCTACACCTTCAAACTTATCGGTCCCAATATCACCTTTAAATTTAGTTGATTCTCCGTCAGTGGATTTTCCTGCTCTATTTCCAGAGCCACTGGTCCGTTTGCCTCTGAAAACCTCTCAGCAGATCCCGACCTTCACTCCTTTGTTCTGCGATCCTATTGTCCATGTTCCTGTAATTGATGTTTGCTCTTCGGGTCCAGGCTACCTTGTTAGTGCTGGCCCTACCATTTCAACCTCCATTCCCCCACTGCATCCTAAACTCGTGAATCCAATGATACCTGCTACTGATGTGGAAAAGGATGCTAGAGAGACTCTGCGTCTGCTCATCAGCGGTTCAAGCCCGGGTAACTCTCAATTGATGAATGTACTCCCTGTTGTTCTAACAGATTCCGAAGCAAATCAAAGTTTATTTTTGACTGGAAGCCGTGGTCTGTACAGTAATGCTCGAGACATTGATGCTATTGCGAACAGCATTGCTTCTCTAGGCATTGTGTCACTTTCAGGGCAATCCACAAGTGAGCATGTAGGGAAGAGATTTAATGTTGACGGTTCGAGTTGCCATTCTGACGGCAGTATTGATCCAGAAAGCTCTTATTTGGATGGCGATGATGTTCTTTCCCCATCTCACTCCAAGGAAAGGAAGTCTGGTTGA
Coding sequence (CDS)
ATGTCAAACCCTAGAAAAGAAGAATCGATCGCCAGCAACGTTAACGATGGCGCCGATCGTGATAACGTTGAAGAATTCGGGGACTCATCTTGCGTTGGTGGTGTCTCTTCGAATGCTGTTGAGGTTTCTGGAGGTTCGCATGCTTCGACGAGGGAGATTAACCTTACGGAGAGGCTGACTGATATTCTTGTCGATGAAGGAGATGGCGATCTGTTGCTTCAGCAGAGCGATCGGGAAGATAGGGTTATTCGGTGGCTTCAAGCGCTAGATATGCAAGTCATGGGCGCTTGTCGGGCTGACGAAAGGTTGAAGCCGTTATTGAAGATGACTACGTCTAGCGGCATAGCGGAAGATCGTCTTCTTGCTCAATTGAGTCAGCATTTCGAGCCGGTTGAAGTTGGCATTCTAGCGAGGTGTTTCTGTATACCTCTCGTCTCTATTCGCGTTGGAAAAATTGAGAAGCAAGGAAGCCTCCTTTGCCCTACGACCACTAGGGGAAACTTAAATCTAATGGTTGTTCCATCATCCGACTTTCGACTCTCATTCATTGGGGATAATGGCCAGGTAGAGAGACTATTCACTCTGAGTAGCAGATCGTCAAGTGCAATTACAATCGATGAGATTGCATCTGATAATTCTGGCCGTTCATTTGTTATCAAAGCAAATGATCAAAATATCTATTTTTGGTGCTCAGAGAAATCAAAGCTCTTGGGAACAGAACTACTTGTGAAGATGAAAGATTTACTACAGAGGAGGCCCTCTATTTCTGAATTAACTGGAATCAGTGGATCACGTCTTGGTTGCTTTGCAACACGCCTTCGTGCCTATCTTGTGGAGTCAACTGTTGCTAACCACCATCCAGCAAGTTCTGCTGACTCAAATTCTTCAGCAGACACCACTAGAGAACTATCTCATTCATCTCATTTTGGACAATCATCTGCATCATCAAAATCTATGCGGTCAAGAAATTCTGGTAGTCCAGCAACTAAAGCAAATTCTGCACACCAGGGTAGTCTTAGCCCCAGGTTGAATTCCTTTAAAGAAGGCCTGCCTAAGACATTGCTTTCTCTGAGAGATGCCGCTAGGGAAAAATTCAGGAGGCGTGGAGAGAACTTGGCTTTAGACAACCATATTGTGGCATCATCGATTTCCACTGATGCATTTTGTGTTAACTCTGAAACACAAGTTGCTGATTCAAATTGCCCATTATCTCCATCAAACTTTTTGGAATCATTGGGAAAATTAGCTGCCCCAATTCCTGCAAGTTCATCTCTTCCCTGTGTGGTTTCACCTCTCTTTACTCCTTACTATTGCTGGTGTCCGGGTGCATCCTCAATTCTGCAGCGAAGGGAAGAACCTTCTCAACTTCCCATCCCATCCATCAGTGCATCTTCTCTTCCGCCATTTCCTTCACTGTTACCAGCTTCTACACCTTCAAACTTATCGGTCCCAATATCACCTTTAAATTTAGTTGATTCTCCGTCAGTGGATTTTCCTGCTCTATTTCCAGAGCCACTGGTCCGTTTGCCTCTGAAAACCTCTCAGCAGATCCCGACCTTCACTCCTTTGTTCTGCGATCCTATTGTCCATGTTCCTGTAATTGATGTTTGCTCTTCGGGTCCAGGCTACCTTGTTAGTGCTGGCCCTACCATTTCAACCTCCATTCCCCCACTGCATCCTAAACTCGTGAATCCAATGATACCTGCTACTGATGTGGAAAAGGATGCTAGAGAGACTCTGCGTCTGCTCATCAGCGGTTCAAGCCCGGGTAACTCTCAATTGATGAATGTACTCCCTGTTGTTCTAACAGATTCCGAAGCAAATCAAAGTTTATTTTTGACTGGAAGCCGTGGTCTGTACAGTAATGCTCGAGACATTGATGCTATTGCGAACAGCATTGCTTCTCTAGGCATTGTGTCACTTTCAGGGCAATCCACAAGTGAGCATGTAGGGAAGAGATTTAATGTTGACGGTTCGAGTTGCCATTCTGACGGCAGTATTGATCCAGAAAGCTCTTATTTGGATGGCGATGATGTTCTTTCCCCATCTCACTCCAAGGAAAGGAAGTCTGGTTGA
Protein sequence
MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLTDILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRLLAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRLSFIGDNGQVERLFTLSSRSSSAITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGTELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADTTRELSHSSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLRDAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAPIPASSSLPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPASTPSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCSSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNVLPVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVDGSSCHSDGSIDPESSYLDGDDVLSPSHSKERKSG
Homology
BLAST of HG10018431 vs. NCBI nr
Match:
XP_038886408.1 (uncharacterized protein LOC120076604 isoform X1 [Benincasa hispida])
HSP 1 Score: 1258.8 bits (3256), Expect = 0.0e+00
Identity = 663/692 (95.81%), Postives = 675/692 (97.54%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKEESIASNVNDGADRDNVEEFGDSS VGGVSSNAVEVSGGSHASTREINLTERLT
Sbjct: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSRVGGVSSNAVEVSGGSHASTREINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL
Sbjct: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKI+KQGSLLCPTTTRGNLNLMVVPSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSSA-ITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNGQVERLFTLS+RSSSA ITIDEI SDNSGRSFVIKANDQNIYFWCSEKSKLLGT
Sbjct: 181 SFIGDNGQVERLFTLSNRSSSASITIDEIESDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
EL++KMKDLLQRRPSISELTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SSADT
Sbjct: 241 ELILKMKDLLQRRPSISELTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSADT 300
Query: 301 TRELSHSSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
TRE SHSSH GQSS SSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 301 TRESSHSSHCGQSSVSSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
Query: 361 AAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAPI 420
AAREKFRRRGENL LDNHIVASSISTDAFC+NSETQ ADS+CPLSPSNFLESLGKLAAPI
Sbjct: 361 AAREKFRRRGENLGLDNHIVASSISTDAFCLNSETQTADSSCPLSPSNFLESLGKLAAPI 420
Query: 421 PASSSLPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPASTPS 480
PASSSLPCVVSPLFTPYYCWCPGASSILQRREE +QLPIPSISASSLPPFPS+LPASTPS
Sbjct: 421 PASSSLPCVVSPLFTPYYCWCPGASSILQRREESNQLPIPSISASSLPPFPSMLPASTPS 480
Query: 481 NLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCSS 540
NLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCSS
Sbjct: 481 NLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCSS 540
Query: 541 GPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNVLP 600
GPGYLVSAGPTISTSIPPLHPKLVNPMIP TDVEKDARETLRLLISGSSPGNSQLMNVLP
Sbjct: 541 GPGYLVSAGPTISTSIPPLHPKLVNPMIPTTDVEKDARETLRLLISGSSPGNSQLMNVLP 600
Query: 601 VVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVDGS 660
VVLTDSEANQSLFLTGSRGLYSNARDID IANSIASLGIVSLSGQSTSEHVGKRFN+DG
Sbjct: 601 VVLTDSEANQSLFLTGSRGLYSNARDIDVIANSIASLGIVSLSGQSTSEHVGKRFNIDGL 660
Query: 661 SCHSDGSIDPESSYLDGDDVLSPSHSKERKSG 692
+ HSD S D ESSYLDGDD+LSPSHSKERKSG
Sbjct: 661 NGHSDDSCDSESSYLDGDDMLSPSHSKERKSG 692
BLAST of HG10018431 vs. NCBI nr
Match:
XP_008441435.1 (PREDICTED: uncharacterized protein LOC103485553 isoform X1 [Cucumis melo] >KAA0055072.1 uncharacterized protein E6C27_scaffold43052G002480 [Cucumis melo var. makuwa] >TYK24413.1 uncharacterized protein E5676_scaffold205G002030 [Cucumis melo var. makuwa])
HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 645/694 (92.94%), Postives = 661/694 (95.24%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKEESIA NVNDGADRDNVEEFGDSS VGG S N +EVSGGSHASTREINLTERLT
Sbjct: 1 MSNPRKEESIARNVNDGADRDNVEEFGDSSRVGGASPNVIEVSGGSHASTREINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DI+VDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTS GIAEDRL
Sbjct: 61 DIIVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSCGIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPT++RGNLNLMVVPSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTSSRGNLNLMVVPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSSA-ITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNGQV+RLFTLSSRSSSA ITI+EIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT
Sbjct: 181 SFIGDNGQVKRLFTLSSRSSSASITIEEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
ELLVKMKDLLQRRPSISELTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SSAD
Sbjct: 241 ELLVKMKDLLQRRPSISELTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSADN 300
Query: 301 TRELSH-SSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLR 360
TRE SH SSHFGQSSASSKSMRSR S SPA KANSAHQGSLSPRLNSFKEGLPKTLLSLR
Sbjct: 301 TREPSHSSSHFGQSSASSKSMRSRYSSSPAIKANSAHQGSLSPRLNSFKEGLPKTLLSLR 360
Query: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAP 420
DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQ ADSNCP SP++FLESLGKLA P
Sbjct: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQTADSNCPSSPTSFLESLGKLATP 420
Query: 421 IPASSS-LPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPAST 480
I SSS PCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPS++ASSLPPFPSLLPAST
Sbjct: 421 ITGSSSHAPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSVTASSLPPFPSLLPAST 480
Query: 481 PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 540
PSNLSVPISPLNLVDSPSVDFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 481 PSNLSVPISPLNLVDSPSVDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 540
Query: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNV 600
SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLIS SS GNSQLMNV
Sbjct: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISSSSQGNSQLMNV 600
Query: 601 LPVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVD 660
LPVVLTDSEANQSLFLTGSRGLYS+ARDIDAIA+SIASLGIVSLSGQSTSEHVGKRFNVD
Sbjct: 601 LPVVLTDSEANQSLFLTGSRGLYSSARDIDAIASSIASLGIVSLSGQSTSEHVGKRFNVD 660
Query: 661 GSSCHSDGSIDPESSYLDGDDVLSPSHSKERKSG 692
G + HSD S + ESS DDVLSPSHS ERKSG
Sbjct: 661 GLNGHSDNSSESESSSC-LDDVLSPSHSDERKSG 693
BLAST of HG10018431 vs. NCBI nr
Match:
XP_004138433.1 (uncharacterized protein LOC101206438 isoform X1 [Cucumis sativus])
HSP 1 Score: 1197.2 bits (3096), Expect = 0.0e+00
Identity = 640/695 (92.09%), Postives = 655/695 (94.24%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKEESIA NVND ADRDNVEEF DSS VGG SSN VEVSGGSHASTREINLTERLT
Sbjct: 1 MSNPRKEESIARNVNDAADRDNVEEFADSSRVGGASSNVVEVSGGSHASTREINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DI+VDEGDGDLLLQ SDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL
Sbjct: 61 DIIVDEGDGDLLLQHSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKI+KQGSLLCPT++RGNLNLMVVPSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGSLLCPTSSRGNLNLMVVPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSSA-ITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNGQVERLFTLSSRSSSA +TI+EI SDNSGRSFVIKANDQNIYFWCSEKSKLLGT
Sbjct: 181 SFIGDNGQVERLFTLSSRSSSASVTIEEIGSDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
ELLVKMKDLLQRRPSISELTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SSAD
Sbjct: 241 ELLVKMKDLLQRRPSISELTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSADN 300
Query: 301 TRELSHS-SHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLR 360
RE SHS SHFGQ SASSKSMRSR S SPA KANS HQGSLSPRLNSFKEGLPKTLLSLR
Sbjct: 301 IREPSHSLSHFGQPSASSKSMRSRYSSSPAIKANSTHQGSLSPRLNSFKEGLPKTLLSLR 360
Query: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAP 420
DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQ DSNCP SP++FLESLGKLA P
Sbjct: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQTVDSNCPSSPTSFLESLGKLATP 420
Query: 421 IPASSS-LPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPAST 480
IP SSS PCVVSPLFTPYYCWCP ASS+LQRREEPSQLPIPS++ASSLPPFPSLLPAST
Sbjct: 421 IPGSSSHAPCVVSPLFTPYYCWCPSASSLLQRREEPSQLPIPSVTASSLPPFPSLLPAST 480
Query: 481 PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 540
PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPL TSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 481 PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLNTSQQIPTFTPLFCDPIVHVPVIDVC 540
Query: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNV 600
SSGPGYLVSAGPTISTSIPPLHPKLVNPMIP TDVEKDARETLRLLIS SS GNSQLMNV
Sbjct: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPTTDVEKDARETLRLLISSSSQGNSQLMNV 600
Query: 601 LPVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVD 660
LPVVLTDSEANQSLFLTGSRGLYS+ARDIDAIA+SIASLGIVSLSGQSTSEHVGKRFNVD
Sbjct: 601 LPVVLTDSEANQSLFLTGSRGLYSSARDIDAIASSIASLGIVSLSGQSTSEHVGKRFNVD 660
Query: 661 GSSCHSDGSIDPESSYL-DGDDVLSPSHSKERKSG 692
G + HSD S D ESS DGDDVLSPSHS ERKSG
Sbjct: 661 GLNDHSDDSSDSESSSCSDGDDVLSPSHSNERKSG 695
BLAST of HG10018431 vs. NCBI nr
Match:
XP_022993058.1 (uncharacterized protein LOC111489188 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1147.9 bits (2968), Expect = 0.0e+00
Identity = 610/693 (88.02%), Postives = 640/693 (92.35%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKE+SIASN N A RDNVEEFG+SS VGGVSSN VEVSGG H STR+INLTERLT
Sbjct: 1 MSNPRKEDSIASNANGDAHRDNVEEFGESSRVGGVSSNVVEVSGGPHPSTRDINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTS+ IAEDRL
Sbjct: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSNDIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKI+KQG+LLCPTTTRGNLNLMV+PSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTTRGNLNLMVLPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSS-AITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNG VERLFTLS+RSSS AITIDEIASD+SGRSFVIKANDQN YFWCSEKSKLLGT
Sbjct: 181 SFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
ELL+KMKDLLQRRPSI+ LTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SS DT
Sbjct: 241 ELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSVDT 300
Query: 301 TRELSHSSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
TRELSHSSHFGQ SSKSMRSRN GSPA KANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 301 TRELSHSSHFGQ---SSKSMRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
Query: 361 AAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAPI 420
+AREKFRRRG+NLALDNHI SSIS D VNSETQ D +CPLSPSNFL+SLGKLAAP
Sbjct: 361 SAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGKLAAPT 420
Query: 421 PASSS-LPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPASTP 480
PA+SS PCVVSPLFTPYYCWCPG+SSILQRREEPSQLPIPS SASSLPPFPSL PAS P
Sbjct: 421 PANSSHAPCVVSPLFTPYYCWCPGSSSILQRREEPSQLPIPSFSASSLPPFPSLFPASAP 480
Query: 481 SNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS 540
SNLSVP+SPLNLVDSPS+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS
Sbjct: 481 SNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS 540
Query: 541 SGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNVL 600
SGPGYLVSAGPTI+TSIPPLHPKLVNPM+PATDVEKDARETLRLLISGSS GN QLMNVL
Sbjct: 541 SGPGYLVSAGPTITTSIPPLHPKLVNPMLPATDVEKDARETLRLLISGSSQGNPQLMNVL 600
Query: 601 PVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVDG 660
PVVLTDSEAN+SLFLTGS GLYSN RDIDAIANSIASLGI SLSG+STSEHVGKRFN+DG
Sbjct: 601 PVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVGKRFNLDG 660
Query: 661 SSCHSDGSIDPESSYLDGDDVLSPSHSKERKSG 692
+ H D S D E S +G+DV S SH +ERK G
Sbjct: 661 LNGHPDDSSDSECSCSEGEDVFSQSHFEERKFG 687
BLAST of HG10018431 vs. NCBI nr
Match:
XP_022939303.1 (uncharacterized protein LOC111445260 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1147.1 bits (2966), Expect = 0.0e+00
Identity = 610/693 (88.02%), Postives = 640/693 (92.35%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKE+SIASN N ADRDNVEEFG+SS VGGVSSN EVSGG HASTR+INLTERLT
Sbjct: 1 MSNPRKEDSIASNANGDADRDNVEEFGESSRVGGVSSNVGEVSGGPHASTRDINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTS+ IAEDRL
Sbjct: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSNDIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKI+KQG+LLCPTT RGNLNLMV+PSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTARGNLNLMVLPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSS-AITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNG VERLFTLS+RSSS AITIDEIASD+SGRSFVIKANDQN YFWCSEKSKLLGT
Sbjct: 181 SFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
ELL+KMKDLLQRRPSI+ LTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SS DT
Sbjct: 241 ELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSVDT 300
Query: 301 TRELSHSSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
TRELSHSSHFGQ SSKS+RSRN GSPA KANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 301 TRELSHSSHFGQ---SSKSIRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
Query: 361 AAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAPI 420
AAREKFRRRG+NLALDNHI SSIS D VNSETQ D +CPLSPSNFL+SLGKLAAP
Sbjct: 361 AAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGKLAAPT 420
Query: 421 PASSS-LPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPASTP 480
PA+SS PCVVSPLFTPYYCWCPG+SSILQRREEPSQLPIPS SASSLPPFPSL PAS P
Sbjct: 421 PANSSHAPCVVSPLFTPYYCWCPGSSSILQRREEPSQLPIPSFSASSLPPFPSLFPASAP 480
Query: 481 SNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS 540
SNLSVP+SPLNLVDSPS+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS
Sbjct: 481 SNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS 540
Query: 541 SGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNVL 600
SGPGYLVSAGPTI+TSIPPLHPKLVNPM+PATDVEKDARETLRLLISGSS GN QLMNVL
Sbjct: 541 SGPGYLVSAGPTITTSIPPLHPKLVNPMLPATDVEKDARETLRLLISGSSQGNPQLMNVL 600
Query: 601 PVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVDG 660
PVVLTDSEAN+SLFLTGS GLYSN RDIDAIANSIASLGI SLSG+STSEHVGKRFN+DG
Sbjct: 601 PVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVGKRFNLDG 660
Query: 661 SSCHSDGSIDPESSYLDGDDVLSPSHSKERKSG 692
+ H D S D ESS +G+DV S SH +E K G
Sbjct: 661 LNGHPDDSSDSESSCSEGEDVFSQSHFEESKFG 687
BLAST of HG10018431 vs. ExPASy TrEMBL
Match:
A0A5A7UGW8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold205G002030 PE=4 SV=1)
HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 645/694 (92.94%), Postives = 661/694 (95.24%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKEESIA NVNDGADRDNVEEFGDSS VGG S N +EVSGGSHASTREINLTERLT
Sbjct: 1 MSNPRKEESIARNVNDGADRDNVEEFGDSSRVGGASPNVIEVSGGSHASTREINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DI+VDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTS GIAEDRL
Sbjct: 61 DIIVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSCGIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPT++RGNLNLMVVPSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTSSRGNLNLMVVPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSSA-ITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNGQV+RLFTLSSRSSSA ITI+EIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT
Sbjct: 181 SFIGDNGQVKRLFTLSSRSSSASITIEEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
ELLVKMKDLLQRRPSISELTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SSAD
Sbjct: 241 ELLVKMKDLLQRRPSISELTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSADN 300
Query: 301 TRELSH-SSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLR 360
TRE SH SSHFGQSSASSKSMRSR S SPA KANSAHQGSLSPRLNSFKEGLPKTLLSLR
Sbjct: 301 TREPSHSSSHFGQSSASSKSMRSRYSSSPAIKANSAHQGSLSPRLNSFKEGLPKTLLSLR 360
Query: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAP 420
DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQ ADSNCP SP++FLESLGKLA P
Sbjct: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQTADSNCPSSPTSFLESLGKLATP 420
Query: 421 IPASSS-LPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPAST 480
I SSS PCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPS++ASSLPPFPSLLPAST
Sbjct: 421 ITGSSSHAPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSVTASSLPPFPSLLPAST 480
Query: 481 PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 540
PSNLSVPISPLNLVDSPSVDFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 481 PSNLSVPISPLNLVDSPSVDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 540
Query: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNV 600
SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLIS SS GNSQLMNV
Sbjct: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISSSSQGNSQLMNV 600
Query: 601 LPVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVD 660
LPVVLTDSEANQSLFLTGSRGLYS+ARDIDAIA+SIASLGIVSLSGQSTSEHVGKRFNVD
Sbjct: 601 LPVVLTDSEANQSLFLTGSRGLYSSARDIDAIASSIASLGIVSLSGQSTSEHVGKRFNVD 660
Query: 661 GSSCHSDGSIDPESSYLDGDDVLSPSHSKERKSG 692
G + HSD S + ESS DDVLSPSHS ERKSG
Sbjct: 661 GLNGHSDNSSESESSSC-LDDVLSPSHSDERKSG 693
BLAST of HG10018431 vs. ExPASy TrEMBL
Match:
A0A1S3B2Z5 (uncharacterized protein LOC103485553 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485553 PE=4 SV=1)
HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 645/694 (92.94%), Postives = 661/694 (95.24%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKEESIA NVNDGADRDNVEEFGDSS VGG S N +EVSGGSHASTREINLTERLT
Sbjct: 1 MSNPRKEESIARNVNDGADRDNVEEFGDSSRVGGASPNVIEVSGGSHASTREINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DI+VDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTS GIAEDRL
Sbjct: 61 DIIVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSCGIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPT++RGNLNLMVVPSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTSSRGNLNLMVVPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSSA-ITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNGQV+RLFTLSSRSSSA ITI+EIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT
Sbjct: 181 SFIGDNGQVKRLFTLSSRSSSASITIEEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
ELLVKMKDLLQRRPSISELTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SSAD
Sbjct: 241 ELLVKMKDLLQRRPSISELTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSADN 300
Query: 301 TRELSH-SSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLR 360
TRE SH SSHFGQSSASSKSMRSR S SPA KANSAHQGSLSPRLNSFKEGLPKTLLSLR
Sbjct: 301 TREPSHSSSHFGQSSASSKSMRSRYSSSPAIKANSAHQGSLSPRLNSFKEGLPKTLLSLR 360
Query: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAP 420
DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQ ADSNCP SP++FLESLGKLA P
Sbjct: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQTADSNCPSSPTSFLESLGKLATP 420
Query: 421 IPASSS-LPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPAST 480
I SSS PCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPS++ASSLPPFPSLLPAST
Sbjct: 421 ITGSSSHAPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSVTASSLPPFPSLLPAST 480
Query: 481 PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 540
PSNLSVPISPLNLVDSPSVDFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 481 PSNLSVPISPLNLVDSPSVDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 540
Query: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNV 600
SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLIS SS GNSQLMNV
Sbjct: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISSSSQGNSQLMNV 600
Query: 601 LPVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVD 660
LPVVLTDSEANQSLFLTGSRGLYS+ARDIDAIA+SIASLGIVSLSGQSTSEHVGKRFNVD
Sbjct: 601 LPVVLTDSEANQSLFLTGSRGLYSSARDIDAIASSIASLGIVSLSGQSTSEHVGKRFNVD 660
Query: 661 GSSCHSDGSIDPESSYLDGDDVLSPSHSKERKSG 692
G + HSD S + ESS DDVLSPSHS ERKSG
Sbjct: 661 GLNGHSDNSSESESSSC-LDDVLSPSHSDERKSG 693
BLAST of HG10018431 vs. ExPASy TrEMBL
Match:
A0A0A0KDA9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G009500 PE=4 SV=1)
HSP 1 Score: 1197.2 bits (3096), Expect = 0.0e+00
Identity = 640/695 (92.09%), Postives = 655/695 (94.24%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKEESIA NVND ADRDNVEEF DSS VGG SSN VEVSGGSHASTREINLTERLT
Sbjct: 1 MSNPRKEESIARNVNDAADRDNVEEFADSSRVGGASSNVVEVSGGSHASTREINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DI+VDEGDGDLLLQ SDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL
Sbjct: 61 DIIVDEGDGDLLLQHSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKI+KQGSLLCPT++RGNLNLMVVPSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGSLLCPTSSRGNLNLMVVPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSSA-ITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNGQVERLFTLSSRSSSA +TI+EI SDNSGRSFVIKANDQNIYFWCSEKSKLLGT
Sbjct: 181 SFIGDNGQVERLFTLSSRSSSASVTIEEIGSDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
ELLVKMKDLLQRRPSISELTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SSAD
Sbjct: 241 ELLVKMKDLLQRRPSISELTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSADN 300
Query: 301 TRELSHS-SHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLR 360
RE SHS SHFGQ SASSKSMRSR S SPA KANS HQGSLSPRLNSFKEGLPKTLLSLR
Sbjct: 301 IREPSHSLSHFGQPSASSKSMRSRYSSSPAIKANSTHQGSLSPRLNSFKEGLPKTLLSLR 360
Query: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAP 420
DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQ DSNCP SP++FLESLGKLA P
Sbjct: 361 DAAREKFRRRGENLALDNHIVASSISTDAFCVNSETQTVDSNCPSSPTSFLESLGKLATP 420
Query: 421 IPASSS-LPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPAST 480
IP SSS PCVVSPLFTPYYCWCP ASS+LQRREEPSQLPIPS++ASSLPPFPSLLPAST
Sbjct: 421 IPGSSSHAPCVVSPLFTPYYCWCPSASSLLQRREEPSQLPIPSVTASSLPPFPSLLPAST 480
Query: 481 PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVC 540
PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPL TSQQIPTFTPLFCDPIVHVPVIDVC
Sbjct: 481 PSNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLNTSQQIPTFTPLFCDPIVHVPVIDVC 540
Query: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNV 600
SSGPGYLVSAGPTISTSIPPLHPKLVNPMIP TDVEKDARETLRLLIS SS GNSQLMNV
Sbjct: 541 SSGPGYLVSAGPTISTSIPPLHPKLVNPMIPTTDVEKDARETLRLLISSSSQGNSQLMNV 600
Query: 601 LPVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVD 660
LPVVLTDSEANQSLFLTGSRGLYS+ARDIDAIA+SIASLGIVSLSGQSTSEHVGKRFNVD
Sbjct: 601 LPVVLTDSEANQSLFLTGSRGLYSSARDIDAIASSIASLGIVSLSGQSTSEHVGKRFNVD 660
Query: 661 GSSCHSDGSIDPESSYL-DGDDVLSPSHSKERKSG 692
G + HSD S D ESS DGDDVLSPSHS ERKSG
Sbjct: 661 GLNDHSDDSSDSESSSCSDGDDVLSPSHSNERKSG 695
BLAST of HG10018431 vs. ExPASy TrEMBL
Match:
A0A6J1K125 (uncharacterized protein LOC111489188 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489188 PE=4 SV=1)
HSP 1 Score: 1147.9 bits (2968), Expect = 0.0e+00
Identity = 610/693 (88.02%), Postives = 640/693 (92.35%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKE+SIASN N A RDNVEEFG+SS VGGVSSN VEVSGG H STR+INLTERLT
Sbjct: 1 MSNPRKEDSIASNANGDAHRDNVEEFGESSRVGGVSSNVVEVSGGPHPSTRDINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTS+ IAEDRL
Sbjct: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSNDIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKI+KQG+LLCPTTTRGNLNLMV+PSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTTRGNLNLMVLPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSS-AITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNG VERLFTLS+RSSS AITIDEIASD+SGRSFVIKANDQN YFWCSEKSKLLGT
Sbjct: 181 SFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
ELL+KMKDLLQRRPSI+ LTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SS DT
Sbjct: 241 ELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSVDT 300
Query: 301 TRELSHSSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
TRELSHSSHFGQ SSKSMRSRN GSPA KANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 301 TRELSHSSHFGQ---SSKSMRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
Query: 361 AAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAPI 420
+AREKFRRRG+NLALDNHI SSIS D VNSETQ D +CPLSPSNFL+SLGKLAAP
Sbjct: 361 SAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGKLAAPT 420
Query: 421 PASSS-LPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPASTP 480
PA+SS PCVVSPLFTPYYCWCPG+SSILQRREEPSQLPIPS SASSLPPFPSL PAS P
Sbjct: 421 PANSSHAPCVVSPLFTPYYCWCPGSSSILQRREEPSQLPIPSFSASSLPPFPSLFPASAP 480
Query: 481 SNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS 540
SNLSVP+SPLNLVDSPS+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS
Sbjct: 481 SNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS 540
Query: 541 SGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNVL 600
SGPGYLVSAGPTI+TSIPPLHPKLVNPM+PATDVEKDARETLRLLISGSS GN QLMNVL
Sbjct: 541 SGPGYLVSAGPTITTSIPPLHPKLVNPMLPATDVEKDARETLRLLISGSSQGNPQLMNVL 600
Query: 601 PVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVDG 660
PVVLTDSEAN+SLFLTGS GLYSN RDIDAIANSIASLGI SLSG+STSEHVGKRFN+DG
Sbjct: 601 PVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVGKRFNLDG 660
Query: 661 SSCHSDGSIDPESSYLDGDDVLSPSHSKERKSG 692
+ H D S D E S +G+DV S SH +ERK G
Sbjct: 661 LNGHPDDSSDSECSCSEGEDVFSQSHFEERKFG 687
BLAST of HG10018431 vs. ExPASy TrEMBL
Match:
A0A6J1FLA2 (uncharacterized protein LOC111445260 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445260 PE=4 SV=1)
HSP 1 Score: 1147.1 bits (2966), Expect = 0.0e+00
Identity = 610/693 (88.02%), Postives = 640/693 (92.35%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSGGSHASTREINLTERLT 60
MSNPRKE+SIASN N ADRDNVEEFG+SS VGGVSSN EVSGG HASTR+INLTERLT
Sbjct: 1 MSNPRKEDSIASNANGDADRDNVEEFGESSRVGGVSSNVGEVSGGPHASTRDINLTERLT 60
Query: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAEDRL 120
DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTS+ IAEDRL
Sbjct: 61 DILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSNDIAEDRL 120
Query: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDFRL 180
LAQLSQHFEPVEVGILARCFCIPLVSIRVGKI+KQG+LLCPTT RGNLNLMV+PSSDFRL
Sbjct: 121 LAQLSQHFEPVEVGILARCFCIPLVSIRVGKIDKQGTLLCPTTARGNLNLMVLPSSDFRL 180
Query: 181 SFIGDNGQVERLFTLSSRSSS-AITIDEIASDNSGRSFVIKANDQNIYFWCSEKSKLLGT 240
SFIGDNG VERLFTLS+RSSS AITIDEIASD+SGRSFVIKANDQN YFWCSEKSKLLGT
Sbjct: 181 SFIGDNGHVERLFTLSNRSSSAAITIDEIASDSSGRSFVIKANDQNTYFWCSEKSKLLGT 240
Query: 241 ELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHHPASSADSNSSADT 300
ELL+KMKDLLQRRPSI+ LTGIS SRLGCFATRLRAYLVESTVANHHPASSADS+SS DT
Sbjct: 241 ELLLKMKDLLQRRPSIAGLTGISESRLGCFATRLRAYLVESTVANHHPASSADSHSSVDT 300
Query: 301 TRELSHSSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
TRELSHSSHFGQ SSKS+RSRN GSPA KANSAHQGSLSPRLNSFKEGLPKTLLSLRD
Sbjct: 301 TRELSHSSHFGQ---SSKSIRSRNYGSPAVKANSAHQGSLSPRLNSFKEGLPKTLLSLRD 360
Query: 361 AAREKFRRRGENLALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESLGKLAAPI 420
AAREKFRRRG+NLALDNHI SSIS D VNSETQ D +CPLSPSNFL+SLGKLAAP
Sbjct: 361 AAREKFRRRGDNLALDNHIATSSISND---VNSETQTGDLSCPLSPSNFLKSLGKLAAPT 420
Query: 421 PASSS-LPCVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLPPFPSLLPASTP 480
PA+SS PCVVSPLFTPYYCWCPG+SSILQRREEPSQLPIPS SASSLPPFPSL PAS P
Sbjct: 421 PANSSHAPCVVSPLFTPYYCWCPGSSSILQRREEPSQLPIPSFSASSLPPFPSLFPASAP 480
Query: 481 SNLSVPISPLNLVDSPSVDFPALFPEPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS 540
SNLSVP+SPLNLVDSPS+DFPALFP+PLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS
Sbjct: 481 SNLSVPVSPLNLVDSPSLDFPALFPDPLVRLPLKTSQQIPTFTPLFCDPIVHVPVIDVCS 540
Query: 541 SGPGYLVSAGPTISTSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNVL 600
SGPGYLVSAGPTI+TSIPPLHPKLVNPM+PATDVEKDARETLRLLISGSS GN QLMNVL
Sbjct: 541 SGPGYLVSAGPTITTSIPPLHPKLVNPMLPATDVEKDARETLRLLISGSSQGNPQLMNVL 600
Query: 601 PVVLTDSEANQSLFLTGSRGLYSNARDIDAIANSIASLGIVSLSGQSTSEHVGKRFNVDG 660
PVVLTDSEAN+SLFLTGS GLYSN RDIDAIANSIASLGI SLSG+STSEHVGKRFN+DG
Sbjct: 601 PVVLTDSEANRSLFLTGSHGLYSNTRDIDAIANSIASLGIASLSGKSTSEHVGKRFNLDG 660
Query: 661 SSCHSDGSIDPESSYLDGDDVLSPSHSKERKSG 692
+ H D S D ESS +G+DV S SH +E K G
Sbjct: 661 LNGHPDDSSDSESSCSEGEDVFSQSHFEESKFG 687
BLAST of HG10018431 vs. TAIR 10
Match:
AT2G39950.1 (unknown protein; Has 978 Blast hits to 254 proteins in 81 species: Archae - 0; Bacteria - 8; Metazoa - 109; Fungi - 53; Plants - 41; Viruses - 0; Other Eukaryotes - 767 (source: NCBI BLink). )
HSP 1 Score: 438.0 bits (1125), Expect = 1.5e-122
Identity = 306/650 (47.08%), Postives = 398/650 (61.23%), Query Frame = 0
Query: 1 MSNPRKEESIASNVNDGADRDNVEEFGDSSCVGGVSSNAVEVSG--GSHASTREINLTER 60
M++ RK ++ +D RD+ GD + +++ + +G G + TR + R
Sbjct: 1 MADSRKRDT----GDDHQHRDDQPNDGDDLSISSSTTDDSQFNGTEGENELTR---IESR 60
Query: 61 LTDILVDEGDGDLLLQQSDREDRVIRWLQALDMQVMGACRADERLKPLLKMTTSSGIAED 120
++D L D GD L+ EDRV+RWLQALDMQVMGACR DERLKPLLK+ S+G+AED
Sbjct: 61 VSDPLTDATGGDFLV----GEDRVLRWLQALDMQVMGACRGDERLKPLLKLNVSNGMAED 120
Query: 121 RLLAQLSQHFEPVEVGILARCFCIPLVSIRVGKIEKQGSLLCPTTTRGNLNLMVVPSSDF 180
RLLA LSQHFEP E+G+LARCFCIPLVS+RVGKI K+G L+ PT RGNL+LMV+P+SD
Sbjct: 121 RLLAHLSQHFEPAEIGMLARCFCIPLVSVRVGKIIKEGILMRPTPIRGNLSLMVLPTSDL 180
Query: 181 RLSFIGDNGQVERLFTLSSRSS-SAITIDEIASDNSGRSFVIK-ANDQNIYFWCSEKSKL 240
RLSFIGDNG E+LFT +S+S SA++I+EI D+SGRSFVI+ AN Y+WCSEKSKL
Sbjct: 181 RLSFIGDNGHSEQLFTYTSKSQCSAVSIEEITVDSSGRSFVIRIANGNAFYYWCSEKSKL 240
Query: 241 LGTELLVKMKDLLQRRPSISELTGISGSRLGCFATRLRAYLVESTVANHH--PASSADSN 300
LGTEL KMKDL++++PSISELTGI SRLG A+ LR YL+ S V N S DS+
Sbjct: 241 LGTELRRKMKDLIKKKPSISELTGIEESRLGSVASHLRLYLMGSVVPNIKGCQVPSPDSS 300
Query: 301 SSADTTRELSHSSHFGQSSASSKSMRSRNSGSPATKANSAHQGSLSPRLNSFKEGLPKTL 360
SS+ + E + SS SSASSKS+R+R+ G+ TK QGSLSPR +SFKE +
Sbjct: 301 SSSGFS-ETADSS----SSASSKSLRARHCGTQQTKT----QGSLSPRASSFKENTLRN- 360
Query: 361 LSLRDAAREKFRRRGEN--LALDNHIVASSISTDAFCVNSETQVADSNCPLSPSNFLESL 420
SLR ++R+K + E DN + S + + SE +V ++ + + +
Sbjct: 361 ASLRISSRDKSKGHSEGHFSIFDNSSITSIPTNVEGFIQSEGEVEEATENYNGIRQIIAF 420
Query: 421 GKLAAPIPASSSLP-----CVVSPLFTPYYCWCPGASSILQRREEPSQLPIPSISASSLP 480
+ A P++ + P + P+F+PYYCWCP +S L Q P SI SLP
Sbjct: 421 EE-AESTPSTMTGPPPFPLKMGPPVFSPYYCWCPPTTSSLHAPSASYQFPPLSIELPSLP 480
Query: 481 PFPSLLPASTPSNLSVPISPLNLVDSPSVDFPALFPEPLV-RLPL----KTSQQIPTFTP 540
P SLLPAS +P SPL+L D P P PLV +P+ +S Q P
Sbjct: 481 PLSSLLPASGSDGFLIPSSPLDLSDIP--------PLPLVHHIPIPGSSSSSSQQQMMIP 540
Query: 541 LFCDPIVHVPVIDVCSSGPGYLVSAGPT--ISTSIPPLHPKLVNPMIPATDVEKDARETL 600
+ CDPIVH+PVID+ SSG YLVSAGPT IST IPPL P+ + VEK ARETL
Sbjct: 541 IMCDPIVHIPVIDIFSSGQSYLVSAGPTGIISTGIPPL------PVENDSLVEKGARETL 598
Query: 601 RLLISGSSPGNSQLMNVLPVVLTDSEANQSLFLTGSRGLYSNARDIDAIA 631
RLLISG++ S +N GSRGLYS +RD+ ++
Sbjct: 601 RLLISGANATTSTPLN----------------HHGSRGLYSVSRDVSGVS 598
BLAST of HG10018431 vs. TAIR 10
Match:
AT2G39950.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 941 Blast hits to 229 proteins in 79 species: Archae - 0; Bacteria - 8; Metazoa - 89; Fungi - 54; Plants - 41; Viruses - 0; Other Eukaryotes - 749 (source: NCBI BLink). )
HSP 1 Score: 408.7 bits (1049), Expect = 9.4e-114
Identity = 276/558 (49.46%), Postives = 350/558 (62.72%), Query Frame = 0
Query: 91 MQVMGACRADERLKPLLKMTTSSGIAEDRLLAQLSQHFEPVEVGILARCFCIPLVSIRVG 150
MQVMGACR DERLKPLLK+ S+G+AEDRLLA LSQHFEP E+G+LARCFCIPLVS+RVG
Sbjct: 1 MQVMGACRGDERLKPLLKLNVSNGMAEDRLLAHLSQHFEPAEIGMLARCFCIPLVSVRVG 60
Query: 151 KIEKQGSLLCPTTTRGNLNLMVVPSSDFRLSFIGDNGQVERLFTLSSRSS-SAITIDEIA 210
KI K+G L+ PT RGNL+LMV+P+SD RLSFIGDNG E+LFT +S+S SA++I+EI
Sbjct: 61 KIIKEGILMRPTPIRGNLSLMVLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEIT 120
Query: 211 SDNSGRSFVIK-ANDQNIYFWCSEKSKLLGTELLVKMKDLLQRRPSISELTGISGSRLGC 270
D+SGRSFVI+ AN Y+WCSEKSKLLGTEL KMKDL++++PSISELTGI SRLG
Sbjct: 121 VDSSGRSFVIRIANGNAFYYWCSEKSKLLGTELRRKMKDLIKKKPSISELTGIEESRLGS 180
Query: 271 FATRLRAYLVESTVANHH--PASSADSNSSADTTRELSHSSHFGQSSASSKSMRSRNSGS 330
A+ LR YL+ S V N S DS+SS+ + E + SS SSASSKS+R+R+ G+
Sbjct: 181 VASHLRLYLMGSVVPNIKGCQVPSPDSSSSSGFS-ETADSS----SSASSKSLRARHCGT 240
Query: 331 PATKANSAHQGSLSPRLNSFKEGLPKTLLSLRDAAREKFRRRGEN--LALDNHIVASSIS 390
TK QGSLSPR +SFKE + SLR ++R+K + E DN + S +
Sbjct: 241 QQTKT----QGSLSPRASSFKENTLRN-ASLRISSRDKSKGHSEGHFSIFDNSSITSIPT 300
Query: 391 TDAFCVNSETQVADSNCPLSPSNFLESLGKLAAPIPASSSLP-----CVVSPLFTPYYCW 450
+ SE +V ++ + + + + A P++ + P + P+F+PYYCW
Sbjct: 301 NVEGFIQSEGEVEEATENYNGIRQIIAFEE-AESTPSTMTGPPPFPLKMGPPVFSPYYCW 360
Query: 451 CPGASSILQRREEPSQLPIPSISASSLPPFPSLLPASTPSNLSVPISPLNLVDSPSVDFP 510
CP +S L Q P SI SLPP SLLPAS +P SPL+L D P
Sbjct: 361 CPPTTSSLHAPSASYQFPPLSIELPSLPPLSSLLPASGSDGFLIPSSPLDLSDIP----- 420
Query: 511 ALFPEPLV-RLPL----KTSQQIPTFTPLFCDPIVHVPVIDVCSSGPGYLVSAGPT--IS 570
P PLV +P+ +S Q P+ CDPIVH+PVID+ SSG YLVSAGPT IS
Sbjct: 421 ---PLPLVHHIPIPGSSSSSSQQQMMIPIMCDPIVHIPVIDIFSSGQSYLVSAGPTGIIS 480
Query: 571 TSIPPLHPKLVNPMIPATDVEKDARETLRLLISGSSPGNSQLMNVLPVVLTDSEANQSLF 630
T IPPL P+ + VEK ARETLRLLISG++ S +N
Sbjct: 481 TGIPPL------PVENDSLVEKGARETLRLLISGANATTSTPLN---------------- 517
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038886408.1 | 0.0e+00 | 95.81 | uncharacterized protein LOC120076604 isoform X1 [Benincasa hispida] | [more] |
XP_008441435.1 | 0.0e+00 | 92.94 | PREDICTED: uncharacterized protein LOC103485553 isoform X1 [Cucumis melo] >KAA00... | [more] |
XP_004138433.1 | 0.0e+00 | 92.09 | uncharacterized protein LOC101206438 isoform X1 [Cucumis sativus] | [more] |
XP_022993058.1 | 0.0e+00 | 88.02 | uncharacterized protein LOC111489188 isoform X1 [Cucurbita maxima] | [more] |
XP_022939303.1 | 0.0e+00 | 88.02 | uncharacterized protein LOC111445260 isoform X1 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7UGW8 | 0.0e+00 | 92.94 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3B2Z5 | 0.0e+00 | 92.94 | uncharacterized protein LOC103485553 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A0A0KDA9 | 0.0e+00 | 92.09 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G009500 PE=4 SV=1 | [more] |
A0A6J1K125 | 0.0e+00 | 88.02 | uncharacterized protein LOC111489188 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FLA2 | 0.0e+00 | 88.02 | uncharacterized protein LOC111445260 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT2G39950.1 | 1.5e-122 | 47.08 | unknown protein; Has 978 Blast hits to 254 proteins in 81 species: Archae - 0; B... | [more] |
AT2G39950.2 | 9.4e-114 | 49.46 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |