Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGCATGGTGGATCCAGGAGGAAGAGATCCTCATCCTTTGTGCGATATGTCGTCGTTCTATGTGCTGTCGGTGCTGCAATTGGATTTTTTATGCTCAATGTTCTTATGAGGCTGGAATCTCGAGAATCAGAATCGAGCTCTGATCAGTTTGGTAATGGCGACGACGTTGAGGAAACTTCGGCTCAGACTGGAATGGAGGGAAGGCAGAGCTCCTGCGCGACGGTGGAGCAGATGGGAGAACCCTTTAAAGATGGTATCTGGAAGGAAAGCCTGAGAGTAAGAACAATTATTCAAAATCACTTTTATTTGAATGGTAATATTTCCTTATATGCTTTTCTGATAATTGTTTGCACTCCCTTTGTTGATCGTCTTATAGTTATATGAATAATTAATAGTTACTGGAAATCAGTTCTTCTTTTGATTCTTAGGCCAGGTTAGTGAATTGATGATTTTATTGATGATCCATGATGCCAGGGTTTTTTTTTTTTTTTTTTTTTAGTCTCGCGTTCGATCTTTTGCCATTCTAATGGTCTTTTGATGCTCTATAGTGAGTGTATTAGGAGAACCAATTTAATTTAAAATACTGATGATTTGGGGGTATTGAGAAAAATGTCCAAGATAAAAAACTAAGGTATTTTGTCTAGCCTCACTAATTTTATTAAGAATGTCATCTCTTTTATGAAAATACCTTATGAATCTGCTTGATTTGTTCCATCTTCTTTTAACCCCATTAGCTTCCTCCATCGCCACTCTCAAGTCTCACCATTATATTAATCTTTCGTCACGAAGATGGTATCCTGCATGTTATGAAGGCTTGATTTCATATTTGTATTGACTGTGAAAATAGACGAAGTAGAATCAAAATGGAGGAATGTGGAAGAAAGAGGGAGTAAGCGGGGGAAAGTAGTTGAAAAAGGGTGGTGGATGAAACTAGAGAAGACCTTGGGCTTCCCAAATGTAATTTGTGGAAGGTCAAATTTCCGGAAGAGACTCAATAGTTTTTTCACCTGGCTAGAACTTTATAGGAAGATAGATGTCAATGGACTCGTGCAAAAGAAGGTTTTTATTTAACTTTTCCCTGTATCCCAAGCGCTGGGCTGTTTTTTGTTTCTTTTGCTTATTGATAGATCCAGCTTCTCACTAAATATTTAAAAAAAAAAAAAAAAATTAGCAGAAGGGCGGAATATCATTATCATTCTTTCTCTGGCAGTTCAAAGAGACCATTTCACTTGGTCGGTGGGGGGTAATAATCCCATAACGATAGTAAGATTATTACGTGGTTAAAAATAATCATGATTGCATTATGGTGCCATATGAGCAGCAGCATGGTCTCCTTGAGCTGCATTAAAATTGAAATTGAGGACTCAGTGGAGTAAATTGAAAATGAATTGCTGGTTGGTTTGGTTTAGTTCAATGCTTAGTAGTTAAATGCTGCCGCTAGTTTTCTATTTTAGTTTATCAGCTAGTAAGTTAGTAGTAATTAATAGTTAATTTACTGGATTCTTAAGTAGTTCACTGTGTTAGTAAAAGAGACTGGCAGCCTTTAAACAAGTGAAATGGTTTTGGAAACGCTACCTTTGAATTGAGTAAACATTTCACACGAGGACTTTTTGTGAAGCCAAAGCAGTTGTACCGAAAACCTCAAAGTGGAACTTCTTGAATCTTACAAGTTGCGGTGTACCCTCTGACGAAATAAATGATACACAAGGGAGGAAACCCATGTAATTCGTTCTTTAGAATTAGAATAGGTCATAGGAACATTAGTACTGGTTTTTCTTTGATGGCGTGGATACTTGCTTGAGTCTATCCTCTTCTTTATTCAAATACAATTTTAAAGATAATCTTCTTGATTTCCATCTCTTGACTTTTGGCATCTTGAACTTCATAATGATGTCATCTTCTATTTGTTTTCGCGGATAATTATGTGATGAACCATATTGGCTTTTCCTTCACTTTCAAACCTTTTGTCACAATGGTTTTTCTGGTGGAGAACTTGTTAGCTTTTTGAGGTCTATAATTGCTCACATAGCTTTCATTTTAGGTGCTTCAAGAGTGCGACAACTTCCTCCTGAGCAGTTTTGCAAACATGGTTTTGTTATGGGCAAATCCTCCGAGGCAGGCTTTGGGAATGAGATGTACAAGATTCTAACTGCTGGAGCTTTAAGTATAATGCTGAACAGATCCTTGATTATTGGGCAAACCAGGCATGTTCGTTGTATATCAAGCCTTTTTATTTCTGTTTAAATATTGGTTTTGAATACGATAATTGCATCGAGTTGTTAGAACTTAGAAGCATTTAGGGTCTATGATAGTTCCTATTTCTCAGTGACATTTTAAGTTATAGGCAAACTAGTAGTGCATCTTGTACAGTATATAAAAATTCGTTAACTTTCCTCAGTTAAAATGTTATTAAATGGATAGCATAGCTCCAAAGAAGCCATCAGAAAAATATCAGAAGAAATTATGGTATATCTATGAAATAGGTTTGTAGTCGTGTATTGATGGAATGCATATGAACTTGACCTATTCTCTTTATGTTTTTTTTTCTCCTTATTTTCCGATATTTTTTTTATCTATATTTCCAGTAGTTGTGTTATTACTTCTATTTTCAGGGACAAGTTTCCTTTCGGGGACTACATTTCTTATTCTGATATCTCGTTTACCTTGAAAGAAATCAAGCATTTATGGAGACTTAACGGTTGTGTTAAGAAATTCAATAGGCATTTGATTATGCGAATTGACGATTTTGAAAAGCCTGCACAGACAAATGTTCTATGCAGTAATTGGAAGGAATGGGAGCATCCGATCATATGGTATCGTAAATTTTCAGTTTACATTAAACTGGTACTCAATCTGTACTCTGTTTATGCTGACTGAAGTGCATCAGGTTCCAAGGTACAACTGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGATATGAGGGCTGCTGCTTCTAATTTATTTGGATGGCCAGAGGTTTTAGAATCTAGACCTAATGTATTTGGAGAGCTGATGAGAGTTCTTATATCTCCTTCAAAGGATGTTGAAGAAGCAGTATTCTCCGCCCTTAAAAGTGGGGCTGATCCTGATATTTCCTTGCACATGCGGATGCTTATGAATAGGTGGTCTTACTAAACGACTGTTATTTCCTTCTCTCTCTACTGCCACTAGCTATGAATTCAATCTGCCATGTCTTCTTCCAATAAAATTCCATTAAAATTGACAGAAATCCTAGATTGATCACTTTTGGGGAAATTTTTTAGTGCTCCCATTGTACCTACTCTTTAATCTTTATGTTCACACTATGATTTTTTCTTTCATTTAAGAATTATTCAATGTTGCTAGACCCCACATCATTCTTAAGGAATGCTGACGGGGTAATTAATTAATGGGAGTTGTATTTGTATCCATACTACTCCGCAGTATAGGAAAGTTTCTCTCACTTTTGTAATCCTTTAACTAAGTAATGTGCATCCATATTTCAGATCTGTCAGAGGTTTACAGGCCGCAGTGCTGTGCATCAGAAGAGCCATGCTTAATCTAACCACAGTCTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAGATTTTGTAAAAAGTATCATGCCTAGCTTAGGCGAATTTGCAGAGGTAACTGCCTCATTGCGCAATTTCATTTCTTCAAATGAAGTCATTTCTGGACAACTGTTGTTAGCCCCACTTTCAGAATAAATGCCCAATGGAATAATTGATCGGTCATAGCCTCCCAGATCTTTTGTACGCTTGTAAAAAAAATATTGGTCTAATTTCATTAGTAATTTATTTTTCAGGTTATTCATTTTGATTATGAACAGTTCAGAGGAAACATCTCTGGAACACACGATGAATTCCATAAATTGGACTTCAGAGTGAAGGACTGGGGCCCGTCACCAAGATGGGTTGCCTTTGTGGATTTCTTTCTTGCATCCCGTGCCAAGCATGCTGTTATATCTGGTGCTCACCGGCGTGTAGGTACTACCTATGCTCAGCTAATTGCGGCATTGGCTGCAGCACACAATCTTGACAATCACGGTATTGTGAAAATCTCCCTCAATCATTTTACACGTTGGATAATGTAGGAAAAGAACAATTTTCTTTGAGGAACATCTATTCAACCGTTTCTTACATTGACATTTCAAATGTGCTCAGGGGACAACTCTACTGGTTCAAACTTTTCATTCTTGAGTAGCTTCCAAAGTAATTTGCTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGGCATATCTGGAACAGATTTGCAGGTCCTTTAAGCTGTCCGAGCCAGCCTAATCAGTGTGCCTTAACCCCTCTTCTCCCTCCATCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGACATTAAAAGAATGGAGAATTATGGAGTTCATTTATCGAGCTTGGGCATTGTTGATGAAGATAGTCTACGATCATTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATCCTATAG
mRNA sequence
ATGAGGCATGGTGGATCCAGGAGGAAGAGATCCTCATCCTTTGTGCGATATGTCGTCGTTCTATGTGCTGTCGGTGCTGCAATTGGATTTTTTATGCTCAATGTTCTTATGAGGCTGGAATCTCGAGAATCAGAATCGAGCTCTGATCAGTTTGGTAATGGCGACGACGTTGAGGAAACTTCGGCTCAGACTGGAATGGAGGGAAGGCAGAGCTCCTGCGCGACGGTGGAGCAGATGGGAGAACCCTTTAAAGATGGTATCTGGAAGGAAAGCCTGAGAGCCAGGGACAAGTTTCCTTTCGGGGACTACATTTCTTATTCTGATATCTCGTTTACCTTGAAAGAAATCAAGCATTTATGGAGACTTAACGGTTGTGTTAAGAAATTCAATAGGCATTTGATTATGCGAATTGACGATTTTGAAAAGCCTGCACAGACAAATGTTCTATGCAGTAATTGGAAGGAATGGGAGCATCCGATCATATGGTTCCAAGGTACAACTGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGATATGAGGGCTGCTGCTTCTAATTTATTTGGATGGCCAGAGGTTTTAGAATCTAGACCTAATGTATTTGGAGAGCTGATGAGAGTTCTTATATCTCCTTCAAAGGATGTTGAAGAAGCAGTATTCTCCGCCCTTAAAAGTGGGGCTGATCCTGATATTTCCTTGCACATGCGGATGCTTATGAATAGATCTGTCAGAGGTTTACAGGCCGCAGTGCTGTGCATCAGAAGAGCCATGCTTAATCTAACCACAGTCTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAGATTTTGTAAAAAGTATCATGCCTAGCTTAGGCGAATTTGCAGAGGTTATTCATTTTGATTATGAACAGTTCAGAGGAAACATCTCTGGAACACACGATGAATTCCATAAATTGGACTTCAGAGTGAAGGACTGGGGCCCGTCACCAAGATGGGTTGCCTTTGTGGATTTCTTTCTTGCATCCCGTGCCAAGCATGCTGTTATATCTGGTGCTCACCGGCGTGTAGGTACTACCTATGCTCAGCTAATTGCGGCATTGGCTGCAGCACACAATCTTGACAATCACGGGGACAACTCTACTGGTTCAAACTTTTCATTCTTGAGTAGCTTCCAAAGTAATTTGCTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGGCATATCTGGAACAGATTTGCAGGTCCTTTAAGCTGTCCGAGCCAGCCTAATCAGTGTGCCTTAACCCCTCTTCTCCCTCCATCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGACATTAAAAGAATGGAGAATTATGGAGTTCATTTATCGAGCTTGGGCATTGTTGATGAAGATAGTCTACGATCATTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATCCTATAG
Coding sequence (CDS)
ATGAGGCATGGTGGATCCAGGAGGAAGAGATCCTCATCCTTTGTGCGATATGTCGTCGTTCTATGTGCTGTCGGTGCTGCAATTGGATTTTTTATGCTCAATGTTCTTATGAGGCTGGAATCTCGAGAATCAGAATCGAGCTCTGATCAGTTTGGTAATGGCGACGACGTTGAGGAAACTTCGGCTCAGACTGGAATGGAGGGAAGGCAGAGCTCCTGCGCGACGGTGGAGCAGATGGGAGAACCCTTTAAAGATGGTATCTGGAAGGAAAGCCTGAGAGCCAGGGACAAGTTTCCTTTCGGGGACTACATTTCTTATTCTGATATCTCGTTTACCTTGAAAGAAATCAAGCATTTATGGAGACTTAACGGTTGTGTTAAGAAATTCAATAGGCATTTGATTATGCGAATTGACGATTTTGAAAAGCCTGCACAGACAAATGTTCTATGCAGTAATTGGAAGGAATGGGAGCATCCGATCATATGGTTCCAAGGTACAACTGATGCTGTGGCTGCTCAATTTTTCTTGAAGAATGTACATCCCGATATGAGGGCTGCTGCTTCTAATTTATTTGGATGGCCAGAGGTTTTAGAATCTAGACCTAATGTATTTGGAGAGCTGATGAGAGTTCTTATATCTCCTTCAAAGGATGTTGAAGAAGCAGTATTCTCCGCCCTTAAAAGTGGGGCTGATCCTGATATTTCCTTGCACATGCGGATGCTTATGAATAGATCTGTCAGAGGTTTACAGGCCGCAGTGCTGTGCATCAGAAGAGCCATGCTTAATCTAACCACAGTCTCGAAACCCAGATTGGTTTTAGTATCAGATACCCCAGATTTTGTAAAAAGTATCATGCCTAGCTTAGGCGAATTTGCAGAGGTTATTCATTTTGATTATGAACAGTTCAGAGGAAACATCTCTGGAACACACGATGAATTCCATAAATTGGACTTCAGAGTGAAGGACTGGGGCCCGTCACCAAGATGGGTTGCCTTTGTGGATTTCTTTCTTGCATCCCGTGCCAAGCATGCTGTTATATCTGGTGCTCACCGGCGTGTAGGTACTACCTATGCTCAGCTAATTGCGGCATTGGCTGCAGCACACAATCTTGACAATCACGGGGACAACTCTACTGGTTCAAACTTTTCATTCTTGAGTAGCTTCCAAAGTAATTTGCTGAGAGAAGGTTTAAAGAACCAGGTTGGCTGGGGGCATATCTGGAACAGATTTGCAGGTCCTTTAAGCTGTCCGAGCCAGCCTAATCAGTGTGCCTTAACCCCTCTTCTCCCTCCATCATGGTGGGATGGACTTTGGCAATCTCCCATTCCACGAGACATTAAAAGAATGGAGAATTATGGAGTTCATTTATCGAGCTTGGGCATTGTTGATGAAGATAGTCTACGATCATTCTGTAATGCAAAGAAGAATGTTGTGAGGACTATCCCTTTCATCCTATAG
Protein sequence
MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEETSAQTGMEGRQSSCATVEQMGEPFKDGIWKESLRARDKFPFGDYISYSDISFTLKEIKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFFLKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDISLHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVIHFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGTTYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLSCPSQPNQCALTPLLPPSWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKKNVVRTIPFIL
Homology
BLAST of HG10009797 vs. NCBI nr
Match:
XP_038906660.1 (uncharacterized protein LOC120092597 isoform X1 [Benincasa hispida])
HSP 1 Score: 909.8 bits (2350), Expect = 9.8e-261
Identity = 454/550 (82.55%), Postives = 470/550 (85.45%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGGSRRKRSSSFVRYVVVLCAVGAAIGF MLN+LMRLE+RESES+SDQFGNGDDVEET
Sbjct: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFLMLNILMRLEARESESTSDQFGNGDDVEET 60
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
AQ+GMEG +SSCATVEQMGE FKDG+WKESLR
Sbjct: 61 PAQSGMEGSRSSCATVEQMGESFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCK 120
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYSDI+FTLKE
Sbjct: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDITFTLKE 180
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRLNGCV+KFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 181 IKHLWRLNGCVRKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAV S LKSGADPDIS
Sbjct: 241 LKNVHPAMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVISVLKSGADPDIS 300
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRSVRGLQAAV CIR+AMLNLTTVSKPRLVLVSDTP+FVKSIM LGEFAEVI
Sbjct: 301 LHMRMLMNRSVRGLQAAVQCIRKAMLNLTTVSKPRLVLVSDTPNFVKSIMLILGEFAEVI 360
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT
Sbjct: 361 HFDYEHFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 480
TYAQLIAALAAAHNLDN G++STGS+FSFLSS+QSNLLREGLKNQVGWGHIWNRFAGPLS
Sbjct: 421 TYAQLIAALAAAHNLDNLGNSSTGSDFSFLSSYQSNLLREGLKNQVGWGHIWNRFAGPLS 480
Query: 481 CPSQPNQCALTPLLPPSWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK 486
CPSQPNQCA TP+LPP+WWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK
Sbjct: 481 CPSQPNQCAFTPVLPPAWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK 540
BLAST of HG10009797 vs. NCBI nr
Match:
XP_022938779.1 (uncharacterized protein LOC111444894 isoform X1 [Cucurbita moschata])
HSP 1 Score: 885.6 bits (2287), Expect = 2.0e-253
Identity = 442/550 (80.36%), Postives = 459/550 (83.45%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGGS+RKRSSS VRYVVVLCAVGAAIGF MLNVL RLESR SE SSDQFGNGDDVEE+
Sbjct: 1 MRHGGSKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 60
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
A++G+EGR+ SCATVE+MGE F DG+WKESLR
Sbjct: 61 FARSGIEGRRGSCATVERMGEVFNDGVWKESLRVRTIIQNHFCLNGASRVRHLPPEQFCK 120
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYSDISFTLKE
Sbjct: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE 180
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRL GCV+KF RHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 181 IKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV S LKSGADPDIS
Sbjct: 241 LKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADPDIS 300
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRS+RGLQAAV CIR+AMLNLTTV KPRLVLVSDTPDFVKSIMP LGEFAEVI
Sbjct: 301 LHMRMLMNRSIRGLQAAVQCIRKAMLNLTTVPKPRLVLVSDTPDFVKSIMPILGEFAEVI 360
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRGNIS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GT
Sbjct: 361 HFDYEHFRGNISATHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRIGT 420
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 480
TYAQLIAALAAAHNLDN G+NSTGS+FSFLSSFQSNLL EGLKNQVGWGHIWNRFAGPLS
Sbjct: 421 TYAQLIAALAAAHNLDNPGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAGPLS 480
Query: 481 CPSQPNQCALTPLLPPSWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK 486
CP QPNQCALTPLLPP+WWDGLWQSPIPRDIKRMENYGVHLSS GI+DEDSLRSFCNAKK
Sbjct: 481 CPGQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSSSGIIDEDSLRSFCNAKK 540
BLAST of HG10009797 vs. NCBI nr
Match:
XP_023549723.1 (uncharacterized protein LOC111808143 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 882.9 bits (2280), Expect = 1.3e-252
Identity = 442/550 (80.36%), Postives = 457/550 (83.09%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGGSRRKRSSS VRYVVVLCAVGAAIGF MLNVL RLESR SE SDQFGNGDDVEE+
Sbjct: 1 MRHGGSRRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELRSDQFGNGDDVEES 60
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
A++G+EGR+ SCATVE+MGE F DG+WKESLR
Sbjct: 61 FARSGIEGRRGSCATVEKMGEVFNDGVWKESLRVRTIIQNHFYLNGASRVRHLPPEQFCK 120
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYSDISFTLKE
Sbjct: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE 180
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRL GCV+KF RHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 181 IKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV S LKSGADPDIS
Sbjct: 241 LKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADPDIS 300
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRS+RGLQAAV CIR+AMLNLTT KPRLVLVSDTPDFVKSIMP LGEFAEVI
Sbjct: 301 LHMRMLMNRSIRGLQAAVQCIRKAMLNLTTAPKPRLVLVSDTPDFVKSIMPILGEFAEVI 360
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GT
Sbjct: 361 HFDYEHFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRIGT 420
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 480
TYAQLIAALAAAHNLDN G+NSTGS+FSFLSSFQSNLL EGLKNQVGWGHIWNRFAGPLS
Sbjct: 421 TYAQLIAALAAAHNLDNLGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAGPLS 480
Query: 481 CPSQPNQCALTPLLPPSWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK 486
CP QPNQCALTPLLPP+WWDG WQSPIPRDIKRMENYGVHLSS GIVDEDSLRSFCNAKK
Sbjct: 481 CPGQPNQCALTPLLPPAWWDGPWQSPIPRDIKRMENYGVHLSSSGIVDEDSLRSFCNAKK 540
BLAST of HG10009797 vs. NCBI nr
Match:
XP_022992741.1 (uncharacterized protein LOC111488989 isoform X1 [Cucurbita maxima])
HSP 1 Score: 880.9 bits (2275), Expect = 4.9e-252
Identity = 441/550 (80.18%), Postives = 457/550 (83.09%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGG +RKRSSS VRYVVVLCAVGAAIGF MLNVL RLESR SE SSDQFGNGDDVEE+
Sbjct: 19 MRHGGLKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 78
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
A++G+EGR+ SCATVEQMGE F DG+WKESLR
Sbjct: 79 FARSGIEGRRGSCATVEQMGEVFNDGVWKESLRVRTIIQNHFYLNGASRVRHLPPEQFCK 138
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYSDISFTLKE
Sbjct: 139 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE 198
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRL GCV+KF RHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 199 IKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 258
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV S LKSGADPDIS
Sbjct: 259 LKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADPDIS 318
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRS+RGLQAAV CIR+A+LNLTTV KPRLVLVSDTPDFV SIMP LGEFAEVI
Sbjct: 319 LHMRMLMNRSIRGLQAAVQCIRKAILNLTTVPKPRLVLVSDTPDFVTSIMPILGEFAEVI 378
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRGNIS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GT
Sbjct: 379 HFDYEHFRGNISRTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRIGT 438
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 480
TYAQLIAALAAAHNLDN G+NSTGS+FSFLSSFQSNLL EGLKNQVGWGHIWNRFAGPLS
Sbjct: 439 TYAQLIAALAAAHNLDNFGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAGPLS 498
Query: 481 CPSQPNQCALTPLLPPSWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK 486
CP QPNQCALTPLLPP+WWDGLWQSPIPRDIKRMENYGVHLSS GIVDEDSLRSFCNAKK
Sbjct: 499 CPGQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSSSGIVDEDSLRSFCNAKK 558
BLAST of HG10009797 vs. NCBI nr
Match:
XP_022938780.1 (uncharacterized protein LOC111444894 isoform X2 [Cucurbita moschata])
HSP 1 Score: 870.5 bits (2248), Expect = 6.6e-249
Identity = 434/542 (80.07%), Postives = 451/542 (83.21%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGGS+RKRSSS VRYVVVLCAVGAAIGF MLNVL RLESR SE SSDQFGNGDDVEE+
Sbjct: 1 MRHGGSKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 60
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
A++G+EGR+ SCATVE+MGE F DG+WKESLR
Sbjct: 61 FARSGIEGRRGSCATVERMGEVFNDGVWKESLRVRTIIQNHFCLNGASRVRHLPPEQFCK 120
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYSDISFTLKE
Sbjct: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE 180
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRL GCV+KF RHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 181 IKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV S LKSGADPDIS
Sbjct: 241 LKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADPDIS 300
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRS+RGLQAAV CIR+AMLNLTTV KPRLVLVSDTPDFVKSIMP LGEFAEVI
Sbjct: 301 LHMRMLMNRSIRGLQAAVQCIRKAMLNLTTVPKPRLVLVSDTPDFVKSIMPILGEFAEVI 360
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRGNIS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GT
Sbjct: 361 HFDYEHFRGNISATHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRIGT 420
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 478
TYAQLIAALAAAHNLDN G+NSTGS+FSFLSSFQSNLL EGLKNQVGWGHIWNRFAGPLS
Sbjct: 421 TYAQLIAALAAAHNLDNPGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAGPLS 480
BLAST of HG10009797 vs. ExPASy TrEMBL
Match:
A0A6J1FF37 (uncharacterized protein LOC111444894 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444894 PE=4 SV=1)
HSP 1 Score: 885.6 bits (2287), Expect = 9.6e-254
Identity = 442/550 (80.36%), Postives = 459/550 (83.45%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGGS+RKRSSS VRYVVVLCAVGAAIGF MLNVL RLESR SE SSDQFGNGDDVEE+
Sbjct: 1 MRHGGSKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 60
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
A++G+EGR+ SCATVE+MGE F DG+WKESLR
Sbjct: 61 FARSGIEGRRGSCATVERMGEVFNDGVWKESLRVRTIIQNHFCLNGASRVRHLPPEQFCK 120
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYSDISFTLKE
Sbjct: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE 180
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRL GCV+KF RHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 181 IKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV S LKSGADPDIS
Sbjct: 241 LKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADPDIS 300
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRS+RGLQAAV CIR+AMLNLTTV KPRLVLVSDTPDFVKSIMP LGEFAEVI
Sbjct: 301 LHMRMLMNRSIRGLQAAVQCIRKAMLNLTTVPKPRLVLVSDTPDFVKSIMPILGEFAEVI 360
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRGNIS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GT
Sbjct: 361 HFDYEHFRGNISATHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRIGT 420
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 480
TYAQLIAALAAAHNLDN G+NSTGS+FSFLSSFQSNLL EGLKNQVGWGHIWNRFAGPLS
Sbjct: 421 TYAQLIAALAAAHNLDNPGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAGPLS 480
Query: 481 CPSQPNQCALTPLLPPSWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK 486
CP QPNQCALTPLLPP+WWDGLWQSPIPRDIKRMENYGVHLSS GI+DEDSLRSFCNAKK
Sbjct: 481 CPGQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSSSGIIDEDSLRSFCNAKK 540
BLAST of HG10009797 vs. ExPASy TrEMBL
Match:
A0A6J1JUE3 (uncharacterized protein LOC111488989 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488989 PE=4 SV=1)
HSP 1 Score: 880.9 bits (2275), Expect = 2.4e-252
Identity = 441/550 (80.18%), Postives = 457/550 (83.09%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGG +RKRSSS VRYVVVLCAVGAAIGF MLNVL RLESR SE SSDQFGNGDDVEE+
Sbjct: 19 MRHGGLKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 78
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
A++G+EGR+ SCATVEQMGE F DG+WKESLR
Sbjct: 79 FARSGIEGRRGSCATVEQMGEVFNDGVWKESLRVRTIIQNHFYLNGASRVRHLPPEQFCK 138
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYSDISFTLKE
Sbjct: 139 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE 198
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRL GCV+KF RHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 199 IKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 258
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV S LKSGADPDIS
Sbjct: 259 LKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADPDIS 318
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRS+RGLQAAV CIR+A+LNLTTV KPRLVLVSDTPDFV SIMP LGEFAEVI
Sbjct: 319 LHMRMLMNRSIRGLQAAVQCIRKAILNLTTVPKPRLVLVSDTPDFVTSIMPILGEFAEVI 378
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRGNIS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GT
Sbjct: 379 HFDYEHFRGNISRTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRIGT 438
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 480
TYAQLIAALAAAHNLDN G+NSTGS+FSFLSSFQSNLL EGLKNQVGWGHIWNRFAGPLS
Sbjct: 439 TYAQLIAALAAAHNLDNFGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAGPLS 498
Query: 481 CPSQPNQCALTPLLPPSWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK 486
CP QPNQCALTPLLPP+WWDGLWQSPIPRDIKRMENYGVHLSS GIVDEDSLRSFCNAKK
Sbjct: 499 CPGQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSSSGIVDEDSLRSFCNAKK 558
BLAST of HG10009797 vs. ExPASy TrEMBL
Match:
A0A6J1FKR5 (uncharacterized protein LOC111444894 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444894 PE=4 SV=1)
HSP 1 Score: 870.5 bits (2248), Expect = 3.2e-249
Identity = 434/542 (80.07%), Postives = 451/542 (83.21%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGGS+RKRSSS VRYVVVLCAVGAAIGF MLNVL RLESR SE SSDQFGNGDDVEE+
Sbjct: 1 MRHGGSKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 60
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
A++G+EGR+ SCATVE+MGE F DG+WKESLR
Sbjct: 61 FARSGIEGRRGSCATVERMGEVFNDGVWKESLRVRTIIQNHFCLNGASRVRHLPPEQFCK 120
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYSDISFTLKE
Sbjct: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE 180
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRL GCV+KF RHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 181 IKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV S LKSGADPDIS
Sbjct: 241 LKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADPDIS 300
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRS+RGLQAAV CIR+AMLNLTTV KPRLVLVSDTPDFVKSIMP LGEFAEVI
Sbjct: 301 LHMRMLMNRSIRGLQAAVQCIRKAMLNLTTVPKPRLVLVSDTPDFVKSIMPILGEFAEVI 360
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRGNIS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GT
Sbjct: 361 HFDYEHFRGNISATHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRIGT 420
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 478
TYAQLIAALAAAHNLDN G+NSTGS+FSFLSSFQSNLL EGLKNQVGWGHIWNRFAGPLS
Sbjct: 421 TYAQLIAALAAAHNLDNPGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAGPLS 480
BLAST of HG10009797 vs. ExPASy TrEMBL
Match:
A0A6J1JQR8 (uncharacterized protein LOC111488989 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111488989 PE=4 SV=1)
HSP 1 Score: 865.9 bits (2236), Expect = 7.9e-248
Identity = 433/542 (79.89%), Postives = 449/542 (82.84%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGG +RKRSSS VRYVVVLCAVGAAIGF MLNVL RLESR SE SSDQFGNGDDVEE+
Sbjct: 19 MRHGGLKRKRSSSLVRYVVVLCAVGAAIGFLMLNVLFRLESRGSELSSDQFGNGDDVEES 78
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
A++G+EGR+ SCATVEQMGE F DG+WKESLR
Sbjct: 79 FARSGIEGRRGSCATVEQMGEVFNDGVWKESLRVRTIIQNHFYLNGASRVRHLPPEQFCK 138
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYSDISFTLKE
Sbjct: 139 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSDISFTLKE 198
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRL GCV+KF RHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 199 IKHLWRLKGCVRKFKRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 258
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFG PEVLESRPNVFGELMR+LISPSKDVEEAV S LKSGADPDIS
Sbjct: 259 LKNVHPAMRAAASNLFGQPEVLESRPNVFGELMRILISPSKDVEEAVLSVLKSGADPDIS 318
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRS+RGLQAAV CIR+A+LNLTTV KPRLVLVSDTPDFV SIMP LGEFAEVI
Sbjct: 319 LHMRMLMNRSIRGLQAAVQCIRKAILNLTTVPKPRLVLVSDTPDFVTSIMPILGEFAEVI 378
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRGNIS THDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRR+GT
Sbjct: 379 HFDYEHFRGNISRTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRIGT 438
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 478
TYAQLIAALAAAHNLDN G+NSTGS+FSFLSSFQSNLL EGLKNQVGWGHIWNRFAGPLS
Sbjct: 439 TYAQLIAALAAAHNLDNFGNNSTGSDFSFLSSFQSNLLTEGLKNQVGWGHIWNRFAGPLS 498
BLAST of HG10009797 vs. ExPASy TrEMBL
Match:
A0A6J1E7F2 (uncharacterized protein LOC111430593 OS=Cucurbita moschata OX=3662 GN=LOC111430593 PE=4 SV=1)
HSP 1 Score: 859.8 bits (2220), Expect = 5.6e-246
Identity = 429/550 (78.00%), Postives = 454/550 (82.55%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
MRHGGSR+KR SSF RYVVVLCAVGA+IGF MLN LMR+E++ESESSSDQ GNGDDVEE+
Sbjct: 1 MRHGGSRKKRWSSFARYVVVLCAVGASIGFLMLNFLMRMEAQESESSSDQLGNGDDVEES 60
Query: 61 SAQTGMEGRQSSCATVEQMGEPFKDGIWKESLR--------------------------- 120
+ M+GR+ SCATVEQMGE FKDG+WKESLR
Sbjct: 61 RVLSEMDGRR-SCATVEQMGEAFKDGVWKESLRVRTIIQNHFYLNGASRVRQLPPEQFCK 120
Query: 121 --------------------------------------ARDKFPFGDYISYSDISFTLKE 180
R KFPFGDYISYS+++FT+KE
Sbjct: 121 HGFVMGKSSEAGFGNEMYKILTAGALSIMLNRSLIIGQTRGKFPFGDYISYSNVTFTMKE 180
Query: 181 IKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
IKHLWRL GC++KFNRHLIMR DDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF
Sbjct: 181 IKHLWRLKGCIRKFNRHLIMRTDDFEKPAQTNVLCSNWKEWEHPIIWFQGTTDAVAAQFF 240
Query: 241 LKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFSALKSGADPDIS 300
LKNVHP MRAAASNLFG PEVLESRPNVFGELMRVLISPSKDVEEAVFS LKSG DPDIS
Sbjct: 241 LKNVHPAMRAAASNLFGHPEVLESRPNVFGELMRVLISPSKDVEEAVFSVLKSGDDPDIS 300
Query: 301 LHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSIMPSLGEFAEVI 360
LHMRMLMNRSVRGLQAA+ CIR+ + NLTT SKPRLVLVSDTP+FVKSI+P LGEFAEVI
Sbjct: 301 LHMRMLMNRSVRGLQAALQCIRKGIRNLTTDSKPRLVLVSDTPNFVKSIVPMLGEFAEVI 360
Query: 361 HFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
HFDYE FRG ISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT
Sbjct: 361 HFDYEHFRGIISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHAVISGAHRRVGT 420
Query: 421 TYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 480
TYAQLIAALAAAHNLDN G+NSTGS+F FLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS
Sbjct: 421 TYAQLIAALAAAHNLDNLGNNSTGSDFFFLSSFQSNLLREGLKNQVGWGHIWNRFAGPLS 480
Query: 481 CPSQPNQCALTPLLPPSWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDEDSLRSFCNAKK 486
CPSQPNQCALTPLLPP+WWDGLWQSPIPRDIKRMENYGVHLS G +DEDSLRSFCNAKK
Sbjct: 481 CPSQPNQCALTPLLPPAWWDGLWQSPIPRDIKRMENYGVHLSGFGTIDEDSLRSFCNAKK 540
BLAST of HG10009797 vs. TAIR 10
Match:
AT3G26950.1 (unknown protein; Has 27 Blast hits to 27 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 27; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 552.7 bits (1423), Expect = 2.8e-157
Identity = 293/560 (52.32%), Postives = 360/560 (64.29%), Query Frame = 0
Query: 1 MRHGGSRRKRSSSFVRYVVVLCAVGAAIGFFMLNVLMRLESRESESSSDQFGNGDDVEET 60
M+ GG+RRKR ++L +V IGF +L L+ L S + SS F + DD E
Sbjct: 1 MKRGGTRRKR---LFGKTILLSSVVFFIGFGLL--LLTLRSVDPNSS---FIDDDDDESE 60
Query: 61 SAQTGMEGRQSS-----------CATVEQMGEPFKDGIWKESLRARD------------- 120
S + SS CATVE+MG F G +SLR RD
Sbjct: 61 SEEASRWSNSSSIGEAMVDGAKLCATVEEMGSEFDGGFVDQSLRVRDVIHRHFQINGASA 120
Query: 121 ----------------------------------------------------KFPFGDYI 180
K+PFGDYI
Sbjct: 121 IRELPPEQFCRHGYVLGKTAEAGFGNEMYKILTSAALSIMLNRSLIIGQTRGKYPFGDYI 180
Query: 181 SYSDISFTLKEIKHLWRLNGCVKKFNRHLIMRIDDFEKPAQTNVLCSNWKEWEHPIIWFQ 240
+YS+ +FT+ E+KHLWR NGCVKK+ R L+MR+DDFEKPA++NVLCSNWK+WE IIWFQ
Sbjct: 181 AYSNATFTMSEVKHLWRQNGCVKKYKRRLVMRLDDFEKPAKSNVLCSNWKKWEEAIIWFQ 240
Query: 241 GTTDAVAAQFFLKNVHPDMRAAASNLFGWPEVLESRPNVFGELMRVLISPSKDVEEAVFS 300
GTTDAVAAQFFLKNVHP+MRAAA LFG R NVFGELM LISP+KDV+EAV
Sbjct: 241 GTTDAVAAQFFLKNVHPEMRAAAFELFGEQGNSAPRGNVFGELMMSLISPTKDVKEAVDW 300
Query: 301 ALKSGADPDISLHMRMLMNRSVRGLQAAVLCIRRAMLNLTTVSKPRLVLVSDTPDFVKSI 360
L DPDIS+HMRMLM++SVR ++AA+ C+ +A +N + PR+V+VSDTP VK I
Sbjct: 301 VLHETGDPDISVHMRMLMSKSVRPMRAAINCLGKA-INRLGIPNPRVVIVSDTPSVVKII 360
Query: 361 MPSLGEFAEVIHFDYEQFRGNISGTHDEFHKLDFRVKDWGPSPRWVAFVDFFLASRAKHA 420
++ AEV+HFDY+ FRG+I+ LDFR+KDWGP+PRWVAFVDFFLA RAKHA
Sbjct: 361 KTNISTIAEVLHFDYKLFRGDIAQRGRGLPMLDFRIKDWGPAPRWVAFVDFFLACRAKHA 420
Query: 421 VISGAHRRVGTTYAQLIAALAAAHNLDNHGDNSTGSNFSFLSSFQSNLLREGLKNQVGWG 480
VISGA+RRVGTTYAQL+AALAAA++L D S+ S+F+FLSSFQSNLL +GLKNQVGWG
Sbjct: 421 VISGANRRVGTTYAQLVAALAAANSLK---DGSSNSSFAFLSSFQSNLLADGLKNQVGWG 480
Query: 481 HIWNRFAGPLSCPSQPNQCALTPLLPPSWWDGLWQSPIPRDIKRMENYGVHLSSLGIVDE 485
H+WNR+AGPLSCP QPNQCA TPL PP WWDG+WQSPIPRD +R+ +G+ LS G V+E
Sbjct: 481 HVWNRYAGPLSCPKQPNQCAFTPLAPPGWWDGIWQSPIPRDTRRLAAFGIELSGFGTVNE 540
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038906660.1 | 9.8e-261 | 82.55 | uncharacterized protein LOC120092597 isoform X1 [Benincasa hispida] | [more] |
XP_022938779.1 | 2.0e-253 | 80.36 | uncharacterized protein LOC111444894 isoform X1 [Cucurbita moschata] | [more] |
XP_023549723.1 | 1.3e-252 | 80.36 | uncharacterized protein LOC111808143 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022992741.1 | 4.9e-252 | 80.18 | uncharacterized protein LOC111488989 isoform X1 [Cucurbita maxima] | [more] |
XP_022938780.1 | 6.6e-249 | 80.07 | uncharacterized protein LOC111444894 isoform X2 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FF37 | 9.6e-254 | 80.36 | uncharacterized protein LOC111444894 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JUE3 | 2.4e-252 | 80.18 | uncharacterized protein LOC111488989 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FKR5 | 3.2e-249 | 80.07 | uncharacterized protein LOC111444894 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JQR8 | 7.9e-248 | 79.89 | uncharacterized protein LOC111488989 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1E7F2 | 5.6e-246 | 78.00 | uncharacterized protein LOC111430593 OS=Cucurbita moschata OX=3662 GN=LOC1114305... | [more] |
Match Name | E-value | Identity | Description | |
AT3G26950.1 | 2.8e-157 | 52.32 | unknown protein; Has 27 Blast hits to 27 proteins in 8 species: Archae - 0; Bact... | [more] |