Spg038883 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg038883
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLate embryogenesis abundant protein, group 2
Locationscaffold12: 5494029 .. 5497586 (-)
RNA-Seq ExpressionSpg038883
SyntenySpg038883
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATCTCTCCGCCACTCTCTCCGCCGCTGATGGAGGCCGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCCTCTCCCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCGTCGGCCTGCCACTCGCCTCTCCGCTCCGACACCTTCCCCAACGCCCACCACCACCACCGCAACCCCACTCAGGAAGCCTCTCGCTTCACTCTCTCCCACTACTCCTCCTCCCGTGGCTCGAACCACGGGACCGGGACCGACAATGGCGAGGCTCGCCTGATTGTCGGTCGGGGCGATGGCCGTGACCGGGACGAGGAGCGGGAGGAAGACGGGGGCGGAGACGGAGACGAGGACGGATATTATGGGAGGAAGAGGAGAGGTTGTTGGAAGACTTATTGTACGTATAGGCATTCGGATTCTAATGCATGGATTTGCTTGCAATTGAGTTGGAGGGCAATTTTCAGCATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAAGCCTCCCTCACCAATAATTTCTGTTAAGGTAATTTTTCAATTTTTTTTTCTCCAAATTTTATGTTATTTTTTTTTTAATTAATATTGGTACAAATATTGAAATATATTTATAAGGATATATATTTTCTTGTATTTGGGTTGTATGAATGAGTATATAATTTGACCATGTTTAACCATACTTTGATATATACTATTGGAAATTTATATGTTACATTTGTTTGGTTTGGTTTCATATAGATTTGTTTTGTTGATCTTCCATGAGAATAAGCTTAGTGAAATGTTTAGCATTAAAATATTTATCATTAAATTTATGAAATGTAAGACATTAAATTGTTTCAAAGAAAAGTAGGACTACAATGTTAGATGTATATTGTTATATTTTTTCCCAAAATAAATGGATGTCTTAACCTTTTCTTTTCTTTAACAAATTTATTGCTACCATTCAAAAATAGGATGAATTGGTGCACTTTTCAATTATACCATTTTCATTGTTGTGTGTATATGCATATGTATATTGAGTAACAACAATTGAGGGTGGAGAATTGAACCATTGTTCTTTTAAAATGGTAATTGGTGTATTGTCTACTGAACTATTGCATGGATTGACATTTTTGTTGGGTAAATATTAAAATTGACTAATGATAAAGATTGCATATATCATGCAATGTAAGCTTATAATCAACTAATTTTAACATTCATCGACAATGAGGAGATAATTGAAAAATGTTAAAACACAAGGAACTAAAAATGGAAAGATTAAAATAATGAAGGCTAAATTTAAAAAAATTTAAATATTAGAGACTAAAACATATATCAGACCTTTTCCTCGACACAACATATTGTGTAAATGTTGGAAATACACCTGCTAAAAACCTTTTTTTTTTTTTTAATAAAAAGAATCATAAAAGATTATTATTATTATTATTTTTATTTCTTCTTCCTTTTCATAAGACTATATAATGGTGTATTCAATCTTGCCCTATAAACTTTCAAACTTTTAATTTTAACTTTTTTACCACATCAAAATGACTAACATGAGAACAGTAGTTAAGACATTTACTCATTTCTTAAAAGTCAAATGTTCAAATCTCCACTTTTCACGATGGTTCGGAAAAAAAATTTCCTAATGGAATTATATAATTCAACCATCAAATCAATAATTCATGTAATTCACAAAGACGAAACCATTTCTTCATGTATAGTTTTAGGGCAAAAGTAAAATCTTATAGGTCATACACATGTCTCCGATCGTCTCACTTCGTGCTTTCATAAATAGTTATACCAAATAGTCGAATCACGACACAATTTTCAAACCCTCCAAAAATTGAAGGACACATCTAAATTTTTTTTTAGAAGTCTATAATTTAGTCAAATTTAAAAATGATGCAGGTGGGAGAAGTAGAAGAATTTATGCTTGGGGAAGGAGTGGATAAAACAGGGGTTGGAACTAAGATCTTAACATGCAATTGCACAATGAATCTAATTGTGGACAATCACTCTAAGCTCTTTGGCCTTCACATTCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCCATTGCTACTTCACAAGTAAGCCCTCTATCATTTTCTTTCTTAAATTCCTTTACTATATTGTGTTTTTTAGTTTTTTCCCGGTTCAATTACAAGGTTAGTCTCTGAACTTTTAGAGTTCTGTCAATTTAGTCCCTAAACCTTAAAATGGTGTCAAATAGATCTTCAAACTTTCAATTTTGTGTCTAATGGGTCCTTGAACTTTAAAAAGTATCCAGGGTCCTTAAACTTTTACTTTTGTATCAAATAGATCCCTCAACTTTTAATTTTGTGTCTATAAGTCATTGACCTATTTGACATTTTTTAAAATTCACAAGTCTACTAGATACAAAATTGGCAAAGTACTGTAGGGTCCCAATAGGCATAAAATTCAATTTTATATCTAATAGATTGGTTAATTTTTAAAAATTTCGAATATTCGAGGGATCTATTAGACACAAAATTGAAAGTTTAAGGGCTAAATTGACATAACATTGAAAGTTCAGGAACTAAACTTGTAATTTAATATATTTTTTTTCAGTTATCTTTATTAATTAAGATTTTAAGTATTTTATACTTTTTTTTTTTTGTTAAGGGTATGAGGACCAAAGTTCTCTGTTGTGGTTAGATTTTTTTTTTTTTTTTTTTTTAAAAAGATTGTGAATTTATAAGAAAGAACAATATTAATGCCCCAATAAATAGCAAGATTATTTTTCAATTATCTTGAATATAATGCTGAGCATAGCTCATCGGTAGTTGGCATGTCCTCTTTTCTTTAAGGATGGAAAACCTCCATTTCTCACACAGTTTTTTTAAGAAACTTTTTGATTAGAATTCAAGAGTAAACCCAAAGTATGAACTAAGATTAAAATAGACTATATCATACCCATATCAAGATGCCCCAATAATCTGTTGCATAAGAATGAATTTTTTATTTTTTTTTAAATCAACTAAACTTTATTAGTTGAGTTGTTATTACTTATTAAGTTCGATGTAAAATTTTTTTTTGACAGTTTTAGTAACTAGTAAGTCAGCAAGCAATTTAAACTTTTTATTTTTATTTTTATTTGACAATATGTTGGAGGTTGGGGGATTCAAAAACCACAAACCTCGTGATCATTAGTAACAAAAATTCAATTCTATATTTAATGTGGGTTTTTGAAATAGGGTCCAAGAATGTATGCTGAGAGCGGAACGACCACGTTTCAATTAAGTGTAGGCATTAGCAATAAGCCAATGTATGGTGCAGGGAGGGACATGGAAGACATGCTTGAATCAGGAACAGGATTGGAGCTTAGAATTCAACTCAATTTCATTTCCAACTATAGGGTAGTTTGGAAAATCATAAGGCCCCGCTTTCATCGCCGTGTCGAATGCACATTGGTCCTCGGAAAAGCGTACGATAGGAAGCGTCACACCCGATCATTCAATAGCACCTGCCTAACTTCTTGATCATTGTCAACCAACAAATTTTGCTTCTAAATGATT

mRNA sequence

ATGGAGGCCGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCCTCTCCCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCGTCGGCCTGCCACTCGCCTCTCCGCTCCGACACCTTCCCCAACGCCCACCACCACCACCGCAACCCCACTCAGGAAGCCTCTCGCTTCACTCTCTCCCACTACTCCTCCTCCCGTGGCTCGAACCACGGGACCGGGACCGACAATGGCGAGGCTCGCCTGATTGTCGGTCGGGGCGATGGCCGTGACCGGGACGAGGAGCGGGAGGAAGACGGGGGCGGAGACGGAGACGAGGACGGATATTATGGGAGGAAGAGGAGAGGTTGTTGGAAGACTTATTGTACGTATAGGCATTCGGATTCTAATGCATGGATTTGCTTGCAATTGAGTTGGAGGGCAATTTTCAGCATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAAGCCTCCCTCACCAATAATTTCTGTTAAGGTGGGAGAAGTAGAAGAATTTATGCTTGGGGAAGGAGTGGATAAAACAGGGGTTGGAACTAAGATCTTAACATGCAATTGCACAATGAATCTAATTGTGGACAATCACTCTAAGCTCTTTGGCCTTCACATTCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCCATTGCTACTTCACAAGGTCCAAGAATGTATGCTGAGAGCGGAACGACCACGTTTCAATTAAGTGTAGGCATTAGCAATAAGCCAATGTATGGTGCAGGGAGGGACATGGAAGACATGCTTGAATCAGGAACAGGATTGGAGCTTAGAATTCAACTCAATTTCATTTCCAACTATAGGGTAGTTTGGAAAATCATAAGGCCCCGCTTTCATCGCCGTGTCGAATGCACATTGGTCCTCGGAAAAGCGTACGATAGGAAGCGTCACACCCGATCATTCAATAGCACCTGCCTAACTTCTTGA

Coding sequence (CDS)

ATGGAGGCCGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCCTCTCCCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCGTCGGCCTGCCACTCGCCTCTCCGCTCCGACACCTTCCCCAACGCCCACCACCACCACCGCAACCCCACTCAGGAAGCCTCTCGCTTCACTCTCTCCCACTACTCCTCCTCCCGTGGCTCGAACCACGGGACCGGGACCGACAATGGCGAGGCTCGCCTGATTGTCGGTCGGGGCGATGGCCGTGACCGGGACGAGGAGCGGGAGGAAGACGGGGGCGGAGACGGAGACGAGGACGGATATTATGGGAGGAAGAGGAGAGGTTGTTGGAAGACTTATTGTACGTATAGGCATTCGGATTCTAATGCATGGATTTGCTTGCAATTGAGTTGGAGGGCAATTTTCAGCATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAAGCCTCCCTCACCAATAATTTCTGTTAAGGTGGGAGAAGTAGAAGAATTTATGCTTGGGGAAGGAGTGGATAAAACAGGGGTTGGAACTAAGATCTTAACATGCAATTGCACAATGAATCTAATTGTGGACAATCACTCTAAGCTCTTTGGCCTTCACATTCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCCATTGCTACTTCACAAGGTCCAAGAATGTATGCTGAGAGCGGAACGACCACGTTTCAATTAAGTGTAGGCATTAGCAATAAGCCAATGTATGGTGCAGGGAGGGACATGGAAGACATGCTTGAATCAGGAACAGGATTGGAGCTTAGAATTCAACTCAATTTCATTTCCAACTATAGGGTAGTTTGGAAAATCATAAGGCCCCGCTTTCATCGCCGTGTCGAATGCACATTGGTCCTCGGAAAAGCGTACGATAGGAAGCGTCACACCCGATCATTCAATAGCACCTGCCTAACTTCTTGA

Protein sequence

MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNAHHHHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDEDGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKVGEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKIIRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS
Homology
BLAST of Spg038883 vs. NCBI nr
Match: XP_038905771.1 (uncharacterized protein LOC120091726 [Benincasa hispida])

HSP 1 Score: 598.6 bits (1542), Expect = 3.3e-167
Identity = 299/337 (88.72%), Postives = 312/337 (92.58%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNA---H 60
           MEAAEEQEAVLFHSYPC+YYVQSPSTLSHANSSDIRNPAESSACHSPL SDTFPN    H
Sbjct: 15  MEAAEEQEAVLFHSYPCSYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGRHHH 74

Query: 61  HHHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGD 120
           HHHRNPTQEASRFTLSHYSSSRGSNHG GTDNGE RLIVGRG+GRD +EE+E D   DGD
Sbjct: 75  HHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGETRLIVGRGNGRDCNEEQEND--EDGD 134

Query: 121 EDGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISV 180
           E+GYYG+K+RGCWK Y TYR+SDSNAWICLQLSWRAIFSMGIALLVFYIVT PPSPIISV
Sbjct: 135 EEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISV 194

Query: 181 KVGEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIA 240
           KVGE+EEFMLGEGVDKTGVGTKILTCNCTM++IVDNHSKLFGLHILPPSLHMSFGPLPIA
Sbjct: 195 KVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIA 254

Query: 241 TSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVW 300
           TSQGPR+YAESGTTTF LSVG SNKPMYGAGRDMED LESG GLEL I+LNFISNYRVVW
Sbjct: 255 TSQGPRLYAESGTTTFHLSVGTSNKPMYGAGRDMEDKLESGMGLELTIRLNFISNYRVVW 314

Query: 301 KIIRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
           K IRP FHR VEC LVLGKAYDRKRHTRSFNSTCL S
Sbjct: 315 KFIRPHFHRHVECLLVLGKAYDRKRHTRSFNSTCLPS 349

BLAST of Spg038883 vs. NCBI nr
Match: XP_008461795.2 (PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo])

HSP 1 Score: 575.5 bits (1482), Expect = 3.0e-160
Identity = 285/336 (84.82%), Postives = 305/336 (90.77%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNA--HH 60
           ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPL SDTFPNA  HH
Sbjct: 12  MEGSEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAEVSTCHSPLPSDTFPNAHHHH 71

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDE 120
           HHRNPTQEASRFTLSHYSSSRGSNHG GTDNGEARLIVGRGDGRD +EE E+   G+G+E
Sbjct: 72  HHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGDGRDCEEEEED---GEGNE 131

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           +GYYG+++RGCWK Y TYR SDSNAWICLQLSWRAIFSMGIALLVFY+VT PPSPIISVK
Sbjct: 132 EGYYGKRKRGCWKRYFTYRSSDSNAWICLQLSWRAIFSMGIALLVFYVVTNPPSPIISVK 191

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           VGE++EFMLGEGVDKTGVGTKILTCNCTM++IVDNHSKLFGLHILPPSLHMSFGPLPIAT
Sbjct: 192 VGEIQEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 251

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESG T F LSVG SNK MYGAGR+MED L+SG GLEL I+LNFISNYRVVWK
Sbjct: 252 SQGPRLYAESGRTRFGLSVGTSNKAMYGAGREMEDKLDSGMGLELTIRLNFISNYRVVWK 311

Query: 301 IIRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
            I P FHR V+C L+LGKAYDRKRHT SFNSTC TS
Sbjct: 312 FISPHFHRHVQCLLLLGKAYDRKRHTPSFNSTCFTS 344

BLAST of Spg038883 vs. NCBI nr
Match: XP_004149613.1 (uncharacterized protein LOC101209149 [Cucumis sativus] >KGN58592.1 hypothetical protein Csa_002328 [Cucumis sativus])

HSP 1 Score: 568.5 bits (1464), Expect = 3.7e-158
Identity = 283/335 (84.48%), Postives = 302/335 (90.15%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNA-HHH 60
           ME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPL SDTFPNA HHH
Sbjct: 16  METAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAEVSTCHSPLPSDTFPNAHHHH 75

Query: 61  HRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDED 120
           HRNPTQEASRFTLSHYSSSRGSNHG GTDNGEARLIVGRG+G D +EE EE   G+G+E+
Sbjct: 76  HRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGGDCEEEEEE---GEGNEE 135

Query: 121 GYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKV 180
           GYYG+++RGCWK Y TYR+SDSNAWICLQLSWRAIFSMGIALLVFYIVT PPSPII+VKV
Sbjct: 136 GYYGKRKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIITVKV 195

Query: 181 GEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIATS 240
           GE+EEFMLGEGVDKTGVGTKILTCNCTM++IVDNHSKLFGLHILPPSLHMSFGPLPIA S
Sbjct: 196 GEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIAAS 255

Query: 241 QGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKI 300
           QGPR+YAESG T F+LSVG SNK MYGAGRDMED L+SG GLEL I+LNFISNYRVVWK 
Sbjct: 256 QGPRLYAESGRTRFRLSVGTSNKAMYGAGRDMEDKLDSGIGLELTIRLNFISNYRVVWKF 315

Query: 301 IRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
           I P FHR V+C L+L K YDR  HTRSFNSTC TS
Sbjct: 316 ISPHFHRHVQCLLLLRKPYDRNPHTRSFNSTCFTS 347

BLAST of Spg038883 vs. NCBI nr
Match: XP_022152674.1 (uncharacterized protein LOC111020336 [Momordica charantia])

HSP 1 Score: 565.5 bits (1456), Expect = 3.1e-157
Identity = 288/335 (85.97%), Postives = 304/335 (90.75%), Query Frame = 0

Query: 1   MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNAHHH 60
           MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPLRSDTFP  HHH
Sbjct: 1   MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHH 60

Query: 61  HRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDED 120
           H N TQEASR TLS YSSSR SNHG GTDNGEARLIVGRG+GR+ DEEREEDG   GDE+
Sbjct: 61  HHNATQEASRVTLSRYSSSRESNHGAGTDNGEARLIVGRGNGREGDEEREEDGA--GDEE 120

Query: 121 GYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKV 180
           GYYG+KRRGCWKTY TYR+SDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISVK+
Sbjct: 121 GYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTMPPSPNISVKM 180

Query: 181 GEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIATS 240
           G VEEFMLGEGVDKTGVGTKILTCN TM++ VDN+SKLFGLHILPPSLH+SFGPLPIATS
Sbjct: 181 GGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATS 240

Query: 241 QGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKI 300
           QG R+YAESGTTTFQLSVG SN+ MYGAGR MEDMLESG GLEL I+LNFISNYRVVWKI
Sbjct: 241 QGARLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI 300

Query: 301 IRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
           IRP F  RVEC+LVLGK YDRKRHTRSFNSTCLTS
Sbjct: 301 IRPHFRHRVECSLVLGKGYDRKRHTRSFNSTCLTS 333

BLAST of Spg038883 vs. NCBI nr
Match: XP_022934269.1 (uncharacterized protein LOC111441481 [Cucurbita moschata])

HSP 1 Score: 558.5 bits (1438), Expect = 3.8e-155
Identity = 277/336 (82.44%), Postives = 303/336 (90.18%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNA--HH 60
           M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPL SDTFPN   HH
Sbjct: 2   MDAAEDQEPVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLPSDTFPNGRRHH 61

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDE 120
           HHRNPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG GDG +  +E+ E+     +E
Sbjct: 62  HHRNPTQEASRFTLSHYSSSCGSNHGGGTDNGEARLMVGGGDGAEEKQEKAEE-----EE 121

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           + YYGRKRRGCWKTY TYR+SDSNAWICLQLSWRA+FSMG+ALLVFYIVT PP P+ISVK
Sbjct: 122 EWYYGRKRRGCWKTYFTYRNSDSNAWICLQLSWRAVFSMGMALLVFYIVTNPPRPVISVK 181

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           V EV+EFMLGEGVDKTGVGTKILTCNCTM++IVDN+SKLF LHILPPSLHMSFGPLPIAT
Sbjct: 182 VREVDEFMLGEGVDKTGVGTKILTCNCTMDVIVDNYSKLFALHILPPSLHMSFGPLPIAT 241

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESGTTTF+L+VGIS KPMYGAGR++ED LESG GLEL I+LNFISNYRVVWK
Sbjct: 242 SQGPRLYAESGTTTFRLNVGISKKPMYGAGREIEDKLESGAGLELTIRLNFISNYRVVWK 301

Query: 301 IIRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
           II+PRFHRRV+C LV+   YDRKRHTR FNSTCLTS
Sbjct: 302 IIKPRFHRRVDCLLVVQNTYDRKRHTRIFNSTCLTS 332

BLAST of Spg038883 vs. ExPASy TrEMBL
Match: A0A1S3CFE9 (uncharacterized protein LOC103500312 OS=Cucumis melo OX=3656 GN=LOC103500312 PE=4 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 1.5e-160
Identity = 285/336 (84.82%), Postives = 305/336 (90.77%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNA--HH 60
           ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPL SDTFPNA  HH
Sbjct: 12  MEGSEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAEVSTCHSPLPSDTFPNAHHHH 71

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDE 120
           HHRNPTQEASRFTLSHYSSSRGSNHG GTDNGEARLIVGRGDGRD +EE E+   G+G+E
Sbjct: 72  HHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGDGRDCEEEEED---GEGNE 131

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           +GYYG+++RGCWK Y TYR SDSNAWICLQLSWRAIFSMGIALLVFY+VT PPSPIISVK
Sbjct: 132 EGYYGKRKRGCWKRYFTYRSSDSNAWICLQLSWRAIFSMGIALLVFYVVTNPPSPIISVK 191

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           VGE++EFMLGEGVDKTGVGTKILTCNCTM++IVDNHSKLFGLHILPPSLHMSFGPLPIAT
Sbjct: 192 VGEIQEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 251

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESG T F LSVG SNK MYGAGR+MED L+SG GLEL I+LNFISNYRVVWK
Sbjct: 252 SQGPRLYAESGRTRFGLSVGTSNKAMYGAGREMEDKLDSGMGLELTIRLNFISNYRVVWK 311

Query: 301 IIRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
            I P FHR V+C L+LGKAYDRKRHT SFNSTC TS
Sbjct: 312 FISPHFHRHVQCLLLLGKAYDRKRHTPSFNSTCFTS 344

BLAST of Spg038883 vs. ExPASy TrEMBL
Match: A0A0A0LD21 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G696870 PE=4 SV=1)

HSP 1 Score: 568.5 bits (1464), Expect = 1.8e-158
Identity = 283/335 (84.48%), Postives = 302/335 (90.15%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNA-HHH 60
           ME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPL SDTFPNA HHH
Sbjct: 16  METAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAEVSTCHSPLPSDTFPNAHHHH 75

Query: 61  HRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDED 120
           HRNPTQEASRFTLSHYSSSRGSNHG GTDNGEARLIVGRG+G D +EE EE   G+G+E+
Sbjct: 76  HRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGGDCEEEEEE---GEGNEE 135

Query: 121 GYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKV 180
           GYYG+++RGCWK Y TYR+SDSNAWICLQLSWRAIFSMGIALLVFYIVT PPSPII+VKV
Sbjct: 136 GYYGKRKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIITVKV 195

Query: 181 GEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIATS 240
           GE+EEFMLGEGVDKTGVGTKILTCNCTM++IVDNHSKLFGLHILPPSLHMSFGPLPIA S
Sbjct: 196 GEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIAAS 255

Query: 241 QGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKI 300
           QGPR+YAESG T F+LSVG SNK MYGAGRDMED L+SG GLEL I+LNFISNYRVVWK 
Sbjct: 256 QGPRLYAESGRTRFRLSVGTSNKAMYGAGRDMEDKLDSGIGLELTIRLNFISNYRVVWKF 315

Query: 301 IRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
           I P FHR V+C L+L K YDR  HTRSFNSTC TS
Sbjct: 316 ISPHFHRHVQCLLLLRKPYDRNPHTRSFNSTCFTS 347

BLAST of Spg038883 vs. ExPASy TrEMBL
Match: A0A6J1DGR2 (uncharacterized protein LOC111020336 OS=Momordica charantia OX=3673 GN=LOC111020336 PE=4 SV=1)

HSP 1 Score: 565.5 bits (1456), Expect = 1.5e-157
Identity = 288/335 (85.97%), Postives = 304/335 (90.75%), Query Frame = 0

Query: 1   MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNAHHH 60
           MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPLRSDTFP  HHH
Sbjct: 1   MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHH 60

Query: 61  HRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDED 120
           H N TQEASR TLS YSSSR SNHG GTDNGEARLIVGRG+GR+ DEEREEDG   GDE+
Sbjct: 61  HHNATQEASRVTLSRYSSSRESNHGAGTDNGEARLIVGRGNGREGDEEREEDGA--GDEE 120

Query: 121 GYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKV 180
           GYYG+KRRGCWKTY TYR+SDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISVK+
Sbjct: 121 GYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTMPPSPNISVKM 180

Query: 181 GEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIATS 240
           G VEEFMLGEGVDKTGVGTKILTCN TM++ VDN+SKLFGLHILPPSLH+SFGPLPIATS
Sbjct: 181 GGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIATS 240

Query: 241 QGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKI 300
           QG R+YAESGTTTFQLSVG SN+ MYGAGR MEDMLESG GLEL I+LNFISNYRVVWKI
Sbjct: 241 QGARLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVWKI 300

Query: 301 IRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
           IRP F  RVEC+LVLGK YDRKRHTRSFNSTCLTS
Sbjct: 301 IRPHFRHRVECSLVLGKGYDRKRHTRSFNSTCLTS 333

BLAST of Spg038883 vs. ExPASy TrEMBL
Match: A0A6J1F239 (uncharacterized protein LOC111441481 OS=Cucurbita moschata OX=3662 GN=LOC111441481 PE=4 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 1.9e-155
Identity = 277/336 (82.44%), Postives = 303/336 (90.18%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNA--HH 60
           M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPL SDTFPN   HH
Sbjct: 2   MDAAEDQEPVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLPSDTFPNGRRHH 61

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDE 120
           HHRNPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG GDG +  +E+ E+     +E
Sbjct: 62  HHRNPTQEASRFTLSHYSSSCGSNHGGGTDNGEARLMVGGGDGAEEKQEKAEE-----EE 121

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           + YYGRKRRGCWKTY TYR+SDSNAWICLQLSWRA+FSMG+ALLVFYIVT PP P+ISVK
Sbjct: 122 EWYYGRKRRGCWKTYFTYRNSDSNAWICLQLSWRAVFSMGMALLVFYIVTNPPRPVISVK 181

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           V EV+EFMLGEGVDKTGVGTKILTCNCTM++IVDN+SKLF LHILPPSLHMSFGPLPIAT
Sbjct: 182 VREVDEFMLGEGVDKTGVGTKILTCNCTMDVIVDNYSKLFALHILPPSLHMSFGPLPIAT 241

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESGTTTF+L+VGIS KPMYGAGR++ED LESG GLEL I+LNFISNYRVVWK
Sbjct: 242 SQGPRLYAESGTTTFRLNVGISKKPMYGAGREIEDKLESGAGLELTIRLNFISNYRVVWK 301

Query: 301 IIRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
           II+PRFHRRV+C LV+   YDRKRHTR FNSTCLTS
Sbjct: 302 IIKPRFHRRVDCLLVVQNTYDRKRHTRIFNSTCLTS 332

BLAST of Spg038883 vs. ExPASy TrEMBL
Match: A0A6J1J6W9 (uncharacterized protein LOC111481909 OS=Cucurbita maxima OX=3661 GN=LOC111481909 PE=4 SV=1)

HSP 1 Score: 547.0 bits (1408), Expect = 5.6e-152
Identity = 272/336 (80.95%), Postives = 300/336 (89.29%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNAH--H 60
           M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPL SDTFPN    H
Sbjct: 2   MDAAEDQEPVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGRRPH 61

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDE 120
           HHRN TQEASRFTLSHYSSS GSNHG GTDNGEARL+VG GDG +   E+ E+     +E
Sbjct: 62  HHRNQTQEASRFTLSHYSSSCGSNHGGGTDNGEARLMVGGGDGAEEKREKAEE-----EE 121

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           + YYG+KRRGCWKTY TYR+SD+NAWICLQLSWRA+FSMG+ALLVFYIVT PP PIISV+
Sbjct: 122 EWYYGKKRRGCWKTYFTYRNSDANAWICLQLSWRAVFSMGMALLVFYIVTNPPPPIISVQ 181

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           V EV+EFMLGEGVDKTGVGTKILTCNCTM++IVDN+SKLF LHILPPSLHMSFGPLPIAT
Sbjct: 182 VREVDEFMLGEGVDKTGVGTKILTCNCTMDVIVDNYSKLFALHILPPSLHMSFGPLPIAT 241

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESGTTTF+L+VG S KPMYGAGR++ED LESG GLEL I+LNFISNYRVVWK
Sbjct: 242 SQGPRLYAESGTTTFRLNVGTSKKPMYGAGREIEDKLESGAGLELTIRLNFISNYRVVWK 301

Query: 301 IIRPRFHRRVECTLVLGKAYDRKRHTRSFNSTCLTS 335
           II+P+FHR V+C LV+  AYDRKRHTR FNSTCLTS
Sbjct: 302 IIKPQFHRHVDCLLVVQNAYDRKRHTRIFNSTCLTS 332

BLAST of Spg038883 vs. TAIR 10
Match: AT3G08490.1 (BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein, group 2 (TAIR:AT3G24600.1); Has 161 Blast hits to 158 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 207.6 bits (527), Expect = 1.5e-53
Identity = 101/187 (54.01%), Postives = 134/187 (71.66%), Query Frame = 0

Query: 139 SDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKVGEVEEFMLGEGVDKTGVGT 198
           S+S+ WI LQ+ WR +FS+G+ALLVFYI T+PP P IS ++G   +FML EGVD  GV T
Sbjct: 75  SNSSWWIVLQVGWRFLFSLGVALLVFYIATQPPHPNISFRIGRFNQFMLEEGVDSHGVST 134

Query: 199 KILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAES-GTTTFQLSV 258
           K LT NC+  LI+DN S +FGLHI PPS+   FGPL  A +QGP++Y  S  +TTFQL +
Sbjct: 135 KFLTFNCSTKLIIDNKSNVFGLHIHPPSIKFFFGPLNFAKAQGPKLYGLSHESTTFQLYI 194

Query: 259 GISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKIIRPRFHRRVECTLVLGKA 318
             +N+ MYGAG +M DML S  GL L ++ + IS+YRVVW II P++H +VEC L+L   
Sbjct: 195 ATTNRAMYGAGTEMNDMLLSRAGLPLILRTSIISDYRVVWNIINPKYHHKVECLLLLA-- 254

Query: 319 YDRKRHT 325
            D++RH+
Sbjct: 255 -DKERHS 258

BLAST of Spg038883 vs. TAIR 10
Match: AT3G24600.1 (Late embryogenesis abundant protein, group 2 )

HSP 1 Score: 78.6 bits (192), Expect = 1.1e-14
Identity = 42/160 (26.25%), Postives = 83/160 (51.88%), Query Frame = 0

Query: 163 VFYIVTKPPSPIISVKVGEVEEFMLGEGVDKTGVGTKILTCNCTMNLIVDNHSKLFGLHI 222
           V +  + P SPI+SVK  ++  F  GEG+D+TGV TKIL+ N ++ + +D+ +  FG+H+
Sbjct: 334 VLWGASHPFSPIVSVKSVDIHSFYYGEGIDRTGVATKILSFNSSVKVTIDSPAPYFGIHV 393

Query: 223 LPPSLHMSFGPLPIATSQGPRMYAESGTTTFQL-SVGISNKPMYGAGRDMEDMLESGTGL 282
              +  ++F  L +AT Q    Y    +    +  +  +  P+YGAG  +    + G  +
Sbjct: 394 SSSTFKLTFSALTLATGQLKSYYQPRKSKHISIVKLTGAEVPLYGAGPHLAASDKKGK-V 453

Query: 283 ELRIQLNFISNYRVVWKIIRPRFHRRVECTLVLGKAYDRK 322
            ++++    S   ++ K+++ +    V C+  +  +   K
Sbjct: 454 PVKLEFEIRSRGNLLGKLVKSKHENHVSCSFFISSSKTSK 492

BLAST of Spg038883 vs. TAIR 10
Match: AT5G42860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 60.8 bits (146), Expect = 2.3e-09
Identity = 74/319 (23.20%), Postives = 128/319 (40.13%), Query Frame = 0

Query: 18  AYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNAHHHHRNPTQEASRFTLSHYSS 77
           AY+VQSPS  SH       +   +    SP+ S   P++H         +SRF  S  + 
Sbjct: 25  AYFVQSPSRDSHDGEKTATSFHSTPVLTSPMGSP--PHSH-------SSSSRF--SKING 84

Query: 78  SRGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDEDGYYGRKRRGCWKTYCTYR 137
           S+   H      GE +  +   +G   D +RE++              RR          
Sbjct: 85  SKRKGHA-----GEKQFAMIEEEGLLDDGDREQE-----------ALPRR---------- 144

Query: 138 HSDSNAWICLQLSWRAIFSMGIAL--LVFYIVTKPPSPIISVKVGEVEEFMLGEGVDKTG 197
                   C  L++   FS+  A   L+ Y   KP  P ISVK    E+  +  G D  G
Sbjct: 145 --------CYVLAFIVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGG 204

Query: 198 VGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMY-AESGTTTFQ 257
           +GT ++T N T+ ++  N    FG+H+    + +SF  + I +    + Y +     T  
Sbjct: 205 IGTDMITMNATLRMLYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVV 264

Query: 258 LSVGISNKPMYGAGRDM----------EDMLESG-----------TGLELRIQLNFISNY 313
           ++V     P+YG+G  +          +   + G             + +R+     S  
Sbjct: 265 VNVLGDKIPLYGSGSTLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRA 298

BLAST of Spg038883 vs. TAIR 10
Match: AT1G45688.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast hits to 242 proteins in 39 species: Archae - 0; Bacteria - 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses - 17; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 59.3 bits (142), Expect = 6.8e-09
Identity = 81/324 (25.00%), Postives = 134/324 (41.36%), Query Frame = 0

Query: 19  YYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNAHHHHRNPTQEASRFTLSHYSSS 78
           YYVQSPS  SH +         S+   SP+ S    ++     +    +SRF+ S    S
Sbjct: 26  YYVQSPSRDSH-DGEKTATSFHSTPVLSPMGSPPHSHSSMGRHSRESSSSRFSGSLKPGS 85

Query: 79  RGSNHGTGTDNGEARLIVGRGDGRDRDEER----EEDG-GGDGDEDGYYGRKRRGCWKTY 138
           R  N   G+          +G G ++  +     EE+G   DGD DG  G  RR      
Sbjct: 86  RKVNPNDGSKR--------KGHGGEKQWKECAVIEEEGLLDDGDRDG--GVPRR------ 145

Query: 139 CTYRHSDSNAWICLQLSWRAIFSM--GIALLVFYIVTKPPSPIISVKVGEVEEFMLGEGV 198
                       C  L++   F +  G   L+ Y   KP  P I+VK    E   +  G 
Sbjct: 146 ------------CYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQ 205

Query: 199 DKTGVGTKILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTT 258
           D  GVGT ++T N T+ ++  N    FG+H+    + +SF  + I +    + Y    + 
Sbjct: 206 DAGGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSE 265

Query: 259 TFQLSVGISNK-PMYGAGRDM----------EDMLESGTGLEL----------RIQLNFI 313
              L   I  K P+YG+G  +          +   + G  + +           + L+F+
Sbjct: 266 RTVLVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFV 320

BLAST of Spg038883 vs. TAIR 10
Match: AT2G41990.1 (CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864); BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family (TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 51.6 bits (122), Expect = 1.4e-06
Identity = 69/297 (23.23%), Postives = 121/297 (40.74%), Query Frame = 0

Query: 19  YYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNAHHHHRNPTQEASRFTLSHYSSS 78
           YYVQSPS      + D+   +  S C S + S T P  H++H +P   +   + S +S  
Sbjct: 28  YYVQSPS------NHDVEKMSFGSGC-SLMGSPTHP--HYYHCSPIHHSRESSTSRFSDR 87

Query: 79  RGSNHGTGTDNGEARLIVGRGDGRDRDEEREEDGGGDGDEDGYYGRKRRGCWKTYCTYRH 138
              ++       E R  +  GD +        DGG D D                  +R+
Sbjct: 88  ALLSY---KSIRERRRYINDGDDK-------TDGGDDDD-----------------PFRN 147

Query: 139 SDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKVGEVEEFMLGEGVDKTGVGT 198
                W+ L +    IF   +  L+ +  +K   P ++VK   V +  L  G D +GV T
Sbjct: 148 VRLYVWLLLSV----IFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPT 207

Query: 199 KILTCNCTMNLIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRM-YAESGTTTFQLSV 258
            +L+ N T+ +   N S  F +H+    L + +  L +++ +  +     +G T     V
Sbjct: 208 DMLSLNSTVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVV 267

Query: 259 GISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKIIRPRFHRRVECTLVL 315
                P+YG      D L     L L + +   S   ++ +++  +F+ R+ C+  L
Sbjct: 268 QGHQIPLYGGVSFHLDTL----SLPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTL 280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905771.13.3e-16788.72uncharacterized protein LOC120091726 [Benincasa hispida][more]
XP_008461795.23.0e-16084.82PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo][more]
XP_004149613.13.7e-15884.48uncharacterized protein LOC101209149 [Cucumis sativus] >KGN58592.1 hypothetical ... [more]
XP_022152674.13.1e-15785.97uncharacterized protein LOC111020336 [Momordica charantia][more]
XP_022934269.13.8e-15582.44uncharacterized protein LOC111441481 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3CFE91.5e-16084.82uncharacterized protein LOC103500312 OS=Cucumis melo OX=3656 GN=LOC103500312 PE=... [more]
A0A0A0LD211.8e-15884.48Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G696870 PE=4 SV=1[more]
A0A6J1DGR21.5e-15785.97uncharacterized protein LOC111020336 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A6J1F2391.9e-15582.44uncharacterized protein LOC111441481 OS=Cucurbita moschata OX=3662 GN=LOC1114414... [more]
A0A6J1J6W95.6e-15280.95uncharacterized protein LOC111481909 OS=Cucurbita maxima OX=3661 GN=LOC111481909... [more]
Match NameE-valueIdentityDescription
AT3G08490.11.5e-5354.01BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein,... [more]
AT3G24600.11.1e-1426.25Late embryogenesis abundant protein, group 2 [more]
AT5G42860.12.3e-0923.20unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G45688.16.8e-0925.00unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G41990.11.4e-0623.23CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterP... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 95..109
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..49
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..118
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 66..88
NoneNo IPR availablePANTHERPTHR31852:SF186DELTA-LATROINSECTOTOXIN-LT1A PROTEINcoord: 88..332
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 88..332

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg038883.1Spg038883.1mRNA