Lag0031155 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0031155
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionLate embryogenesis abundant protein, group 2
Locationchr11: 5379232 .. 5382631 (-)
RNA-Seq ExpressionLag0031155
SyntenyLag0031155
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGCCGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCCTCTCCCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCGTCGGCCTGCCACTCGCCTCTCCGCTCCGACACCTTCCCCAACGGCCACCACCACCACCACCACCGCAACCCGACCCAGGAAGCCTCTCGCTTCACTCTCTCTCACTACTCCTCCTCCCGTGGCTCGAACCACGGGACCGGGACCGACAATGGCGAGGCTCGCCTGATTGTCGGTCGCGGCGATGGCCGTGACCGGGACGAGGAGCGGGAGGAAAACGGGGACGGAGACGGGGACGAGGACGGATATTATGGGAGGAAGAGGAGAGGTTGTTGGAAGACTTATTGTACGTATAGGCATTCGGATTCTAATGCATGGATTTGCTTGCAATTGAGTTGGAGGGCAATTTTCAGCATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAAGCCTCCCTCACCAATAATTTCTGTTAAGGTAATTTTTCAATTTTTTTTTCTTCCAAATTTTATTTTATTTTATTTTATTTTTTAAATTTAATATTGGTCAAATATTGAAATATATTTATAAGGATATATATTTTCTTGTATTTGGGTTGTTTGAATGAGTATATAATTTGACCATGTTTAACCATACTTTGATATATATACTATTGGAAATTTATATGTTACATTTGTTTTGGTTTGGTTTCATATAGATTTGTTGTTGATCTTCCATGAGAAAATAAGCTTAGTGAAATGTTTAGCATTAAATATTTATCATTACATTTATGAAAATGTAAGACATTAAATTGTTTCAAAGAAAAGTAGGACTACAATGTTAGAAGTATATTGTTATATTTTTTCCAAAATAAATGGAGGTCTTAACCTTTTCTTTTCTTTAACAAATTTATTACTACCATTCAAAAATAGGATGAATTGGTGCACTTTTCAATTATACCATTTTCATTGTTGTGTATATGTATATGTATATTGAGTAACAACAATTGGGGGTGGAGAATTGAACTATCGTTATTTTAAAATGGTAATTGGTGTCTTTGTCTGCTGAGCTATTTACGGATTGACATTTTTGTTGGGTAAATATTAAAATTGACTAATGATAAAGATTACATATATCATGCAATGTAAACTTATAATCAACCAATTTTAACATTCATCGACAATGAGGAAATAATTGAAAAATCTTAAGACACGAGGAACTAAAAATGGAAAGATTAAAATAATGAAGGCTAAATTAAAAAACATTTAAATATTAGAGACTAAAACATATATTTGACATTTTCCTTGAGCACAGGAAATACACCTGCTAAAAACTTTTTTTTTAATAAAAAAAATTATAATTTTTTTTTTAAATAGTTTCTTCTTTGTTTTTATAATAACAATAAACATATTTCTGTATAGTTTTTCGTAAGACTATATAACAGCCTATAAACTTTCAAACTTTTAATTTTAACTTTTTTATCGCATCAAAATGACTAACAATGTTCAAATCTCCCACTTTCACAATGGTTCTTCGAAAGGAGAAAAAAATTTCCTAACGTAAATATATAATTCAACCATCAAATCAATAATAATTCATGCATTTCAAAGACAAAACTACTTGTTCATGTATAGTTTTAGGGCAAAATTAGTAAAATCTTAGGTTATACACATATATTCGATCGTCTCATTTTGTGCTTTCATAAATAGTTATACTACGACGCGATTTTCAAAATTCAAACCCTCCAAAAGTTGAAGGACACATCTAATTTTTTTTAAGAAGTCTATAACTTAGTCAAATTAAAAAAAAAAAAATGATGCAGGTGGGAGAAGTAGAAGAATTTATGCTTGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATCTTAACATGCAATTGCACAATGAATGTAATTGTGGACAACCACTCTAAGCTCTTTGGCCTTCACATTCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCCATTGCTACTTCACAAGTAAGCCATCTATCATTTTCTTTCTTAAATTCCTTTACTATATATATATTGTGTTTTTTAGTTTTTTTTTCGGGGTCAATTACAAGGTTAGTCCCTGAACTTTCACAGGTTCGTCAATTTGGTTCCTAAACTTTAAGAAGTGTCAAACAAATCTTCGAGTTTTTAATTTTGTGTCCAATAGATTTTTGAACTCCAGTAGAAACTTTCACTTTCGTGTCAAATAGATCCCTCAACTTTCAATTTTGTATCTAATAAGTCATTGATCTATTTGACATTACTTTTTAAAATTCACAAGTCTACTAGATACAAAATTGGCAAAGTAGTGTAAGGTTCCAATAGGCATAAAAGTCAATTTTATATCTAATAGATTGGTTAATTCTTACAAAATTCGAATGTACCAGGGATCTATTAGATGCAAAATTGAAAGTTTAGGAGCCAAATTGACATAACCTTGAAAGTTCAAGGACTAAACTTGTAATTTAACATTTTTTTTTCAATTATCTTTATTAATTAAGATTTTAATTATTCTATACTTTTTTTTTTTTGTTAAGGGTATGGGGACCAAAGTTCTCTGTTGTGGTTAGAAATTAAAAAAAAATAAATAAATAAAAGATTGTGAATTTATAAGATAGAACAATATTAATGCCCCAATAAATAGCAAGATTATTTTTCAATTAAAAAAAAACTTCATGTCCCACACATTTTTTAAAAAAACTTTTCGGATCGAATTCAGGAGTAAACCCGAAGTGTGAACTAAGATTAAAATAGACCATATCATACCCATATCAAGATGCCCCAATAATCTGTTGCATAAGAATGGATTAATTTTTTTTTAAAAAAATCAACTAAACTTTATTAGTTGAGTTGTTATTACTTATTGAGTTGGATGTAAAAGAAACTTTGACGGTTTTTTTTGAAACAGAAACTTTCACGATTTTAGTAACTAGTAAAATCAATTTTTTTTTGACAATATGTTGGAGGTTGGGGGATTCAAACCACAAACCTATTGATCATTAGTAAAAAAAAAAAAAAAAAATCTATACTTAATGTGGGTTTTTGAAATAGGGTCCAAGAATGTATGCTGAGAGTGGAACGACGACGTTTCAATTAAGCGTAGGCATTAGCAATAAGCCAATGTATGGTGCAGGGAGGGACATGGAAGACATGCTTGAATCAGGAACAGGATTGGAGCTTAGAATTCAACTTAATTTCATTTCCAACTATAGGGTAGTTTGGAAAATCATAAGGCCCCGCTTTCATCGCCGTGTCGAATGCTCATTGGTCCTCGGAAAAGCGTACGATAGGAAGCGTCACACCCGATCATTCAATAGCACCTGCCTAACTTCTTGA

mRNA sequence

ATGGAGGCCGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCCTCTCCCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCGTCGGCCTGCCACTCGCCTCTCCGCTCCGACACCTTCCCCAACGGCCACCACCACCACCACCACCGCAACCCGACCCAGGAAGCCTCTCGCTTCACTCTCTCTCACTACTCCTCCTCCCGTGGCTCGAACCACGGGACCGGGACCGACAATGGCGAGGCTCGCCTGATTGTCGGTCGCGGCGATGGCCGTGACCGGGACGAGGAGCGGGAGGAAAACGGGGACGGAGACGGGGACGAGGACGGATATTATGGGAGGAAGAGGAGAGGTTGTTGGAAGACTTATTGTACGTATAGGCATTCGGATTCTAATGCATGGATTTGCTTGCAATTGAGTTGGAGGGCAATTTTCAGCATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAAGCCTCCCTCACCAATAATTTCTGTTAAGGTGGGAGAAGTAGAAGAATTTATGCTTGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATCTTAACATGCAATTGCACAATGAATGTAATTGTGGACAACCACTCTAAGCTCTTTGGCCTTCACATTCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCCATTGCTACTTCACAAGGTCCAAGAATGTATGCTGAGAGTGGAACGACGACGTTTCAATTAAGCGTAGGCATTAGCAATAAGCCAATGTATGGTGCAGGGAGGGACATGGAAGACATGCTTGAATCAGGAACAGGATTGGAGCTTAGAATTCAACTTAATTTCATTTCCAACTATAGGGTAGTTTGGAAAATCATAAGGCCCCGCTTTCATCGCCGTGTCGAATGCTCATTGGTCCTCGGAAAAGCGTACGATAGGAAGCGTCACACCCGATCATTCAATAGCACCTGCCTAACTTCTTGA

Coding sequence (CDS)

ATGGAGGCCGCCGAAGAGCAAGAAGCCGTTCTCTTCCACTCATATCCATGTGCTTATTACGTCCAAAGCCCCTCCACCCTCTCCCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCGTCGGCCTGCCACTCGCCTCTCCGCTCCGACACCTTCCCCAACGGCCACCACCACCACCACCACCGCAACCCGACCCAGGAAGCCTCTCGCTTCACTCTCTCTCACTACTCCTCCTCCCGTGGCTCGAACCACGGGACCGGGACCGACAATGGCGAGGCTCGCCTGATTGTCGGTCGCGGCGATGGCCGTGACCGGGACGAGGAGCGGGAGGAAAACGGGGACGGAGACGGGGACGAGGACGGATATTATGGGAGGAAGAGGAGAGGTTGTTGGAAGACTTATTGTACGTATAGGCATTCGGATTCTAATGCATGGATTTGCTTGCAATTGAGTTGGAGGGCAATTTTCAGCATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAAGCCTCCCTCACCAATAATTTCTGTTAAGGTGGGAGAAGTAGAAGAATTTATGCTTGGGGAAGGAGTGGACAAAACAGGGGTTGGAACTAAGATCTTAACATGCAATTGCACAATGAATGTAATTGTGGACAACCACTCTAAGCTCTTTGGCCTTCACATTCTTCCTCCATCTCTTCATATGTCTTTTGGGCCTCTCCCCATTGCTACTTCACAAGGTCCAAGAATGTATGCTGAGAGTGGAACGACGACGTTTCAATTAAGCGTAGGCATTAGCAATAAGCCAATGTATGGTGCAGGGAGGGACATGGAAGACATGCTTGAATCAGGAACAGGATTGGAGCTTAGAATTCAACTTAATTTCATTTCCAACTATAGGGTAGTTTGGAAAATCATAAGGCCCCGCTTTCATCGCCGTGTCGAATGCTCATTGGTCCTCGGAAAAGCGTACGATAGGAAGCGTCACACCCGATCATTCAATAGCACCTGCCTAACTTCTTGA

Protein sequence

MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHHHHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGDEDGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKVGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKIIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS
Homology
BLAST of Lag0031155 vs. NCBI nr
Match: XP_038905771.1 (uncharacterized protein LOC120091726 [Benincasa hispida])

HSP 1 Score: 606.7 bits (1563), Expect = 1.2e-169
Identity = 303/337 (89.91%), Postives = 315/337 (93.47%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNG-HHH 60
           MEAAEEQEAVLFHSYPC+YYVQSPSTLSHANSSDIRNPAESSACHSPL SDTFPNG HHH
Sbjct: 15  MEAAEEQEAVLFHSYPCSYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGRHHH 74

Query: 61  HHHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGD 120
           HHHRNPTQEASRFTLSHYSSSRGSNHG GTDNGE RLIVGRG+GRD +EE+E   D DGD
Sbjct: 75  HHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGETRLIVGRGNGRDCNEEQE--NDEDGD 134

Query: 121 EDGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISV 180
           E+GYYG+K+RGCWK Y TYR+SDSNAWICLQLSWRAIFSMGIALLVFYIVT PPSPIISV
Sbjct: 135 EEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISV 194

Query: 181 KVGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIA 240
           KVGE+EEFMLGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIA
Sbjct: 195 KVGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIA 254

Query: 241 TSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVW 300
           TSQGPR+YAESGTTTF LSVG SNKPMYGAGRDMED LESG GLEL I+LNFISNYRVVW
Sbjct: 255 TSQGPRLYAESGTTTFHLSVGTSNKPMYGAGRDMEDKLESGMGLELTIRLNFISNYRVVW 314

Query: 301 KIIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
           K IRP FHR VEC LVLGKAYDRKRHTRSFNSTCL S
Sbjct: 315 KFIRPHFHRHVECLLVLGKAYDRKRHTRSFNSTCLPS 349

BLAST of Lag0031155 vs. NCBI nr
Match: XP_008461795.2 (PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo])

HSP 1 Score: 585.5 bits (1508), Expect = 2.9e-163
Identity = 288/336 (85.71%), Postives = 306/336 (91.07%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHH 60
           ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPL SDTFPN HHHH
Sbjct: 12  MEGSEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAEVSTCHSPLPSDTFPNAHHHH 71

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGDE 120
           HHRNPTQEASRFTLSHYSSSRGSNHG GTDNGEARLIVGRGDGRD +EE E   DG+G+E
Sbjct: 72  HHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGDGRDCEEEEE---DGEGNE 131

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           +GYYG+++RGCWK Y TYR SDSNAWICLQLSWRAIFSMGIALLVFY+VT PPSPIISVK
Sbjct: 132 EGYYGKRKRGCWKRYFTYRSSDSNAWICLQLSWRAIFSMGIALLVFYVVTNPPSPIISVK 191

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           VGE++EFMLGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIAT
Sbjct: 192 VGEIQEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 251

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESG T F LSVG SNK MYGAGR+MED L+SG GLEL I+LNFISNYRVVWK
Sbjct: 252 SQGPRLYAESGRTRFGLSVGTSNKAMYGAGREMEDKLDSGMGLELTIRLNFISNYRVVWK 311

Query: 301 IIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
            I P FHR V+C L+LGKAYDRKRHT SFNSTC TS
Sbjct: 312 FISPHFHRHVQCLLLLGKAYDRKRHTPSFNSTCFTS 344

BLAST of Lag0031155 vs. NCBI nr
Match: XP_004149613.1 (uncharacterized protein LOC101209149 [Cucumis sativus] >KGN58592.1 hypothetical protein Csa_002328 [Cucumis sativus])

HSP 1 Score: 569.7 bits (1467), Expect = 1.7e-158
Identity = 284/336 (84.52%), Postives = 302/336 (89.88%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHH 60
           ME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPL SDTFPN  HHH
Sbjct: 16  METAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAEVSTCHSPLPSDTFPNA-HHH 75

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGDE 120
           HHRNPTQEASRFTLSHYSSSRGSNHG GTDNGEARLIVGRG+G D +EE EE   G+G+E
Sbjct: 76  HHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGGDCEEEEEE---GEGNE 135

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           +GYYG+++RGCWK Y TYR+SDSNAWICLQLSWRAIFSMGIALLVFYIVT PPSPII+VK
Sbjct: 136 EGYYGKRKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIITVK 195

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           VGE+EEFMLGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIA 
Sbjct: 196 VGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIAA 255

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESG T F+LSVG SNK MYGAGRDMED L+SG GLEL I+LNFISNYRVVWK
Sbjct: 256 SQGPRLYAESGRTRFRLSVGTSNKAMYGAGRDMEDKLDSGIGLELTIRLNFISNYRVVWK 315

Query: 301 IIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
            I P FHR V+C L+L K YDR  HTRSFNSTC TS
Sbjct: 316 FISPHFHRHVQCLLLLRKPYDRNPHTRSFNSTCFTS 347

BLAST of Lag0031155 vs. NCBI nr
Match: XP_022152674.1 (uncharacterized protein LOC111020336 [Momordica charantia])

HSP 1 Score: 566.6 bits (1459), Expect = 1.4e-157
Identity = 292/337 (86.65%), Postives = 306/337 (90.80%), Query Frame = 0

Query: 1   MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHH 60
           MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPLRSDTFP GHHH
Sbjct: 1   MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHH 60

Query: 61  HHHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGD 120
           HH  N TQEASR TLS YSSSR SNHG GTDNGEARLIVGRG+GR+ DEEREE  DG GD
Sbjct: 61  HH--NATQEASRVTLSRYSSSRESNHGAGTDNGEARLIVGRGNGREGDEEREE--DGAGD 120

Query: 121 EDGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISV 180
           E+GYYG+KRRGCWKTY TYR+SDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISV
Sbjct: 121 EEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTMPPSPNISV 180

Query: 181 KVGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIA 240
           K+G VEEFMLGEGVDKTGVGTKILTCN TM+V VDN+SKLFGLHILPPSLH+SFGPLPIA
Sbjct: 181 KMGGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIA 240

Query: 241 TSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVW 300
           TSQG R+YAESGTTTFQLSVG SN+ MYGAGR MEDMLESG GLEL I+LNFISNYRVVW
Sbjct: 241 TSQGARLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVW 300

Query: 301 KIIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
           KIIRP F  RVECSLVLGK YDRKRHTRSFNSTCLTS
Sbjct: 301 KIIRPHFRHRVECSLVLGKGYDRKRHTRSFNSTCLTS 333

BLAST of Lag0031155 vs. NCBI nr
Match: XP_022934269.1 (uncharacterized protein LOC111441481 [Cucurbita moschata])

HSP 1 Score: 563.9 bits (1452), Expect = 9.2e-157
Identity = 279/336 (83.04%), Postives = 303/336 (90.18%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHH 60
           M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPL SDTFPNG  HH
Sbjct: 2   MDAAEDQEPVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLPSDTFPNGRRHH 61

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGDE 120
           HHRNPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG GDG +  +E+ E      +E
Sbjct: 62  HHRNPTQEASRFTLSHYSSSCGSNHGGGTDNGEARLMVGGGDGAEEKQEKAEE-----EE 121

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           + YYGRKRRGCWKTY TYR+SDSNAWICLQLSWRA+FSMG+ALLVFYIVT PP P+ISVK
Sbjct: 122 EWYYGRKRRGCWKTYFTYRNSDSNAWICLQLSWRAVFSMGMALLVFYIVTNPPRPVISVK 181

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           V EV+EFMLGEGVDKTGVGTKILTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIAT
Sbjct: 182 VREVDEFMLGEGVDKTGVGTKILTCNCTMDVIVDNYSKLFALHILPPSLHMSFGPLPIAT 241

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESGTTTF+L+VGIS KPMYGAGR++ED LESG GLEL I+LNFISNYRVVWK
Sbjct: 242 SQGPRLYAESGTTTFRLNVGISKKPMYGAGREIEDKLESGAGLELTIRLNFISNYRVVWK 301

Query: 301 IIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
           II+PRFHRRV+C LV+   YDRKRHTR FNSTCLTS
Sbjct: 302 IIKPRFHRRVDCLLVVQNTYDRKRHTRIFNSTCLTS 332

BLAST of Lag0031155 vs. ExPASy TrEMBL
Match: A0A1S3CFE9 (uncharacterized protein LOC103500312 OS=Cucumis melo OX=3656 GN=LOC103500312 PE=4 SV=1)

HSP 1 Score: 585.5 bits (1508), Expect = 1.4e-163
Identity = 288/336 (85.71%), Postives = 306/336 (91.07%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHH 60
           ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPL SDTFPN HHHH
Sbjct: 12  MEGSEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAEVSTCHSPLPSDTFPNAHHHH 71

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGDE 120
           HHRNPTQEASRFTLSHYSSSRGSNHG GTDNGEARLIVGRGDGRD +EE E   DG+G+E
Sbjct: 72  HHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGDGRDCEEEEE---DGEGNE 131

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           +GYYG+++RGCWK Y TYR SDSNAWICLQLSWRAIFSMGIALLVFY+VT PPSPIISVK
Sbjct: 132 EGYYGKRKRGCWKRYFTYRSSDSNAWICLQLSWRAIFSMGIALLVFYVVTNPPSPIISVK 191

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           VGE++EFMLGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIAT
Sbjct: 192 VGEIQEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 251

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESG T F LSVG SNK MYGAGR+MED L+SG GLEL I+LNFISNYRVVWK
Sbjct: 252 SQGPRLYAESGRTRFGLSVGTSNKAMYGAGREMEDKLDSGMGLELTIRLNFISNYRVVWK 311

Query: 301 IIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
            I P FHR V+C L+LGKAYDRKRHT SFNSTC TS
Sbjct: 312 FISPHFHRHVQCLLLLGKAYDRKRHTPSFNSTCFTS 344

BLAST of Lag0031155 vs. ExPASy TrEMBL
Match: A0A0A0LD21 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G696870 PE=4 SV=1)

HSP 1 Score: 569.7 bits (1467), Expect = 8.1e-159
Identity = 284/336 (84.52%), Postives = 302/336 (89.88%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHH 60
           ME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPL SDTFPN  HHH
Sbjct: 16  METAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAEVSTCHSPLPSDTFPNA-HHH 75

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGDE 120
           HHRNPTQEASRFTLSHYSSSRGSNHG GTDNGEARLIVGRG+G D +EE EE   G+G+E
Sbjct: 76  HHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGGDCEEEEEE---GEGNE 135

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           +GYYG+++RGCWK Y TYR+SDSNAWICLQLSWRAIFSMGIALLVFYIVT PPSPII+VK
Sbjct: 136 EGYYGKRKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIITVK 195

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           VGE+EEFMLGEGVDKTGVGTKILTCNCTM+VIVDNHSKLFGLHILPPSLHMSFGPLPIA 
Sbjct: 196 VGEIEEFMLGEGVDKTGVGTKILTCNCTMDVIVDNHSKLFGLHILPPSLHMSFGPLPIAA 255

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESG T F+LSVG SNK MYGAGRDMED L+SG GLEL I+LNFISNYRVVWK
Sbjct: 256 SQGPRLYAESGRTRFRLSVGTSNKAMYGAGRDMEDKLDSGIGLELTIRLNFISNYRVVWK 315

Query: 301 IIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
            I P FHR V+C L+L K YDR  HTRSFNSTC TS
Sbjct: 316 FISPHFHRHVQCLLLLRKPYDRNPHTRSFNSTCFTS 347

BLAST of Lag0031155 vs. ExPASy TrEMBL
Match: A0A6J1DGR2 (uncharacterized protein LOC111020336 OS=Momordica charantia OX=3673 GN=LOC111020336 PE=4 SV=1)

HSP 1 Score: 566.6 bits (1459), Expect = 6.8e-158
Identity = 292/337 (86.65%), Postives = 306/337 (90.80%), Query Frame = 0

Query: 1   MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHH 60
           MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPLRSDTFP GHHH
Sbjct: 1   MEAAGEDQEAVLFHSYPCAYYVQSPSTVSHANSSDIRNAAESSACHSPLRSDTFPTGHHH 60

Query: 61  HHHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGD 120
           HH  N TQEASR TLS YSSSR SNHG GTDNGEARLIVGRG+GR+ DEEREE  DG GD
Sbjct: 61  HH--NATQEASRVTLSRYSSSRESNHGAGTDNGEARLIVGRGNGREGDEEREE--DGAGD 120

Query: 121 EDGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISV 180
           E+GYYG+KRRGCWKTY TYR+SDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISV
Sbjct: 121 EEGYYGKKRRGCWKTYFTYRNSDSNAWILLQLSWRAIFSMGIALLVFYIVTMPPSPNISV 180

Query: 181 KVGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIA 240
           K+G VEEFMLGEGVDKTGVGTKILTCN TM+V VDN+SKLFGLHILPPSLH+SFGPLPIA
Sbjct: 181 KMGGVEEFMLGEGVDKTGVGTKILTCNFTMDVTVDNNSKLFGLHILPPSLHISFGPLPIA 240

Query: 241 TSQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVW 300
           TSQG R+YAESGTTTFQLSVG SN+ MYGAGR MEDMLESG GLEL I+LNFISNYRVVW
Sbjct: 241 TSQGARLYAESGTTTFQLSVGTSNRAMYGAGRSMEDMLESGMGLELMIRLNFISNYRVVW 300

Query: 301 KIIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
           KIIRP F  RVECSLVLGK YDRKRHTRSFNSTCLTS
Sbjct: 301 KIIRPHFRHRVECSLVLGKGYDRKRHTRSFNSTCLTS 333

BLAST of Lag0031155 vs. ExPASy TrEMBL
Match: A0A6J1F239 (uncharacterized protein LOC111441481 OS=Cucurbita moschata OX=3662 GN=LOC111441481 PE=4 SV=1)

HSP 1 Score: 563.9 bits (1452), Expect = 4.4e-157
Identity = 279/336 (83.04%), Postives = 303/336 (90.18%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHH 60
           M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPL SDTFPNG  HH
Sbjct: 2   MDAAEDQEPVLFHSYPCAYYVQSPSTLSHANSSDNRNPAESSACHSPLPSDTFPNGRRHH 61

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGDE 120
           HHRNPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG GDG +  +E+ E      +E
Sbjct: 62  HHRNPTQEASRFTLSHYSSSCGSNHGGGTDNGEARLMVGGGDGAEEKQEKAEE-----EE 121

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           + YYGRKRRGCWKTY TYR+SDSNAWICLQLSWRA+FSMG+ALLVFYIVT PP P+ISVK
Sbjct: 122 EWYYGRKRRGCWKTYFTYRNSDSNAWICLQLSWRAVFSMGMALLVFYIVTNPPRPVISVK 181

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           V EV+EFMLGEGVDKTGVGTKILTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIAT
Sbjct: 182 VREVDEFMLGEGVDKTGVGTKILTCNCTMDVIVDNYSKLFALHILPPSLHMSFGPLPIAT 241

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESGTTTF+L+VGIS KPMYGAGR++ED LESG GLEL I+LNFISNYRVVWK
Sbjct: 242 SQGPRLYAESGTTTFRLNVGISKKPMYGAGREIEDKLESGAGLELTIRLNFISNYRVVWK 301

Query: 301 IIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
           II+PRFHRRV+C LV+   YDRKRHTR FNSTCLTS
Sbjct: 302 IIKPRFHRRVDCLLVVQNTYDRKRHTRIFNSTCLTS 332

BLAST of Lag0031155 vs. ExPASy TrEMBL
Match: A0A6J1J6W9 (uncharacterized protein LOC111481909 OS=Cucurbita maxima OX=3661 GN=LOC111481909 PE=4 SV=1)

HSP 1 Score: 551.6 bits (1420), Expect = 2.3e-153
Identity = 274/336 (81.55%), Postives = 300/336 (89.29%), Query Frame = 0

Query: 1   MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHH 60
           M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPL SDTFPNG   H
Sbjct: 2   MDAAEDQEPVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNGRRPH 61

Query: 61  HHRNPTQEASRFTLSHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGDE 120
           HHRN TQEASRFTLSHYSSS GSNHG GTDNGEARL+VG GDG +   E+ E      +E
Sbjct: 62  HHRNQTQEASRFTLSHYSSSCGSNHGGGTDNGEARLMVGGGDGAEEKREKAEE-----EE 121

Query: 121 DGYYGRKRRGCWKTYCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVK 180
           + YYG+KRRGCWKTY TYR+SD+NAWICLQLSWRA+FSMG+ALLVFYIVT PP PIISV+
Sbjct: 122 EWYYGKKRRGCWKTYFTYRNSDANAWICLQLSWRAVFSMGMALLVFYIVTNPPPPIISVQ 181

Query: 181 VGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIAT 240
           V EV+EFMLGEGVDKTGVGTKILTCNCTM+VIVDN+SKLF LHILPPSLHMSFGPLPIAT
Sbjct: 182 VREVDEFMLGEGVDKTGVGTKILTCNCTMDVIVDNYSKLFALHILPPSLHMSFGPLPIAT 241

Query: 241 SQGPRMYAESGTTTFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWK 300
           SQGPR+YAESGTTTF+L+VG S KPMYGAGR++ED LESG GLEL I+LNFISNYRVVWK
Sbjct: 242 SQGPRLYAESGTTTFRLNVGTSKKPMYGAGREIEDKLESGAGLELTIRLNFISNYRVVWK 301

Query: 301 IIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
           II+P+FHR V+C LV+  AYDRKRHTR FNSTCLTS
Sbjct: 302 IIKPQFHRHVDCLLVVQNAYDRKRHTRIFNSTCLTS 332

BLAST of Lag0031155 vs. TAIR 10
Match: AT3G08490.1 (BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein, group 2 (TAIR:AT3G24600.1); Has 161 Blast hits to 158 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 161; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 206.1 bits (523), Expect = 4.5e-53
Identity = 100/187 (53.48%), Postives = 134/187 (71.66%), Query Frame = 0

Query: 141 SDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKVGEVEEFMLGEGVDKTGVGT 200
           S+S+ WI LQ+ WR +FS+G+ALLVFYI T+PP P IS ++G   +FML EGVD  GV T
Sbjct: 75  SNSSWWIVLQVGWRFLFSLGVALLVFYIATQPPHPNISFRIGRFNQFMLEEGVDSHGVST 134

Query: 201 KILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAES-GTTTFQLSV 260
           K LT NC+  +I+DN S +FGLHI PPS+   FGPL  A +QGP++Y  S  +TTFQL +
Sbjct: 135 KFLTFNCSTKLIIDNKSNVFGLHIHPPSIKFFFGPLNFAKAQGPKLYGLSHESTTFQLYI 194

Query: 261 GISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKIIRPRFHRRVECSLVLGKA 320
             +N+ MYGAG +M DML S  GL L ++ + IS+YRVVW II P++H +VEC L+L   
Sbjct: 195 ATTNRAMYGAGTEMNDMLLSRAGLPLILRTSIISDYRVVWNIINPKYHHKVECLLLLA-- 254

Query: 321 YDRKRHT 327
            D++RH+
Sbjct: 255 -DKERHS 258

BLAST of Lag0031155 vs. TAIR 10
Match: AT3G24600.1 (Late embryogenesis abundant protein, group 2 )

HSP 1 Score: 80.9 bits (198), Expect = 2.2e-15
Identity = 44/160 (27.50%), Postives = 83/160 (51.88%), Query Frame = 0

Query: 165 VFYIVTKPPSPIISVKVGEVEEFMLGEGVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHI 224
           V +  + P SPI+SVK  ++  F  GEG+D+TGV TKIL+ N ++ V +D+ +  FG+H+
Sbjct: 334 VLWGASHPFSPIVSVKSVDIHSFYYGEGIDRTGVATKILSFNSSVKVTIDSPAPYFGIHV 393

Query: 225 LPPSLHMSFGPLPIATSQGPRMYAESGTTTFQL-SVGISNKPMYGAGRDMEDMLESGTGL 284
              +  ++F  L +AT Q    Y    +    +  +  +  P+YGAG  +    + G  +
Sbjct: 394 SSSTFKLTFSALTLATGQLKSYYQPRKSKHISIVKLTGAEVPLYGAGPHLAASDKKGK-V 453

Query: 285 ELRIQLNFISNYRVVWKIIRPRFHRRVECSLVLGKAYDRK 324
            ++++    S   ++ K+++ +    V CS  +  +   K
Sbjct: 454 PVKLEFEIRSRGNLLGKLVKSKHENHVSCSFFISSSKTSK 492

BLAST of Lag0031155 vs. TAIR 10
Match: AT1G45688.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast hits to 242 proteins in 39 species: Archae - 0; Bacteria - 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses - 17; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 58.2 bits (139), Expect = 1.5e-08
Identity = 82/326 (25.15%), Postives = 133/326 (40.80%), Query Frame = 0

Query: 19  YYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHHHHRNPTQEASRFTLSHYS 78
           YYVQSPS  SH +         S+   SP+ S   P+ H      +    +SRF+ S   
Sbjct: 26  YYVQSPSRDSH-DGEKTATSFHSTPVLSPMGSP--PHSHSSMGRHSRESSSSRFSGSLKP 85

Query: 79  SSRGSNHGTGTDNGEARLIVGRGDGRDRDEE-----REENGDGDGDEDGYYGRKRRGCWK 138
            SR  N   G+          +G G ++  +      EE    DGD DG  G  RR    
Sbjct: 86  GSRKVNPNDGSKR--------KGHGGEKQWKECAVIEEEGLLDDGDRDG--GVPRR---- 145

Query: 139 TYCTYRHSDSNAWICLQLSWRAIFSM--GIALLVFYIVTKPPSPIISVKVGEVEEFMLGE 198
                         C  L++   F +  G   L+ Y   KP  P I+VK    E   +  
Sbjct: 146 --------------CYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQA 205

Query: 199 GVDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMYAESG 258
           G D  GVGT ++T N T+ ++  N    FG+H+    + +SF  + I +    + Y    
Sbjct: 206 GQDAGGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRK 265

Query: 259 TTTFQLSVGISNK-PMYGAGRDM----------EDMLESGTGLEL----------RIQLN 315
           +    L   I  K P+YG+G  +          +   + G  + +           + L+
Sbjct: 266 SERTVLVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLS 320

BLAST of Lag0031155 vs. TAIR 10
Match: AT5G42860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 56.2 bits (134), Expect = 5.8e-08
Identity = 80/346 (23.12%), Postives = 131/346 (37.86%), Query Frame = 0

Query: 18  AYYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHHHHHHRNPTQEASRFTLSHY 77
           AY+VQSPS  SH       +   +    SP+ S                        SH 
Sbjct: 25  AYFVQSPSRDSHDGEKTATSFHSTPVLTSPMGSPPH---------------------SHS 84

Query: 78  SSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENG---DGDGDEDGYYGRKRRGCWKT 137
           SSSR S        G A        G  +    EE G   DGD +++    R        
Sbjct: 85  SSSRFSKINGSKRKGHA--------GEKQFAMIEEEGLLDDGDREQEALPRR-------- 144

Query: 138 YCTYRHSDSNAWICLQLSWRAIFSMGIAL--LVFYIVTKPPSPIISVKVGEVEEFMLGEG 197
                        C  L++   FS+  A   L+ Y   KP  P ISVK    E+  +  G
Sbjct: 145 -------------CYVLAFIVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAG 204

Query: 198 VDKTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRMY-AESG 257
            D  G+GT ++T N T+ ++  N    FG+H+    + +SF  + I +    + Y +   
Sbjct: 205 QDAGGIGTDMITMNATLRMLYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKS 264

Query: 258 TTTFQLSVGISNKPMYGAGRDM----------EDMLESG-----------TGLELRIQLN 317
             T  ++V     P+YG+G  +          +   + G             + +R+   
Sbjct: 265 QRTVVVNVLGDKIPLYGSGSTLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFT 319

Query: 318 FISNYRVVWKIIRPRFHRRVECSLVLGKAYDRKRHTRSFNSTCLTS 337
             S   V+ K+++P+F++R+ C L+  +     +H    N+  +TS
Sbjct: 325 VRSRAYVLGKLVQPKFYKRIVC-LINFEHKKLSKHIPITNNCTVTS 319

BLAST of Lag0031155 vs. TAIR 10
Match: AT2G41990.1 (CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864); BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family (TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 55.5 bits (132), Expect = 9.9e-08
Identity = 73/303 (24.09%), Postives = 120/303 (39.60%), Query Frame = 0

Query: 19  YYVQSPSTLSHANSSDIRNPAESSACHSPLRSDTFPNGHH----HHHHRNPTQEASRFTL 78
           YYVQSPS      + D+   +  S C S + S T P+ +H    HH   + T   S   L
Sbjct: 28  YYVQSPS------NHDVEKMSFGSGC-SLMGSPTHPHYYHCSPIHHSRESSTSRFSDRAL 87

Query: 79  SHYSSSRGSNHGTGTDNGEARLIVGRGDGRDRDEEREENGDGDGDEDGYYGRKRRGCWKT 138
             Y S R           E R  +  GD         +  DG  D+D             
Sbjct: 88  LSYKSIR-----------ERRRYINDGD---------DKTDGGDDDD------------- 147

Query: 139 YCTYRHSDSNAWICLQLSWRAIFSMGIALLVFYIVTKPPSPIISVKVGEVEEFMLGEGVD 198
              +R+     W+ L +    IF   +  L+ +  +K   P ++VK   V +  L  G D
Sbjct: 148 --PFRNVRLYVWLLLSV----IFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGND 207

Query: 199 KTGVGTKILTCNCTMNVIVDNHSKLFGLHILPPSLHMSFGPLPIATSQGPRM-YAESGTT 258
            +GV T +L+ N T+ +   N S  F +H+    L + +  L +++ +  +     +G T
Sbjct: 208 LSGVPTDMLSLNSTVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGET 267

Query: 259 TFQLSVGISNKPMYGAGRDMEDMLESGTGLELRIQLNFISNYRVVWKIIRPRFHRRVECS 317
                V     P+YG      D L     L L + +   S   ++ +++  +F+ R+ CS
Sbjct: 268 NVVTVVQGHQIPLYGGVSFHLDTL----SLPLNLTIVLHSKAYILGRLVTSKFYTRIICS 280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905771.11.2e-16989.91uncharacterized protein LOC120091726 [Benincasa hispida][more]
XP_008461795.22.9e-16385.71PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo][more]
XP_004149613.11.7e-15884.52uncharacterized protein LOC101209149 [Cucumis sativus] >KGN58592.1 hypothetical ... [more]
XP_022152674.11.4e-15786.65uncharacterized protein LOC111020336 [Momordica charantia][more]
XP_022934269.19.2e-15783.04uncharacterized protein LOC111441481 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3CFE91.4e-16385.71uncharacterized protein LOC103500312 OS=Cucumis melo OX=3656 GN=LOC103500312 PE=... [more]
A0A0A0LD218.1e-15984.52Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G696870 PE=4 SV=1[more]
A0A6J1DGR26.8e-15886.65uncharacterized protein LOC111020336 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A6J1F2394.4e-15783.04uncharacterized protein LOC111441481 OS=Cucurbita moschata OX=3662 GN=LOC1114414... [more]
A0A6J1J6W92.3e-15381.55uncharacterized protein LOC111481909 OS=Cucurbita maxima OX=3661 GN=LOC111481909... [more]
Match NameE-valueIdentityDescription
AT3G08490.14.5e-5353.48BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein,... [more]
AT3G24600.12.2e-1527.50Late embryogenesis abundant protein, group 2 [more]
AT1G45688.11.5e-0825.15unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G42860.15.8e-0823.12unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G41990.19.9e-0824.09CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterP... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 97..111
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..49
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 27..121
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 68..90
NoneNo IPR availablePANTHERPTHR31852:SF186DELTA-LATROINSECTOTOXIN-LT1A PROTEINcoord: 91..334
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 91..334

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0031155.1Lag0031155.1mRNA