HG10018139 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018139
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLate embryogenesis abundant protein-related / LEA protein-related protein
LocationChr04: 795888 .. 798859 (-)
RNA-Seq ExpressionHG10018139
SyntenyHG10018139
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAGAATCGCCATATTCCTCTTCTTGTTCTTTTTTCTCTTCTCTTCAGCCTTGGTTGAGGGAGCTCCAAATGCCAAGAAAGTTAAATGCAATGACGATAATTATCCTCAATGTTACAAATCAGATCATTATTGTCCTTCTGATTGCCCTCAAACTTGTGTTGTAGATTGTTCATCTTGCCAGCCTGTTTGCACTCCGCCACCGCCTCCTCCGTCACGCAAACTCAAATCTCCGCCACCACCGTACATTTACTCTTCGCCCCCGCCCCCGCCCCCACGTACTTACTCTTCTCCCCCACCACCTCCTTACATTTACTCTTCGCCCCCACCGCCACCTCCCTATATTTACTCTTCTCCCCCACCACCTCCCAAAAGTTACGCTTCTCCTCCACCACCTCCGCCAGCTACGACAGAGCCTTCACCTCCGACTCCTCCAGCATCTCCACCACCGTCGTCCGAGGCGTCGGGGCAAAAGAAAGCTAGGTGTAAGAATAAGAGCTATCCACATTGCTACGGTATGGAGCTAAGTTGTCCGAGTGCTTGTCCTGACCAATGCGAGGTAGACTGTGTTACTTGCAGCCCCGTTTGCAGTAAGTTTTCTCACACCTTTATATTTGTCGGGTCAAAATTACTTTAGAATATGTTTTTAATTATTCAAGAGAAATTTTGATGAAACTATAATTGCATTTAAAAGTAAAATTAAATATTAAATTATTTTTGAGTGATTAAAATTAAGCATGTTTCGAAGTAATTTGAACATTACAAAAATGATTTTAACGATTTTCAAAATTGTTCTCAAATATAAACCTAGTACATTTGTCAAAAAAACGAGAGATCGTTTAAGCAAACAATATGAACTACTACAATAAATCTTTGTAAAATTATGGGATCAAATGATAACCATACCTCCAACCCTGCATCAACAAAGAAAATAAAAATTCTAATTTAATCCAAAATCTTTAAAGTTGTCTTGACAATGCTTCAAGAACCAAATAATTTAGTAGGGGATATGTATCGTGGTTTTAAATGTGGGTGCCATGAAAATGTCGATATCTTAGTTTTATGGAATTATTGATAAAATATCCAATATGAGTAAATATTTCTAAACAAATCTAAATATACTTTTGTTTAACAAAAAGTATTTTAGTTAATGAACAGCCAATCTACCCTTTTCAAATAAGTTTAAATTTATTATTATTTATATTAGATCCACATTGTTTATATTTTTATTATAAGTTAAAATATCATTATTTTTATCCATATACTTTAAGGTTTGTTCAATTCTACTTCCCGTACTTTCAAATGTCCAATTTTACTCCTATACTTTTAATAAATCTTAAATTGAGTCCTTCGAAATTAATCTTGATCAAAATTGGCTAAATAATAATATAATTTTCATCAATGAAATATAATATGTGAAAGAGTTTTCAAATTTTATAGTGAAAAAAAGCTATAAATAACAAAAGTTTTTTTGAAAAATCCACGATAAAAAATCAGCATATAGACTAAATTTAAGATTTCGTGAAAGTACATGGTCTAAAATTGGACATGTGAAAGTGCGGAAAATATAGGGACTAGAATTGAACAAACTTCAAAGTATAGGAACTAAAATGGTATTTAAATTACCTTTATTATATTTTATCTAATAGTATAGAATATATAATGAAAAATCGTTGATATCAAAACCTTGAAGAAATGTTGACATGGATTGGATATTTAAAACCCAAAGGGTAACCACCACTTACAAAAATAATTATACATACACACAAAAGATCTAAAATGTCGGTGTCAACAAAATTAAATATCAGTGTATCAATTTCATAGAAATGTTGATAGAAATATCGATAAGTATTGATAGGTATTTCTTAAAGGATTTCAAAAATAAAAATTTTAAATTATATTTAAATCAATAAATAGATATTGTGCACTTCTTAAATGAGATAGAACACTTGTTATAATAATATATTCATGTTAGGTTGCAATTTGTTGCTAGTTTTTTAATATTTATTAATTGATTTTGTATGTTATAAAATGAAAATGTCTATTCACCTCTCATATTAATTAATGTCAAACTCATGAACATGTTGAAATGTCAATAAAAATGTCAATATTTGGATTGAAATTTAGTAAGATATGCATACACATGGGTTATACTTATTAAATATTCTTTACTTCACTGTATACACAGATTGCAACCGTCCAGGCTCAGTCTGCCAAGACCCAAAATTTGTTGGAGGAGATGGAATCACTTTCTACTTCCATGGCAAAAAAGACCAAGATTTCTGCATTGTCACTGACTCGAACCTCCACATCAACGCCCACTTCATCGGCCGACGAAATGTCAACATGAAGAGGGACTTCACTTGGGTTCAATCTCTCGGCATCCTCTTTGACTCCCACAAGCTCTTTATAGGTGCTCGAAAAACTGCGACATGGAATGATGCTATCGACCGTCTCTCCGTCTCCCTTGACGACGAAACCATCCTCCTCCCTACTGAGGAGGGTGCCACCTGGAGTAATTCAACCTCGTACAAGGGAATCACCATAACCAGGAGTAGAAACACGAACGCAATCGAGATCGAAGTCTCTAGGAACTTCAAGATCAAAGCCGCCGTGGTCCCGATAACGGAAAAGGAATCAAGGGTCCATAACTATGGGATCACACAAGAGGATTGCTTTGCACATTTGGACTTGAGCTTCAAGTTCTTTGCATTGAGTGGGGATGTGAATGGGGTTTTGGGGCAAACTTATGGTATCAACTATGTGAGCAAGGCCAAGATGGGAGTGGCAATGCCTGTTATTGGTGGTGTCAATGAGTTTGCTTCTTCAAATCTTTTTGCCACGGATTGCCAAGTGGCACGTTTTAGTGGGCAGTTGGATGGAAAAGATGACAGTTCTTTGGATGCTGAAGCCTATGCCAATATGAACTGTGGCAGTGACATGGAAGGTGGAGTTGTTTGCAAAAGATAA

mRNA sequence

ATGGCAAGAATCGCCATATTCCTCTTCTTGTTCTTTTTTCTCTTCTCTTCAGCCTTGGTTGAGGGAGCTCCAAATGCCAAGAAAGTTAAATGCAATGACGATAATTATCCTCAATGTTACAAATCAGATCATTATTGTCCTTCTGATTGCCCTCAAACTTGTGTTGTAGATTGTTCATCTTGCCAGCCTGTTTGCACTCCGCCACCGCCTCCTCCGTCACGCAAACTCAAATCTCCGCCACCACCGTACATTTACTCTTCGCCCCCGCCCCCGCCCCCACGTACTTACTCTTCTCCCCCACCACCTCCTTACATTTACTCTTCGCCCCCACCGCCACCTCCCTATATTTACTCTTCTCCCCCACCACCTCCCAAAAGTTACGCTTCTCCTCCACCACCTCCGCCAGCTACGACAGAGCCTTCACCTCCGACTCCTCCAGCATCTCCACCACCGTCGTCCGAGGCGTCGGGGCAAAAGAAAGCTAGGTGTAAGAATAAGAGCTATCCACATTGCTACGGTATGGAGCTAAGTTGTCCGAGTGCTTGTCCTGACCAATGCGAGGTAGACTGTGTTACTTGCAGCCCCGTTTGCAATTGCAACCGTCCAGGCTCAGTCTGCCAAGACCCAAAATTTGTTGGAGGAGATGGAATCACTTTCTACTTCCATGGCAAAAAAGACCAAGATTTCTGCATTGTCACTGACTCGAACCTCCACATCAACGCCCACTTCATCGGCCGACGAAATGTCAACATGAAGAGGGACTTCACTTGGGTTCAATCTCTCGGCATCCTCTTTGACTCCCACAAGCTCTTTATAGGTGCTCGAAAAACTGCGACATGGAATGATGCTATCGACCGTCTCTCCGTCTCCCTTGACGACGAAACCATCCTCCTCCCTACTGAGGAGGGTGCCACCTGGAGTAATTCAACCTCGTACAAGGGAATCACCATAACCAGGAGTAGAAACACGAACGCAATCGAGATCGAAGTCTCTAGGAACTTCAAGATCAAAGCCGCCGTGGTCCCGATAACGGAAAAGGAATCAAGGGTCCATAACTATGGGATCACACAAGAGGATTGCTTTGCACATTTGGACTTGAGCTTCAAGTTCTTTGCATTGAGTGGGGATGTGAATGGGGTTTTGGGGCAAACTTATGGTATCAACTATGTGAGCAAGGCCAAGATGGGAGTGGCAATGCCTGTTATTGGTGGTGTCAATGAGTTTGCTTCTTCAAATCTTTTTGCCACGGATTGCCAAGTGGCACGTTTTAGTGGGCAGTTGGATGGAAAAGATGACAGTTCTTTGGATGCTGAAGCCTATGCCAATATGAACTGTGGCAGTGACATGGAAGGTGGAGTTGTTTGCAAAAGATAA

Coding sequence (CDS)

ATGGCAAGAATCGCCATATTCCTCTTCTTGTTCTTTTTTCTCTTCTCTTCAGCCTTGGTTGAGGGAGCTCCAAATGCCAAGAAAGTTAAATGCAATGACGATAATTATCCTCAATGTTACAAATCAGATCATTATTGTCCTTCTGATTGCCCTCAAACTTGTGTTGTAGATTGTTCATCTTGCCAGCCTGTTTGCACTCCGCCACCGCCTCCTCCGTCACGCAAACTCAAATCTCCGCCACCACCGTACATTTACTCTTCGCCCCCGCCCCCGCCCCCACGTACTTACTCTTCTCCCCCACCACCTCCTTACATTTACTCTTCGCCCCCACCGCCACCTCCCTATATTTACTCTTCTCCCCCACCACCTCCCAAAAGTTACGCTTCTCCTCCACCACCTCCGCCAGCTACGACAGAGCCTTCACCTCCGACTCCTCCAGCATCTCCACCACCGTCGTCCGAGGCGTCGGGGCAAAAGAAAGCTAGGTGTAAGAATAAGAGCTATCCACATTGCTACGGTATGGAGCTAAGTTGTCCGAGTGCTTGTCCTGACCAATGCGAGGTAGACTGTGTTACTTGCAGCCCCGTTTGCAATTGCAACCGTCCAGGCTCAGTCTGCCAAGACCCAAAATTTGTTGGAGGAGATGGAATCACTTTCTACTTCCATGGCAAAAAAGACCAAGATTTCTGCATTGTCACTGACTCGAACCTCCACATCAACGCCCACTTCATCGGCCGACGAAATGTCAACATGAAGAGGGACTTCACTTGGGTTCAATCTCTCGGCATCCTCTTTGACTCCCACAAGCTCTTTATAGGTGCTCGAAAAACTGCGACATGGAATGATGCTATCGACCGTCTCTCCGTCTCCCTTGACGACGAAACCATCCTCCTCCCTACTGAGGAGGGTGCCACCTGGAGTAATTCAACCTCGTACAAGGGAATCACCATAACCAGGAGTAGAAACACGAACGCAATCGAGATCGAAGTCTCTAGGAACTTCAAGATCAAAGCCGCCGTGGTCCCGATAACGGAAAAGGAATCAAGGGTCCATAACTATGGGATCACACAAGAGGATTGCTTTGCACATTTGGACTTGAGCTTCAAGTTCTTTGCATTGAGTGGGGATGTGAATGGGGTTTTGGGGCAAACTTATGGTATCAACTATGTGAGCAAGGCCAAGATGGGAGTGGCAATGCCTGTTATTGGTGGTGTCAATGAGTTTGCTTCTTCAAATCTTTTTGCCACGGATTGCCAAGTGGCACGTTTTAGTGGGCAGTTGGATGGAAAAGATGACAGTTCTTTGGATGCTGAAGCCTATGCCAATATGAACTGTGGCAGTGACATGGAAGGTGGAGTTGTTTGCAAAAGATAA

Protein sequence

MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSSCQPVCTPPPPPPSRKLKSPPPPYIYSSPPPPPPRTYSSPPPPPYIYSSPPPPPPYIYSSPPPPPKSYASPPPPPPATTEPSPPTPPASPPPSSEASGQKKARCKNKSYPHCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSVSLDDETILLPTEEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESRVHNYGITQEDCFAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFASSNLFATDCQVARFSGQLDGKDDSSLDAEAYANMNCGSDMEGGVVCKR
Homology
BLAST of HG10018139 vs. NCBI nr
Match: XP_038894866.1 (uncharacterized protein LOC120083266 [Benincasa hispida])

HSP 1 Score: 797.0 bits (2057), Expect = 8.7e-227
Identity = 407/459 (88.67%), Postives = 425/459 (92.59%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           MARIAIFLFL F LF SA+VEGAPNAKKVKCNDDNYPQCYKSDHYCP+DCPQTCVVDCSS
Sbjct: 1   MARIAIFLFL-FSLFFSAVVEGAPNAKKVKCNDDNYPQCYKSDHYCPADCPQTCVVDCSS 60

Query: 61  CQPVCTPPPPPPSRKLKSPPPPYIYSSPPPPPPRTYSSPPPPPYIYSSPPPPPPYIYSS- 120
           CQPVCTPPPPPPSRKLKSPPPPYIYSSP          PPPPPYIYSSPPPPPPYIYSS 
Sbjct: 61  CQPVCTPPPPPPSRKLKSPPPPYIYSSP----------PPPPPYIYSSPPPPPPYIYSSP 120

Query: 121 PPPPPKSYASPPPPPPATTEPSPPTPPASPPPSSEASGQKKARCKNKSYPHCYGMELSCP 180
           PPPPPKSYAS PPPPPATT    PTPP SPPPSSE SGQKKARCKN+ YPHCYGMELSCP
Sbjct: 121 PPPPPKSYASSPPPPPATT----PTPPTSPPPSSETSGQKKARCKNRGYPHCYGMELSCP 180

Query: 181 SACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHI 240
           SACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHG+KDQ+FCIVTDSNLHI
Sbjct: 181 SACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGQKDQEFCIVTDSNLHI 240

Query: 241 NAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSVSLDDETILLP 300
           NAHFIGRRN++MKRDFTWVQSLGILFDSHKLFIGA+KTATWNDAIDRLS+SLDDETILLP
Sbjct: 241 NAHFIGRRNIDMKRDFTWVQSLGILFDSHKLFIGAQKTATWNDAIDRLSISLDDETILLP 300

Query: 301 TEEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESRVHNYGITQED 360
           T+EGATWSNST YKG+ ITRSRNTNA+EI+V  NFKIKA VVPITEK+SRVHNYGITQED
Sbjct: 301 TQEGATWSNSTFYKGMVITRSRNTNAVEIKVPGNFKIKAVVVPITEKDSRVHNYGITQED 360

Query: 361 CFAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFASSNLFATDCQ 420
           CFAHLDLSFKF+ALSGDVNGVLGQTYG NYVSK KMGVAMPV GG NEFASSNLFATDCQ
Sbjct: 361 CFAHLDLSFKFYALSGDVNGVLGQTYGSNYVSKVKMGVAMPVFGGANEFASSNLFATDCQ 420

Query: 421 VARFSGQLDGKDDSSLDAEAYANMNCGSDME-GGVVCKR 458
           VARFSGQL GKDDSSL+AE +ANM+CGSDME GGVVCKR
Sbjct: 421 VARFSGQLAGKDDSSLEAEVFANMSCGSDMEGGGVVCKR 444

BLAST of HG10018139 vs. NCBI nr
Match: XP_011648458.1 (uncharacterized protein LOC101207483 [Cucumis sativus] >KGN64912.1 hypothetical protein Csa_022857 [Cucumis sativus])

HSP 1 Score: 753.8 bits (1945), Expect = 8.5e-214
Identity = 388/460 (84.35%), Postives = 416/460 (90.43%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           M RIA+   LF FLF S  VEGAP AKKVKCNDDNYPQCYKSDHYCP+DCPQTCVVDCSS
Sbjct: 1   MGRIAL---LFLFLFFSVAVEGAPQAKKVKCNDDNYPQCYKSDHYCPADCPQTCVVDCSS 60

Query: 61  CQPVCTPPPPPPSRKLKSPPPPYIYSSPPPPPPRTYSSPPPPPYIYSSPPPPPPYIYSSP 120
           C+PVC PPPPPP RKLKSPPPPYIYSSPPPPP    S PPPPP +YSS PPPPPYIYSSP
Sbjct: 61  CKPVCNPPPPPP-RKLKSPPPPYIYSSPPPPPYIYSSPPPPPPRVYSS-PPPPPYIYSSP 120

Query: 121 PPPPKSY-ASPPPPPPATTEPSP-PTPPASPPPSSEASGQKKARCKNKSYPHCYGMELSC 180
           PPPP    ASPPPPPP+T+ P+P P+ P SPPPSSE SGQKKARCKN+ YPHCYGMELSC
Sbjct: 121 PPPPPYINASPPPPPPSTSPPTPTPSTPTSPPPSSEGSGQKKARCKNRGYPHCYGMELSC 180

Query: 181 PSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLH 240
           PS+CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLH
Sbjct: 181 PSSCPDHCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLH 240

Query: 241 INAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSVSLDDETILL 300
           INAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGA+KTATWNDA DRLSVSLD+ETI+L
Sbjct: 241 INAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGAQKTATWNDATDRLSVSLDNETIIL 300

Query: 301 PTEEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESRVHNYGITQE 360
           P +EGATWSNSTS KGITITR++NTNA+EI+V  NFKIKA VVPITE +SR+HNYGITQE
Sbjct: 301 PNQEGATWSNSTSNKGITITRTQNTNAVEIDVPGNFKIKAVVVPITEMDSRIHNYGITQE 360

Query: 361 DCFAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFASSNLFATDC 420
           DCFAHLDLSFKF+ALSGDVNGVLGQTY  NYVSK KMGVAMPV GG+NEFASSN+FAT+C
Sbjct: 361 DCFAHLDLSFKFYALSGDVNGVLGQTYSSNYVSKVKMGVAMPVFGGLNEFASSNIFATNC 420

Query: 421 QVARFSGQLDGKDDSSLDAEAYAN-MNCGSDMEGGVVCKR 458
           +VARFSG+LD KDDSSL+AE YAN M CGSD+EGGVVCKR
Sbjct: 421 RVARFSGELDEKDDSSLEAEVYANMMRCGSDIEGGVVCKR 455

BLAST of HG10018139 vs. NCBI nr
Match: XP_008461654.1 (PREDICTED: uncharacterized protein LOC103500203 [Cucumis melo] >KAA0038827.1 TGF-beta-activated kinase 1 and MAP3K7-binding protein 3-like [Cucumis melo var. makuwa] >TYK25769.1 TGF-beta-activated kinase 1 and MAP3K7-binding protein 3-like [Cucumis melo var. makuwa])

HSP 1 Score: 738.0 bits (1904), Expect = 4.8e-209
Identity = 381/458 (83.19%), Postives = 402/458 (87.77%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           MARIAI   LF FLF SA VEG PN KKVKC+DDNYPQCYKSD YCP+DCP+TCVVDCSS
Sbjct: 1   MARIAI---LFLFLFFSAAVEGLPNTKKVKCHDDNYPQCYKSDLYCPADCPETCVVDCSS 60

Query: 61  CQPVCTPPPPPPSRKLKSPPPPYIYSSPPPPPPRTYSSPPPPPYIYSSPPPPPPYIYSSP 120
           CQ VC PPPPP  R+LKSPPPPYI           YSSPPPPPYIYSSPPPPPPYIY+SP
Sbjct: 61  CQAVCNPPPPP--RRLKSPPPPYI-----------YSSPPPPPYIYSSPPPPPPYIYASP 120

Query: 121 PPPPKSYASPPPPPPATTEPSPPTPPASPPPSSEASGQKKARCKNKSYPHCYGMELSCPS 180
           PPPP +   P PP P    PS PTPP SPPPSSEASGQKKARCKN+SYPHCYGMELSCPS
Sbjct: 121 PPPPPATLPPSPPTPT---PSTPTPPTSPPPSSEASGQKKARCKNRSYPHCYGMELSCPS 180

Query: 181 ACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHIN 240
           +CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHIN
Sbjct: 181 SCPDHCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHIN 240

Query: 241 AHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSVSLDDETILLPT 300
           AHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGA+KTATWNDAIDRLSVSLDDETILL  
Sbjct: 241 AHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGAQKTATWNDAIDRLSVSLDDETILLSN 300

Query: 301 EEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESRVHNYGITQEDC 360
           +EGATW NSTS K ITITR++NTNA+EIEV  NFKIKA VVPITE +SR+HNYGITQEDC
Sbjct: 301 QEGATWRNSTSNKEITITRTQNTNAVEIEVPGNFKIKAVVVPITEMDSRIHNYGITQEDC 360

Query: 361 FAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFASSNLFATDCQV 420
           FAHLDLSFKF+ALSGDVNGVLGQTY  NYVSK KMG AMPV GGVNEFASSN+F+TDCQV
Sbjct: 361 FAHLDLSFKFYALSGDVNGVLGQTYSSNYVSKVKMGAAMPVFGGVNEFASSNIFSTDCQV 420

Query: 421 ARFSGQLDGKDDSSLDAEAYAN-MNCGSDMEGGVVCKR 458
           ARFSG+ DGKDDSSL+AE YA+ M CGSD EGGVVCKR
Sbjct: 421 ARFSGESDGKDDSSLEAEVYASMMRCGSDTEGGVVCKR 439

BLAST of HG10018139 vs. NCBI nr
Match: XP_004139573.2 (uncharacterized protein LOC101207232 [Cucumis sativus] >KGN64910.1 hypothetical protein Csa_022730 [Cucumis sativus])

HSP 1 Score: 718.8 bits (1854), Expect = 3.0e-203
Identity = 374/469 (79.74%), Postives = 411/469 (87.63%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           MARIAIFLF F FLF SA+VEGAP AKKVKC D  +PQCYKS+HYCP+DC +TCVVDCSS
Sbjct: 1   MARIAIFLF-FLFLFLSAVVEGAPKAKKVKCKDKKFPQCYKSEHYCPADCLRTCVVDCSS 60

Query: 61  CQPVCTPPPPPP---------SRKLKSPPPPYIYSSPPPPPPRTYSS-PPPPPYIYSSPP 120
           CQPVCTPPPPPP          RKLKSPPPPYIYSSPPPPPPR YSS PPPPPYIYSS P
Sbjct: 61  CQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSSPPPPPPRIYSSPPPPPPYIYSS-P 120

Query: 121 PPPPYIYSSPPPPPKSYASPPPP-PPATTEPSPPTPPASPPPSSEASGQKKARCKNKSYP 180
           PPPP+IYSSPPPPP +   P PP PPA T PS   PP SPPPSSEASGQKK RCKN+ YP
Sbjct: 121 PPPPHIYSSPPPPPPTTVEPSPPLPPAPTPPSSSPPPLSPPPSSEASGQKKVRCKNRGYP 180

Query: 181 HCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDF 240
           HCYGMELSCPS CP QCEVDCVTCSPVCNCNRPG+VCQDPKF+GGDGITFYFHGK+D+DF
Sbjct: 181 HCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKRDKDF 240

Query: 241 CIVTDSNLHINAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSV 300
           CIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILFDSH+LFI ARKT+TW+DA DRL +
Sbjct: 241 CIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHQLFISARKTSTWDDANDRLYI 300

Query: 301 SLDDETILLPTEEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESR 360
           SLDDETI+LP +EGATWSNSTSY+GI ITRSR TNA+EIEV  NFKIKA VVPITEKES 
Sbjct: 301 SLDDETIILPNQEGATWSNSTSYEGIAITRSRKTNAVEIEVPGNFKIKAVVVPITEKESM 360

Query: 361 VHNYGITQEDCFAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFA 420
           +H YGITQEDCFAHLDLSFKF+ALSG+VNGVLGQTYG NYVS+AKMGVAMPV+GG  EFA
Sbjct: 361 IHKYGITQEDCFAHLDLSFKFYALSGNVNGVLGQTYGKNYVSRAKMGVAMPVLGGDKEFA 420

Query: 421 SSNLFATDCQVARFSGQLDGKDDSSLDAEAYANMNCGSDMEG-GVVCKR 458
           SS++FATDC+V RF+ ++D K +S ++A AYANM+CGSDM+G GVVCKR
Sbjct: 421 SSSIFATDCEVTRFTKEMDEK-ESYVEAAAYANMSCGSDMDGQGVVCKR 466

BLAST of HG10018139 vs. NCBI nr
Match: XP_008461680.1 (PREDICTED: uncharacterized protein LOC103500222 [Cucumis melo])

HSP 1 Score: 708.0 bits (1826), Expect = 5.3e-200
Identity = 371/468 (79.27%), Postives = 404/468 (86.32%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           MARIAIFLF FFFLF SA+VEG P AKKVKC D  +PQCYKS HYCP DC +TCVVDCSS
Sbjct: 1   MARIAIFLF-FFFLFLSAVVEGVPKAKKVKCKDKKFPQCYKSQHYCPDDCLRTCVVDCSS 60

Query: 61  CQPVCT-PPPPPPS--------RKLKSPPPPYIYSSPPPPPPRTYSSPPPPPYIYSSPPP 120
           CQPVCT PPPPPPS        RKL+SPPPPYIYSSPPPPPPR          +YSSPPP
Sbjct: 61  CQPVCTAPPPPPPSPPPPPPKPRKLRSPPPPYIYSSPPPPPPR----------VYSSPPP 120

Query: 121 PPPYIYSSPPPPPKSYASPPPP-PPATTEPSPPTPPASPPPSSEASGQKKARCKNKSYPH 180
           PPPYIYSSPPPPP +   P PP PP  T PS P PP SPPPSSEASGQKK RCKN+ YPH
Sbjct: 121 PPPYIYSSPPPPPPATVEPSPPLPPTPTPPSSP-PPLSPPPSSEASGQKKVRCKNRGYPH 180

Query: 181 CYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFC 240
           CYGMELSCPS CP QCEVDCVTCSPVCNCNRPG+VCQDPKF+GGDGITFYFHGKKD+DFC
Sbjct: 181 CYGMELSCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDRDFC 240

Query: 241 IVTDSNLHINAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSVS 300
           IVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILF SHKLFI ARKT+TW+DA DRL +S
Sbjct: 241 IVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFGSHKLFISARKTSTWDDANDRLYIS 300

Query: 301 LDDETILLPTEEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESRV 360
           LDDETILLP +EGATWSNSTSY+GI I+RSR TNA+EIEV  NFKIKA VVPITEKES +
Sbjct: 301 LDDETILLPNQEGATWSNSTSYEGIAISRSRKTNAVEIEVPGNFKIKAVVVPITEKESMI 360

Query: 361 HNYGITQEDCFAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFAS 420
           H YGITQEDCFAHLDLSFKF+ALSG+V+GVLGQTYG NYVS+AKMGVAMPV+GG  EFAS
Sbjct: 361 HKYGITQEDCFAHLDLSFKFYALSGNVSGVLGQTYGNNYVSRAKMGVAMPVLGGDKEFAS 420

Query: 421 SNLFATDCQVARFSGQLDGKDDSSLDAEAYANMNCGSDMEG-GVVCKR 458
           S++FATDC+VARFS +LDGK +SS++A AYANM+CG+DMEG GVVCKR
Sbjct: 421 SSIFATDCEVARFSRELDGK-ESSVEAAAYANMSCGNDMEGQGVVCKR 455

BLAST of HG10018139 vs. ExPASy Swiss-Prot
Match: O65375 (Leucine-rich repeat extensin-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=LRX1 PE=1 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 8.7e-12
Identity = 62/93 (66.67%), Postives = 68/93 (73.12%), Query Frame = 0

Query: 67  PPPPPPSRKLKSPPPPYIYSSP--------PPPPPRTYSSPPPPPYIYSSPPPPPPYIYS 126
           PPPPPPS    SPPPPY+YSSP        PPPPP  YSSPPPPPY+YSS  PPPPY+YS
Sbjct: 457 PPPPPPS---PSPPPPYVYSSPPPPYVYSSPPPPPYVYSSPPPPPYVYSS--PPPPYVYS 516

Query: 127 SPPPPPKSYASPPPPPPATTEPSPPTPPASPPP 152
           S PPPP  Y+SPPPPPP+   P PP P +SPPP
Sbjct: 517 S-PPPPYVYSSPPPPPPS---PPPPCPESSPPP 540

BLAST of HG10018139 vs. ExPASy Swiss-Prot
Match: Q9T0K5 (Leucine-rich repeat extensin-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=LRX3 PE=1 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 2.9e-07
Identity = 63/118 (53.39%), Postives = 67/118 (56.78%), Query Frame = 0

Query: 48  SDCPQTCVVDCSSCQPVCTPPPPPPSRKLK-SPPPPYIYSSPPPPPPRTYSSPPPPPYIY 107
           S  P T V       P   PPPPPP  +    PPPP ++ S PPPPP  YSSPPPPP  Y
Sbjct: 599 SSPPPTPVYSPPPPPPCIEPPPPPPCIEYSPPPPPPVVHYSSPPPPPVYYSSPPPPPVYY 658

Query: 108 SSPPPPPPYIYSSPPPP----------PKSYASPPPPPPATTEPSPPTPPA---SPPP 152
           SSPPPPPP  YSSPPPP          P  Y+SPPPPP A  E SPP  P    SPPP
Sbjct: 659 SSPPPPPPVHYSSPPPPEVHYHSPPPSPVHYSSPPPPPSAPCEESPPPAPVVHHSPPP 716

BLAST of HG10018139 vs. ExPASy Swiss-Prot
Match: Q9M1G9 (Extensin-2 OS=Arabidopsis thaliana OX=3702 GN=EXT2 PE=2 SV=1)

HSP 1 Score: 47.8 bits (112), Expect = 3.9e-04
Identity = 66/152 (43.42%), Postives = 76/152 (50.00%), Query Frame = 0

Query: 64  VCTPPPPP---PSRKL--KSPPPPYIYSSPPP----PPPRTYSSPPPPPYIYSSPP---- 123
           V + PPPP   PS K+  KSPPPPY+YSSPPP    P P+ Y   PPPPY+YSSPP    
Sbjct: 401 VYSSPPPPTYSPSPKVYYKSPPPPYVYSSPPPPYYSPSPKVYYKSPPPPYVYSSPPPPYY 460

Query: 124 ----------PPPPYIYSSPPPP-----PKSYASPPPPPPATTEPSPP-----------T 173
                     PPPPY+YSSPPPP     PK Y   PPPP   + P PP           +
Sbjct: 461 SPSPKVYYKSPPPPYVYSSPPPPYYSPSPKVYYKSPPPPYVYSSPPPPYYSPSPKVYYKS 520

BLAST of HG10018139 vs. ExPASy TrEMBL
Match: A0A0A0LVQ3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G154060 PE=4 SV=1)

HSP 1 Score: 753.8 bits (1945), Expect = 4.1e-214
Identity = 388/460 (84.35%), Postives = 416/460 (90.43%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           M RIA+   LF FLF S  VEGAP AKKVKCNDDNYPQCYKSDHYCP+DCPQTCVVDCSS
Sbjct: 1   MGRIAL---LFLFLFFSVAVEGAPQAKKVKCNDDNYPQCYKSDHYCPADCPQTCVVDCSS 60

Query: 61  CQPVCTPPPPPPSRKLKSPPPPYIYSSPPPPPPRTYSSPPPPPYIYSSPPPPPPYIYSSP 120
           C+PVC PPPPPP RKLKSPPPPYIYSSPPPPP    S PPPPP +YSS PPPPPYIYSSP
Sbjct: 61  CKPVCNPPPPPP-RKLKSPPPPYIYSSPPPPPYIYSSPPPPPPRVYSS-PPPPPYIYSSP 120

Query: 121 PPPPKSY-ASPPPPPPATTEPSP-PTPPASPPPSSEASGQKKARCKNKSYPHCYGMELSC 180
           PPPP    ASPPPPPP+T+ P+P P+ P SPPPSSE SGQKKARCKN+ YPHCYGMELSC
Sbjct: 121 PPPPPYINASPPPPPPSTSPPTPTPSTPTSPPPSSEGSGQKKARCKNRGYPHCYGMELSC 180

Query: 181 PSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLH 240
           PS+CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLH
Sbjct: 181 PSSCPDHCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLH 240

Query: 241 INAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSVSLDDETILL 300
           INAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGA+KTATWNDA DRLSVSLD+ETI+L
Sbjct: 241 INAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGAQKTATWNDATDRLSVSLDNETIIL 300

Query: 301 PTEEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESRVHNYGITQE 360
           P +EGATWSNSTS KGITITR++NTNA+EI+V  NFKIKA VVPITE +SR+HNYGITQE
Sbjct: 301 PNQEGATWSNSTSNKGITITRTQNTNAVEIDVPGNFKIKAVVVPITEMDSRIHNYGITQE 360

Query: 361 DCFAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFASSNLFATDC 420
           DCFAHLDLSFKF+ALSGDVNGVLGQTY  NYVSK KMGVAMPV GG+NEFASSN+FAT+C
Sbjct: 361 DCFAHLDLSFKFYALSGDVNGVLGQTYSSNYVSKVKMGVAMPVFGGLNEFASSNIFATNC 420

Query: 421 QVARFSGQLDGKDDSSLDAEAYAN-MNCGSDMEGGVVCKR 458
           +VARFSG+LD KDDSSL+AE YAN M CGSD+EGGVVCKR
Sbjct: 421 RVARFSGELDEKDDSSLEAEVYANMMRCGSDIEGGVVCKR 455

BLAST of HG10018139 vs. ExPASy TrEMBL
Match: A0A5D3DR05 (TGF-beta-activated kinase 1 and MAP3K7-binding protein 3-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold862G00760 PE=4 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 2.3e-209
Identity = 381/458 (83.19%), Postives = 402/458 (87.77%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           MARIAI   LF FLF SA VEG PN KKVKC+DDNYPQCYKSD YCP+DCP+TCVVDCSS
Sbjct: 1   MARIAI---LFLFLFFSAAVEGLPNTKKVKCHDDNYPQCYKSDLYCPADCPETCVVDCSS 60

Query: 61  CQPVCTPPPPPPSRKLKSPPPPYIYSSPPPPPPRTYSSPPPPPYIYSSPPPPPPYIYSSP 120
           CQ VC PPPPP  R+LKSPPPPYI           YSSPPPPPYIYSSPPPPPPYIY+SP
Sbjct: 61  CQAVCNPPPPP--RRLKSPPPPYI-----------YSSPPPPPYIYSSPPPPPPYIYASP 120

Query: 121 PPPPKSYASPPPPPPATTEPSPPTPPASPPPSSEASGQKKARCKNKSYPHCYGMELSCPS 180
           PPPP +   P PP P    PS PTPP SPPPSSEASGQKKARCKN+SYPHCYGMELSCPS
Sbjct: 121 PPPPPATLPPSPPTPT---PSTPTPPTSPPPSSEASGQKKARCKNRSYPHCYGMELSCPS 180

Query: 181 ACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHIN 240
           +CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHIN
Sbjct: 181 SCPDHCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHIN 240

Query: 241 AHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSVSLDDETILLPT 300
           AHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGA+KTATWNDAIDRLSVSLDDETILL  
Sbjct: 241 AHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGAQKTATWNDAIDRLSVSLDDETILLSN 300

Query: 301 EEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESRVHNYGITQEDC 360
           +EGATW NSTS K ITITR++NTNA+EIEV  NFKIKA VVPITE +SR+HNYGITQEDC
Sbjct: 301 QEGATWRNSTSNKEITITRTQNTNAVEIEVPGNFKIKAVVVPITEMDSRIHNYGITQEDC 360

Query: 361 FAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFASSNLFATDCQV 420
           FAHLDLSFKF+ALSGDVNGVLGQTY  NYVSK KMG AMPV GGVNEFASSN+F+TDCQV
Sbjct: 361 FAHLDLSFKFYALSGDVNGVLGQTYSSNYVSKVKMGAAMPVFGGVNEFASSNIFSTDCQV 420

Query: 421 ARFSGQLDGKDDSSLDAEAYAN-MNCGSDMEGGVVCKR 458
           ARFSG+ DGKDDSSL+AE YA+ M CGSD EGGVVCKR
Sbjct: 421 ARFSGESDGKDDSSLEAEVYASMMRCGSDTEGGVVCKR 439

BLAST of HG10018139 vs. ExPASy TrEMBL
Match: A0A1S3CF01 (uncharacterized protein LOC103500203 OS=Cucumis melo OX=3656 GN=LOC103500203 PE=4 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 2.3e-209
Identity = 381/458 (83.19%), Postives = 402/458 (87.77%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           MARIAI   LF FLF SA VEG PN KKVKC+DDNYPQCYKSD YCP+DCP+TCVVDCSS
Sbjct: 1   MARIAI---LFLFLFFSAAVEGLPNTKKVKCHDDNYPQCYKSDLYCPADCPETCVVDCSS 60

Query: 61  CQPVCTPPPPPPSRKLKSPPPPYIYSSPPPPPPRTYSSPPPPPYIYSSPPPPPPYIYSSP 120
           CQ VC PPPPP  R+LKSPPPPYI           YSSPPPPPYIYSSPPPPPPYIY+SP
Sbjct: 61  CQAVCNPPPPP--RRLKSPPPPYI-----------YSSPPPPPYIYSSPPPPPPYIYASP 120

Query: 121 PPPPKSYASPPPPPPATTEPSPPTPPASPPPSSEASGQKKARCKNKSYPHCYGMELSCPS 180
           PPPP +   P PP P    PS PTPP SPPPSSEASGQKKARCKN+SYPHCYGMELSCPS
Sbjct: 121 PPPPPATLPPSPPTPT---PSTPTPPTSPPPSSEASGQKKARCKNRSYPHCYGMELSCPS 180

Query: 181 ACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHIN 240
           +CPD CEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHIN
Sbjct: 181 SCPDHCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHIN 240

Query: 241 AHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSVSLDDETILLPT 300
           AHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGA+KTATWNDAIDRLSVSLDDETILL  
Sbjct: 241 AHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGAQKTATWNDAIDRLSVSLDDETILLSN 300

Query: 301 EEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESRVHNYGITQEDC 360
           +EGATW NSTS K ITITR++NTNA+EIEV  NFKIKA VVPITE +SR+HNYGITQEDC
Sbjct: 301 QEGATWRNSTSNKEITITRTQNTNAVEIEVPGNFKIKAVVVPITEMDSRIHNYGITQEDC 360

Query: 361 FAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFASSNLFATDCQV 420
           FAHLDLSFKF+ALSGDVNGVLGQTY  NYVSK KMG AMPV GGVNEFASSN+F+TDCQV
Sbjct: 361 FAHLDLSFKFYALSGDVNGVLGQTYSSNYVSKVKMGAAMPVFGGVNEFASSNIFSTDCQV 420

Query: 421 ARFSGQLDGKDDSSLDAEAYAN-MNCGSDMEGGVVCKR 458
           ARFSG+ DGKDDSSL+AE YA+ M CGSD EGGVVCKR
Sbjct: 421 ARFSGESDGKDDSSLEAEVYASMMRCGSDTEGGVVCKR 439

BLAST of HG10018139 vs. ExPASy TrEMBL
Match: A0A0A0LSM1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G153550 PE=4 SV=1)

HSP 1 Score: 718.8 bits (1854), Expect = 1.5e-203
Identity = 374/469 (79.74%), Postives = 411/469 (87.63%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           MARIAIFLF F FLF SA+VEGAP AKKVKC D  +PQCYKS+HYCP+DC +TCVVDCSS
Sbjct: 1   MARIAIFLF-FLFLFLSAVVEGAPKAKKVKCKDKKFPQCYKSEHYCPADCLRTCVVDCSS 60

Query: 61  CQPVCTPPPPPP---------SRKLKSPPPPYIYSSPPPPPPRTYSS-PPPPPYIYSSPP 120
           CQPVCTPPPPPP          RKLKSPPPPYIYSSPPPPPPR YSS PPPPPYIYSS P
Sbjct: 61  CQPVCTPPPPPPPSPPPPPPKPRKLKSPPPPYIYSSPPPPPPRIYSSPPPPPPYIYSS-P 120

Query: 121 PPPPYIYSSPPPPPKSYASPPPP-PPATTEPSPPTPPASPPPSSEASGQKKARCKNKSYP 180
           PPPP+IYSSPPPPP +   P PP PPA T PS   PP SPPPSSEASGQKK RCKN+ YP
Sbjct: 121 PPPPHIYSSPPPPPPTTVEPSPPLPPAPTPPSSSPPPLSPPPSSEASGQKKVRCKNRGYP 180

Query: 181 HCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDF 240
           HCYGMELSCPS CP QCEVDCVTCSPVCNCNRPG+VCQDPKF+GGDGITFYFHGK+D+DF
Sbjct: 181 HCYGMELSCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKRDKDF 240

Query: 241 CIVTDSNLHINAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSV 300
           CIVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILFDSH+LFI ARKT+TW+DA DRL +
Sbjct: 241 CIVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFDSHQLFISARKTSTWDDANDRLYI 300

Query: 301 SLDDETILLPTEEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESR 360
           SLDDETI+LP +EGATWSNSTSY+GI ITRSR TNA+EIEV  NFKIKA VVPITEKES 
Sbjct: 301 SLDDETIILPNQEGATWSNSTSYEGIAITRSRKTNAVEIEVPGNFKIKAVVVPITEKESM 360

Query: 361 VHNYGITQEDCFAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFA 420
           +H YGITQEDCFAHLDLSFKF+ALSG+VNGVLGQTYG NYVS+AKMGVAMPV+GG  EFA
Sbjct: 361 IHKYGITQEDCFAHLDLSFKFYALSGNVNGVLGQTYGKNYVSRAKMGVAMPVLGGDKEFA 420

Query: 421 SSNLFATDCQVARFSGQLDGKDDSSLDAEAYANMNCGSDMEG-GVVCKR 458
           SS++FATDC+V RF+ ++D K +S ++A AYANM+CGSDM+G GVVCKR
Sbjct: 421 SSSIFATDCEVTRFTKEMDEK-ESYVEAAAYANMSCGSDMDGQGVVCKR 466

BLAST of HG10018139 vs. ExPASy TrEMBL
Match: A0A1S3CF51 (uncharacterized protein LOC103500222 OS=Cucumis melo OX=3656 GN=LOC103500222 PE=4 SV=1)

HSP 1 Score: 708.0 bits (1826), Expect = 2.6e-200
Identity = 371/468 (79.27%), Postives = 404/468 (86.32%), Query Frame = 0

Query: 1   MARIAIFLFLFFFLFSSALVEGAPNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSS 60
           MARIAIFLF FFFLF SA+VEG P AKKVKC D  +PQCYKS HYCP DC +TCVVDCSS
Sbjct: 1   MARIAIFLF-FFFLFLSAVVEGVPKAKKVKCKDKKFPQCYKSQHYCPDDCLRTCVVDCSS 60

Query: 61  CQPVCT-PPPPPPS--------RKLKSPPPPYIYSSPPPPPPRTYSSPPPPPYIYSSPPP 120
           CQPVCT PPPPPPS        RKL+SPPPPYIYSSPPPPPPR          +YSSPPP
Sbjct: 61  CQPVCTAPPPPPPSPPPPPPKPRKLRSPPPPYIYSSPPPPPPR----------VYSSPPP 120

Query: 121 PPPYIYSSPPPPPKSYASPPPP-PPATTEPSPPTPPASPPPSSEASGQKKARCKNKSYPH 180
           PPPYIYSSPPPPP +   P PP PP  T PS P PP SPPPSSEASGQKK RCKN+ YPH
Sbjct: 121 PPPYIYSSPPPPPPATVEPSPPLPPTPTPPSSP-PPLSPPPSSEASGQKKVRCKNRGYPH 180

Query: 181 CYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVCQDPKFVGGDGITFYFHGKKDQDFC 240
           CYGMELSCPS CP QCEVDCVTCSPVCNCNRPG+VCQDPKF+GGDGITFYFHGKKD+DFC
Sbjct: 181 CYGMELSCPSDCPSQCEVDCVTCSPVCNCNRPGAVCQDPKFIGGDGITFYFHGKKDRDFC 240

Query: 241 IVTDSNLHINAHFIGRRNVNMKRDFTWVQSLGILFDSHKLFIGARKTATWNDAIDRLSVS 300
           IVTDSNLHINAHFIGRRNV+MKRDFTWVQSLGILF SHKLFI ARKT+TW+DA DRL +S
Sbjct: 241 IVTDSNLHINAHFIGRRNVDMKRDFTWVQSLGILFGSHKLFISARKTSTWDDANDRLYIS 300

Query: 301 LDDETILLPTEEGATWSNSTSYKGITITRSRNTNAIEIEVSRNFKIKAAVVPITEKESRV 360
           LDDETILLP +EGATWSNSTSY+GI I+RSR TNA+EIEV  NFKIKA VVPITEKES +
Sbjct: 301 LDDETILLPNQEGATWSNSTSYEGIAISRSRKTNAVEIEVPGNFKIKAVVVPITEKESMI 360

Query: 361 HNYGITQEDCFAHLDLSFKFFALSGDVNGVLGQTYGINYVSKAKMGVAMPVIGGVNEFAS 420
           H YGITQEDCFAHLDLSFKF+ALSG+V+GVLGQTYG NYVS+AKMGVAMPV+GG  EFAS
Sbjct: 361 HKYGITQEDCFAHLDLSFKFYALSGNVSGVLGQTYGNNYVSRAKMGVAMPVLGGDKEFAS 420

Query: 421 SNLFATDCQVARFSGQLDGKDDSSLDAEAYANMNCGSDMEG-GVVCKR 458
           S++FATDC+VARFS +LDGK +SS++A AYANM+CG+DMEG GVVCKR
Sbjct: 421 SSIFATDCEVARFSRELDGK-ESSVEAAAYANMSCGNDMEGQGVVCKR 455

BLAST of HG10018139 vs. TAIR 10
Match: AT3G19430.1 (late embryogenesis abundant protein-related / LEA protein-related )

HSP 1 Score: 378.3 bits (970), Expect = 9.0e-105
Identity = 236/554 (42.60%), Postives = 304/554 (54.87%), Query Frame = 0

Query: 23  APNAKKVKCNDDNYPQCYKSDHYCPSDCPQTCVVDCSSCQPVCTPP-------------- 82
           A N     C    Y  CY  +H CP  CP +C V+C+SC+P+C PP              
Sbjct: 9   AKNPSHATCKIKKYKHCYNLEHVCPKFCPDSCHVECASCKPICGPPSPGDDGGGDDSGGD 68

Query: 83  --------------PPPPSRKLKSPPPPYIYSSP--------------PPPPPRTYSSPP 142
                         PPPP+  + SP PP     P              PPPP  T S P 
Sbjct: 69  DGGYTPPAPVPPVSPPPPTPSVPSPTPPVSPPPPTPTPSVPSPTPPVSPPPPTPTPSVPS 128

Query: 143 PPPYIYSSPPPPPPYIYS-----SPPPP--------------------PKSYASPPPPPP 202
           P P +   PP P P + S     SPPPP                    P    SPPPP P
Sbjct: 129 PTPPVSPPPPTPTPSVPSPTPPVSPPPPTPTPSVPSPTPPVPTDPMPSPPPPVSPPPPTP 188

Query: 203 ATTEPSPP-----------------------------------------------TPPAS 262
             + PSPP                                               +PP  
Sbjct: 189 TPSVPSPPDVTPTPPTPSVPSPPDVTPTPPTPSVPSPPDVTPTPPTPPSVPTPSGSPPYV 248

Query: 263 PPPS--SEASGQKKARCKNKSYPHCYGMELSCPSACPDQCEVDCVTCSPVCNCNRPGSVC 322
           PPPS   EA+G K+ RCK +  P CYG+E +CP+ CP  C+VDCVTC PVCNC++PGSVC
Sbjct: 249 PPPSDEEEAAGAKRVRCKKQRSP-CYGVEYTCPADCPRSCQVDCVTCKPVCNCDKPGSVC 308

Query: 323 QDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVNMKRDFTWVQSLGILFD 382
           QDP+F+GGDG+TFYFHGKKD +FC+++D NLHINAHFIG+R   M RDFTWVQS+ ILF 
Sbjct: 309 QDPRFIGGDGLTFYFHGKKDSNFCLISDPNLHINAHFIGKRRAGMARDFTWVQSIAILFG 368

Query: 383 SHKLFIGARKTATWNDAIDRLSVSLDDETILLPTEEGATWSNSTS-YKGITITR-SRNTN 442
           +H+L++GA KTATW+D++DR++VS D   I LP  +GA W++S   Y  +++ R + +TN
Sbjct: 369 THRLYVGALKTATWDDSVDRIAVSFDGNVISLPQLDGARWTSSPGVYPEVSVKRVNTDTN 428

Query: 443 AIEIEVSRNFKIKAAVVPITEKESRVHNYGITQEDCFAHLDLSFKFFALSGDVNGVLGQT 458
            +E+EV    KI A VVPIT ++SR+H Y + ++DC AHLDL FKF  LS +V+GVLGQT
Sbjct: 429 NLEVEVEGLLKITARVVPITMEDSRIHGYDVKEDDCLAHLDLGFKFQDLSDNVDGVLGQT 488

BLAST of HG10018139 vs. TAIR 10
Match: AT5G54370.1 (Late embryogenesis abundant (LEA) protein-related )

HSP 1 Score: 255.8 bits (652), Expect = 6.8e-68
Identity = 130/313 (41.53%), Postives = 189/313 (60.38%), Query Frame = 0

Query: 163 CKNKSYPHCYGMELSCPSACPDQ---------CEVDC--VTCSPVC-----NCNRPGSVC 222
           C N  Y  CY   + CP  CP +         C  DC   TC   C     NCNRPGS C
Sbjct: 29  CSN-PYTRCYRKYIRCPEECPSKTAMNSKNKVCYADCDRPTCKSQCRMRKPNCNRPGSAC 88

Query: 223 QDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVNMKRDFTWVQSLGILFD 282
            DP+F+GGDGI FYFHGK +++F +V+DS+L IN  FIG R     RDFTW+Q+LG LF+
Sbjct: 89  YDPRFIGGDGIVFYFHGKSNEEFSLVSDSDLQINGRFIGHRPAGRARDFTWIQALGFLFN 148

Query: 283 SHKLFIGARKTATWNDAIDRLSVSLDDETILLPTEEGATWSNSTSYKGITITRSRNTNAI 342
           S+K  + A KTA+W++ ID L  S D + + +P E  +TW +    K I I R    N++
Sbjct: 149 SNKFSLEAAKTASWDNEIDHLKFSYDGQDLSVPEETLSTWYSPN--KDIKIERVSMRNSV 208

Query: 343 EIEVSRNFKIKAAVVPITEKESRVHNYGITQEDCFAHLDLSFKFFALSGDVNGVLGQTYG 402
            + +    +I   VVP+T+++ R+H+Y +  +DCFAHL++ F+FF LS  V+G+LG+TY 
Sbjct: 209 IVTIKDKAEIMINVVPVTKEDDRIHSYKVPSDDCFAHLEVQFRFFNLSPKVDGILGRTYR 268

Query: 403 INYVSKAKMGVAMPVIGGVNEFASSNLFATDCQVARFSGQLDGKDDSSLDAEAYANMNC- 458
            ++ + AK GVAMPV+GG + F +S+L + DC+   FS +   + DS      YA ++C 
Sbjct: 269 PDFQNPAKPGVAMPVVGGEDSFKTSSLLSNDCKTCIFS-ESQAEIDSVKSEIEYATLDCT 328

BLAST of HG10018139 vs. TAIR 10
Match: AT5G60520.1 (Late embryogenesis abundant (LEA) protein-related )

HSP 1 Score: 246.1 bits (627), Expect = 5.4e-65
Identity = 125/287 (43.55%), Postives = 173/287 (60.28%), Query Frame = 0

Query: 156 SGQKKARCKNKSYPHCYGMELSCPSACPDQ----------CEVDCVT-CSPVC-----NC 215
           SGQ++ +C  +    C    L+CP  CP++          C +DC + C   C     NC
Sbjct: 46  SGQERVQCLARG--SCNQKILTCPKECPERKPKMNKKKKACFIDCSSKCEVTCKWRKANC 105

Query: 216 NRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVNMKRDFTWVQ 275
           N  GS+C DP+FVGGDG+ FYFHG KD +F IV+D NL INAHFIG R     RDFTWVQ
Sbjct: 106 NGYGSLCYDPRFVGGDGVMFYFHGNKDGNFAIVSDENLQINAHFIGTRPAGRTRDFTWVQ 165

Query: 276 SLGILFDSHKLFIGARKTATWNDAIDRLSVSLDDETILLPTEEGATWSNSTSYKGITITR 335
           +  ++FDSH L I A+K A+W+D++D L V  + E + +PTE  A W      + + + R
Sbjct: 166 AFSVMFDSHNLVIAAKKVASWDDSVDSLVVRWNGEEVEVPTEGEAEWRIDLDEREVIVER 225

Query: 336 SRNTNAIEIEVSRNFKIKAAVVPITEKESRVHNYGITQEDCFAHLDLSFKFFALSGDVNG 395
           +   N + + VS   +I   V PI ++E RVH Y + ++D FAHL+  FKFF LS  V G
Sbjct: 226 TDERNNVRVTVSGIVQIDIQVRPIGKEEDRVHKYQLPKDDAFAHLETQFKFFNLSDLVEG 285

Query: 396 VLGQTYGINYVSKAKMGVAMPVIGGVNEFASSNLFATDCQVARFSGQ 427
           VLG+TY   YVS  K GV MP++GG +++ + +LF+  C V RF G+
Sbjct: 286 VLGKTYRPGYVSPVKTGVPMPMMGGEDKYQTPSLFSPLCNVCRFQGK 330

BLAST of HG10018139 vs. TAIR 10
Match: AT5G60530.1 (late embryogenesis abundant protein-related / LEA protein-related )

HSP 1 Score: 245.4 bits (625), Expect = 9.1e-65
Identity = 127/288 (44.10%), Postives = 175/288 (60.76%), Query Frame = 0

Query: 156 SGQKKARCKNKSYPHCYGMELSCPSACPDQ----------CEVDCVT-CSPVC-----NC 215
           +GQ++A C+ +    CY   L CP  CP +          C +DC   C   C     NC
Sbjct: 146 TGQEQAMCQGRG--SCYYKTLVCPGECPKRKPTKNKNTKGCFIDCTNKCEATCKWRKTNC 205

Query: 216 NRPGSVCQDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVNMKRDFTWVQ 275
           N  GS+C DP+FVGGDG+ FYFHG K  +F IV+D+NL INAHFIG R V   RDFTWVQ
Sbjct: 206 NGYGSLCYDPRFVGGDGVMFYFHGSKGGNFAIVSDNNLQINAHFIGTRPVGRTRDFTWVQ 265

Query: 276 SLGILFDSHKLFIGARKTATWNDAIDRLSVSLDDETILLPTEEGATWSN-STSYKGITIT 335
           +L ++F++HKL I A +   W++  D  ++  D E I LP +E + W   S   K I I 
Sbjct: 266 ALNVMFENHKLVITANRVNQWDETSDAFTIRYDGELITLPEDEQSEWREISGQKKDIIIE 325

Query: 336 RSRNTNAIEIEVSRNFKIKAAVVPITEKESRVHNYGITQEDCFAHLDLSFKFFALSGDVN 395
           R+   N++ + VS   ++   V PI ++E+RVHNY + Q+D FAHL+  FKF  LS  V 
Sbjct: 326 RTDERNSVRVLVSDLVQMDIRVRPIGKEENRVHNYQLPQDDAFAHLETQFKFLDLSELVE 385

Query: 396 GVLGQTYGINYVSKAKMGVAMPVIGGVNEFASSNLFATDCQVARFSGQ 427
           GVLG+TY  +YVS AK GV MPV+GG +++ + +LF+  C++ RF  Q
Sbjct: 386 GVLGKTYRPDYVSSAKTGVPMPVLGGEDKYQTPSLFSPTCRLCRFKPQ 431

BLAST of HG10018139 vs. TAIR 10
Match: AT4G27400.1 (Late embryogenesis abundant (LEA) protein-related )

HSP 1 Score: 237.7 bits (605), Expect = 1.9e-62
Identity = 121/313 (38.66%), Postives = 177/313 (56.55%), Query Frame = 0

Query: 163 CKNKSYPHCYGMELSCPSACPDQ---------CEVDCV--TCSPVC-----NCNRPGSVC 222
           C   + P C    + CP  CP +         C VDC    C  VC     NC   GS+C
Sbjct: 31  CGGNATPRCQLRYIDCPEECPTEMFPNSQNKICWVDCFKPLCEAVCRAVKPNCESYGSIC 90

Query: 223 QDPKFVGGDGITFYFHGKKDQDFCIVTDSNLHINAHFIGRRNVNMKRDFTWVQSLGILFD 282
            DP+F+GGDGI FYFHGK ++ F IV+D +  INA F G R     RDFTW+Q+LG LF+
Sbjct: 91  LDPRFIGGDGIVFYFHGKSNEHFSIVSDPDFQINARFTGHRPAGRTRDFTWIQALGFLFN 150

Query: 283 SHKLFIGARKTATWNDAIDRLSVSLDDETILLPTEEGATWSNSTSYKGITITRSRNTNAI 342
           SHK  +   K ATW+  +D L  ++D + +++P E  +TW +S   K I I R    N++
Sbjct: 151 SHKFSLETTKVATWDSNLDHLKFTIDGQDLIIPQETLSTWYSSD--KDIKIERLTEKNSV 210

Query: 343 EIEVSRNFKIKAAVVPITEKESRVHNYGITQEDCFAHLDLSFKFFALSGDVNGVLGQTYG 402
            + +    +I   VVP+T+++ R+HNY +  +DCFAH ++ FKF  LS  V+G+LG+TY 
Sbjct: 211 IVTIKDKAEIMVNVVPVTKEDDRIHNYKLPVDDCFAHFEVQFKFINLSPKVDGILGRTYR 270

Query: 403 INYVSKAKMGVAMPVIGGVNEFASSNLFATDCQVARFSGQLDGKDDSSLDAEAYANMNC- 458
            ++ + AK GV MPV+GG + F +S+L +  C+   FS        S      YA ++C 
Sbjct: 271 PDFKNPAKPGVVMPVVGGEDSFRTSSLLSHVCKTCLFSEDPAVASGSVKPKSTYALLDCS 330

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894866.18.7e-22788.67uncharacterized protein LOC120083266 [Benincasa hispida][more]
XP_011648458.18.5e-21484.35uncharacterized protein LOC101207483 [Cucumis sativus] >KGN64912.1 hypothetical ... [more]
XP_008461654.14.8e-20983.19PREDICTED: uncharacterized protein LOC103500203 [Cucumis melo] >KAA0038827.1 TGF... [more]
XP_004139573.23.0e-20379.74uncharacterized protein LOC101207232 [Cucumis sativus] >KGN64910.1 hypothetical ... [more]
XP_008461680.15.3e-20079.27PREDICTED: uncharacterized protein LOC103500222 [Cucumis melo][more]
Match NameE-valueIdentityDescription
O653758.7e-1266.67Leucine-rich repeat extensin-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=L... [more]
Q9T0K52.9e-0753.39Leucine-rich repeat extensin-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=L... [more]
Q9M1G93.9e-0443.42Extensin-2 OS=Arabidopsis thaliana OX=3702 GN=EXT2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LVQ34.1e-21484.35Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G154060 PE=4 SV=1[more]
A0A5D3DR052.3e-20983.19TGF-beta-activated kinase 1 and MAP3K7-binding protein 3-like OS=Cucumis melo va... [more]
A0A1S3CF012.3e-20983.19uncharacterized protein LOC103500203 OS=Cucumis melo OX=3656 GN=LOC103500203 PE=... [more]
A0A0A0LSM11.5e-20379.74Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G153550 PE=4 SV=1[more]
A0A1S3CF512.6e-20079.27uncharacterized protein LOC103500222 OS=Cucumis melo OX=3656 GN=LOC103500222 PE=... [more]
Match NameE-valueIdentityDescription
AT3G19430.19.0e-10542.60late embryogenesis abundant protein-related / LEA protein-related [more]
AT5G54370.16.8e-6841.53Late embryogenesis abundant (LEA) protein-related [more]
AT5G60520.15.4e-6543.55Late embryogenesis abundant (LEA) protein-related [more]
AT5G60530.19.1e-6544.10late embryogenesis abundant protein-related / LEA protein-related [more]
AT4G27400.11.9e-6238.66Late embryogenesis abundant (LEA) protein-related [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009646Root capPFAMPF06830Root_capcoord: 367..423
e-value: 1.2E-26
score: 92.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..161
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 68..96
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..152
NoneNo IPR availablePANTHERPTHR31656ROOT CAP DOMAIN-CONTAINING PROTEINcoord: 145..457
NoneNo IPR availablePANTHERPTHR31656:SF52ROOT CAP PERIPHERY GENE2coord: 23..66
NoneNo IPR availablePANTHERPTHR31656ROOT CAP DOMAIN-CONTAINING PROTEINcoord: 23..66
NoneNo IPR availablePANTHERPTHR31656:SF52ROOT CAP PERIPHERY GENE2coord: 145..457

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018139.1HG10018139.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007186 G protein-coupled receptor signaling pathway
biological_process GO:0001505 regulation of neurotransmitter levels
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004969 histamine receptor activity