Lsi05G002020 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi05G002020
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein family, putative
Locationchr05 : 2783765 .. 2787372 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCATACACAGTGTCTCTCTCTACTCTCTACAAACAAAAAAAGAAGCAAAGGATTCACTGAAACAAGAAATCAAAGACAACCATGGAGGAAATGACGTCACGACCTCATGTCATGAACTCTCGCAACACGCAACGTCCTCTTCCACCGCCACCGTCAAGAACACCCGACAACAACAATCACCGTCCTCTTCCGCCTCCACCTTCAAGAGCTCCGCTCAATGTTCACAACACTCACCGTTCTCCTTCATTGCCACCGTCCAGGCCTAATTCCGACTCTCACAACACTCGCTATCCATCTCCGCCGTCGCCGCCTACCTCTCGTCGCCAACATTTTGGTTACGGCACGGCATCATCCTCGTCTTCCTCATCGGCTTCCTTCCGAGGCTGCTGCTGCTGCCTCTGCCTCCTCTTCTTCTTCATCGCTCTCCTCGCTCTCGCTATCGTCCTCGTCATTGTTCTCGCCGTCAAACCTAAAAAGCCTCAATTCGATCTCCAGCGAGTCGGCGTTCAATACATGGGGATAACCGCTCCAAATCTCTTCTCATTGTCTTCCTCTGACGCCGAGACCGCTGCGACAACGGCGACGTCCACCTCCGCATCGTTATCGCTTAACATTCGATTGCTGTTCACGGCGGTGAATCCTAACAAAGTCGGAATAAAGTACGGGAATTCGAGGTTCACAGTGATGTACCGAGGGATTCCGTTAGGGAAAGCGATAGTTCCTGGATTTTACCAAGAGGCACACAGTCAGAGAGAGGTGGAGGCGACGATCGCGGTGGATCGAGTGAATTTGCTTCAGGCGGACGCCGCCGATCTCATCAGAGACGCTTCGTTGAACGATCGAGTAGAACTGAGGGTTTTGGGGGAAGTTGGCGCCAGGATCCGCGTCTTGGATTTTGATTCGCCCGGCGTTCAGGTCAGTTCCCAATTTTTTTTTTTTTTTAATTAAATCAAATTATTTTAATTATTTAATTTTATTCGCTTTTGAATATTATGGAGCACGCACTTTACATTTCTAGATAATTGATTGATGGATTTTAGATTCTTTATAATGAATGAATATATTTTTGGATAAAAAAAAATGTGTGAAGGAATAAAAATTAGATGAAACTGACCCACAAAACAAAAGGTTTAGGGAAGTTCTAGTAAAAAAAAATTCACTTCATAAATTTATTACTTTAAATTATTATTATTATTATTATTATTTTGTGGAGTCCAGTTATTTTTGAAGTTATTGTTATTAATTTTTCACAAGTTATTTAAGAATAACTTTGATTGATTCGTTTTTTTCCCCCATTAGATGTGTGTGTAAATAAAATATTTGTCCAAGGTTTAATTCTAAAATGGATCTAAATTTCTGTTAAGATTATTAAGAAAAAAAATTTATATAATTCATAAGTCTAGATTCGAATTAGAATAGTTGAAAATATTTTTTTTAAAAAAAAAAAATTTAGAGATAAGAATAAAACTGAAAACTAATTTAAATCTATGAAAATCTTTCAGGTCTTCTCAACAAAAGTTGATTAGAAGACTAAATATGCATATACTCTCCAATAAAAGTTTAACCTCAAAAATAAATAATAAAAAGGATATTTTTGTAAGTGATTGTTTGAATTATTAAAATGGAAAGGGTTAAAAATTATTTGTGGTCATTGAACTTTAGAGTAATAATTTAATCCATATACTTTGGTTCGTAACATTTTAGTTATTACTTCAAATATTATAAAAATTTAATCTTTATCATGAATAATTTCATCAAAATAAATGTCAATTTTCAGCATGTGATAATATGATTTTTTTAGTTTATAAATATATGTGGTCCTTAATCAATTATATCAACGATAAATCTTATTAAATCTTATCAATATTTTCATATAAAATATTATAACATTAAAAGTATGATGACTAAATTGTTACCACTCTAAATACTATTTTGTAATATATATTATATATATATGTCCAAATAATTTCCTACTTTAATTAACATAACAGACCGACTGGGAAAACACATAAGAATACAAAAATAAATACAAAAGTCAACGAAATAATTCAGAATCCCAAAAAAAAATGCAAGCACACGAAGAAGAAAAAAATTGTGATATTATATTTATTTGCTTCTTTTTTCCCTTTTTGGAGGATGATTTTCTTGTTTTTATAAATAAAAAATTGCAAAAATTAACAATCACTAAATAATCATATTAAAAAATTAATAATAAACTATAAAGGAAGGATGTAAAAGATCATAATTAAGGAAAATGAAATTAGACAGAAAAGTAGAAAAAGAAAATTGAGTATTATATGGGCAAAAGCCAAAGAGTAAAAGGTAAAAAAAAAAAATTAAGTGGGAAACCGTTGTTGAAGGTAACGGTGCCGCTTAATTGCTTTACTTTTTCGCGGCGATTGAGGGAGGCGCCAAATTGGTGGGCCATACCGACCGAACAAAACTACTTTAGTTTTCAACCCTTTCACATTTATAGTTTATTCAAAACTACTTTATCCTTTCATCCATCATGTGTTGTCCCTATTCATCTAGGTAATATAATCATTGTTAAAATATCTTTTTTTTAGTTCATATATTTTGGAATTTTTTTCAATTTTAGTCATGTAGTTACGAGCGTCCAATTTTAGTGTATATGTAATTCGTTAAATCTTAAATTTAGTCATTTAGGAGGCGTTTAGTTGTTGTGAAAAAAAGTGAATTGAGTTGAGTTTAGTTAGTAATTATTATAGGTAGAAAGTAATTGGGGGAGTTGACACAAATAACAAAGTGAGAAATGTAAGTTAATGCGAATATAAACCTCTTTAATTATTGGTGAACCAAACTGTGGGTGGGTTTTTTTTACCCACCCAACCCAATTCAACTTGGGTGTCTAGGGGTGTTTGGCCTACCGACTTCATAAGTCGGTGTTAACTAACTCAACTCCACTTACTTCAATAGTGTTCACTATTGAAATTTGCACATTTATTGAGAAATTTTATCTTTTTCTCTTTTGTACATTTATCGATGAACTTTATTGCTTGTAAGTTAAATAATTAAAATTGACATTTTCAAACAACAAACATCAAAGTTGAATAAAATGGTTTAAATAACATGTTAATCAATATATAACATGTTTTTCTCTTTTTGGGTTATAGATTTATTACTCTTGGAAGAATATTTCCAAAGATAAACAAATTATGTCATTATCTACTACTCTTGAATGTTAAATGCTTGATGCATATCATGTATTATCTCTAGCAATCACAATTCATACTTACATATCGTTCTTTCGACGATGCATGCTAATCCTAATTGAAAAGCGACTATATAAGAAATTCGATATGAGCTCTTACTAACGAGTAACGAGTTTGTAGCGAAATATTGTTATTTTAAGGACGAAGAAGAAACTGAAAATGATCGATGTATGACAGGTGTCGGTCGATTGCTCAATAGTGATAAGTCCAAGGAATCAATCTTTGACTTCCAAGCAATGTGGATTTGATGGGTTCAGTTTATGATTTTTTGCCCCCACTTTTTTTTTTTTATATTCTAACATAGTTTCTTCTCTTCCTCACTCTCTATCTCTGCTATTTTATACTCAAAAGAGATTGGAATAATGATGAGGGAG

mRNA sequence

GCCATACACAGTGTCTCTCTCTACTCTCTACAAACAAAAAAAGAAGCAAAGGATTCACTGAAACAAGAAATCAAAGACAACCATGGAGGAAATGACGTCACGACCTCATGTCATGAACTCTCGCAACACGCAACGTCCTCTTCCACCGCCACCGTCAAGAACACCCGACAACAACAATCACCGTCCTCTTCCGCCTCCACCTTCAAGAGCTCCGCTCAATGTTCACAACACTCACCGTTCTCCTTCATTGCCACCGTCCAGGCCTAATTCCGACTCTCACAACACTCGCTATCCATCTCCGCCGTCGCCGCCTACCTCTCGTCGCCAACATTTTGGTTACGGCACGGCATCATCCTCGTCTTCCTCATCGGCTTCCTTCCGAGGCTGCTGCTGCTGCCTCTGCCTCCTCTTCTTCTTCATCGCTCTCCTCGCTCTCGCTATCGTCCTCGTCATTGTTCTCGCCGTCAAACCTAAAAAGCCTCAATTCGATCTCCAGCGAGTCGGCGTTCAATACATGGGGATAACCGCTCCAAATCTCTTCTCATTGTCTTCCTCTGACGCCGAGACCGCTGCGACAACGGCGACGTCCACCTCCGCATCGTTATCGCTTAACATTCGATTGCTGTTCACGGCGGTGAATCCTAACAAAGTCGGAATAAAGTACGGGAATTCGAGGTTCACAGTGATGTACCGAGGGATTCCGTTAGGGAAAGCGATAGTTCCTGGATTTTACCAAGAGGCACACAGTCAGAGAGAGGTGGAGGCGACGATCGCGGTGGATCGAGTGAATTTGCTTCAGGCGGACGCCGCCGATCTCATCAGAGACGCTTCGTTGAACGATCGAGTAGAACTGAGGGTTTTGGGGGAAGTTGGCGCCAGGATCCGCGTCTTGGATTTTGATTCGCCCGGCGTTCAGGTGTCGGTCGATTGCTCAATAGTGATAAGTCCAAGGAATCAATCTTTGACTTCCAAGCAATGTGGATTTGATGGGTTCAGTTTATGATTTTTTGCCCCCACTTTTTTTTTTTTATATTCTAACATAGTTTCTTCTCTTCCTCACTCTCTATCTCTGCTATTTTATACTCAAAAGAGATTGGAATAATGATGAGGGAG

Coding sequence (CDS)

ATGGAGGAAATGACGTCACGACCTCATGTCATGAACTCTCGCAACACGCAACGTCCTCTTCCACCGCCACCGTCAAGAACACCCGACAACAACAATCACCGTCCTCTTCCGCCTCCACCTTCAAGAGCTCCGCTCAATGTTCACAACACTCACCGTTCTCCTTCATTGCCACCGTCCAGGCCTAATTCCGACTCTCACAACACTCGCTATCCATCTCCGCCGTCGCCGCCTACCTCTCGTCGCCAACATTTTGGTTACGGCACGGCATCATCCTCGTCTTCCTCATCGGCTTCCTTCCGAGGCTGCTGCTGCTGCCTCTGCCTCCTCTTCTTCTTCATCGCTCTCCTCGCTCTCGCTATCGTCCTCGTCATTGTTCTCGCCGTCAAACCTAAAAAGCCTCAATTCGATCTCCAGCGAGTCGGCGTTCAATACATGGGGATAACCGCTCCAAATCTCTTCTCATTGTCTTCCTCTGACGCCGAGACCGCTGCGACAACGGCGACGTCCACCTCCGCATCGTTATCGCTTAACATTCGATTGCTGTTCACGGCGGTGAATCCTAACAAAGTCGGAATAAAGTACGGGAATTCGAGGTTCACAGTGATGTACCGAGGGATTCCGTTAGGGAAAGCGATAGTTCCTGGATTTTACCAAGAGGCACACAGTCAGAGAGAGGTGGAGGCGACGATCGCGGTGGATCGAGTGAATTTGCTTCAGGCGGACGCCGCCGATCTCATCAGAGACGCTTCGTTGAACGATCGAGTAGAACTGAGGGTTTTGGGGGAAGTTGGCGCCAGGATCCGCGTCTTGGATTTTGATTCGCCCGGCGTTCAGGTGTCGGTCGATTGCTCAATAGTGATAAGTCCAAGGAATCAATCTTTGACTTCCAAGCAATGTGGATTTGATGGGTTCAGTTTATGA

Protein sequence

MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL
BLAST of Lsi05G002020 vs. TrEMBL
Match: A0A0A0L9B0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G209460 PE=4 SV=1)

HSP 1 Score: 485.3 bits (1248), Expect = 5.3e-134
Identity = 268/306 (87.58%), Postives = 279/306 (91.18%), Query Frame = 1

Query: 1   MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSR 60
           MEEMTSRP  +N RNTQ PLPPPPSR PDNN+  PLPPPPSRAP N+    RSP  P + 
Sbjct: 1   MEEMTSRPQ-LNPRNTQPPLPPPPSRRPDNNHRPPLPPPPSRAPFNLQTNPRSPPFPSTT 60

Query: 61  PNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFRGCCCCLCLLFFFIALLALAI 120
           PNS++ NTRYPSPPSPP+SRRQHFGYG A    SSS S RGCCCCLCLLF FIALLA+AI
Sbjct: 61  PNSNTRNTRYPSPPSPPSSRRQHFGYGAA----SSSPSLRGCCCCLCLLFSFIALLAVAI 120

Query: 121 VLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRL 180
           VLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSD ETAATT+T TSASLSLNIRL
Sbjct: 121 VLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTST-TSASLSLNIRL 180

Query: 181 LFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQA 240
           LFTAVNPNKVGIKYG+SRFTVMYRGIPLGKAIVPGFYQEAHS+REVEATIAVDRVNLLQA
Sbjct: 181 LFTAVNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQA 240

Query: 241 DAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCG 300
           DAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCG
Sbjct: 241 DAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCG 300

Query: 301 FDGFSL 307
           FDGFSL
Sbjct: 301 FDGFSL 300

BLAST of Lsi05G002020 vs. TrEMBL
Match: A0A061DZ99_THECC (Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 OS=Theobroma cacao GN=TCM_006430 PE=4 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 1.7e-84
Identity = 187/269 (69.52%), Postives = 210/269 (78.07%), Query Frame = 1

Query: 38  PPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSA 97
           PP + +P   H  H     P +  N + H+    +PP  P  +R H  Y     SSSSSA
Sbjct: 2   PPTNMSPN--HQPHAREMRPTA--NGEHHHRGLTAPP--PRPQRHHPYY---PRSSSSSA 61

Query: 98  SFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSS 157
           SF+GCCCCL LLF F+ALL LA+VL+IVLAVKPKKPQFDLQ+VGVQYMGI+  N    S+
Sbjct: 62  SFKGCCCCLFLLFSFLALLVLAVVLIIVLAVKPKKPQFDLQQVGVQYMGISTSNP---SA 121

Query: 158 SDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFY 217
            D   AA T T T+ASLSL I +LFTAVNPNKVGIKYG SRFTVMYRGIPLGKA VPGF+
Sbjct: 122 FDGAAAAVTTTPTTASLSLTIHMLFTAVNPNKVGIKYGESRFTVMYRGIPLGKAAVPGFF 181

Query: 218 QEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGV 277
           QEAHS R VEATIAVDR NL+QADAADLIRDASLNDRVELRVLG+VGA+IRVLDFDSPGV
Sbjct: 182 QEAHSTRNVEATIAVDRANLMQADAADLIRDASLNDRVELRVLGDVGAKIRVLDFDSPGV 241

Query: 278 QVSVDCSIVISPRNQSLTSKQCGFDGFSL 307
           QVS+DC+IVISPR QSLT KQCGFDG S+
Sbjct: 242 QVSIDCAIVISPRKQSLTYKQCGFDGLSV 258

BLAST of Lsi05G002020 vs. TrEMBL
Match: M5Y567_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010161mg PE=4 SV=1)

HSP 1 Score: 320.5 bits (820), Expect = 2.3e-84
Identity = 189/291 (64.95%), Postives = 216/291 (74.23%), Query Frame = 1

Query: 16  TQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPS 75
           T R  P P      N  HRP  PPP   P +            S P++ +H+  YP    
Sbjct: 2   TSRANPNPTPNGTANGEHRPRGPPPRPPPSS------------SNPHNSNHHPYYP---- 61

Query: 76  PPTSRRQHFGYGTASSSSSSSASFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQF 135
                       T SSSSSSSASF+GCCCCL LLF F+ALL LA+VLVI+LAVKPKKPQF
Sbjct: 62  ------------TTSSSSSSSASFKGCCCCLFLLFSFLALLVLAVVLVIILAVKPKKPQF 121

Query: 136 DLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYG 195
           DLQ+VGVQYMGI +PN    + + A TA      TSASLSL+IR+LF+AVNPNKVGI+YG
Sbjct: 122 DLQQVGVQYMGINSPNP---TPAAAATADPNQNPTSASLSLSIRMLFSAVNPNKVGIRYG 181

Query: 196 NSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRV 255
            SRFTVMYRGIPLGKA VPGF+Q+AH+ R+V ATI+VDRVNLLQADAADLIRDASLNDRV
Sbjct: 182 ESRFTVMYRGIPLGKASVPGFFQDAHTVRQVVATISVDRVNLLQADAADLIRDASLNDRV 241

Query: 256 ELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDGFSL 307
           ELRVLG+VGA+IRVL+FDSPGVQVSVDC+IVISPR QSLT KQCGFDG S+
Sbjct: 242 ELRVLGDVGAKIRVLNFDSPGVQVSVDCAIVISPRKQSLTYKQCGFDGLSV 261

BLAST of Lsi05G002020 vs. TrEMBL
Match: A0A067EH55_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g023930mg PE=4 SV=1)

HSP 1 Score: 318.2 bits (814), Expect = 1.1e-83
Identity = 183/262 (69.85%), Postives = 207/262 (79.01%), Query Frame = 1

Query: 54  PSLPP-SRPNSDSHNTRYPSPPSPPTSRRQ--------HFGYGTASSSSSSSASFRGCCC 113
           P +PP ++PN   H+ R P PP PP  + Q        H  Y   ++SSSSSASFRGCCC
Sbjct: 20  PKMPPQTQPNGTHHHQRRPHPPPPPPLQPQSQYHHHHDHHQY-YPTTSSSSSASFRGCCC 79

Query: 114 CLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAA 173
           CL LLF FIALL LA+VL++ LAVKPKKPQFDLQ+VGVQYMGI+ PN    SS D    +
Sbjct: 80  CLFLLFSFIALLILAVVLIVFLAVKPKKPQFDLQQVGVQYMGISTPN--PTSSVDP---S 139

Query: 174 TTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQR 233
           TT  +TSASLSL I LLFTA NPNKVGIKYG S+FTVMYRGIPLGKA VPGFYQ AHS R
Sbjct: 140 TTIAATSASLSLTIHLLFTAANPNKVGIKYGESKFTVMYRGIPLGKASVPGFYQGAHSVR 199

Query: 234 EVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCS 293
            VEATIAVDR NL+QADAA LI+DASLNDRVELRVLG+V A+IRV++FDSPGVQVSVDC+
Sbjct: 200 NVEATIAVDRANLMQADAASLIKDASLNDRVELRVLGDVSAKIRVMNFDSPGVQVSVDCA 259

Query: 294 IVISPRNQSLTSKQCGFDGFSL 307
           IVISPR QSLT KQCGFDG ++
Sbjct: 260 IVISPRKQSLTYKQCGFDGLTV 275

BLAST of Lsi05G002020 vs. TrEMBL
Match: F6H1R4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0055g00370 PE=4 SV=1)

HSP 1 Score: 317.0 bits (811), Expect = 2.5e-83
Identity = 175/248 (70.56%), Postives = 197/248 (79.44%), Query Frame = 1

Query: 59  SRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFRGCCCCLCLLFFFIALLAL 118
           SR N +  +   P P        QH  +  + S S SSASF+GCCCCL LLF F+ALL L
Sbjct: 3   SRANVNGEHNLRPPPNHHHHPHSQHHSHYQSPSYSPSSASFKGCCCCLFLLFSFLALLVL 62

Query: 119 AIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNI 178
           A+VL+IVLAVKPKKPQFDLQ+VGVQYMGITA       +  +  A +  T TSASLSLNI
Sbjct: 63  AVVLIIVLAVKPKKPQFDLQQVGVQYMGITA-------NPSSTVAGSPPTPTSASLSLNI 122

Query: 179 RLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLL 238
           ++LFTAVNPNKVGIKYG SRFTVMYRGIPLGK +VPGFYQ AHS R+VE T+AVDR NLL
Sbjct: 123 KMLFTAVNPNKVGIKYGESRFTVMYRGIPLGKGVVPGFYQPAHSVRQVETTVAVDRANLL 182

Query: 239 QADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQ 298
           QADAADLI+DASLNDRVELR+LGEVGA+IRVLDF SPGVQVSVDC+IVISPR QSLT KQ
Sbjct: 183 QADAADLIKDASLNDRVELRILGEVGAKIRVLDFTSPGVQVSVDCAIVISPRKQSLTYKQ 242

Query: 299 CGFDGFSL 307
           CGFDG S+
Sbjct: 243 CGFDGLSV 243

BLAST of Lsi05G002020 vs. TAIR10
Match: AT2G01080.1 (AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 278.9 bits (712), Expect = 3.8e-75
Identity = 156/243 (64.20%), Postives = 184/243 (75.72%), Query Frame = 1

Query: 73  PPSPPTSRRQHFGYGTA---------SSSSSSSASFRGCCCCLCLLFFFIALLALAIVLV 132
           PP P +SR    G   A         S SSSSSAS +GCCCCL LLF F+ALL LA+VL+
Sbjct: 2   PPPPSSSRAGLNGDPIAAQNQQPYYRSYSSSSSASLKGCCCCLFLLFAFLALLVLAVVLI 61

Query: 133 IVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRLLFT 192
           ++LAVKPKKPQFDLQ+V V YMGI+ P+                  T+ASLSL IR+LFT
Sbjct: 62  VILAVKPKKPQFDLQQVAVVYMGISNPS-------------AVLDPTTASLSLTIRMLFT 121

Query: 193 AVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQADAA 252
           AVNPNKVGI+YG S FTVMY+G+PLG+A VPGFYQ+AHS + VEATI+VDRVNL+QA AA
Sbjct: 122 AVNPNKVGIRYGESSFTVMYKGMPLGRATVPGFYQDAHSTKNVEATISVDRVNLMQAHAA 181

Query: 253 DLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCGFDG 307
           DL+RDASLNDRVEL V G+VGA+IRV++FDSPGVQVSV+C I ISPR Q+L  KQCGFDG
Sbjct: 182 DLVRDASLNDRVELTVRGDVGAKIRVMNFDSPGVQVSVNCGIGISPRKQALIYKQCGFDG 231

BLAST of Lsi05G002020 vs. TAIR10
Match: AT3G54200.1 (AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family)

HSP 1 Score: 51.6 bits (122), Expect = 9.9e-07
Identity = 59/246 (23.98%), Postives = 100/246 (40.65%), Query Frame = 1

Query: 56  LPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFRGCCCCLCLLFFFIAL 115
           LPP +PN+ S  T+  +  +    RR+                 R C  C+C     I L
Sbjct: 22  LPPPKPNASSMETQSANTGTAKKLRRK-----------------RNCKICICFTILLILL 81

Query: 116 LALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLS 175
           +A+ IV++     KPK+P   +  V V                D   A+         L+
Sbjct: 82  IAIVIVILAFTLFKPKRPTTTIDSVTV----------------DRLQASVNPLLLKVLLN 141

Query: 176 LNIRLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRV 235
           L + +  +  NPN++G  Y +S   + YRG  +G+A +P     A     +  T+ +   
Sbjct: 142 LTLNVDLSLKNPNRIGFSYDSSSALLNYRGQVIGEAPLPANRIAARKTVPLNITLTLMAD 201

Query: 236 NLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLT 295
            LL      L+ D  +   + L    +V  ++ VL      VQ S  C + IS  ++++T
Sbjct: 202 RLL--SETQLLSDV-MAGVIPLNTFVKVTGKVTVLKIFKIKVQSSSSCDLSISVSDRNVT 231

Query: 296 SKQCGF 302
           S+ C +
Sbjct: 262 SQHCKY 231

BLAST of Lsi05G002020 vs. NCBI nr
Match: gi|659112197|ref|XP_008456109.1| (PREDICTED: proline-rich receptor-like protein kinase PERK10 [Cucumis melo])

HSP 1 Score: 491.9 bits (1265), Expect = 8.2e-136
Identity = 269/306 (87.91%), Postives = 283/306 (92.48%), Query Frame = 1

Query: 1   MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSR 60
           MEEMTSRP  +N R+TQ PLPPPPSR P NN+HRPLPPPPSRAP N+H+  RSP  P + 
Sbjct: 1   MEEMTSRPQ-LNPRSTQPPLPPPPSRRPHNNHHRPLPPPPSRAPFNLHSNPRSPPFPSTA 60

Query: 61  PNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFRGCCCCLCLLFFFIALLALAI 120
           PN ++ NTRYPSPPSPP+SRRQHFGYG A    SSS SFRGCCCCLCLLF FIALLA+AI
Sbjct: 61  PNPNTRNTRYPSPPSPPSSRRQHFGYGAA----SSSPSFRGCCCCLCLLFSFIALLAIAI 120

Query: 121 VLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRL 180
           +LVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATT+T TSASLSLNIRL
Sbjct: 121 ILVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTST-TSASLSLNIRL 180

Query: 181 LFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQA 240
           LFTAVNPNKVGIKYG+SRFTVMYRGIPLGKAIVPGFYQEAHS+REVEATIAVDRVNLLQA
Sbjct: 181 LFTAVNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQA 240

Query: 241 DAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCG 300
           DAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCG
Sbjct: 241 DAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCG 300

Query: 301 FDGFSL 307
           FDGFSL
Sbjct: 301 FDGFSL 300

BLAST of Lsi05G002020 vs. NCBI nr
Match: gi|449446081|ref|XP_004140800.1| (PREDICTED: proline-rich receptor-like protein kinase PERK10 [Cucumis sativus])

HSP 1 Score: 485.3 bits (1248), Expect = 7.6e-134
Identity = 268/306 (87.58%), Postives = 279/306 (91.18%), Query Frame = 1

Query: 1   MEEMTSRPHVMNSRNTQRPLPPPPSRTPDNNNHRPLPPPPSRAPLNVHNTHRSPSLPPSR 60
           MEEMTSRP  +N RNTQ PLPPPPSR PDNN+  PLPPPPSRAP N+    RSP  P + 
Sbjct: 1   MEEMTSRPQ-LNPRNTQPPLPPPPSRRPDNNHRPPLPPPPSRAPFNLQTNPRSPPFPSTT 60

Query: 61  PNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSASFRGCCCCLCLLFFFIALLALAI 120
           PNS++ NTRYPSPPSPP+SRRQHFGYG A    SSS S RGCCCCLCLLF FIALLA+AI
Sbjct: 61  PNSNTRNTRYPSPPSPPSSRRQHFGYGAA----SSSPSLRGCCCCLCLLFSFIALLAVAI 120

Query: 121 VLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIRL 180
           VLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSD ETAATT+T TSASLSLNIRL
Sbjct: 121 VLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDTETAATTST-TSASLSLNIRL 180

Query: 181 LFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQA 240
           LFTAVNPNKVGIKYG+SRFTVMYRGIPLGKAIVPGFYQEAHS+REVEATIAVDRVNLLQA
Sbjct: 181 LFTAVNPNKVGIKYGDSRFTVMYRGIPLGKAIVPGFYQEAHSEREVEATIAVDRVNLLQA 240

Query: 241 DAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCG 300
           DAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCG
Sbjct: 241 DAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQCG 300

Query: 301 FDGFSL 307
           FDGFSL
Sbjct: 301 FDGFSL 300

BLAST of Lsi05G002020 vs. NCBI nr
Match: gi|645223834|ref|XP_008218823.1| (PREDICTED: uncharacterized protein LOC103319102 [Prunus mume])

HSP 1 Score: 324.3 bits (830), Expect = 2.3e-85
Identity = 183/259 (70.66%), Postives = 210/259 (81.08%), Query Frame = 1

Query: 53  SPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFG-----YGTASSSSSSSASFRGCCCCLC 112
           +P+  P+   +  H  R P P  PP+S   H       Y T SSSSS+SASF+GCCCCL 
Sbjct: 6   NPNPTPNGTANGEHRQRGPPPRPPPSSSNPHNSNHHPYYPTTSSSSSNSASFKGCCCCLF 65

Query: 113 LLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTA 172
           LLF F+ALL LA+VLVIVLAVKPKKPQFDLQ+VGVQYMGI +PN    + + A TA    
Sbjct: 66  LLFSFLALLVLAVVLVIVLAVKPKKPQFDLQQVGVQYMGINSPNP---TPAAAATADPNQ 125

Query: 173 TSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVE 232
             TSASLSL+IR+LF+AVNPNKVGI+YG SRFTVMYRGIPLGKA VPGF+Q+AH+ R+V 
Sbjct: 126 NPTSASLSLSIRMLFSAVNPNKVGIRYGESRFTVMYRGIPLGKASVPGFFQDAHTVRQVV 185

Query: 233 ATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVI 292
           ATI+VDRVNLLQADAADLIRDASLNDRVELRVLG+VGA+IRVL+FDSPGVQVSVDC+IVI
Sbjct: 186 ATISVDRVNLLQADAADLIRDASLNDRVELRVLGDVGAKIRVLNFDSPGVQVSVDCAIVI 245

Query: 293 SPRNQSLTSKQCGFDGFSL 307
           SPR QSLT KQCGFDG S+
Sbjct: 246 SPRKQSLTYKQCGFDGLSV 261

BLAST of Lsi05G002020 vs. NCBI nr
Match: gi|658058956|ref|XP_008365290.1| (PREDICTED: uncharacterized protein LOC103428932 [Malus domestica])

HSP 1 Score: 323.2 bits (827), Expect = 5.0e-85
Identity = 181/247 (73.28%), Postives = 200/247 (80.97%), Query Frame = 1

Query: 66  HNTRYPSPPS-----PPTSRRQHFGYGTASS-SSSSSASFRGCCCCLCLLFFFIALLALA 125
           H  R P PPS     P +S   H  Y T SS SSSSSASF+GCCCCL LLF F+ALL LA
Sbjct: 19  HRXRVPPPPSSSSSNPHSSSNHHRYYPTRSSYSSSSSASFKGCCCCLFLLFSFLALLVLA 78

Query: 126 IVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSSSDAETAATTATSTSASLSLNIR 185
           +VLVI+LAVKPKKPQFDLQ+VGVQYMGI +PN          TA      TSASLSLNIR
Sbjct: 79  VVLVIILAVKPKKPQFDLQQVGVQYMGINSPN-------PTATADPNQNPTSASLSLNIR 138

Query: 186 LLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFYQEAHSQREVEATIAVDRVNLLQ 245
           +LF+A NPNKV IKYG SRFTVMYRGIPLGKA +PGFYQ+AH+ R+V ATIAVDRVNLLQ
Sbjct: 139 MLFSAANPNKVXIKYGESRFTVMYRGIPLGKASIPGFYQDAHTVRQVVATIAVDRVNLLQ 198

Query: 246 ADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGVQVSVDCSIVISPRNQSLTSKQC 305
           ADAADL+RDASLNDRVELRVLG+VGA+IRVL+FDSPGVQVSVDC+IVISPR QSLT KQC
Sbjct: 199 ADAADLVRDASLNDRVELRVLGDVGAKIRVLNFDSPGVQVSVDCAIVISPRKQSLTYKQC 258

Query: 306 GFDGFSL 307
           GFDG S+
Sbjct: 259 GFDGLSV 258

BLAST of Lsi05G002020 vs. NCBI nr
Match: gi|590683364|ref|XP_007041580.1| (Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 [Theobroma cacao])

HSP 1 Score: 320.9 bits (821), Expect = 2.5e-84
Identity = 187/269 (69.52%), Postives = 210/269 (78.07%), Query Frame = 1

Query: 38  PPPSRAPLNVHNTHRSPSLPPSRPNSDSHNTRYPSPPSPPTSRRQHFGYGTASSSSSSSA 97
           PP + +P   H  H     P +  N + H+    +PP  P  +R H  Y     SSSSSA
Sbjct: 2   PPTNMSPN--HQPHAREMRPTA--NGEHHHRGLTAPP--PRPQRHHPYY---PRSSSSSA 61

Query: 98  SFRGCCCCLCLLFFFIALLALAIVLVIVLAVKPKKPQFDLQRVGVQYMGITAPNLFSLSS 157
           SF+GCCCCL LLF F+ALL LA+VL+IVLAVKPKKPQFDLQ+VGVQYMGI+  N    S+
Sbjct: 62  SFKGCCCCLFLLFSFLALLVLAVVLIIVLAVKPKKPQFDLQQVGVQYMGISTSNP---SA 121

Query: 158 SDAETAATTATSTSASLSLNIRLLFTAVNPNKVGIKYGNSRFTVMYRGIPLGKAIVPGFY 217
            D   AA T T T+ASLSL I +LFTAVNPNKVGIKYG SRFTVMYRGIPLGKA VPGF+
Sbjct: 122 FDGAAAAVTTTPTTASLSLTIHMLFTAVNPNKVGIKYGESRFTVMYRGIPLGKAAVPGFF 181

Query: 218 QEAHSQREVEATIAVDRVNLLQADAADLIRDASLNDRVELRVLGEVGARIRVLDFDSPGV 277
           QEAHS R VEATIAVDR NL+QADAADLIRDASLNDRVELRVLG+VGA+IRVLDFDSPGV
Sbjct: 182 QEAHSTRNVEATIAVDRANLMQADAADLIRDASLNDRVELRVLGDVGAKIRVLDFDSPGV 241

Query: 278 QVSVDCSIVISPRNQSLTSKQCGFDGFSL 307
           QVS+DC+IVISPR QSLT KQCGFDG S+
Sbjct: 242 QVSIDCAIVISPRKQSLTYKQCGFDGLSV 258

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L9B0_CUCSA5.3e-13487.58Uncharacterized protein OS=Cucumis sativus GN=Csa_3G209460 PE=4 SV=1[more]
A0A061DZ99_THECC1.7e-8469.52Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 OS... [more]
M5Y567_PRUPE2.3e-8464.95Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010161mg PE=4 SV=1[more]
A0A067EH55_CITSI1.1e-8369.85Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g023930mg PE=4 SV=1[more]
F6H1R4_VITVI2.5e-8370.56Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0055g00370 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT2G01080.13.8e-7564.20 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
AT3G54200.19.9e-0723.98 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein f... [more]
Match NameE-valueIdentityDescription
gi|659112197|ref|XP_008456109.1|8.2e-13687.91PREDICTED: proline-rich receptor-like protein kinase PERK10 [Cucumis melo][more]
gi|449446081|ref|XP_004140800.1|7.6e-13487.58PREDICTED: proline-rich receptor-like protein kinase PERK10 [Cucumis sativus][more]
gi|645223834|ref|XP_008218823.1|2.3e-8570.66PREDICTED: uncharacterized protein LOC103319102 [Prunus mume][more]
gi|658058956|ref|XP_008365290.1|5.0e-8573.28PREDICTED: uncharacterized protein LOC103428932 [Malus domestica][more]
gi|590683364|ref|XP_007041580.1|2.5e-8469.52Late embryogenesis abundant hydroxyproline-rich glycoprotein family isoform 1 [T... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004864LEA_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0016310 phosphorylation
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016301 kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi05G002020.1Lsi05G002020.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA-14PFAMPF03168LEA_2coord: 184..283
score: 7.8
NoneNo IPR availablePANTHERPTHR31234FAMILY NOT NAMEDcoord: 32..304
score: 3.6E
NoneNo IPR availablePANTHERPTHR31234:SF8EXPRESSED PROTEINcoord: 32..304
score: 3.6E
NoneNo IPR availableunknownSSF117070LEA14-likecoord: 175..263
score: 7.1